You are here: Home > Briefing Amazon Knowledge > AWS Certified Big Data - Specialty

PrepAway - Latest Free Exam Questions & Answers

Category: AWS Certified Big Data – Specialty

Exam AWS Certified Big Data – Specialty

Which approach meets the requirement for a centralized metadata layer?

adminFebruary 5, 2020 Leave a comment

A company has several teams of analysts. Each team of analysts has their own cluster. The teams need to run SQL queries using Hive, Spark-SQL, and Presto with Amazon EMR. The company needs to enable a centralized metadata layer to expose the Amazon S3 objects as tables to the analysts. Which approach meets the requirement […]

Which DynamoDB table scheme is most efficient to support these queries?

adminFebruary 5, 2020 Leave a comment

A customer has an Amazon S3 bucket. Objects are uploaded simultaneously by a cluster of servers from multiple streams of data. The customer maintains a catalog of objects uploaded in Amazon S3 using an Amazon DynamoDB table. This catalog has the following fileds: StreamName, TimeStamp, and ServerName, from which ObjectName can be obtained. The customer […]

What is the most reliable and fault-tolerant technique to get each website to send data to Amazon Kinesis with

adminFebruary 5, 2020 Leave a comment

A web-hosting company is building a web analytics tool to capture clickstream data from all of the websites hosted within its platform and to provide near-real-time business intelligence. This entire system is built on AWS services. The web-hosting company is interested in using Amazon Kinesis to collect this data and perform sliding window analytics. What […]

Which recommendation should an administrator provide?

adminFebruary 5, 2020 Leave a comment

A large grocery distributor receives daily depletion reports from the field in the form of gzip archives od CSV files uploaded to Amazon S3. The files range from 500MB to 5GB. These files are processed daily by an EMR job. Recently it has been observed that the file sizes vary, and the EMR jobs take […]

In which two circumstances would choosing EVEN distribution be most appropriate? (Choose two.)

adminFebruary 5, 2020 Leave a comment

An administrator needs to design a strategy for the schema in a Redshift cluster. The administrator needs to determine the optimal distribution style for the tables in the Redshift schema. In which two circumstances would choosing EVEN distribution be most appropriate? (Choose two.) A. When the tables are highly denormalized and do NOT participate in […]

Which option allows Company A to do clustering in the AWS Cloud and meet the legal requirement of maintaining

adminFebruary 5, 2020 Leave a comment

Company A operates in Country X. Company A maintains a large dataset of historical purchase orders that contains personal data of their customers in the form of full names and telephone numbers. The dataset consists of 5 text files, 1TB each. Currently the dataset resides on-premises due to legal requirements of storing personal data in-country. […]

In which three circumstances would choosing Key-based distribution be most appropriate? (Select three.)

adminFebruary 5, 2020 Leave a comment

An administrator needs to design a distribution strategy for a star schema in a Redshift cluster. The administrator needs to determine the optimal distribution style for the tables in the Redshift schema. In which three circumstances would choosing Key-based distribution be most appropriate? (Select three.) A. When the administrator needs to optimize a large, slowly […]

How should this control mapping be achieved using AWS?

adminFebruary 5, 2020 Leave a comment

A data engineer chooses Amazon DynamoDB as a data store for a regulated application. This application must be submitted to regulators for review. The data engineer needs to provide a control framework that lists the security controls from the process to follow to add new users down to the physical controls of the data center, […]

Which AWS service strategy is best for this use case?

adminFebruary 5, 2020 Leave a comment

A new algorithm has been written in Python to identify SPAM e-mails. The algorithm analyzes the free text contained within a sample set of 1 million e-mails stored on Amazon S3. The algorithm must be scaled across a production dataset of 5 PB, which also resides in Amazon S3 storage. Which AWS service strategy is […]

What is the most efficient architecture strategy for this purpose?

adminFebruary 5, 2020 Leave a comment

A data engineer in a manufacturing company is designing a data processing platform that receives a large volume of unstructured data. The data engineer must populate a well-structured star schema in Amazon Redshift. What is the most efficient architecture strategy for this purpose? A. Transform the unstructured data using Amazon EMR and generate CSV data. […]

Page 2 of 2«12

Move Up

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Amazon Knowledge

Free Amazon study guide

Category: AWS Certified Big Data – Specialty

Which approach meets the requirement for a centralized metadata layer?

Which DynamoDB table scheme is most efficient to support these queries?

What is the most reliable and fault-tolerant technique to get each website to send data to Amazon Kinesis with

Which recommendation should an administrator provide?

In which two circumstances would choosing EVEN distribution be most appropriate? (Choose two.)

Which option allows Company A to do clustering in the AWS Cloud and meet the legal requirement of maintaining

In which three circumstances would choosing Key-based distribution be most appropriate? (Select three.)

How should this control mapping be achieved using AWS?

Which AWS service strategy is best for this use case?

What is the most efficient architecture strategy for this purpose?