PrepAway - Latest Free Exam Questions & Answers

Which process will accomplish all three objectives?

You have acquired a new data source of millions of customer records, and you’ve this data into
HDFS. Prior to analysis, you want to change all customer registration to the same date format,
make all addresses uppercase, and remove all customer names (for anonymization). Which
process will accomplish all three objectives?

PrepAway - Latest Free Exam Questions & Answers

A.
Adapt the data cleansing module in Mahout to your data, and invoke the Mahout library when
you run your analysis

B.
Pull this data into an RDBMS using sqoop and scrub records using stored procedures

C.
Write a script that receives records on stdin, corrects them, and then writes them to stdout.
Then, invoke this script in a map-only Hadoop Streaming Job

D.
Write a MapReduce job with a mapper to change words to uppercase and to reduce different
forms of dates to a single form


Leave a Reply

Your email address will not be published. Required fields are marked *