PrepAway - Latest Free Exam Questions & Answers

Category: CCD-333

Exam CCD-333: Cloudera Certified Developer for Apache Hadoop

Determine the difference between setting the number of reducers to zero.

You write a MapReduce job to process 100 files in HDFS. Your MapReduce algorithm uses
TextInputFormat and the IdentityReducer: the mapper applies a regular expression over input
values and emits key-value pairs with the key consisting of the matching text, and the value
containing the filename and byte offset. Determine the difference between setting the number of
reducers to zero.

which of the following interfaces is most likely to reduce the amount of intermediate data transferred across

You’ve written a MapReduce job that will process 500 million input records and generate 500
million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a
significant amount of intermediate data that it needs to transfer between mappers and reducers
which is a potential bottleneck. A custom implementation of which of the following interfaces is
most likely to reduce the amount of intermediate data transferred across the network?


Page 5 of 6« First...23456