PrepAway - Latest Free Exam Questions & Answers

Category: CCD-410

Exam CCD-410: Cloudera Certified Developer for Apache Hadoop

which method in the Mapper you should use to implement code for reading the file and populating the associativ

You want to populate an associative array in order to perform a map-side join. You’ve decided to
put this information in a text file, place that file into the DistributedCache and read it in your
Mapper before any records are processed.
Indentify which method in the Mapper you should use to implement code for reading the file and
populating the associative array?

which interface is most likely to reduce the amount of intermediate data transferred across the network?

You’ve written a MapReduce job that will process 500 million input records and generated 500
million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a
significant amount of intermediate data that it needs to transfer between mappers and reduces
which is a potential bottleneck. A custom implementation of which interface is most likely to reduce
the amount of intermediate data transferred across the network?

which invocation correctly passes.mapred.job.name with a value of Example to Hadoop?

You need to run the same job many times with minor variations. Rather than hardcoding all job
configuration options in your drive code, you’ve decided to have your Driver subclass
org.apache.hadoop.conf.Configured and implement the org.apache.hadoop.util.Tool interface.
Indentify which invocation correctly passes.mapred.job.name with a value of Example to Hadoop?


Page 6 of 6« First...23456