PrepAway - Latest Free Exam Questions & Answers

Tag: Exam CCD-333

Exam CCD-333: Cloudera Certified Developer for Apache Hadoop

which resources could you expect to be likely bottlenecks?

You need to create a job that does frequency analysis on input data. You will do this by writing a
Mapper that uses TextInputForma and splits each value (a line of text from an input file) into
individual characters. For each one of these characters, you will emit the character as a key and
as IntWritable as the value. Since this will produce proportionally more intermediate data than
input data, which resources could you expect to be likely bottlenecks?


Page 6 of 6« First...23456