Briefing Cloudera Knowledge

which resources could you expect to be likely bottlenecks?

You need to create a job that does frequency analysis on input data. You will do this by writing a
Mapper that uses TextInputForma and splits each value (a line of text from an input file) into
individual characters. For each one of these characters, you will emit the character as a key and
as IntWritable as the value. Since this will produce proportionally more intermediate data than
input data, which resources could you expect to be likely bottlenecks?

A.
Processor and RAM

B.
Processor and disk I/O

C.
Disk I/O and network I/O

D.
Processor and network I/O