Where does a MapReduce job store the intermediate data output from Mappers?

seenagapeJuly 14, 2016

PrepAway - Latest Free Exam Questions & Answers

A.
On the underlying filesystem of the local disk machine on which the JobTracker ran.

B.
In HDFS, in the job’s output directory.

C.
In HDFS, in temporary directory defined mapred.tmp.dir.

D.
On the underlying filesystem of the local disk of the machine on which the Mapper ran.

E.
Stores on the underlying filesystem of the local disk of the machine on which the Reducer.

Explanation:
The mapper output (intermediate data) is stored on the Local file system (NOT
HDFS) of each individual mapper nodes. This is typically a temporary directory location which can
be setup in config by the hadoop administrator. The intermediate data is cleaned up after the
Hadoop Job completes.
Reference:
24 Interview Questions & Answers for Hadoop MapReduce developers , Where is the
Mapper Output (intermediate kay-value data) stored ?

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Where does a MapReduce job store the intermediate data output from Mappers?

One Comment on “Where does a MapReduce job store the intermediate data output from Mappers?”

Leave a Reply Cancel reply