Briefing Cloudera Knowledge

where does the Mapper place the intermediate data of each Map Task?

During the execution of a MapReduce v2 (MRv2) job on YARN, where does the Mapper place the
intermediate data of each Map Task?

A.
The Mapper stores the intermediate data on the node running the Job’s ApplicationMaster so
that it is available to YARN ShuffleService before the data is presented to the Reducer

B.
The Mapper stores the intermediate data in HDFS on the node where the Map tasks ran in the
HDFS /usercache/&(user)/apache/application_&(appid) directory for the user who ran the job

C.
The Mapper transfers the intermediate data immediately to the reducers as it is generated by
the Map Task

D.
YARN holds the intermediate data in the NodeManager’s memory (a container) until it is
transferred to the Reducer

E.
The Mapper stores the intermediate data on the underlying filesystem of the local disk in the
directories yarn.nodemanager.locak-DIFS