Briefing Cloudera Knowledge

where does the Mapper place the intermediate data each Map task?

During the execution of a MapReduce v2 (MRv2) job on YARN, where does the Mapper place the
intermediate data each Map task?

A.
The Mapper stores the intermediate data on the mode running the job’s ApplicationMaster so
that is available to YARN’s ShuffleService before the data is presented to the Reducer

B.
The Mapper stores the intermediate data in HDFS on the node where the MAP tasks ran in the
HDFS /usercache/&[user]sppcache/application_&(appid) directory for the user who ran the job

C.
YARN holds the intermediate data in the NodeManager’s memory (a container) until it is
transferred to the Reducers

D.
The Mapper stores the intermediate data on the underlying filesystem of the local disk in the
directories yarn.nodemanager.local-dirs

E.
The Mapper transfers the intermediate data immediately to the Reducers as it generated by the
Map task