Briefing Cloudera Knowledge

where does the Mapper place the intermediate data each Map task?

During the execution of a MapReduce v2 (MRv2) job on YARN, where does the Mapper
place the intermediate data each Map task?

A.
The Mapper stores the intermediate data on the underlying filesystem of the local disk in
the directories yarn.nodemanager.local-dirs

B.
The Mapper transfers the intermediate data immediately to the Reducers as it generated
by the Map task

C.
The Mapper stores the intermediate data on the mode running the job’s
ApplicationMaster so that is available to YARN’s ShuffleService before the data is
presented to the Reducer

D.
The Mapper stores the intermediate data in HDFS on the node where the MAP tasks ran
in the HDFS /usercache/&[user]sppcache/application_&(appid) directory for the user who
ran the job

E.
YARN holds the intermediate data in the NodeManager’s memory (a container) until it is
transferred to the Reducers

Explanation: