which major functions of the JobTracker into separate daemons?
MapReduce v2 (MRv2/YARN) splits which major functions of the JobTracker into separate
daemons? Select two.
Which action should you take to relieve this situation and store more files in HDFS?
You need to move a file titled “weblogs” into HDFS. When you try to copy the file, you can’t. You
know you have ample space on your DataNodes. Which action should you take to relieve this
situation and store more files in HDFS?
What does calling the next () method return?
In the reducer, the MapReduce API provides you with an iterator over Writable values. What does
calling the next () method return?
What types of algorithms are difficult to express in MapReduce v1 (MRv1)?
What types of algorithms are difficult to express in MapReduce v1 (MRv1)?
which best describes the behavior of the default partitioner?
Analyze each scenario below and indentify which best describes the behavior of the default
partitioner?
what the map method accepts and emits?
Which best describes what the map method accepts and emits?
Workflows expressed in Oozie can contain:
Workflows expressed in Oozie can contain:
how many distinct copy operations will there be in the sort/shuffle phase?
In a large MapReduce job with m mappers and n reducers, how many distinct copy operations will
there be in the sort/shuffle phase?
What is a SequenceFile?
What is a SequenceFile?
Which format should you use to store this data in HDFS?
You want to perform analysis on a large collection of images. You want to store this data in HDFS
and process it with MapReduce but you also want to give your data analysts and data scientists
the ability to process the data directly from HDFS with an interpreted high-level programming
language like Python. Which format should you use to store this data in HDFS?