which best describes the behavior of the default partitioner?
Analyze each scenario below and indentify which best describes the behavior of the default
partitioner?
Which action should you take to relieve this situation and store more files in HDFS?
You need to move a file titled “weblogs” into HDFS. When you try to copy the file, you can’t. You
know you have ample space on your DataNodes. Which action should you take to relieve this
situation and store more files in HDFS?
how many distinct copy operations will there be in the sort/shuffle phase?
In a large MapReduce job with m mappers and n reducers, how many distinct copy operations will
there be in the sort/shuffle phase?
Workflows expressed in Oozie can contain:
Workflows expressed in Oozie can contain:
what the map method accepts and emits?
Which best describes what the map method accepts and emits?
When can a reduce class also serve as a combiner without affecting the output of a MapReduce program?
When can a reduce class also serve as a combiner without affecting the output of a MapReduce
program?
Which format should you use to store this data in HDFS?
You want to perform analysis on a large collection of images. You want to store this data in HDFS
and process it with MapReduce but you also want to give your data analysts and data scientists
the ability to process the data directly from HDFS with an interpreted high-level programming
language like Python. Which format should you use to store this data in HDFS?
Which mode of operation in Hadoop allows you to most closely simulate a production cluster while using a sing
You want to run Hadoop jobs on your development workstation for testing before you submit them
to your production cluster. Which mode of operation in Hadoop allows you to most closely simulate
a production cluster while using a single machine?
how many Mappers will run?
Your cluster’s HDFS block size in 64MB. You have directory containing 100 plain text files, each of
which is 100MB in size. The InputFormat for your job is TextInputFormat. Determine how many
Mappers will run?
Which of the following best describes the workings of TextInputFormat?
Which of the following best describes the workings of TextInputFormat?