how many distinct copy operations will there be in the sort/shuffle phase?
In a large MapReduce job with m mappers and r reducers, how many distinct copy operations will
there be in the sort/shuffle phase?
What happens in a MapReduce job when you set the number of reducers to one?
What happens in a MapReduce job when you set the number of reducers to one?
why might using a combiner reduce the overall Job running time?
In the standard word count MapReduce algorithm, why might using a combiner reduce the overall
Job running time?
What is the storage capacity of your Hadoop cluster (assuming no compression)?
Your cluster has 10 DataNodes, each with a single 1 TB hard drive. You utilize all your disk
capacity for HDFS, reserving none for MapReduce. You implement default replication settings.
What is the storage capacity of your Hadoop cluster (assuming no compression)?
Which of the following statements best describes how a large (100 GB) file is stored in HDFS?
Which of the following statements best describes how a large (100 GB) file is stored in HDFS?
Which of the following describes how a client reads a file from HDFS?
Which of the following describes how a client reads a file from HDFS?
Would HDFS be appropriate for this customer information file?
You need to create a GUI application to help your company’s sales people add and edit customer
information. Would HDFS be appropriate for this customer information file?
Which two of the following are valid statements?
Which two of the following are valid statements? (Choose two)
Which command does Hadoop offer to discover missing or corrupt HDFS data?
Which command does Hadoop offer to discover missing or corrupt HDFS data?
On a cluster running MapReduce v1 (MRv1), the value of the mapred.tasktracker.map.tasks.maximum configuration
On a cluster running MapReduce v1 (MRv1), the value of the
mapred.tasktracker.map.tasks.maximum configuration parameter in the mapred-site.xml file
should be set to: