how many distinct copy operations will there be in the sort/shuffle phase?
In a large MapReduce job with m mappers and r reducers, how many distinct copy operations will
there be in the sort/shuffle phase?
Combiners Increase the efficiency of a MapReduce program because:
Combiners Increase the efficiency of a MapReduce program because:
Which interface should your class implement?
You are developing a combiner that takes as input Text keys, IntWritable values, and emits Text
keys, Intwritable values. Which interface should your class implement?
what would another user see when trying to access this file?
You use the hadoop fs –put command to write a 300 MB file using an HDFS block size of 64 MB.
Just after this command has finished writing 200 MB of this file, what would another user see
when trying to access this file?
Which of the following describes how a client reads a file from HDFS?
Which of the following describes how a client reads a file from HDFS?
Which two of the following are valid statements?
Which two of the following are valid statements? (Choose two)
why might using a combiner reduce the overall Job running time?
In the standard word count MapReduce algorithm, why might using a combiner reduce the overall
Job running time?
What happens in a MapReduce job when you set the number of reducers to one?
What happens in a MapReduce job when you set the number of reducers to one?
What is the storage capacity of your Hadoop cluster (assuming no compression)?
Your cluster has 10 DataNodes, each with a single 1 TB hard drive. You utilize all your disk
capacity for HDFS, reserving none for MapReduce. You implement default replication settings.
What is the storage capacity of your Hadoop cluster (assuming no compression)?
Which of the following statements best describes how a large (100 GB) file is stored in HDFS?
Which of the following statements best describes how a large (100 GB) file is stored in HDFS?