Which MapReduce daemon instantiates user code, and executes map and reduce tasks on a cluster running MapReduc
Which MapReduce daemon instantiates user code, and executes map and reduce tasks on a
cluster running MapReduce v1 (MRv1)?
What is the recommended disk configuration for slave nodes in your Hadoop cluster with 6 x 2 TB hard drives?
What is the recommended disk configuration for slave nodes in your Hadoop cluster with 6 x 2 TB
hard drives?
How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?
You observe that the number of spilled records from map tasks for exceeds the number of map
output records. You child heap size is 1 GB and your io.sort.mb value is set to 100MB. How would
you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?
you need to configure to run on your master nodes?
You configure Hadoop cluster with both MapReduce frameworks, MapReduce v1 (MRv1) and
MapReduce v2 (MRv2/YARN). Which two MapReduce (computational) daemons do you need to
configure to run on your master nodes?
What is the maximum number of NameNodes daemon you should run on you cluster in order to avoid a “split-brai
You configure you cluster with HDFS High Availability (HA) using Quorum-Based storage. You do
not implement HDFS Federation.
What is the maximum number of NameNodes daemon you should run on you cluster in order to
avoid a “split-brain” scenario with your NameNodes?
Identify two features/issues that MapReduce v2 (MRv2/YARN) is designed to address:
Identify two features/issues that MapReduce v2 (MRv2/YARN) is designed to address:
What happens when client tries to write a file to/reports/myreport.txt?
You set up the Hadoop cluster using NameNode Federation. One NameNode manages the/users
namespace and one NameNode manages the/data namespace. What happens when client tries
to write a file to/reports/myreport.txt?
how much data will you be able to store?
Your Hadoop cluster has 25 nodes with a total of 100 TB (4 TB per node) of raw disk space
allocated HDFS storage. Assuming Hadoop’s default configuration, how much data will you be
able to store?
which daemon makes HDFS unavailable on a cluster running MapReduce v1 (MRv1)?
The failure of which daemon makes HDFS unavailable on a cluster running MapReduce v1
(MRv1)?
The most important consideration for slave nodes in a Hadoop cluster running production jobs that require shor
The most important consideration for slave nodes in a Hadoop cluster running production jobs that
require short turnaround times is: