On a cluster running MapReduce v1 (MRv1), the value of the mapred.tasktracker.map.tasks.maximum configuration
On a cluster running MapReduce v1 (MRv1), the value of the
mapred.tasktracker.map.tasks.maximum configuration parameter in the mapred-site.xml file
should be set to:
Which command does Hadoop offer to discover missing or corrupt HDFS data?
Which command does Hadoop offer to discover missing or corrupt HDFS data?
What does CDH packaging do on install to facilitate Kerberos security setup?
What does CDH packaging do on install to facilitate Kerberos security setup?
which scenario results if you leave the dfs.hosts property in the NameNode’s configuration file empty (blank
Your Hadoop cluster contains nodes in three racks. Choose which scenario results if you leave the
dfs.hosts property in the NameNode’s configuration file empty (blank)?
What metadata is stored on a DataNode when a block is written to it?
What metadata is stored on a DataNode when a block is written to it?
What occurs when you execute the command: Hdfs haadmin -failover nn01 nn02
Your cluster implements HDFS High Availability (HA). You two NameNodes are named nn01 and
nn02. What occurs when you execute the command:
Hdfs haadmin -failover nn01 nn02
How do you configure a client machine to access both the /data and the /reports directories on the cluster?
You’ve configured your cluster with HDFS Federation. One NameNode manages the /data
namesapace and another Name/Node manages the /reports namespace. How do you configure a
client machine to access both the /data and the /reports directories on the cluster?
Where are Hadoop’s task log files stored?
For each job, the Hadoop framework generates task log files. Where are Hadoop’s task log files
stored?
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop cluster running MapRe
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop
cluster running MapReduce v1 (MRv1):
Which three processes does HDFS High Availability (HA) enable on your cluster?
Which three processes does HDFS High Availability (HA) enable on your cluster?