Identify two features/issues that YARN is designed to address:
Identify two features/issues that YARN is designed to address:
What processes must you do if you are running a Hadoop cluster with a single NameNode and six DataNodes…
What processes must you do if you are running a Hadoop cluster with a single NameNode and six
DataNodes, and you want to change a configuration parameter so that it affects all six DataNodes.
Which workloads benefit the most from faster network fabric?
You are planning a Hadoop cluster and considering implementing 10 Gigabit Ethernet as the
network fabric. Which workloads benefit the most from faster network fabric?
How does this alter HDFS block storage?
A slave node in your cluster has four 2TB hard drives installed (4 x 2TB). The DataNode is
configured to store HDFS blocks on the disks. You set the value of the dfs.datanode.du.reserved
parameter to 100GB. How does this alter HDFS block storage?
Which configuration should you set?
Your cluster is running MapReduce version 2 (MRv2) on YARN. Your ResourceManager is
configured to use the FairScheduler. Now you want to configure your scheduler such that a new
user on the cluster can submit jobs into their own queue application submission. Which
configuration should you set?
Which configuration should you set?
Your cluster is running MapReduce vserion 2 (MRv2) on YARN. Your ResourceManager is
configured to use the FairScheduler. Now you want to configure your scheduler such that a new
user on the cluster can submit jobs into their own queue application submission. Which
configuration should you set?
How does this alter HDFS block storage?
A slave node in your cluster has 4 TB hard drives installed (4 x 2TB). The DataNode is configured
to store HDFS blocks on all disks. You set the value of the dfs.datanode.du.reserved parameter to
100 GB. How does this alter HDFS block storage?
What is the cause of the error?
A user comes to you, complaining that when she attempts to submit a Hadoop job, it fails. There is
a directory in HDFS named /data/input. The Jar is named j.jar, and the driver class is named
DriverClass. She runs command:
hadoop jar j.jar DriverClass /data/input/data/output
The error message returned includes the line:
PrivilegedActionException as:training (auth:SIMPLE)
cause.apache.hadoop.mapreduce.lib.input.InvalidInputException: Input path does not exits: file
:/data/input
What is the cause of the error?
What do you have to do on the cluster to allow the worker node to join, and start sorting HDFS blocks?
You have installed a cluster HDFS and MapReduce version 2 (MRv2) on YARN. You have no
dfs.hosts entry(ies) in your hdfs-site.xml configuration file. You configure a new worker node by
setting fs.default.name in its configuration files to point to the NameNode on your cluster, and you
start the DataNode daemon on that worker node. What do you have to do on the cluster to allow
the worker node to join, and start sorting HDFS blocks?
What two processes must you do if you are running a Hadoop cluster with a single NameNode and six DataNodes
What two processes must you do if you are running a Hadoop cluster with a single NameNode
and six DataNodes, and you want to change a configuration parameter so that it affects all six
DataNodes.