Which three processes does HDFS High Availability (HA) enable on your cluster?
Which three processes does HDFS High Availability (HA) enable on your cluster?
How do you configure a client machine to access both the /data and the /reports directories on the cluster?
You’ve configured your cluster with HDFS Federation. One NameNode manages the /data
namesapace and another Name/Node manages the /reports namespace. How do you configure a
client machine to access both the /data and the /reports directories on the cluster?
What occurs when you execute the command: Hdfs haadmin -failover nn01 nn02
Your cluster implements HDFS High Availability (HA). You two NameNodes are named nn01 and
nn02. What occurs when you execute the command:
Hdfs haadmin -failover nn01 nn02
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop cluster running MapRe
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop
cluster running MapReduce v1 (MRv1):
Where are Hadoop’s task log files stored?
For each job, the Hadoop framework generates task log files. Where are Hadoop’s task log files
stored?
Identify four characteristics of a 300MB file that has been written to HDFS with block size of 128MB and all o
Identify four characteristics of a 300MB file that has been written to HDFS with block size of
128MB and all other Hadoop defaults unchanged?
Identify which two daemons typically run each slave node in a Hadoop cluster running MapReduce v1 (MRv1)
Identify which two daemons typically run each slave node in a Hadoop cluster running MapReduce
v1 (MRv1)
What does this tell you about the file?
In HDFS, you view a file with rw-r–r– set as its permissions. What does this tell you about the file?
What are two ways to determine available HDFS space in your cluster?
You are a Hadoop cluster with a NameNode on host mynamenode. What are two ways to
determine available HDFS space in your cluster?
How will the Fair’ Scheduler handle these two Jobs?
You has a cluster running with the Fail Scheduler enabled. There are currently no jobs running on
the cluster you submit a job A, so that only job A is running on the cluster. A while later, you
submit job B. Now job A and Job B are running on the cluster al the same time. How will the Fair’
Scheduler handle these two Jobs?