Where are Hadoop’s task log files stored?
For each job, the Hadoop framework generates task log files. Where are Hadoop’s task log files
stored?
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop cluster running MapRe
Compare the hardware requirements of the NameNode with that of the DataNodes in a Hadoop
cluster running MapReduce v1 (MRv1):
What are two ways to determine available HDFS space in your cluster?
You are a Hadoop cluster with a NameNode on host mynamenode. What are two ways to
determine available HDFS space in your cluster?
you need to deploy at a minimum to store one year’s worth of data.
You are planning a Hadoop duster, and you expect to be receiving just under 1TB of data per
week which will be stored on the cluster, using Hadoop’s default replication. You decide that your
slave nodes will be configured with 4 x 1TB disks.
Calculate how many slave nodes you need to deploy at a minimum to store one year’s worth of
data.
Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a reasonable t
Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a
reasonable time without starving long-running jobs?
which best describes how the Hadoop Framework distributes block writes into HDFS from a Reducer outputting a 1
Your Hadoop cluster has 12 slave nodes, a block size set to 64MB, and a replication factor of
three.
Choose which best describes how the Hadoop Framework distributes block writes into HDFS from
a Reducer outputting a 150MB file?
How will the Fair’ Scheduler handle these two Jobs?
You has a cluster running with the Fail Scheduler enabled. There are currently no jobs running on
the cluster you submit a job A, so that only job A is running on the cluster. A while later, you
submit job B. Now job A and Job B are running on the cluster al the same time. How will the Fair’
Scheduler handle these two Jobs?
In the context of configuring a Hadoop cluster for HDFS High Availability (HA), ‘fencing’ refers to:
In the context of configuring a Hadoop cluster for HDFS High Availability (HA), ‘fencing’ refers to:
which scenario would it be most appropriate to consider using faster (e.g 10 Gigabit) Ethernet as the network
Under which scenario would it be most appropriate to consider using faster (e.g 10 Gigabit)
Ethernet as the network fabric for your Hadoop cluster?
What determines the number of Reduces that run a given MapReduce job on a cluster running MapReduce v1 (MRv1)?
What determines the number of Reduces that run a given MapReduce job on a cluster running
MapReduce v1 (MRv1)?