What command you enter?
you need to do when adding a new slave node to a cluster?
Assuming a cluster running HDFS, MapReduce version 2 (MRv2) on YARN with all settings at
their default, what do you need to do when adding a new slave node to a cluster?
Table schemas in Hive are:
Table schemas in Hive are:
what is the maximum number of NameNode daemons you should run on your cluster in order to avoid a “split-bra
Assuming you’re not running HDFS Federation, what is the maximum number of NameNode
daemons you should run on your cluster in order to avoid a “split-brain” scenario with your
NameNode when running HDFS High Availability (HA) using Quorum-based storage?
What should you do?
You have a 20 node Hadoop cluster, with 18 slave nodes and 2 master nodes running HDFS High
Availability (HA). You want to minimize the chance of data loss in you cluster. What should you
do?
Where are Hadoop task log files stored?
For each YARN job, the Hadoop framework generates task log file. Where are Hadoop task log
files stored?
What is the purpose of ZooKeeper in such a configuration?
You decide to create a cluster which runs HDFS in High Availability mode with automatic failover,
using Quorum-based Storage. What is the purpose of ZooKeeper in such a configuration?
How will the Fair Scheduler handle these two jobs?
You have a cluster running with the fair Scheduler enabled. There are currently no jobs running on
the cluster, and you submit a job A, so that only job A is running on the cluster. A while later, you
submit Job B. now Job A and Job B are running on the cluster at the same time. How will the Fair
Scheduler handle these two jobs?
where does the Mapper place the intermediate data each Map task?
During the execution of a MapReduce v2 (MRv2) job on YARN, where does the Mapper place the
intermediate data each Map task?
What should you do?
Each node in your Hadoop cluster, running YARN, has 64GB memory and 24 cores. Your
yarn.site.xml has the following configuration:
<property>
<name>yarn.nodemanager.resource.memory-mb</name>
<value>32768</value>
</property>
<property>
<name>yarn.nodemanager.resource.cpu-vcores</name>
<value>12</value>
</property>
You want YARN to launch no more than 16 containers per node. What should you do?