What is the maximum amount of virtual memory allocated for each map before YARN will kill its Container?
Your cluster’s mapped-site.xml includes the following parameters
<name>mapreduce.map.memory.mb</name> <value>4096<value/>
<name>mapreduce.reduce.memory,mb</name> <value>8192</value> And your cluster’s
yarn-site.xml includes the following parameters
<name>yarn.nodemanager/vmen-pmem-ratio</name> <value>2.1</value> What is the
maximum amount of virtual memory allocated for each map before YARN will kill its
Container?
Which Linux commands help you to identify whether swapping is occurring?
You suspect that your NameNode is incorrectly configured, and is swapping memory to
disk. Which Linux commands help you to identify whether swapping is occurring?
What is the result when you execute: hadoop jar samplejar.jar MyClass on a client machine?
Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is
the result when you execute: hadoop jar samplejar.jar MyClass on a client machine?
Which three basic configuration parameters must you set to migrate your cluster from MapReduce1 (MRv1) to MapR
Which three basic configuration parameters must you set to migrate your cluster from
MapReduce1 (MRv1) to MapReduce v2 (MRv2)?
how do you increase JVM heap property to 3GB to optimize performance?
You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You
consistently see that MapReduce map tasks on your cluster are running slowly because of
excessive garbage collection of JVM, how do you increase JVM heap property to 3GB to
optimize performance?
Which is the most efficient process to gather these web server logs into your Hadoop cluster for analysis?
You want to understand more about how users browse you public website. For example,
you want to know which pages they visit prior to placing an order. You have a server farm of
200 web servers hosting your website. Which is the most efficient process to gather these
web server logs into your Hadoop cluster for analysis?
What occurs when you execute the command: hdfs haadmin –failover nn01 nn02
Your cluster implements HDFS High Availability (HA). Your two NameNodes are named
nn01 and nn02. What occurs when you execute the command: hdfs haadmin –failover nn01
nn02
Can you configure a worker node to run a NodeManager daemon but not a DataNode daemon and still have a functio
Your Hadoop cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN.
Can you configure a worker node to run a NodeManager daemon but not a DataNode
daemon and still have a function cluster?
Which YARN process runs as “controller O” of a submitted job and is responsible for resource requests?
Which YARN process runs as “controller O” of a submitted job and is responsible for
resource requests?
How will the Fair Scheduler handle these two jobs?
You have a cluster running with the Fair Scheduler enabled. There are currently no jobs
running on the cluster, and you submit a job A, so that only job A is running on the cluster. A
while later, you submit Job B. now job A and Job B are running on the cluster at the same
time. How will the Fair Scheduler handle these two jobs?