what would another use see when they look in the directory?
On a cluster running CDH 5.0 or above, you use the hadoop fs –put command to write a
300MB file into a previously empty directory using an HDFS block of 64MB. Just after this
command has finished writing 200MB of this file, what would another use see when they
look in the directory?
What is the maximum amount of virtual memory allocated for each map before YARN will kill its Container?
Your cluster’s mapped-site.xml includes the following parameters
<name>mapreduce.map.memory.mb</name> <value>4096<value/>
<name>mapreduce.reduce.memory,mb</name> <value>8192</value> And your cluster’s
yarn-site.xml includes the following parameters
<name>yarn.nodemanager/vmen-pmem-ratio</name> <value>2.1</value> What is the
maximum amount of virtual memory allocated for each map before YARN will kill its
Container?
Which Linux commands help you to identify whether swapping is occurring?
You suspect that your NameNode is incorrectly configured, and is swapping memory to
disk. Which Linux commands help you to identify whether swapping is occurring?
What is the result when you execute: hadoop jar samplejar.jar MyClass on a client machine?
Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is
the result when you execute: hadoop jar samplejar.jar MyClass on a client machine?
Which three basic configuration parameters must you set to migrate your cluster from MapReduce1 (MRv1) to MapR
Which three basic configuration parameters must you set to migrate your cluster from
MapReduce1 (MRv1) to MapReduce v2 (MRv2)?
how do you increase JVM heap property to 3GB to optimize performance?
You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You
consistently see that MapReduce map tasks on your cluster are running slowly because of
excessive garbage collection of JVM, how do you increase JVM heap property to 3GB to
optimize performance?
Which is the most efficient process to gather these web server logs into your Hadoop cluster for analysis?
You want to understand more about how users browse you public website. For example,
you want to know which pages they visit prior to placing an order. You have a server farm of
200 web servers hosting your website. Which is the most efficient process to gather these
web server logs into your Hadoop cluster for analysis?
What occurs when you execute the command: hdfs haadmin –failover nn01 nn02
Your cluster implements HDFS High Availability (HA). Your two NameNodes are named
nn01 and nn02. What occurs when you execute the command: hdfs haadmin –failover nn01
nn02
Can you configure a worker node to run a NodeManager daemon but not a DataNode daemon and still have a functio
Your Hadoop cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN.
Can you configure a worker node to run a NodeManager daemon but not a DataNode
daemon and still have a function cluster?
Which YARN process runs as “controller O” of a submitted job and is responsible for resource requests?
Which YARN process runs as “controller O” of a submitted job and is responsible for
resource requests?