you need to do in order to run on the cluster and submit jobs from the command line of the gateway machine?
You have a Hadoop cluster running HDFS, and a gateway machine external to the cluster from
which clients submit jobs. What do you need to do in order to run on the cluster and submit jobs
from the command line of the gateway machine?
What does CDH packaging do on install to facilitate Kerberos security setup?
What does CDH packaging do on install to facilitate Kerberos security setup?
Which method should you tell that developer to implement?
You have converted your Hadoop cluster from a MapReduce 1 (MRv1) architecture to a
MapReduce 2 (MRv2) on YARN architecture. Your developers are accustomed to specifying map
and reduce tasks (resource allocation) tasks when they run jobs. A developer wants to know how
specify to reduce tasks when a specific job runs. Which method should you tell that developer to
implement?
Which is the most efficient process to gather these web server across logs into your Hadoop cluster analysis?
You want to understand more about how users browse your public website. For example, you
want to know which pages they visit prior to placing an order. You have a server farm of 200 web
servers hosting your website. Which is the most efficient process to gather these web server
across logs into your Hadoop cluster analysis?
What should you do?
You are upgrading a Hadoop cluster from HDFS and MapReduce version 1 (MRv1) to one running
HDFS and MapReduce version 2 (MRv2) on YARN. You want to set and enforce a block of
128MB for all new files written to the cluster after the upgrade. What should you do?