Which three basic configuration parameters must you set to migrate your cluster from MapReduce 1 (MRv1) to Map
Which three basic configuration parameters must you set to migrate your cluster from MapReduce
1 (MRv1) to MapReduce V2 (MRv2)?
Which data serialization system gives the flexibility to do this?
You need to analyze 60,000,000 images stored in JPEG format, each of which is approximately 25
KB. Because you Hadoop cluster isn’t optimized for storing and processing many small files, you
decide to do the following actions:
1. Group the individual images into a set of larger files
2. Use the set of larger files as input for a MapReduce job that processes them directly with python
using Hadoop streaming.
Which data serialization system gives the flexibility to do this?
Identify two features/issues that YARN is designated to address:
Identify two features/issues that YARN is designated to address:
Which YARN daemon or service monitors a Controller’s per-application resource using (e.g., memory CPU)?
Which YARN daemon or service monitors a Controller’s per-application resource using (e.g.,
memory CPU)?
Which is the default scheduler in YARN?
Which is the default scheduler in YARN?
Which YARN process run as “container 0” of a submitted job and is responsible for resource qrequests?
Which YARN process run as “container 0” of a submitted job and is responsible for resource
qrequests?
Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a reasonable t
Which scheduler would you deploy to ensure that your cluster allows short jobs to finish within a
reasonable time without starting long-running jobs?
What is the result when you execute: hadoop jar SampleJar MyClass on a client machine?
Your cluster is configured with HDFS and MapReduce version 2 (MRv2) on YARN. What is the
result when you execute: hadoop jar SampleJar MyClass on a client machine?
Which ecosystem project should you use to perform these actions?
You are working on a project where you need to chain together MapReduce, Pig jobs. You also
need the ability to use forks, decision points, and path joins. Which ecosystem project should you
use to perform these actions?
Which process instantiates user code, and executes map and reduce tasks on a cluster running MapReduce v2 (MRv
Which process instantiates user code, and executes map and reduce tasks on a cluster running
MapReduce v2 (MRv2) on YARN?