- Part 8

What are two ways to determine available HDFS space in your cluster?

seenagapeApril 3, 2017 10 comments

You are running a Hadoop cluster with a NameNode on host mynamenode. What are two ways to
determine available HDFS space in your cluster?

Which method should you tell that developers to implement?

seenagapeApril 1, 2017 6 comments

You have recently converted your Hadoop cluster from a MapReduce 1 (MRv1) architecture to
MapReduce 2 (MRv2) on YARN architecture. Your developers are accustomed to specifying map
and reduce tasks (resource allocation) tasks when they run jobs: A developer wants to know how
specify to reduce tasks when a specific job runs. Which method should you tell that developers to
implement?

What results?

seenagapeMarch 31, 2017 12 comments

Your Hadoop cluster contains nodes in three racks. You have not configured the dfs.hosts
property in the NameNode’s configuration file. What results?

how do you increase JVM heap size property to 3GB to optimize performance?

seenagapeMarch 29, 2017 7 comments

You are running a Hadoop cluster with MapReduce version 2 (MRv2) on YARN. You consistently
see that MapReduce map tasks on your cluster are running slowly because of excessive garbage
collection of JVM, how do you increase JVM heap size property to 3GB to optimize performance?

Which two best describes how FIFO Scheduler arbitrates the cluster resources for job and its tasks?

seenagapeMarch 28, 2017 12 comments

You have a cluster running with a FIFO scheduler enabled. You submit a large job A to the cluster,
which you expect to run for one hour. Then, you submit job B to the cluster, which you expect to
run a couple of minutes only.
You submit both jobs with the same priority.
Which two best describes how FIFO Scheduler arbitrates the cluster resources for job and its
tasks?

What is the cause of the error?

seenagapeMarch 26, 2017 10 comments

A user comes to you, complaining that when she attempts to submit a Hadoop job, it fails. There is
a Directory in HDFS named /data/input. The Jar is named j.jar, and the driver class is named
DriverClass.

She runs the command:
Hadoop jar j.jar DriverClass /data/input/data/output
The error message returned includes the line:
PriviligedActionException as:training (auth:SIMPLE)
cause:org.apache.hadoop.mapreduce.lib.input.invalidInputException:
Input path does not exist: file:/data/input
What is the cause of the error?

What is the best way to obtain and ingest these user records?

seenagapeMarch 24, 2017 9 comments

Your company stores user profile records in an OLTP databases. You want to join these records
with web server logs you have already ingested into the Hadoop file system. What is the best way
to obtain and ingest these user records?

Which two are features of Hadoop’s rack topology?

seenagapeMarch 22, 2017 4 comments

Which two are features of Hadoop’s rack topology?

This is called:

seenagapeMarch 20, 2017 3 comments

The Hadoop framework provides a mechanism for coping with machine issues such as faulty
configuration or impending hardware failure. MapReduce detects that one or a number of
machines are performing poorly and starts more copies of a map or reduce task. All the tasks run
simultaneously and the task finish first are used. This is called:

What is the disadvantage of using multiple reducers with the default HashPartitioner and distributing your wor

seenagapeMarch 18, 2017 3 comments

What is the disadvantage of using multiple reducers with the default HashPartitioner and
distributing your workload across you cluster?

Page 8 of 71« First «...6 789 10...20 30 40...»Last »

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Author: seenagape

What are two ways to determine available HDFS space in your cluster?

Which method should you tell that developers to implement?

What results?

how do you increase JVM heap size property to 3GB to optimize performance?

Which two best describes how FIFO Scheduler arbitrates the cluster resources for job and its tasks?

What is the cause of the error?

What is the best way to obtain and ingest these user records?

Which two are features of Hadoop’s rack topology?

This is called:

What is the disadvantage of using multiple reducers with the default HashPartitioner and distributing your wor