- Part 39

Which two daemons needs to be installed on your cluster’s master nodes?

seenagapeFebruary 13, 2016 5 comments

You are configuring your cluster to run HDFS and MapReducer v2 (MRv2) on YARN. Which
two daemons needs to be installed on your cluster’s master nodes?

How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

seenagapeFebruary 13, 2016 3 comments

You observed that the number of spilled records from Map tasks far exceeds the number of
map output records. Your child heap size is 1GB and your io.sort.mb value is set to
1000MB. How would you tune your io.sort.mb value to achieve maximum memory to disk
I/O ratio?

Which best describes how you determine when the last checkpoint happened?

seenagapeFebruary 13, 2016 7 comments

You are running a Hadoop cluster with a NameNode on host my name node, a secondary
NameNode on host my secondary name node and several DataNodes. Which best
describes how you determine when the last checkpoint happened?

What does CDH packaging do on install to facilitate Kerberos security setup?

seenagapeFebruary 13, 2016 3 comments

What does CDH packaging do on install to facilitate Kerberos security setup?

Which is the most efficient process to gather these web server across logs into your Hadoop cluster analysis?

seenagapeFebruary 13, 2016 5 comments

You want to understand more about how users browse your public website. For example,
you want to know which pages they visit prior to placing an order. You have a server farm of
200 web servers hosting your website. Which is the most efficient process to gather these
web server across logs into your Hadoop cluster analysis?

Which three basic configuration parameters must you set to migrate your cluster from MapReduce 1 (MRv1) to Map

seenagapeFebruary 13, 2016 4 comments

Which three basic configuration parameters must you set to migrate your cluster from
MapReduce 1 (MRv1) to MapReduce V2 (MRv2)?

Which data serialization system gives the flexibility to do this?

seenagapeFebruary 13, 2016 6 comments

You need to analyze 60,000,000 images stored in JPEG format, each of which is
approximately 25 KB. Because you Hadoop cluster isn’t optimized for storing and
processing many small files, you decide to do the following actions: 1. Group the individual
images into a set of larger files 2. Use the set of larger files as input for a MapReduce job
that processes them directly with python using Hadoop streaming. Which data serialization
system gives the flexibility to do this?

Page 39 of 71« First «...10 20 30...37 383940 41...50 60 70...»Last »

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Author: seenagape

Which two daemons needs to be installed on your cluster’s master nodes?

How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

Which best describes how you determine when the last checkpoint happened?

What does CDH packaging do on install to facilitate Kerberos security setup?

Which is the most efficient process to gather these web server across logs into your Hadoop cluster analysis?

Which three basic configuration parameters must you set to migrate your cluster from MapReduce 1 (MRv1) to Map

Which data serialization system gives the flexibility to do this?

Identify two features/issues that YARN is designated to address:

Which YARN daemon or service monitors a Controller’s per-application resource using (e.g., memory CPU)?

Which is the default scheduler in YARN?