Cca-500 | Briefing Cloudera Knowledge

Table schemas in Hive are:

seenagapeSeptember 24, 2014 20 comments

Table schemas in Hive are:

Where are Hadoop task log files stored?

seenagapeSeptember 24, 2014 12 comments

For each YARN job, the Hadoop framework generates task log file. Where are Hadoop task log
files stored?

How will the Fair Scheduler handle these two jobs?

seenagapeSeptember 24, 2014 23 comments

You have a cluster running with the fair Scheduler enabled. There are currently no jobs running on
the cluster, and you submit a job A, so that only job A is running on the cluster. A while later, you
submit Job B. now Job A and Job B are running on the cluster at the same time. How will the Fair
Scheduler handle these two jobs?

What should you do?

seenagapeSeptember 24, 2014 14 comments

Each node in your Hadoop cluster, running YARN, has 64GB memory and 24 cores. Your
yarn.site.xml has the following configuration:
<property>
<name>yarn.nodemanager.resource.memory-mb</name>
<value>32768</value>
</property>
<property>
<name>yarn.nodemanager.resource.cpu-vcores</name>
<value>12</value>
</property>
You want YARN to launch no more than 16 containers per node. What should you do?

What should you do?

seenagapeSeptember 24, 2014 14 comments

You want to node to only swap Hadoop daemon data from RAM to disk when absolutely
necessary. What should you do?

Which two daemons needs to be installed on your cluster’s master nodes?

seenagapeSeptember 24, 2014 9 comments

You are configuring your cluster to run HDFS and MapReducer v2 (MRv2) on YARN. Which two
daemons needs to be installed on your cluster’s master nodes?

How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

seenagapeSeptember 24, 2014 5 comments

You observed that the number of spilled records from Map tasks far exceeds the number of map
output records. Your child heap size is 1GB and your io.sort.mb value is set to 1000MB. How
would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

Which best describes how you determine when the last checkpoint happened?

seenagapeSeptember 24, 2014 15 comments

You are running a Hadoop cluster with a NameNode on host mynamenode, a secondary
NameNode on host mysecondarynamenode and several DataNodes.
Which best describes how you determine when the last checkpoint happened?

What does CDH packaging do on install to facilitate Kerberos security setup?

seenagapeSeptember 24, 2014 8 comments

What does CDH packaging do on install to facilitate Kerberos security setup?

Which is the most efficient process to gather these web server across logs into your Hadoop cluster analysis?

seenagapeSeptember 24, 2014 9 comments

You want to understand more about how users browse your public website. For example, you
want to know which pages they visit prior to placing an order. You have a server farm of 200 web
servers hosting your website. Which is the most efficient process to gather these web server
across logs into your Hadoop cluster analysis?

Page 6 of 6« First «...2 3 4 56

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Category: CCA-500

Table schemas in Hive are:

Where are Hadoop task log files stored?

How will the Fair Scheduler handle these two jobs?

What should you do?

What should you do?

Which two daemons needs to be installed on your cluster’s master nodes?

How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?

Which best describes how you determine when the last checkpoint happened?

What does CDH packaging do on install to facilitate Kerberos security setup?

Which is the most efficient process to gather these web server across logs into your Hadoop cluster analysis?