- Part 11

which best describes the file access rules in HDFS if the file has a single block that is stored on data nodes

seenagapeFebruary 13, 2017 2 comments

A client application creates an HDFS file named foo.txt with a replication factor of 3. Identify which
best describes the file access rules in HDFS if the file has a single block that is stored on data
nodes A, B and C?

What is the preferred way to pass a small number of configuration parameters to a mapper or reducer?

seenagapeFebruary 12, 2017 5 comments

What is the preferred way to pass a small number of configuration parameters to a mapper or
reducer?

Which of the following would you use?

seenagapeFebruary 11, 2017 3 comments

You need a distributed, scalable, data Store that allows you random, realtime read/write access to
hundreds of terabytes of data. Which of the following would you use?

What is the best way to accomplish this?

seenagapeFebruary 9, 2017 10 comments

To process input key-value pairs, your mapper needs to lead a 512 MB data file in memory. What
is the best way to accomplish this?

How many times will the Reducer’s reduce method be invoked?

seenagapeFebruary 7, 2017 2 comments

You have written a Mapper which invokes the following five calls to the OutputColletor.collect
method:
output.collect (new Text (“Apple”), new Text (“Red”) ) ;
output.collect (new Text (“Banana”), new Text (“Yellow”) ) ;
output.collect (new Text (“Apple”), new Text (“Yellow”) ) ;
output.collect (new Text (“Cherry”), new Text (“Red”) ) ;
output.collect (new Text (“Apple”), new Text (“Green”) ) ;
How many times will the Reducer’s reduce method be invoked?

which best describes when the reduce method is first called in a MapReduce job?

seenagapeFebruary 5, 2017 10 comments

Determine which best describes when the reduce method is first called in a MapReduce job?

which is the correct way of submitting the job to the cluster?

seenagapeFebruary 4, 2017 4 comments

Given a Mapper, Reducer, and Driver class packaged into a jar, which is the correct way of
submitting the job to the cluster?

Which of the following utilities allows you to create and run MapReduce jobs with any executable or script as

seenagapeFebruary 3, 2017 3 comments

Which of the following utilities allows you to create and run MapReduce jobs with any executable
or script as the mapper and/or the reducer?

Identify the Hadoop daemon on which the Hadoop framework will look for an available slot schedule a MapReduce

seenagapeFebruary 1, 2017 2 comments

Your client application submits a MapReduce job to your Hadoop cluster. Identify the Hadoop
daemon on which the Hadoop framework will look for an available slot schedule a MapReduce
operation.

which two resources should you expect to be bottlenecks?

seenagapeJanuary 31, 2017 5 comments

You need to create a job that does frequency analysis on input data. You will do this by writing a
Mapper that uses TextInputFormat and splits each value (a line of text from an input file) into
individual characters. For each one of these characters, you will emit the character as a key and
an InputWritable as the value. As this will produce proportionally more intermediate data than input
data, which two resources should you expect to be bottlenecks?

Page 11 of 71« First «...9 101112 13...20 30 40...»Last »

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Author: seenagape