- Part 17

For each input key-value pair, mappers can emit:

seenagapeNovember 12, 2016 5 comments

For each input key-value pair, mappers can emit:

For each intermediate key, each reducer task can emit:

seenagapeNovember 11, 2016 8 comments

For each intermediate key, each reducer task can emit:

Determine the difference between setting the number of reducers to zero.

seenagapeNovember 10, 2016 3 comments

You write a MapReduce job to process 100 files in HDFS. Your MapReduce algorithm uses
TextInputFormat and the IdentityReducer: the mapper applies a regular expression over input
values and emits key-value pairs with the key consisting of the matching text, and the value
containing the filename and byte offset. Determine the difference between setting the number of
reducers to zero.

What happens in a MapReduce job when you set the number of reducers to zero?

seenagapeNovember 9, 2016 4 comments

What happens in a MapReduce job when you set the number of reducers to zero?

In writing a MapReduce program to accomplish this, can you take advantage of a combiner?

seenagapeNovember 7, 2016 3 comments

You have a large dataset of key-value pairs, where the keys are strings, and the values are
integers. For each unique key, you want to identify the largest integer. In writing a MapReduce
program to accomplish this, can you take advantage of a combiner?

how many key-value pairs will there be in each file?

seenagapeNovember 5, 2016 6 comments

If you run the word count MapReduce program with m mappers and r reducers, how many output
files will you get at the end of the job? And how many key-value pairs will there be in each file?
Assume k is the number of unique words in the input files.

which of the following interfaces is most likely to reduce the amount of intermediate data transferred across

seenagapeNovember 3, 2016 3 comments

You’ve written a MapReduce job that will process 500 million input records and generate 500
million key-value pairs. The data is not uniformly distributed. Your MapReduce job will create a
significant amount of intermediate data that it needs to transfer between mappers and reducers
which is a potential bottleneck. A custom implementation of which of the following interfaces is
most likely to reduce the amount of intermediate data transferred across the network?

Which statement best describes the data path of intermediate key-value pairs (i.e., output of the mappers)?

seenagapeNovember 2, 2016 3 comments

Which statement best describes the data path of intermediate key-value pairs (i.e., output of the

mappers)?

Would HDFS be appropriate for this customer information file?

seenagapeNovember 1, 2016 3 comments

You need to create a GUI application to help your company’s sales people add and edit customer
information. Would HDFS be appropriate for this customer information file?

When is the reduce method first called in a MapReduce job?

seenagapeOctober 30, 2016 3 comments

When is the reduce method first called in a MapReduce job?

Page 17 of 71« First «...10...15 161718 19...30 40 50...»Last »

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Author: seenagape

For each input key-value pair, mappers can emit:

For each intermediate key, each reducer task can emit:

Determine the difference between setting the number of reducers to zero.

What happens in a MapReduce job when you set the number of reducers to zero?

In writing a MapReduce program to accomplish this, can you take advantage of a combiner?

how many key-value pairs will there be in each file?

which of the following interfaces is most likely to reduce the amount of intermediate data transferred across

Which statement best describes the data path of intermediate key-value pairs (i.e., output of the mappers)?

Would HDFS be appropriate for this customer information file?

When is the reduce method first called in a MapReduce job?