How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapRe

seenagapeMay 29, 2015

How are keys and values presented and passed to the reducers during a standard sort and shuffle
phase of MapReduce?

PrepAway - Latest Free Exam Questions & Answers

A.
Keys are presented to reducer in sorted order; values for a given key are not sorted.

B.
Keys are presented to reducer in sorted order; values for a given key are sorted in ascending
order.

C.
Keys are presented to a reducer in random order; values for a given key are not sorted.

D.
Keys are presented to a reducer in random order; values for a given key are sorted in
ascending order.

Explanation:
Reducer has 3 primary phases:
1. Shuffle
The Reducer copies the sorted output from each Mapper using HTTP across the network.
2. Sort
The framework merge sorts Reducer inputs by keys (since different Mappers may have output the
same key).
The shuffle and sort phases occur simultaneously i.e. while outputs are being fetched they are
merged.
SecondarySort
To achieve a secondary sort on the values returned by the value iterator, the application should
extend the key with the secondary key and define a grouping comparator. The keys will be sorted
using the entire key, but will be grouped using the grouping comparator to decide which keys and
values are sent in the same call to reduce.
3. Reduce
In this phase the reduce(Object, Iterable, Context) method is called for each <key, (collection of
values)> in the sorted inputs.
The output of the reduce task is typically written to a RecordWriter via
TaskInputOutputContext.write(Object, Object).
The output of the Reducer is not re-sorted.
Reference: org.apache.hadoop.mapreduce, Class
Reducer<KEYIN,VALUEIN,KEYOUT,VALUEOUT>

5 Comments on “How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapRe”

madhuri says:

August 10, 2015 at 10:56 pm

Vishal says:

August 24, 2015 at 2:14 pm

Henry says:

September 9, 2015 at 7:43 pm

A. Each of the mappers partition their intermediate data into as many reducers and than sorts on keys for each of the partitions. Each reducer receives intermediate data from as many mappers, and than sorts on key for all the partitions for a local total sort within the reducer. To achieve global total sort on the key, you have to write a custom partitioner than ensures that all keys for partition1 < all keys for partion2 < all keys for partionN

Nagaraju says:

November 16, 2015 at 1:56 pm

A.
The keys are sorted but the values for the Keys are not sorted.

rutuparn says:

April 25, 2016 at 1:58 pm

madhuri says:

August 10, 2015 at 10:56 pm

D

0

0

Vishal says:

August 24, 2015 at 2:14 pm

A

0

0

Henry says:

September 9, 2015 at 7:43 pm

A. Each of the mappers partition their intermediate data into as many reducers and than sorts on keys for each of the partitions. Each reducer receives intermediate data from as many mappers, and than sorts on key for all the partitions for a local total sort within the reducer. To achieve global total sort on the key, you have to write a custom partitioner than ensures that all keys for partition1 < all keys for partion2 < all keys for partionN

0

0

Nagaraju says:

November 16, 2015 at 1:56 pm

A.
The keys are sorted but the values for the Keys are not sorted.

0

0

rutuparn says:

April 25, 2016 at 1:58 pm

A

0

0

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapRe

5 Comments on “How are keys and values presented and passed to the reducers during a standard sort and shuffle phase of MapRe”

Leave a Reply Cancel reply