How many Mappers will run?

seenagapeApril 20, 2017

On a cluster running MapReduce v2 (MRv2) on YARN, a MapReduce job is given a directory of 10
plain text files as its input directory. Each file is made up of 3 HDFS blocks. How many Mappers
will run?

PrepAway - Latest Free Exam Questions & Answers

A.
We cannot say; the number of Mappers is determined by the ResourceManager

B.
We cannot say; the number of Mappers is determined by the developer

C.
30

D.
3

E.
10

F.
We cannot say; the number of mappers is determined by the ApplicationMaster

17 Comments on “How many Mappers will run?”

John says:

January 3, 2015 at 7:13 pm

C is correct answer – 1 mapper per input split/block. 30 blocks.

chinna says:

January 20, 2015 at 12:02 pm

C is the correct answer.

Arun says:

January 31, 2015 at 4:12 pm

Answer: C
Each block will have a Mapper running, 3 blocks for a file, 3*10 = 30 Mappers

Tarun says:

March 4, 2015 at 9:12 pm

The number of mappers is based on the the number of input split (Which is decided on InputFormat) so it depends on the developers I think B is the right one

Gaurav says:

May 25, 2015 at 4:03 pm

Number of mappers depends on input split size, which is equal to block size in default so here it will be 30 mappers, but developer has option to overwrite this parameter and can decide the input split size which can further modify the number of mappers for the job.

Dev says:

September 17, 2015 at 12:21 pm

Default split size is 1 block, 30 blocks hence 30 mappers, unless developer does partition otherwise. “C”.

Ddhanaji says:

March 28, 2016 at 11:06 am

I think the context here is – 1 file takes 3 Block means with replications factor 3 (default). Hence 1 job per file so its 10

frank lin says:

April 1, 2016 at 5:44 pm

should be E , people choose C is because they forgot the file is actually only contain 1 block , the rest 2 are replica

mdivk says:

August 26, 2016 at 7:34 pm

No, it didn’t indicate the replica is 3, and what about if a file does need 3 blocks? the answer should be C

0

0

Reply

anonymous says:

August 18, 2016 at 6:47 pm

What is the correct answer ?? C or E

anonymous says:

November 19, 2016 at 10:46 am

its C, each file is 3 blocks, mean 10×3=30, without replication each file size is 3 blocks

Rakesh says:

December 29, 2016 at 2:57 pm

Mappers are instantiated based on the number of input splits, not number of blocks.
Answer is E

sree says:

January 17, 2017 at 11:49 am

no of input splits equal to no of mappers.
answer:E

thom says:

March 19, 2017 at 2:48 pm

E. Based there’s no information about the split size, it could be larger than the block size.

Number of Mappers depends on the number of splits, however if the files are less then the split size then each file will correspond to one mapper. that is the reason large number of small files are not recommended

determining properties to decide split size and there default values are as follows

mapred.min.split.size=1 (in bytes)
mapred.max.split.size=Long.MAX_VALUE
dfs.block.size=64 MB
split size is calculated as

inputSplitSize=max(minimumSize, min(maximumSize, blockSize))

# of mappers= totalInputSize/inputSplitSize

networkmanagers says:

April 21, 2017 at 1:01 am

I have the same idea. E

ovais humayun says:

April 26, 2017 at 12:31 pm

its E , have verified it

dip says:

April 26, 2017 at 1:47 pm

C.
If you have not defined any input split size in Map/Reduce program then default HDFS block split will be considered as input split.There is no mention about replica.

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

How many Mappers will run?

17 Comments on “How many Mappers will run?”

Leave a Reply Cancel reply