Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?

seenagapeOctober 23, 2015

Given a directory of files with the following structure: line number, tab character, string:
Example:
1abialkjfjkaoasdfjksdlkjhqweroij
2kadfjhuwqounahagtnbvaswslmnbfgy
3kjfteiomndscxeqalkzhtopedkfsikj
You want to send each line as one record to your Mapper. Which InputFormat should you use to
complete the line: conf.setInputFormat (____.class) ; ?

PrepAway - Latest Free Exam Questions & Answers

A.
SequenceFileAsTextInputFormat

B.
SequenceFileInputFormat

C.
KeyValueFileInputFormat

D.
BDBInputFormat

Explanation:
Note:
The output format for your first MR job should be SequenceFileOutputFormat – this will store the
Key/Values output from the reducer in a binary format, that can then be read back in, in your
second MR job using SequenceFileInputFormat.
Reference: How to parse CustomWritable from text in Hadoop
http://stackoverflow.com/questions/9721754/how-to-parse-customwritable-from-text-in-hadoop
(see answer 1 and then see the comment #1 for it)

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?

2 Comments on “Which InputFormat should you use to complete the line: conf.setInputFormat (____.class) ; ?”

Leave a Reply Cancel reply