what would another user see when trying to access this life?
You use the hadoop fs –put command to write a 300 MB file using and HDFS block size of 64 MB.
Just after this command has finished writing 200 MB of this file, what would another user see
when trying to access this life?
Identify the tool best suited to import a portion of a relational database every day as files into HDFS, and g
Identify the tool best suited to import a portion of a relational database every day as files into
HDFS, and generate Java classes to interact with that imported data?
How many files will be processed by the FileInputFormat.setInputPaths () command when it’s given a path
You have a directory named jobdata in HDFS that contains four files: _first.txt, second.txt, .third.txt
and #data.txt. How many files will be processed by the FileInputFormat.setInputPaths () command
when it’s given a path object representing this directory?
Determine the difference between setting the number of reduces to one and settings the number of reducers to z
You write MapReduce job to process 100 files in HDFS. Your MapReduce algorithm uses
TextInputFormat: the mapper applies a regular expression over input values and emits key-values
pairs with the key consisting of the matching text, and the value containing the filename and byte
offset. Determine the difference between setting the number of reduces to one and settings the
number of reducers to zero.
A combiner reduces:
A combiner reduces:
how many map task attempts will there be?
In a MapReduce job with 500 map tasks, how many map task attempts will there be?
which major functions of the JobTracker into separate daemons?
MapReduce v2 (MRv2/YARN) splits which major functions of the JobTracker into separate
daemons? Select two.
What types of algorithms are difficult to express in MapReduce v1 (MRv1)?
What types of algorithms are difficult to express in MapReduce v1 (MRv1)?
What does calling the next () method return?
In the reducer, the MapReduce API provides you with an iterator over Writable values. What does
calling the next () method return?
Table metadata in Hive is:
Table metadata in Hive is: