For each intermediate key, each reducer task can emit:
For each intermediate key, each reducer task can emit:
Which of the following is a data warehousing software built on top of Apache Hadoop that defines a simple SQL-
You have an employee who is a Date Analyst and is very comfortable with SQL. He would like to
run ad-hoc analysis on data in your HDFS duster. Which of the following is a data warehousing
software built on top of Apache Hadoop that defines a simple SQL-like query language well-suited
for this kind of user?
Which of the following best describes the workings of TextInputFormat?
Which of the following best describes the workings of TextInputFormat?
which best defines a SequenceFile?
Indentify which best defines a SequenceFile?
What determines how the JobTracker assigns each map task to a TaskTracker?
On a cluster running MapReduce v1 (MRv1), a TaskTracker heartbeats into the JobTracker on
your cluster, and alerts the JobTracker it has an open map task slot.
What determines how the JobTracker assigns each map task to a TaskTracker?
Workflows expressed in Oozie can contain:
Workflows expressed in Oozie can contain:
Which of the following tools should you use to accomplish this?
You need to import a portion of a relational database every day as files to HDFS, and generate
Java classes to Interact with your imported data. Which of the following tools should you use to
accomplish this?
Which of the following statements most accurately describes the relationship between MapReduce and Pig?
Which of the following statements most accurately describes the relationship between MapReduce
and Pig?
Which process describes the lifecycle of a Mapper?
Which process describes the lifecycle of a Mapper?
how many blocks the input file occupies?
In a MapReduce job, you want each of your input files processed by a single map task. How do
you configure a MapReduce job so that a single map task processes each input file regardless of
how many blocks the input file occupies?