PrepAway - Latest Free Exam Questions & Answers

Identify the tool best suited to import a portion of a relational database every day as files into HDFS, and g

Identify the tool best suited to import a portion of a relational database every day as files into
HDFS, and generate Java classes to interact with that imported data?

PrepAway - Latest Free Exam Questions & Answers

A.
Oozie

B.
Flume

C.
Pig

D.
Hue

E.
Hive

F.
Sqoop

G.
fuse-dfs

Explanation:
Sqoop (“SQL-to-Hadoop”) is a straightforward command-line tool with the following capabilities:
Imports individual tables or entire databases to files in HDFS
Generates Java classes to allow you to interact with your imported data
Provides the ability to import from SQL databases straight into your Hive data warehouse
Note:
Data Movement Between Hadoop and Relational Databases
Data can be moved between Hadoop and a relational database as a bulk data transfer, or
relational tables can be accessed from within a MapReduce map function.
Note:
* Cloudera’s Distribution for Hadoop provides a bulk data transfer tool (i.e., Sqoop) that imports
individual tables or entire databases into HDFS files. The tool also generates Java classes that
support interaction with the imported data. Sqoop supports all relational databases over JDBC,
and Quest Software provides a connector (i.e., OraOop) that has been optimized for access to

data residing in Oracle databases.
Reference: http://log.medcl.net/item/2011/08/hadoop-and-mapreduce-big-data-analytics-gartner/
(Data Movement between hadoop and relational databases, second paragraph)

8 Comments on “Identify the tool best suited to import a portion of a relational database every day as files into HDFS, and g


Leave a Reply

Your email address will not be published. Required fields are marked *