PrepAway - Latest Free Exam Questions & Answers

Which of the following tools should you use to accomplish this?

You need to import a portion of a relational database every day as files to HDFS, and generate
Java classes to Interact with your imported data. Which of the following tools should you use to
accomplish this?

PrepAway - Latest Free Exam Questions & Answers

A.
Pig

B.
Hue

C.
Hive

D.
Flume

E.
Sqoop

F.
Oozie

G.
fuse-dfs

Explanation:
Sqoop (“SQL-to-Hadoop”) is a straightforward command-line tool with the following
capabilities:
Imports individual tables or entire databases to files in HDFS
Generates Java classes to allow you to interact with your imported data

Provides the ability to import from SQL databases straight into your Hive data warehouse
Note:
Data Movement Between Hadoop and Relational Databases
Data can be moved between Hadoop and a relational database as a bulk data transfer, or
relational tables can be accessed from within a MapReduce map function.
Note:
*Cloudera’s Distribution for Hadoop provides a bulk data transfer tool (i.e., Sqoop) that imports
individual tables or entire databases into HDFS files. The tool also generates Java classes that
support interaction with the imported data. Sqoop supports all relational databases over JDBC,
and Quest Software provides a connector (i.e., OraOop) that has been optimized for access to
data residing in Oracle databases.
Reference:http://log.medcl.net/item/2011/08/hadoop-and-mapreduce-big-data-analyticsgartner/(Data Movement between hadoop and relational databases, second paragraph)

3 Comments on “Which of the following tools should you use to accomplish this?


Leave a Reply

Your email address will not be published. Required fields are marked *