PrepAway - Latest Free Exam Questions & Answers

Which tool should they use?

The web analytics team uses Hadoop to process access logs. They now want to correlate this
data with structured user data residing in a production single-instance JDBC database. They
collaborate with the production team to import the data into Hadoop. Which tool should they use?

PrepAway - Latest Free Exam Questions & Answers

A.
Sqoop

B.
Pig

C.
Chukwa

D.
Scribe

One Comment on “Which tool should they use?

  1. Jack Huang says:

    Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

    Apache Chukwa is an open source data collection system for monitoring large distributed systems. Apache Chukwa is built on top of the Hadoop Distributed File System (HDFS) and Map/Reduce framework and inherits Hadoop’s scalability and robustness. Apache Chukwa also includes a flexible and powerful toolkit for displaying, monitoring and analyzing results to make the best use of the collected data.

    Scribe was a server for aggregating log data streamed in real-time from a large number of servers.




    0



    0

Leave a Reply