PrepAway - Latest Free Exam Questions & Answers

Which is the most efficient process to gather these web servers access logs into your Hadoop cluster for analy

You want to understand more about how users browse your public website. For example, you war
know which pages they visit prior to placing an order. You have a server farm of 200 web server
hosting your website. Which is the most efficient process to gather these web servers access logs
into your Hadoop cluster for analysis?

PrepAway - Latest Free Exam Questions & Answers

A.
Sample the web server logs web servers and copy them into HDFS using curl

B.
Channel these click streams into Hadoop using Hadoop Streaming

C.
Write a MapReduce job with the web servers for mappers and the Hadoop cluster nodes for
reducers

D.
Import all user clicks from your OLTP databases Into Hadoop using Sqoop

E.
Ingest the server web logs into HDFS using Flume


Leave a Reply

Your email address will not be published. Required fields are marked *