PrepAway - Latest Free Exam Questions & Answers

Which three distcp features can you utilize on a Hadoop cluster?

Which three distcp features can you utilize on a Hadoop cluster?

PrepAway - Latest Free Exam Questions & Answers

A.
Use distcp to copy files only between two clusters or more. You cannot use distcp to copy data
between directories inside the same cluster.

B.
Use distcp to copy HBase table files.

C.
Use distcp to copy physical blocks from the source to the target destination in your cluster.

D.
Use distcp to copy data between directories inside the same cluster.

E.
Use distcp to run an internal MapReduce job to copy files.

Explanation:
DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. It uses Map/Reduce to
effect its distribution, error handling and recovery, and reporting. It expands a list of files and
directories into input to map tasks, each of which will copy a partition of the files specified in the
source list. Its Map/Reduce pedigree has endowed it with some quirks in both its semantics and
execution.
Hadoop DistCp Guide

One Comment on “Which three distcp features can you utilize on a Hadoop cluster?


Leave a Reply

Your email address will not be published. Required fields are marked *