Can you use MapReduce to perform a relational join on two large tables sharing a key?

seenagapeJanuary 20, 2017

Can you use MapReduce to perform a relational join on two large tables sharing a key? Assume
that the two tables are formatted as comma-separated file in HDFS.

PrepAway - Latest Free Exam Questions & Answers

A.
Yes.

B.
Yes, but only if one of the tables fits into memory.

C.
Yes, so long as both tables fit into memory.

D.
No, MapReduce cannot perform relational operations.

E.
No, but it can be done with either Pig or Hive.

Explanation:
Note:
* Join Algorithms in MapReduce
A) Reduce-side join
B) Map-side join
C) In-memory join
/ Striped Striped variant variant
/ Memcached variant
* Which join to use?
/ In-memory join > map-side join > reduce-side join
/ Limitations of each?
In-memory join: memory
Map-side join: sort order and partitioning
Reduce-side join: general purpose

Get 50% Discount on All Your Purchases
at PrepAway.com - Latest Exam Questions

This is ONE TIME OFFER

Enter your email address to receive your 50% off dicount code:

SPECIAL OFFER: GET 50% OFF

Use Discount Code:

Briefing Cloudera Knowledge

Free Cloudera Study Guide

Can you use MapReduce to perform a relational join on two large tables sharing a key?

3 Comments on “Can you use MapReduce to perform a relational join on two large tables sharing a key?”

Leave a Reply Cancel reply