Identify two features/issues that MapReduce v2 (MRv2/YARN) is designed to address:
Identify two features/issues that MapReduce v2 (MRv2/YARN) is designed to address:
What happens when client tries to write a file to/reports/myreport.txt?
You set up the Hadoop cluster using NameNode Federation. One NameNode manages the/users
namespace and one NameNode manages the/data namespace. What happens when client tries
to write a file to/reports/myreport.txt?
how much data will you be able to store?
Your Hadoop cluster has 25 nodes with a total of 100 TB (4 TB per node) of raw disk space
allocated HDFS storage. Assuming Hadoop’s default configuration, how much data will you be
able to store?
how many regions will your RegionServers have?
You have a total of three tables stored in HBase. Exchanging catalog regions, how many regions
will your RegionServers have?
How would you design the schema?
You have two tables in existing RDBMS. One contains information about the products you sell
(name, size, color, etc.) The other contains images of the products in JPEG format. These tables
are frequently joined in queries to your database. You would like to move this data into HBase.
How would you design the schema?
Adding an additional HMaster will:
Your HBase cluster has hit a performance wall and doesn’t seem to be getting faster as you add
RegionServers. Adding an additional HMaster will:
which daemon makes HDFS unavailable on a cluster running MapReduce v1 (MRv1)?
The failure of which daemon makes HDFS unavailable on a cluster running MapReduce v1
(MRv1)?
The most important consideration for slave nodes in a Hadoop cluster running production jobs that require shor
The most important consideration for slave nodes in a Hadoop cluster running production jobs that
require short turnaround times is:
what point in the write is your data secure?
Your client is writing to a region when the RegionServer crashes. At what point in the write is your
data secure?