Will disabling block caching improve scan performance?
You want to do a full table scan on your data. You decide to disable block caching to see if this
improves scan performance. Will disabling block caching improve scan performance?
where is the data saved first?
Your client application if; writing data to a Region. By default, where is the data saved first?
What is the recommended disk configuration for slave nodes in your Hadoop cluster with 6 x 2 TB hard drives?
What is the recommended disk configuration for slave nodes in your Hadoop cluster with 6 x 2 TB
hard drives?
You need to free up disk space on your HBase cluster
You need to free up disk space on your HBase cluster. You delete all versions of your data that is
older than one week. You notice your delete has had minimal impact on your storage availability.
This is because:
Which additional feature does HBase provide to HDFS?
You have data already stored in HDFS and are considering using HBase. Which additional feature
does HBase provide to HDFS?
How would you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?
You observe that the number of spilled records from map tasks for exceeds the number of map
output records. You child heap size is 1 GB and your io.sort.mb value is set to 100MB. How would
you tune your io.sort.mb value to achieve maximum memory to disk I/O ratio?
you need to configure to run on your master nodes?
You configure Hadoop cluster with both MapReduce frameworks, MapReduce v1 (MRv1) and
MapReduce v2 (MRv2/YARN). Which two MapReduce (computational) daemons do you need to
configure to run on your master nodes?
What is the maximum number of NameNodes daemon you should run on you cluster in order to avoid a “split-brai
You configure you cluster with HDFS High Availability (HA) using Quorum-Based storage. You do
not implement HDFS Federation.
What is the maximum number of NameNodes daemon you should run on you cluster in order to
avoid a “split-brain” scenario with your NameNodes?
Given the following HBase table schema: Row Key, colFam_A:a, colFam_A:b, colFamB:2, colFam_B:10 A table scan w
Given the following HBase table schema:
Row Key, colFam_A:a, colFam_A:b, colFamB:2, colFam_B:10
A table scan will return the column data in which of the following sorted orders: