You have two tables in existing RDBMS. One contains information about the products you sell
(name, size, color, etc.) The other contains images of the products in JPEG format. These tables
are frequently joined in queries to your database. You would like to move this data into HBase.
How would you design the schema?
Your HBase cluster has hit a performance wall and doesn’t seem to be getting faster as you add
RegionServers. Adding an additional HMaster will:
Your client is writing to a region when the RegionServer crashes. At what point in the write is your
For a given Column Family, you want to always retain at least one version, but expire all other
versions that are older than 5 days. Which of the following Column Family attribute settings would
you set to do this?
You have a table with 5 TB of data, 10 RegionServers, and a region size of 256MB. You want to
continue with puts to widely disbursed row ids in your table. Which of the following will improve
Yon are storing page view data for a large number of Web sites, each of which has many
subdomains (www.example.com, archive.example.com, beta.example.com, etc.) Your reporting
tool needs to retrieve the total number of page views for a given subdomain of a Web site. Which
of the following rowkeys should you use?
From within an HBase application, you would like to create a new table named weblogs. You have
started with the following Java code:
HBaseAdmin admin = new HBaseAdmin (conf);
HTableDescriptor t = new HTableDescriptor(“weblogs”);
Which of the following method(s) would you use next?
Your client application connects to HBase for the first time and queries the .META. table. What
information does the .META. table provide to your client application?