Which of the following configuration values determines automated splitting?
Consider going to larger regions to cut down on the total number of regions on your
cluster. Generally less Regions to manage makes for a smoother running cluster (You can always
latermanually split the big Regions should one prove hot and you want to spread the request load
over the cluster). A lower number of regions is preferred, generally in the range of 20 to lowhundreds per RegionServer. Adjust the regionsize as appropriate to achieve this number.
For the 0.90.x codebase, the upper-bound of regionsize is about 4Gb, with a default of 256Mb. For
0.92.x codebase, due to the HFile v2 change much larger regionsizes can be supported (e.g.,
You may need to experiment with this setting based on your hardware configuration and
Adjust hbase.hregion.max.filesize in your hbase-site.xml. RegionSize can also be set on a pertable basis via HTableDescriptor.
Reference:The Apache HBaseReference Guide,Bigger Regions
http://hbase.apache.org/book/important_configurations.html(22.214.171.124. Bigger regions, see the code
in the last sentence)