How many regions (per region server) do you have on average?
If it's not too bad you might just be able to increase hbase.hregion.max.filesize to 10 or 20g and bounce all the region servers.
Then as you write more data you will fill up the existing regions.
"Too bad" is fuzzy. If you approach hundreds of regions per region server you likely have a problem, depending on your read/write patterns.
From: Ted Tuttle <[EMAIL PROTECTED]>
To: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]>
Cc: Development <[EMAIL PROTECTED]>
Sent: Thursday, August 28, 2014 11:19 AM
Subject: state-of-the-art method for merging regions on v0.94
We recently realized our region size is 1G and need to increase it to get our region count under control. I've done some research on merging regions and have come away confused.
There is the ops handbook:
And then there is this horror story:
Is there someone out there that has done a large scale (i.e. 10:1 reduction on 10k's of regions) merge successfully on HBase 0.94? If so, how did you do it?