Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Region Size == Size of Compressed Store file or Actual Size of Data in Store?


Copy link to this message
-
Region Size == Size of Compressed Store file or Actual Size of Data in Store?
Hi All,

In one of my test cluster, i have set region size to 1 GB and I am using
Snappy compression.
The combined size of store files under that table is 50 GB. Then also i see
around 100 regions for that table. I am assuming that the compression ratio
is 50%. So, uncompressed data size will be 100GB.
I would like to know what hbase.hregion.max.filesize property looks at?
Actual size of store file(s) or actual size of data in that region?

I only created 10 presplit region for this table and then i did
bulkloading. Then why am i seeing regions much more than approximately 50.
Is there any other setting i need to look at?

Thanks & Regards,
Anil Gupta

 
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB