Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - Downside of too many HFiles

Copy link to this message
Downside of too many HFiles
Rahul Ravindran 2013-06-12, 15:14
I am trying to understand the downsides of having a large number of hfiles by having a large hbase.hstore.compactionThreshold

  This delays major compaction. However, the amount of data that needs to be read and re-written as a single hfile during major compaction will remain the same unless we have large number of deletes or expired rows

I understand the random reads will be affected since each hfile may be a candidate for the row, but is there any other downside I am missing?