-HBase scan performance decreases over time.
David Koch 2012-11-03, 15:12
Every now and then we need to flatten our cluster and re-import all data
from log files (changes in data format, etc.) Afterwards we notice a
significant increase in scan performance. As data is added and shuffled
around between region servers, performance goes down again over time (say a
couple of weeks). Are there any routine operations that one should run
manually, or settings to activate in the HBase configuration to keep the
data well distributed? We use HBase 0.92 as part of a Cloudera4 cluster.
Ted Yu 2012-11-03, 15:42
David Koch 2012-11-03, 19:50
Michael Segel 2012-11-05, 13:04
Asaf Mesika 2012-11-05, 18:14
Leonid Fedotov 2012-11-05, 18:52
Michael Segel 2012-11-05, 18:49
Ted Yu 2012-11-03, 20:14