Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Scan performance on compressed column families


Copy link to this message
-
Scan performance on compressed column families
Hello,

Are scans faster when compression is activated? The HBase book by Lars
George seems to suggest so (p424, Section on "Compression" in chapter
"Performance Tuning").

"... compression usually will yield overall better performance, because the
overhead of the CPU performing the compression and de- compression is less
than what is required to read more data from disk."

I searched around for a bit and found this:
http://gbif.blogspot.fr/2012/02/performance-evaluation-of-hbase.html. The
author conducted a series of scan performance tests on tables of up to
200million rows and found that compression actually slowed down read
performance slightly - albeit at lower CPU load.

Thank you,

/David
+
David Koch 2012-11-07, 13:09
+
Kevin Odell 2012-11-07, 14:02
+
Oliver Meyn 2012-11-09, 19:46
+
David Koch 2012-11-11, 18:08