Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> HBase table disk usage

Copy link to this message
Re: HBase table disk usage
Timestamps on the cells themselves?
# Versions?

On Jul 3, 2012, at 4:54 AM, Sever Fundatureanu wrote:

> Hello,
> I have a simpel table with 1.5 billion rows and one column familiy 'F'.
> Each row key is 33 bytes and the cell values are void. By doing the math I
> would expect this table to take up (33+1)x1.5*10^9 = 51GB. However if I do
> a "hadoop dfs -du" I get that the table takes up ~82GB. This is after
> running major compactions a couple of times. Can someone explain where this
> difference might come from?
> Regards,
> --
> Sever Fundatureanu
> Vrije Universiteit Amsterdam