Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> HBase table disk usage


+
Sever Fundatureanu 2012-07-03, 09:54
Copy link to this message
-
Re: HBase table disk usage
Timestamps on the cells themselves?
# Versions?

On Jul 3, 2012, at 4:54 AM, Sever Fundatureanu wrote:

> Hello,
>
> I have a simpel table with 1.5 billion rows and one column familiy 'F'.
> Each row key is 33 bytes and the cell values are void. By doing the math I
> would expect this table to take up (33+1)x1.5*10^9 = 51GB. However if I do
> a "hadoop dfs -du" I get that the table takes up ~82GB. This is after
> running major compactions a couple of times. Can someone explain where this
> difference might come from?
>
> Regards,
> --
> Sever Fundatureanu
>
> Vrije Universiteit Amsterdam
> E-mail: [EMAIL PROTECTED]
+
Sever Fundatureanu 2012-07-03, 12:17
+
Stack 2012-07-03, 12:29
+
Sever Fundatureanu 2012-07-03, 15:06
+
Anoop Sam John 2012-07-04, 03:28
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB