Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> HBase Column Family TTL and cell deletions


Copy link to this message
-
Re: HBase Column Family TTL and cell deletions
I don't know if it's right (haven't checked source just now) but according to this:

http://hbase.apache.org/book/ttl.html

Column family TTL is in seconds, not milliseconds. Could that be the problem?

(If not, we should fix that page in the ref guide.)

On Apr 3, 2013, at 5:19 PM, Ashish Nigam wrote:

Any thoughts?

Here's one cell that should have been deleted ( I am putting HBase get
repines in shell)

hbase(main):001:0> get 'service_counters','3141',{COLUMN =>
'stats-hourly:2013032014-C-U-4092276'}
13/04/03 21:57:17 WARN util.NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
COLUMN                                                            CELL
stats-hourly:2013032014-C-U-4092276
timestamp=1364928886015, value=\x00\x00\x00\x00\x00\x00\x00\x06

1 row(s) in 0.1040 seconds
On Wed, Apr 3, 2013 at 12:03 PM, Ashish Nigam <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>>wrote:

minVersion is set to 0
Here's detailed info on this CF -

{NAME => 'stats-hourly', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER =>
'NONE', REPLICATION_SCOPE => '0', VERSIONS => '3', COMPRESSION => 'LZO',
MIN_VERSIONS => '0', TTL => '604800', KEEP_DELETED_CELLS => 'false',
BLOCKSIZE => '65536', IN_MEMORY => 'false', ENCODE_ON_DISK => 'true',
BLOCKCACHE => 'true'}

HBase version is 0.94.2-cdh4.2.0,

On Wed, Apr 3, 2013 at 11:48 AM, Ted Yu <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:

Is it possible that minVersions was set for your column family ?

What version of HBase are you using ?

Thanks
On Wed, Apr 3, 2013 at 11:34 AM, Ashish Nigam <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>
wrote:

Hi,
I have a table "service_counters" that has service id as row id.
There is a column family "stats-hourly" that stores counters related to
that service per hour.
I have set TTL to be 7 days for the column family.

{NAME => 'service_counters', FAMILIES => [{NAME => 'stats-hourly',
COMPRESSION => 'LZO', TTL => '604800'}]}

Now the problem is that I still see columns that are more than 7 days
old.
I have also run major compactions but I still see old rows.
Any pointers as to why these columns are not being deleted?

Thanks
Ashish

NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB