Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> HBase Column Family TTL and cell deletions


Copy link to this message
-
Re: HBase Column Family TTL and cell deletions
I don't know if it's right (haven't checked source just now) but according to this:

http://hbase.apache.org/book/ttl.html

Column family TTL is in seconds, not milliseconds. Could that be the problem?

(If not, we should fix that page in the ref guide.)

On Apr 3, 2013, at 5:19 PM, Ashish Nigam wrote:

Any thoughts?

Here's one cell that should have been deleted ( I am putting HBase get
repines in shell)

hbase(main):001:0> get 'service_counters','3141',{COLUMN =>
'stats-hourly:2013032014-C-U-4092276'}
13/04/03 21:57:17 WARN util.NativeCodeLoader: Unable to load native-hadoop
library for your platform... using builtin-java classes where applicable
COLUMN                                                            CELL
stats-hourly:2013032014-C-U-4092276
timestamp=1364928886015, value=\x00\x00\x00\x00\x00\x00\x00\x06

1 row(s) in 0.1040 seconds
On Wed, Apr 3, 2013 at 12:03 PM, Ashish Nigam <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>>wrote:

minVersion is set to 0
Here's detailed info on this CF -

{NAME => 'stats-hourly', DATA_BLOCK_ENCODING => 'NONE', BLOOMFILTER =>
'NONE', REPLICATION_SCOPE => '0', VERSIONS => '3', COMPRESSION => 'LZO',
MIN_VERSIONS => '0', TTL => '604800', KEEP_DELETED_CELLS => 'false',
BLOCKSIZE => '65536', IN_MEMORY => 'false', ENCODE_ON_DISK => 'true',
BLOCKCACHE => 'true'}

HBase version is 0.94.2-cdh4.2.0,

On Wed, Apr 3, 2013 at 11:48 AM, Ted Yu <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:

Is it possible that minVersions was set for your column family ?

What version of HBase are you using ?

Thanks
On Wed, Apr 3, 2013 at 11:34 AM, Ashish Nigam <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>
wrote:

Hi,
I have a table "service_counters" that has service id as row id.
There is a column family "stats-hourly" that stores counters related to
that service per hour.
I have set TTL to be 7 days for the column family.

{NAME => 'service_counters', FAMILIES => [{NAME => 'stats-hourly',
COMPRESSION => 'LZO', TTL => '604800'}]}

Now the problem is that I still see columns that are more than 7 days
old.
I have also run major compactions but I still see old rows.
Any pointers as to why these columns are not being deleted?

Thanks
Ashish