HBase, mail # user - Re: Easiest way to get a random sample of keys - 2014-01-24, 23:42
Solr & Elasticsearch trainings in New York & San Francisco [more info][hide]
 Search Hadoop and all its subprojects:

Switch to Threaded View
Copy link to this message
-
Re: Easiest way to get a random sample of keys
HBase shell is a JRuby shell and wraps all Java classes in a ruby interface. You can actually use a RandomRowFilter with a 5% configuration to achieve what you need.

Regards,

Dhaval
________________________________
From: Sudarshan Kadambi (BLOOMBERG/ 731 LEXIN) <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Friday, 24 January 2014 6:15 PM
Subject: Easiest way to get a random sample of keys
Something like count 't1', {INTERVAL=>20} should give me every 20th row in table 't1'. Is there an easy way to get a random sample via. the shell using filters? 

 
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB