Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Easiest way to get a random sample of keys


Copy link to this message
-
Re: Easiest way to get a random sample of keys
HBase shell is a JRuby shell and wraps all Java classes in a ruby interface. You can actually use a RandomRowFilter with a 5% configuration to achieve what you need.

Regards,

Dhaval
________________________________
From: Sudarshan Kadambi (BLOOMBERG/ 731 LEXIN) <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Friday, 24 January 2014 6:15 PM
Subject: Easiest way to get a random sample of keys
Something like count 't1', {INTERVAL=>20} should give me every 20th row in table 't1'. Is there an easy way to get a random sample via. the shell using filters? 

 
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB