Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Easiest way to get a random sample of keys


+
Sudarshan Kadambi 2014-01-24, 23:15
+
Jean-Marc Spaggiari 2014-01-24, 23:29
+
Dhaval Shah 2014-01-24, 23:42
Copy link to this message
-
Re: Easiest way to get a random sample of keys
RandomRowFilter is a good idea! +1
2014/1/24 Dhaval Shah <[EMAIL PROTECTED]>

> HBase shell is a JRuby shell and wraps all Java classes in a ruby
> interface. You can actually use a RandomRowFilter with a 5% configuration
> to achieve what you need.
>
> Regards,
>
> Dhaval
>
>
> ________________________________
> From: Sudarshan Kadambi (BLOOMBERG/ 731 LEXIN) <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Sent: Friday, 24 January 2014 6:15 PM
> Subject: Easiest way to get a random sample of keys
>
>
> Something like count 't1', {INTERVAL=>20} should give me every 20th row in
> table 't1'. Is there an easy way to get a random sample via. the shell
> using filters?
>

 
+
Sudarshan Kadambi 2014-01-24, 23:35
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB