Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Thrift Python client with regex


Copy link to this message
-
Re: Thrift Python client with regex
On Thu, Oct 18, 2012 at 7:13 PM, Norbert Burger
<[EMAIL PROTECTED]> wrote:
> We had the same question earlier.  Unfortunately the documentation is
> wrong on this account; scannerOpen resolves to either a call to
> scan.addFamily or scan.addColumn, and neither directly supports regex
> matching.
>
> Regex pattern matching against colquals is definitely supported on the
> Java side, so Thrift2 (0.94.0) is a possible solution, if you can
> upgrade.  Another approach, depending on how large your rows are,
> would be to grab the full list of cols, filter via regex on the client
> side, and then specify explicitly in scannerOpen().
>

Thanks Norbert.

Of if one of you fellas wants to put up a patch that adds the
regex'ing to thrift1, we'll commit it.

But what about '10.3.1. Filter Language' in
http://hbase.apache.org/book.html ?  Have you fellas tried it?  The
doc looks like it might be wrong regards how you open the scanner --
it seems like you pass the filter string to the thrift Scan object --
but maybe this'll work?  Let us know and if inclined, tell us how to
fix the doc.

Thanks,
St.Ack
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB