Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Thrift Python client with regex

Copy link to this message
Re: Thrift Python client with regex
On Thu, Oct 18, 2012 at 7:13 PM, Norbert Burger
> We had the same question earlier.  Unfortunately the documentation is
> wrong on this account; scannerOpen resolves to either a call to
> scan.addFamily or scan.addColumn, and neither directly supports regex
> matching.
> Regex pattern matching against colquals is definitely supported on the
> Java side, so Thrift2 (0.94.0) is a possible solution, if you can
> upgrade.  Another approach, depending on how large your rows are,
> would be to grab the full list of cols, filter via regex on the client
> side, and then specify explicitly in scannerOpen().

Thanks Norbert.

Of if one of you fellas wants to put up a patch that adds the
regex'ing to thrift1, we'll commit it.

But what about '10.3.1. Filter Language' in
http://hbase.apache.org/book.html ?  Have you fellas tried it?  The
doc looks like it might be wrong regards how you open the scanner --
it seems like you pass the filter string to the thrift Scan object --
but maybe this'll work?  Let us know and if inclined, tell us how to
fix the doc.