|
|
-
Re: Scan startRow/stopRow vs. filterSuraj Varma 2012-03-15, 19:26
According to http://hbase.apache.org/book.html#client.filter.row, in
general it is preferable to use start/stopRow rather than RowFilter. I believe with a RowFilter, you would be doing a full table scan ... --Suraj On Thu, Mar 15, 2012 at 11:48 AM, Andy Lindeman <[EMAIL PROTECTED]> wrote: > Hi all-- > > I was reading the source code for Pig HBaseStorage loadfunc/storefunc recently. > > It accepts arguments such as -gte and -lt for scanning ranges of rows; > however, it implements them by adding a RowFilter. Something that > basically boils down to ... > > scan = new Scan(); > gte_ = Bytes.toBytesBinary(Utils.slashisize(configuredOptions_.getOptionValue("gte"))); > scan.setFilter(new RowFilter(CompareOp.GREATOR_OR_EQUAL, new > BinaryComparator(gte_))); > > How does this compare (in terms of equivalence and performance) to > setting startRow on Scan .. such as .. > > scan = new Scan(); > scan.setStartRow(Bytes.toBytesBinary(Utils.slashisize(configuredOptions_.getOptionValue("gte"))); > > Thanks. > > -- > Andy Lindeman > http://www.andylindeman.com/ |