Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - Scanner returning subset of data

Copy link to this message
RE: Scanner returning subset of data
Anoop Sam John 2013-04-09, 03:23
As Ted suggested can you see the client logs closely (RS side also)?  Is there next() call retries happening from the client side because of RPC timeouts?
In such a case this kind of issue can happen.   I doubt he hit HBASE-5974

Sent: Tuesday, April 09, 2013 2:48 AM
Subject: Re: Scanner returning subset of data

0.92.1 is pretty old. Are you able to deploy newer release, e.g.
and see if the problem can be reproduced ?

Otherwise we have two choices:
1. write a unit / integration test that shows this bug
2. see more of the region server / client logs so that further analysis can
be performed.


On Mon, Apr 8, 2013 at 2:07 PM, Randy Fox <[EMAIL PROTECTED]> wrote:

> I have a needle-in-the-haystack type scan.  I have tried to read all the
> issues with ScannerTimeoutException and LeaseException, but do have not
> seen anyone report what I am seeing.
> Running 0.92.1-cdh4.1.1.  All config wrt to timeouts and periods are
> default: 60s.
> When I run a scanner that will return few results and my cache setting is
> a bit too high for results to return in 60 seconds, i sometimes get a
> subset of results (the last few returnable rows) and no exception.  it may
> take a while to get those results.  Other times I get the LeaseException,
> the ScannerTimeoutException, or the RetriesExhaustedException. I can see
> throwExceptionIfCallerDisconne**cted in RS logs.
> The incorrect return set has me very concerned.  I can easily reproduce
> this with my own code or hbase shell.
> Any help is greatly appreciated.
> Cheers,
> Randy Fox