Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> scan is slower after bulk load


Copy link to this message
-
Re: scan is slower after bulk load
may be because bulk load writes to the same region thus putting the entire
load on a single region server.

Regards,
    Mohammad Tariq

On Mon, Nov 12, 2012 at 9:15 PM, Michael Segel <[EMAIL PROTECTED]>wrote:

> Just a guess... have you done any compactions on the table post bulk load?
>
> On Nov 12, 2012, at 8:44 AM, Marcos Ortiz <[EMAIL PROTECTED]> wrote:
>
> > Regards, Amit.
> > Did you tuned the RegionServer where you has that data range hosted?
> > Why do you say that scans are slower after a bulk load?
> > Did you test it before bulk load?
> >
> > HBase version?
> >
> > On 11/12/2012 09:39 AM, Amit Sela wrote:
> >> Hi all,
> >>
> >> Anyone has any idea why scanning over specific range in a table is about
> >> 20% slower if that data (that specific range) was just inserted into
> HBase
> >> using bulk load ?
> >>
> >> I do the bulk load programmatically with  LoadIncrementalHFiles.
> >>
> >> Thanks.
> >>
> >
> > --
> >
> > Marcos Luis Ortíz Valmaseda
> > about.me/marcosortiz <http://about.me/marcosortiz>
> > @marcosluis2186 <http://twitter.com/marcosluis2186>
> >
> >
> >
> > 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS
> INFORMATICAS...
> > CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION
> >
> > http://www.uci.cu
> > http://www.facebook.com/universidad.uci
> > http://www.flickr.com/photos/universidad_uci
>
>