Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> HBase random read performance


Copy link to this message
-
Re: HBase random read performance
Using bloom filter is almost mandatory there;
You might also want to try Short Circuit Reads and be sure you get 100%
data locality (major_compact your table first)
On Sat, Apr 13, 2013 at 5:16 PM, Ted Yu <[EMAIL PROTECTED]> wrote:

> Did you enable bloom filters ?
> See http://hbase.apache.org/book.html#schema.bloom
>
> Cheers
>
> On Fri, Apr 12, 2013 at 10:31 PM, Ankit Jain <[EMAIL PROTECTED]
> >wrote:
>
> > Hi All,
> >
> > We are using HBase 0.94.5 and Hadoop 1.0.4.
> >
> > We have HBase cluster of 5 nodes(5 regionservers and 1 master node). Each
> > regionserver has 8 GB RAM.
> >
> > We have loaded 25 millions records in HBase table, regions are pre-split
> > into 16 regions and all the regions are equally loaded.
> >
> > We are getting very low random read performance while performing multi
> get
> > from HBase.
> >
> > We are passing random 10000 row-keys as input, while HBase is taking
> around
> > 17 secs to return 10000 records.
> >
> > Please suggest some tuning to increase HBase read performance.
> >
> > Thanks,
> > Ankit Jain
> > iLabs
> >
> >
> >
> > --
> > Thanks,
> > Ankit Jain
> >
>

--
Adrien Mogenet
http://www.borntosegfault.com