Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Re: Hbase random read performance


Copy link to this message
-
Re: HBase random read performance
The thread ended abruptly..Can some suggest a solution for improved random
read performance...Even we are facing a similar issue with multi-gets on a
table with growing data...
On Wed, Apr 17, 2013 at 6:03 PM, Michel Segel <[EMAIL PROTECTED]>wrote:

> Wouldn't do that... Changing block size is the last thing you want to do.
>
> First question...
>
> What is your key?
>
> Second...
> What is your record size that you are attempting to read.
>
>
> Third...
> Compare the 10k multiget versus 10k individual gets.
>
> Fourth are your random keys sorted?
> If not, try sorting them...
>
> There are a lot of issues that can affect performance ....
>
>
>
> Sent from a remote device. Please excuse any typos...
>
> Mike Segel
>
> On Apr 15, 2013, at 3:17 AM, Anoop Sam John <[EMAIL PROTECTED]> wrote:
>
> > Ankit
> >                 I guess you might be having default HFile block size
> which is 64KB.
> > For random gets a lower value will be better. Try will some thing like
> 8KB and check the latency?
> >
> > Ya ofcourse blooms can help (if major compaction was not done at the
> time of testing)
> >
> > -Anoop-
> > ________________________________________
> > From: Ankit Jain [[EMAIL PROTECTED]]
> > Sent: Saturday, April 13, 2013 11:01 AM
> > To: [EMAIL PROTECTED]
> > Subject: HBase random read performance
> >
> > Hi All,
> >
> > We are using HBase 0.94.5 and Hadoop 1.0.4.
> >
> > We have HBase cluster of 5 nodes(5 regionservers and 1 master node). Each
> > regionserver has 8 GB RAM.
> >
> > We have loaded 25 millions records in HBase table, regions are pre-split
> > into 16 regions and all the regions are equally loaded.
> >
> > We are getting very low random read performance while performing multi
> get
> > from HBase.
> >
> > We are passing random 10000 row-keys as input, while HBase is taking
> around
> > 17 secs to return 10000 records.
> >
> > Please suggest some tuning to increase HBase read performance.
> >
> > Thanks,
> > Ankit Jain
> > iLabs
> >
> >
> >
> > --
> > Thanks,
> > Ankit Jain
>

--
Thank you
Kiran Sarvabhotla

-----Even a correct decision is wrong when it is taken late