Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Regarding rowkey


Copy link to this message
-
Re: Regarding rowkey
I think yes, because it will avoid hotspotting. I think we have a good post
on that topic on Sematext Blob.

Otis
--
Performance Monitoring - http://sematext.com/spm
On Sep 12, 2012 3:08 AM, "Ramasubramanian" <
[EMAIL PROTECTED]> wrote:

> Hi thanks! But for loading data into hbase, adding hash in rowkey will
> improve performance?
>
> Regards,
> Rams
>
> On 12-Sep-2012, at 8:38 AM, lars hofhansl <[EMAIL PROTECTED]> wrote:
>
> > It depends. If you do not need to perform rangescans along (prefixes of)
> your row keys, you can prefix the row key by a hash of the row key.
> > That will give you a more or less random distribution of the keys and
> hence not hit the same region server over and over.
> >
> > You'll probably also want to presplit your table then.
> >
> > -- Lars
> >
> >
> >
> > ----- Original Message -----
> > From: Ramasubramanian <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Cc:
> > Sent: Tuesday, September 11, 2012 10:39 AM
> > Subject: Regarding rowkey
> >
> > Hi,
> >
> > What can be used as rowkey to improve performance while loading into
> hbase? Currently I am having sequence. It takes some 11 odd minutes to load
> 1 million record with 147 columns.
> >
> > Regards,
> > Rams
>