Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Regarding rowkey


Copy link to this message
-
Re: Regarding rowkey
I think yes, because it will avoid hotspotting. I think we have a good post
on that topic on Sematext Blob.

Otis
--
Performance Monitoring - http://sematext.com/spm
On Sep 12, 2012 3:08 AM, "Ramasubramanian" <
[EMAIL PROTECTED]> wrote:

> Hi thanks! But for loading data into hbase, adding hash in rowkey will
> improve performance?
>
> Regards,
> Rams
>
> On 12-Sep-2012, at 8:38 AM, lars hofhansl <[EMAIL PROTECTED]> wrote:
>
> > It depends. If you do not need to perform rangescans along (prefixes of)
> your row keys, you can prefix the row key by a hash of the row key.
> > That will give you a more or less random distribution of the keys and
> hence not hit the same region server over and over.
> >
> > You'll probably also want to presplit your table then.
> >
> > -- Lars
> >
> >
> >
> > ----- Original Message -----
> > From: Ramasubramanian <[EMAIL PROTECTED]>
> > To: [EMAIL PROTECTED]
> > Cc:
> > Sent: Tuesday, September 11, 2012 10:39 AM
> > Subject: Regarding rowkey
> >
> > Hi,
> >
> > What can be used as rowkey to improve performance while loading into
> hbase? Currently I am having sequence. It takes some 11 odd minutes to load
> 1 million record with 147 columns.
> >
> > Regards,
> > Rams
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB