Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Regarding rowkey

Copy link to this message
Re: Regarding rowkey
If you (1) load through the HTable API and (2) have multiple region servers and (3) have monotonically increasing row keys: Yes.
-- Lars
----- Original Message -----
From: Ramasubramanian <[EMAIL PROTECTED]>
Sent: Wednesday, September 12, 2012 12:07 AM
Subject: Re: Regarding rowkey

Hi thanks! But for loading data into hbase, adding hash in rowkey will improve performance?


On 12-Sep-2012, at 8:38 AM, lars hofhansl <[EMAIL PROTECTED]> wrote:

> It depends. If you do not need to perform rangescans along (prefixes of) your row keys, you can prefix the row key by a hash of the row key.
> That will give you a more or less random distribution of the keys and hence not hit the same region server over and over.
> You'll probably also want to presplit your table then.
> -- Lars
> ----- Original Message -----
> From: Ramasubramanian <[EMAIL PROTECTED]>
> Cc:
> Sent: Tuesday, September 11, 2012 10:39 AM
> Subject: Regarding rowkey
> Hi,
> What can be used as rowkey to improve performance while loading into hbase? Currently I am having sequence. It takes some 11 odd minutes to load 1 million record with 147 columns.
> Regards,
> Rams