Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - Regarding rowkey


Copy link to this message
-
Re: Regarding rowkey
lars hofhansl 2012-09-12, 15:22
If you (1) load through the HTable API and (2) have multiple region servers and (3) have monotonically increasing row keys: Yes.
-- Lars
----- Original Message -----
From: Ramasubramanian <[EMAIL PROTECTED]>
To: "[EMAIL PROTECTED]" <[EMAIL PROTECTED]>
Cc:
Sent: Wednesday, September 12, 2012 12:07 AM
Subject: Re: Regarding rowkey

Hi thanks! But for loading data into hbase, adding hash in rowkey will improve performance?

Regards,
Rams

On 12-Sep-2012, at 8:38 AM, lars hofhansl <[EMAIL PROTECTED]> wrote:

> It depends. If you do not need to perform rangescans along (prefixes of) your row keys, you can prefix the row key by a hash of the row key.
> That will give you a more or less random distribution of the keys and hence not hit the same region server over and over.
>
> You'll probably also want to presplit your table then.
>
> -- Lars
>
>
>
> ----- Original Message -----
> From: Ramasubramanian <[EMAIL PROTECTED]>
> To: [EMAIL PROTECTED]
> Cc:
> Sent: Tuesday, September 11, 2012 10:39 AM
> Subject: Regarding rowkey
>
> Hi,
>
> What can be used as rowkey to improve performance while loading into hbase? Currently I am having sequence. It takes some 11 odd minutes to load 1 million record with 147 columns.
>
> Regards,
> Rams