Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - Regarding rowkey


Copy link to this message
-
Re: Regarding rowkey
lars hofhansl 2012-09-12, 03:08
It depends. If you do not need to perform rangescans along (prefixes of) your row keys, you can prefix the row key by a hash of the row key.
That will give you a more or less random distribution of the keys and hence not hit the same region server over and over.

You'll probably also want to presplit your table then.

-- Lars

----- Original Message -----
From: Ramasubramanian <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Cc:
Sent: Tuesday, September 11, 2012 10:39 AM
Subject: Regarding rowkey

Hi,

What can be used as rowkey to improve performance while loading into hbase? Currently I am having sequence. It takes some 11 odd minutes to load 1 million record with 147 columns.

Regards,
Rams