Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - md5 hash key and splits


Copy link to this message
-
Re: md5 hash key and splits
Mohit Anchlia 2012-08-30, 04:38
On Wed, Aug 29, 2012 at 9:19 PM, Stack <[EMAIL PROTECTED]> wrote:

>  On Wed, Aug 29, 2012 at 3:56 PM, Mohit Anchlia <[EMAIL PROTECTED]>
> wrote:
> > If I use md5 hash + timestamp rowkey would hbase automatically detect the
> > difference in ranges and peforms split? How does split work in such cases
> > or is it still advisable to manually split the regions.
>

What logic would you recommend to split the table into multiple regions
when using md5 hash?
> Yes.
>
> On how split works, when a region hits the maximum configured size, it
> splits in two.
>
> Manual splitting can be useful when you know your distribution and
> you'd save on hbase doing it for you.  It can speed up bulk loads for
> instance.
>
> St.Ack
>