Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> customize partitioning of regionserver

Copy link to this message
Re: customize partitioning of regionserver
Thanks for the reply.
In my use case, I have to retrieve a range of data usually by month and
operate on them before reinserting them, so it would be nice if i could
partition by month but then I don't know how would the partition affect the
mapreduce job.

On Mon, Jan 10, 2011 at 12:48 PM, Buttler, David <[EMAIL PROTECTED]> wrote:

> Not to my knowledge.  Partitions are dynamically determined. As your table
> grows, regions become too large and are split roughly in half.  This
> prevents unbalanced regions.  Any predetermined partitioning will ultimately
> fail because you don't know your data as well as you think you do.
> Dave
> -----Original Message-----
> From: Weishung Chung [mailto:[EMAIL PROTECTED]]
> Sent: Monday, January 10, 2011 10:14 AM
> Subject: customize partitioning of regionserver
> Does HBase have the capability to partition dataset by range like the MySQL
> partitioning eg. partition the datetime, row key by month?
> Thank you.