Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase, mail # user - customize partitioning of regionserver


+
Weishung Chung 2011-01-10, 18:14
+
Buttler, David 2011-01-10, 18:48
Copy link to this message
-
Re: customize partitioning of regionserver
Weishung Chung 2011-01-10, 19:02
Thanks for the reply.
In my use case, I have to retrieve a range of data usually by month and
operate on them before reinserting them, so it would be nice if i could
partition by month but then I don't know how would the partition affect the
mapreduce job.

On Mon, Jan 10, 2011 at 12:48 PM, Buttler, David <[EMAIL PROTECTED]> wrote:

> Not to my knowledge.  Partitions are dynamically determined. As your table
> grows, regions become too large and are split roughly in half.  This
> prevents unbalanced regions.  Any predetermined partitioning will ultimately
> fail because you don't know your data as well as you think you do.
>
> Dave
>
>
> -----Original Message-----
> From: Weishung Chung [mailto:[EMAIL PROTECTED]]
> Sent: Monday, January 10, 2011 10:14 AM
> To: [EMAIL PROTECTED]
> Subject: customize partitioning of regionserver
>
> Does HBase have the capability to partition dataset by range like the MySQL
> partitioning eg. partition the datetime, row key by month?
> Thank you.
>
+
Buttler, David 2011-01-10, 19:19
+
Weishung Chung 2011-01-10, 20:47