Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Data Locality, HBase? Or Hadoop?


Copy link to this message
-
Re: Data Locality, HBase? Or Hadoop?
Ok. So each time I move a region manually, I'm better to run a minor
compaction to make sure data is moved locally too.

Got it.

Thanks,

JM

2012/12/3, Kevin O'dell <[EMAIL PROTECTED]>:
> JM,
>
>   If you have disabled the balancer and are manually moving regions, you
> will need to run a compaction on those regions.  That is the only(logical)
> way of bringing the data local.  HDFS does not have a concept of HBase
> locality.  HBase locality is all managed through major and minor
> compactions.
>
> On Mon, Dec 3, 2012 at 10:04 AM, Jean-Marc Spaggiari <
> [EMAIL PROTECTED]> wrote:
>
>> Hi,
>>
>> I'm wondering who is taking care of the data locality. Is it hadoop? Or
>> hbase?
>>
>> Let's say I have disabled the load balancer and I'm manually moving a
>> region to a specific server. Who is going to take care that the data
>> is going to be on the same datanode as the regionserver I moved the
>> region to? Is hadoop going to see that my region is now on this region
>> server and make sure my data is moved there too? Or is hbase going to
>> ask hadoop to do it?
>>
>> Or, since I moved it manually, there is not any data locality guaranteed?
>>
>> Thanks,
>>
>> JM
>>
>
>
>
> --
> Kevin O'Dell
> Customer Operations Engineer, Cloudera
>