Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase, mail # user - 0.90.3


Copy link to this message
-
Re: 0.90.3
Jack Levin 2011-05-25, 02:03
"HBase uses the local hostname to self-report it's IP address."

using 'hostname' as authoritative name for regionserver is what caused
all of the confusion, hostname usually not governed by name resolution
(/etc/hosts, dns),  some users may call their servers something other
than whats in dns, so hbase will break for them if they do.  Better
idea would be to check eth0 for IP, get reverse dns name for it, and
use that.

just my small two cents.

-Jack

On Tue, May 24, 2011 at 6:02 PM, Jean-Daniel Cryans <[EMAIL PROTECTED]> wrote:
> Zookeeper doesn't query addresses, it's all done in HBase which in
> turn stores it in ZK.
>
> Also http://hbase.apache.org/book.html#dns
>
> J-D
>
> On Tue, May 24, 2011 at 4:37 PM, Jack Levin <[EMAIL PROTECTED]> wrote:
>> figured it out... the /etc/hosts file has ip to name, was used by
>> zookeeper was *.prod.imageshack.com, while hostname was
>> imgXX.imageshack.us... use by Regionserver/Master -  Ideally, all
>> three components should source hostnames form same place, whether its
>> hostname or /etc/hosts (or dns), etc... it gotta be consistent,
>> otherwise aliases end up screwing things up and people will end up
>> guessing why things don't work.
>>
>> -Jack
>>
>> On Tue, May 24, 2011 at 4:04 PM, Jack Levin <[EMAIL PROTECTED]> wrote:
>>> img645.prod.imageshack.us and img645.imageshack.us are both point to
>>> the same IP.
>>>
>>> -Jack
>>>
>>> On Tue, May 24, 2011 at 3:50 PM, Jack Levin <[EMAIL PROTECTED]> wrote:
>>>> looks like our balancer is on:
>>>>
>>>> hbase(main):001:0> balance_switch true
>>>> true
>>>> 0 row(s) in 0.3700 seconds
>>>>
>>>> I simply kill PID for RS, and it stays on the list with regions
>>>> assigned, and master does not know about it.
>>>>
>>>> So it still does not work.
>>>>
>>>> -Jack
>>>>
>>>> On Tue, May 24, 2011 at 3:43 PM, Dave Latham <[EMAIL PROTECTED]> wrote:
>>>>> Are you using the graceful_stop script?
>>>>>
>>>>> In 0.90.3 the bin/graceful_stop.sh script was updated to disable the
>>>>> master's balancer.  However, it doesn't seem that anything re-enables it, so
>>>>> if you're using it you need to re-enable it on your own.  See the book for
>>>>> more details:
>>>>> http://hbase.apache.org/book.html#decommission
>>>>>
>>>>> Dave
>>>>>
>>>>> On Tue, May 24, 2011 at 3:33 PM, Jack Levin <[EMAIL PROTECTED]> wrote:
>>>>>
>>>>>> just put new hbase version on our test cluster. and been testing it...
>>>>>> so far if I shutdown an RS, master does not reassign its regions, and
>>>>>> we remain inconsistent forerver, likewise when new RS is up, it does
>>>>>> not get regions assigned to it, this is the master log:
>>>>>>
>>>>>>
>>>>>> 2011-05-24 15:30:57,724 DEBUG
>>>>>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher:
>>>>>> master:60000-0x1302094818900a4-0x1302094818900a4 Received ZooKeeper
>>>>>> Event, type=NodeDeleted, state=SyncConnected,
>>>>>> path=/hbase/rs/img645.prod.imageshack.com,60020,1306276075768
>>>>>> 2011-05-24 15:30:57,724 INFO
>>>>>> org.apache.hadoop.hbase.zookeeper.RegionServerTracker: RegionServer
>>>>>> ephemeral node deleted, processing expiration
>>>>>> [img645.prod.imageshack.com,60020,1306276075768]
>>>>>> 2011-05-24 15:30:57,724 INFO
>>>>>> org.apache.hadoop.hbase.zookeeper.RegionServerTracker: No HServerInfo
>>>>>> found for img645.prod.imageshack.com,60020,1306276075768
>>>>>> 2011-05-24 15:30:57,726 DEBUG
>>>>>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher:
>>>>>> master:60000-0x1302094818900a4-0x1302094818900a4 Received ZooKeeper
>>>>>> Event, type=NodeChildrenChanged, state=SyncConnected, path=/hbase/rs
>>>>>> 2011-05-24 15:31:03,330 DEBUG
>>>>>> org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher:
>>>>>> master:60000-0x1302094818900a4-0x1302094818900a4 Received ZooKeeper
>>>>>> Event, type=NodeChildrenChanged, state=SyncConnected, path=/hbase/rs
>>>>>> 2011-05-24 15:31:03,338 DEBUG
>>>>>> org.apache.hadoop.hbase.zookeeper.ZKUtil:
>>>>>> master:60000-0x1302094818900a4-0x1302094818900a4 Retrieved 32 byte(s)
>>>>>> of data from znode