Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Zookeeper >> mail # user >> server cannot join quorum


Copy link to this message
-
Re: server cannot join quorum
Hi,

Can you attach zoo.cfg files and logs from all the nodes? It might be also
worth verifying that zk2 and zk3 are able to talk to zk1 (not
firewall/ip/networking issues).

On Fri, Jan 7, 2011 at 6:02 AM, Alexis Midon <[EMAIL PROTECTED]> wrote:

> Hi there,
>
> I have a cluster of 3 machines, running zookeeper 3.3.1.
> zk1 fails to join the quorum while zk2 and zk3 interact correctly. zk1 is
> stuck in the election loop. See the log below.
> I checked the config files, the connectivity between the machines. I can't
> find anything wrong.
>
> Any ideas?
>
> thanks in advance,
>
> alexis
>
> 2011-01-07 00:14:23,156 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumPeer@551] - Initializing leader
> election protocol...
> 2011-01-07 00:14:23,157 - INFO
>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@649] - New election.
> My id =  1, Proposed zxid = 0
> 2011-01-07 00:14:23,158 - DEBUG [WorkerSender Thread:QuorumCnxManager@346]
> -
> Opening channel to server 2
> 2011-01-07 00:14:23,159 - DEBUG [WorkerReceiver
> Thread:FastLeaderElection$Messenger$WorkerReceiver@214] - Receive new
> notification message. My id = 1
> 2011-01-07 00:14:23,160 - INFO
>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@689] - Notification:
> 1, 0, 1, 1, LOOKING, LOOKING, 1
> 2011-01-07 00:14:23,160 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@495] - id: 1,
> proposed
> id: 1, zxid: 0, proposed zxid: 0
> 2011-01-07 00:14:23,161 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@717] - Adding vote:
> From = 1, Proposed leader = 1, Porposed zxid = 0, Proposed epoch = 1
> 2011-01-07 00:14:23,162 - INFO  [WorkerSender Thread:QuorumCnxManager@162]
> -
> Have smaller server identifier, so dropping the connection: (2, 1)
> 2011-01-07 00:14:23,162 - DEBUG [WorkerSender Thread:QuorumCnxManager@346]
> -
> Opening channel to server 3
> 2011-01-07 00:14:23,172 - INFO  [WorkerSender Thread:QuorumCnxManager@162]
> -
> Have smaller server identifier, so dropping the connection: (3, 1)
> 2011-01-07 00:14:23,365 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
> 2011-01-07 00:14:23,366 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
> 2011-01-07 00:14:23,366 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@346] - Opening channel
> to
> server 2
> 2011-01-07 00:14:23,367 - INFO
>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@162] - Have smaller
> server identifier, so dropping the connection: (2, 1)
> 2011-01-07 00:14:23,367 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@346] - Opening channel
> to
> server 3
> 2011-01-07 00:14:23,378 - INFO
>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@162] - Have smaller
> server identifier, so dropping the connection: (3, 1)
> 2011-01-07 00:14:23,378 - INFO
>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:FastLeaderElection@683] - Notification
> time out: 400
> 2011-01-07 00:14:23,785 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
> 2011-01-07 00:14:23,785 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@391] - Queue size: 1
> 2011-01-07 00:14:23,786 - DEBUG
> [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@346] - Opening channel
> to
> server 2
> 2011-01-07 00:14:26,786 - INFO
>  [QuorumPeer:/0:0:0:0:0:0:0:0:2181:QuorumCnxManager@162] - Have smaller
> server identifier, so dropping the connection: (2, 1)
> ...
>