Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Zookeeper, mail # user - zk keeps disconnecting and reconnecting


+
Jun Rao 2011-08-23, 21:58
+
Ted Dunning 2011-08-23, 23:07
+
Jun Rao 2011-08-23, 23:24
+
Mahadev Konar 2011-08-24, 03:42
+
Patrick Hunt 2011-08-25, 16:34
+
Jun Rao 2011-08-29, 04:34
+
Jun Rao 2011-08-29, 16:39
+
Fournier, Camille F. 2011-08-29, 17:50
+
Mahadev Konar 2011-08-29, 18:10
+
Fournier, Camille F. 2011-08-29, 19:38
+
Mahadev Konar 2011-08-29, 19:45
+
Benjamin Reed 2011-08-30, 23:45
+
Camille Fournier 2011-08-30, 23:50
+
Jun Rao 2011-08-31, 01:48
Copy link to this message
-
Re: zk keeps disconnecting and reconnecting
kishore g 2011-08-31, 17:02
Here is a simple test case that reproduces this error.
 public void testChroot() throws Exception
    {
        Watcher watcher = new Watcher()
        {
            @Override
            public void process(WatchedEvent event)
            {
                System.out.println("Event:" + event);
            }
        };
        ZooKeeper zk = new ZooKeeper("localhost:2181/foo", 6000, watcher);
        //uncommenting this line will not cause infinite connect/disconnect
        //zk.create("/", new byte[0], Ids.OPEN_ACL_UNSAFE,
CreateMode.PERSISTENT);

        zk.exists("/", true);
        System.out.println("Stop the server and restart it when you see this
message");
        Thread.currentThread().join();
    }

As pointed out earlier in this thread, setting an watch on a non-existent
path triggers this.

Is some one working on a patch for 961 and the issues described in this
thread. Any pointers on what needs to be fixed for both the issues? I can
take a look and submit a patch if I can fix it.

thanks,
Kishore G

On Tue, Aug 30, 2011 at 6:48 PM, Jun Rao <[EMAIL PROTECTED]> wrote:

> I was also wondering why our clients get disconnected in the first place
> since the ZK servers are all up. The following are the logs when the first
> disconnect happens. Does anyone know why the client can't seem to connect
> to
> most servers? Also, is  "Session 0x1320765b0ac002e for server nulll"
> normal?
> Thanks,
>
> 2011/08/29 07:33:51.824 INFO [ClientCnxn]
> [main-SendThread(esv4-app27.stg:12913)] [kafka] Unable to read additional
> data from server sessionid 0x1320765b0ac002f, likely server h
> as closed socket, closing socket connection and attempting
> reconnect2011/08/29 07:33:51.824 INFO [ClientCnxn]
> [main-SendThread(esv4-app27.stg:12913)] [kafka] Unable to read additional
> data from server sessionid 0x1320765b0ac002e, likely server h
> as closed socket, closing socket connection and attempting reconnect
> 2011/08/29 07:33:51.990 INFO [ZkClient] [main-EventThread] [kafka]
> zookeeper
> state changed (Disconnected)2011/08/29 07:33:52.019 INFO [ZkClient]
> [main-EventThread] [kafka] zookeeper state changed (Disconnected)
> 2011/08/29 07:33:52.092 INFO [ClientCnxn]
> [main-SendThread(esv4-app27.stg:12913)] [kafka] Opening socket connection
> to
> server esv4-app29.stg/172.18.98.89:12913
> 2011/08/29 07:33:52.093 INFO [ClientCnxn]
> [main-SendThread(esv4-app29.stg:12913)] [kafka] Socket connection
> established to esv4-app29.stg/172.18.98.89:12913, initiating session
> 2011/08/29 07:33:52.094 INFO [ClientCnxn]
> [main-SendThread(esv4-app29.stg:12913)] [kafka] Unable to read additional
> data from server sessionid 0x1320765b0ac002f, likely server h
> as closed socket, closing socket connection and attempting reconnect
> 2011/08/29 07:33:52.652 INFO [ClientCnxn]
> [main-SendThread(esv4-app27.stg:12913)] [kafka] Opening socket connection
> to
> server esv4-app28.stg/172.18.98.101:12913
> 2011/08/29 07:33:52.652 INFO [ClientCnxn]
> [main-SendThread(esv4-app28.stg:12913)] [kafka] Socket connection
> established to esv4-app28.stg/172.18.98.101:12913, initiating
> session2011/08/29 07:33:53.075 INFO [ClientCnxn]
> [main-SendThread(esv4-app28.stg:12913)] [kafka] Unable to read additional
> data from server sessionid 0x1320765b0ac002e, likely server h
> as closed socket, closing socket connection and attempting reconnect
> 2011/08/29 07:33:53.108 INFO [ClientCnxn]
> [main-SendThread(esv4-app29.stg:12913)] [kafka] Opening socket connection
> to
> server esv4-app28.stg/172.18.98.101:12913
> 2011/08/29 07:33:53.108 INFO [ClientCnxn]
> [main-SendThread(esv4-app28.stg:12913)] [kafka] Socket connection
> established to esv4-app28.stg/172.18.98.101:12913, initiating session
> 2011/08/29 07:33:53.109 INFO [ClientCnxn]
> [main-SendThread(esv4-app28.stg:12913)] [kafka] Unable to read additional
> data from server sessionid 0x1320765b0ac002f, likely server h
> as closed socket, closing socket connection and attempting reconnect
> 2011/08/29 07:33:53.577 INFO [ClientCnxn]
+
Fournier, Camille F. 2011-09-01, 14:06