Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Zookeeper, mail # user - issue with nutch-gora+hbase+zookeeper


Copy link to this message
-
issue with nutch-gora+hbase+zookeeper
kaveh minooie 2013-02-22, 20:31
Hi everyone

I am having this problem for couple of days now and would appreciate any
idea or suggestion that any one might have. I am using nutch 2.x with
hbase. due to nutch requirement I need to use older version of hbase (I
am using 0.90.6  over hadoop 1.1.1 with 10 nodes with zookeepr 3.5.0
[trunk] )

hbase seems to be running fine, thou I appreciate if someone can show me
how I can actually test it systematically. but I seem to be able to
create and read data from hbase, but when I run any nutch command
something very similar to this happens as soon as the job starts
running. this is for example, the output of nutch inject

13/02/22 12:07:30 INFO mapred.JobClient:  map 0% reduce 0%
13/02/22 12:07:52 INFO mapred.JobClient: Task Id :
attempt_201302191325_0013_m_000000_0, Status : FAILED
org.apache.gora.util.GoraException:
org.apache.hadoop.hbase.ZooKeeperConnectionException: HBase is able to
connect to ZooKeeper but the connection closes immediately. This could
be a sign that the server has too many connections (30 is the default).
Consider inspecting your ZK server logs for that error and then make
sure you are reusing HBaseConfiguration as often as you can. See
HTable's javadoc for more information.
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:118)
at
org.apache.gora.mapreduce.GoraOutputFormat.getRecordWriter(GoraOutputFormat.java:88)
at
org.apache.hadoop.mapred.MapTask$NewDirectOutputCollector.<init>(MapTask.java:628)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:753)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Unknown Source)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136)
at org.apache.hadoop.mapred.Child.main(Child.java:249)
Caused by: org.apache.hadoop.hbase.ZooKeeperConnectionException: HBase
is able to connect to ZooKeeper but the connection closes immediately.
This could be a sign that the server has too many connections (30 is the
default). Consider inspecting your ZK server logs for that error and
then make sure you are reusing HBaseConfiguration as often as you can.
See HTable's javadoc for more information.
at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:156)
at
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getZooKeeperWatcher(HConnectionManager.java:1265)
at
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.setupZookeeperTrackers(HConnectionManager.java:526)
at
org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.<init>(HConnectionManager.java:516)
at
org.apache.hadoop.hbase.client.HConnectionManager.getConnection(HConnectionManager.java:173)
at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:93)
at org.apache.gora.hbase.store.HBaseStore.initialize(HBaseStore.java:108)
at
org.apache.gora.store.DataStoreFactory.initializeDataStore(DataStoreFactory.java:102)
at
org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:161)
... 10 more
Caused by: org.apache.zookeeper.KeeperException$ConnectionLossException:
KeeperErrorCode = ConnectionLoss for /hbase
at org.apache.zookeeper.KeeperException.create(KeeperException.java:99)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1237)
at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1265)
at
org.apache.hadoop.hbase.zookeeper.ZKUtil.createAndFailSilent(ZKUtil.java:931)
at
org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher.<init>(ZooKeeperWatcher.java:134)
... 18 more

Now I know that I am not running out of connection. for one thing I have
increased the number of connection to 200 in zoo.cfg, and also here is
what is in the zookeeper log file around that time:

2013-02-22 12:07:27,704 [myid:] - INFO
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:2181:NIOServerCnxnFactory$AcceptThread@289]
- Accepted socket connection from /127.0.0.1:55073
2013-02-22 12:07:27,707 [myid:] - INFO
[NIOWorkerThread-3:ZooKeeperServer@810] - Client attempting to establish
new session at /127.0.0.1:55073
2013-02-22 12:07:27,720 [myid:] - INFO
[SyncThread:0:ZooKeeperServer@566] - Established session
0x13d037b8e6b0016 with negotiated timeout 40000 for client /127.0.0.1:55073
2013-02-22 12:07:27,945 [myid:] - INFO
[NIOServerCxnFactory.AcceptThread:0.0.0.0/0.0.0.0:2181:NIOServerCnxnFactory$AcceptThread@289]
- Accepted socket connection from /127.0.0.1:55075
2013-02-22 12:07:27,946 [myid:] - INFO
[NIOWorkerThread-2:ZooKeeperServer@810] - Client attempting to establish
new session at /127.0.0.1:55075
2013-02-22 12:07:27,953 [myid:] - INFO
[SyncThread:0:ZooKeeperServer@566] - Established session
0x13d037b8e6b0017 with negotiated timeout 40000 for client /127.0.0.1:55075
2013-02-22 12:07:28,010 [myid:] - INFO  [ProcessThread(sid:0
cport:-1)::PrepRequestProcessor@533] - Processed session termination for
sessionid: 0x13d037b8e6b0017
2013-02-22 12:07:28,011 [myid:] - INFO
[NIOWorkerThread-6:NIOServerCnxn@1000] - Closed socket connection for
client /127.0.0.1:55075 which had sessionid 0x13d037b8e6b0017
2013-02-22 12:08:14,005 [myid:] - WARN
[NIOWorkerThread-7:NIOServerCnxn@362] - Unable to read additional data
from client sessionid 0x13d037b8e6b0016, likely client has closed socket
2013-02-22 12:08:14,005 [myid:] - INFO
[NIOWorkerThread-7:NIOServerCnxn@1000] - Closed socket connection for
client /127.0.0.1:55073 which had sessionid 0x13d037b8e6b0016
2013-02-22 12:08:48,000 [myid:] - INFO
[SessionTracker:ZooKeeperServer@304] - Expiring session
0x13d037b8e6b0016, timeout of 40000ms exceeded
2013-02-22 12:08:48,0
+
kaveh minooie 2013-02-23, 04:57