Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> could not start HMaster

Copy link to this message
答复: could not start HMaster
Is there any complain in HDFS log ?
发送时间: 2012年10月16日 4:35
主题: RE: could not start HMaster

No, I don't think so. This is a dedicated testing machine and no automatic cleaning up on the /tmp folder...



-----Original Message-----
From: Jimmy Xiang [mailto:[EMAIL PROTECTED]]
Sent: Monday, October 15, 2012 1:32 PM
Subject: Re: could not start HMaster

Is your /tmp folder cleaned up automatically and some files are gone?


On Mon, Oct 15, 2012 at 12:26 PM,  <[EMAIL PROTECTED]> wrote:
> Hi,
> I set up a single node HBase server on top of Hadoop and it has been working fine with most of my testing scenarios such as creating tables and inserting data. Just during the weekend, I accidentally left a testing script running that inserts about 67 rows every min for three days. Today when I looked at the environment, I found out that HBase master could not be started anymore. Digging into the logs, I could see that starting from the second day, HBase first got an exception as follows:
> 2012-10-13 13:05:07,367 INFO org.apache.hadoop.hbase.regionserver.wal.HLog: Roll /tmp/hbase-root/hbase/.logs/sflow-linux02.santanet.dell.com,47137,1348606516541/sflow-linux02.santanet.dell.com%2C47137%2C1348606516541.1350155105992, entries=7981, filesize=3754556.  for /tmp/hbase-root/hbase/.logs/sflow-linux02.santanet.dell.com,47137,1348606516541/sflow-linux02.santanet.dell.com%2C47137%2C1348606516541.1350158707364
> 2012-10-13 13:05:07,367 INFO org.apache.hadoop.hbase.regionserver.wal.HLog: moving old hlog file /tmp/hbase-root/hbase/.logs/sflow-linux02.santanet.dell.com,47137,1348606516541/sflow-linux02.santanet.dell.com%2C47137%2C1348606516541.1348606520442 whose highest sequenceid is 4 to /tmp/hbase-root/hbase/.oldlogs/sflow-linux02.santanet.dell.com%2C47137%2C1348606516541.1348606520442
> 2012-10-13 13:05:07,379 FATAL org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server sflow-linux02.santanet.dell.com,47137,1348606516541: IOE in log roller
> java.io.FileNotFoundException: File file:/tmp/hbase-root/hbase/.logs/sflow-linux02.santanet.dell.com,47137,1348606516541/sflow-linux02.santanet.dell.com%2C47137%2C1348606516541.1348606520442 does not exist.
>        at org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:397)
>        at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:213)
>        at org.apache.hadoop.fs.FileUtil.copy(FileUtil.java:163)
>        at org.apache.hadoop.fs.RawLocalFileSystem.rename(RawLocalFileSystem.java:287)
>        at org.apache.hadoop.fs.ChecksumFileSystem.rename(ChecksumFileSystem.java:428)
>        at org.apache.hadoop.hbase.regionserver.wal.HLog.archiveLogFile(HLog.java:825)
>        at org.apache.hadoop.hbase.regionserver.wal.HLog.cleanOldLogs(HLog.java:708)
>        at org.apache.hadoop.hbase.regionserver.wal.HLog.rollWriter(HLog.java:603)
>        at org.apache.hadoop.hbase.regionserver.LogRoller.run(LogRoller.java:94)
>        at java.lang.Thread.run(Thread.java:662)
> Then SplitLogManager kept splitting the logs for about two days:
> 2012-10-13 13:05:09,061 WARN org.apache.zookeeper.server.NIOServerCnxn: caught end of stream exception
> EndOfStreamException: Unable to read additional data from client sessionid 0x139ff3656b30003, likely client has closed socket
>        at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:220)
>        at org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:224)
>        at java.lang.Thread.run(Thread.java:662)
> 2012-10-13 13:05:09,061 INFO org.apache.zookeeper.server.NIOServerCnxn: Closed socket connection for client / which had sessionid 0x139ff3656b30003
> 2012-10-13 13:05:09,082 INFO org.apache.zookeeper.ClientCnxn: EventThread shut down
> 2012-10-13 13:05:09,085 INFO org.apache.hadoop.hbase.master.handler.ServerShutdownHandler: Splitting logs for sflow-linux02.santanet.dell.com,47137,1348606516541