|
|
+
Jeff Zhang 2010-03-17, 07:14
+
William Kang 2010-03-17, 07:18
-
Re: Distributed hadoop setup 0 live datanode problem in clusterWilliam Kang 2010-03-17, 07:29
Hi Jeff,
I think I partly found out the reasons of this problem. The /etc/hosts 127.0.0.1 has the master's host name in it. And the namenode took 127.0.0.1 as the ip address of the namenode. I fixed it and I already found two nodes. There is one still missing. I will let you guys know what happened. Thanks. William On Wed, Mar 17, 2010 at 3:14 AM, Jeff Zhang <[EMAIL PROTECTED]> wrote: > Can you post your namenode's log ? It seems that your data node can not > connect to the name node. > > On Wed, Mar 17, 2010 at 2:43 PM, William Kang <[EMAIL PROTECTED] > >wrote: > > > Hi, > > I just moved from pseudo distributed hadoop to a four machine full > > distributed hadoop setup. > > > > But, after I start the dfs, there is no live node showing up. If I make > > master a slave too, then the datanode in master machine will show up. > > > > I looked up all logs and found no errors. The only thing > > looks suspicious is the log in the datanode: > > > > > > ************************************ > > 2010-03-17 02:39:04,003 INFO org.apache.hadoop.ipc.RPC: Server at > > /xx.xx.xx.xx:9000 not available yet, Zzzzz... > > 2010-03-17 02:39:06,064 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 0 time(s). > > 2010-03-17 02:39:07,076 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 1 time(s). > > 2010-03-17 02:39:08,081 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 2 time(s). > > 2010-03-17 02:39:09,098 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx6:9000. Already tried 3 time(s). > > 2010-03-17 02:39:10,159 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 4 time(s). > > 2010-03-17 02:39:11,179 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 5 time(s). > > 2010-03-17 02:39:12,221 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 6 time(s). > > 2010-03-17 02:39:13,372 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 7 time(s). > > 2010-03-17 02:39:14,545 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 8 time(s). > > 2010-03-17 02:39:15,558 INFO org.apache.hadoop.ipc.Client: Retrying > connect > > to server: /xx.xx.xx.xx:9000. Already tried 9 time(s). > > ************************************* > > > > Does anybody know what might cause this problem? > > > > ssh among these machines are fine without password. The owner of hadoop > > folder has been changed to the same hadoop user. > > > > Thanks! > > > > > > William > > > > > > -- > Best Regards > > Jeff Zhang > +
Steve Loughran 2010-03-22, 11:36
|