|
|
-
cluster startup problem
Gang Luo 2010-08-30, 02:49
Hi all, I am trying to configure and start a hadoop cluster on EC2. I got some problems here. 1. Can I share hadoop code and its configuration across nodes? Say I have a distributed file system running in the cluster and all the nodes could see the hadoop code and conf there. So all the nodes will use the same copy of code and conf to run. Is it possible?
2. if all the nodes could share hadoop and conf, does it mean I can launch hadoop (bin/start-dfs.sh, bin/start-mapred.sh) from any node (even slave node)?
3. I think I specify and master and slave correctly. When I launch hadoop from master node, no tasktracker or datanode was launched on slave nodes. The log on slave nodes says:
ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: Incompatible namespaceIDs in /mnt/hadoop/dfs/data: namenode namespaceID = 1048149291; datanode namespaceID = 313740560
what is the problem?
Thanks, -Gang
-
RE: cluster startup problem
xiujin yang 2010-08-30, 03:44
> Date: Mon, 30 Aug 2010 10:49:50 +0800 > From: [EMAIL PROTECTED] > Subject: cluster startup problem > To: [EMAIL PROTECTED] > > Hi all, > I am trying to configure and start a hadoop cluster on EC2. I got some problems > here. > > > 1. Can I share hadoop code and its configuration across nodes? Say I have a > distributed file system running in the cluster and all the nodes could see the > hadoop code and conf there. So all the nodes will use the same copy of code and > conf to run. Is it possible? Use rsync
> 2. if all the nodes could share hadoop and conf, does it mean I can launch > hadoop (bin/start-dfs.sh, bin/start-mapred.sh) from any node (even slave node)? > Just have a try. You will get answer.
> 3. I think I specify and master and slave correctly. When I launch hadoop from > master node, no tasktracker or datanode was launched on slave nodes. The log on > slave nodes says: > > ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: > Incompatible namespaceIDs in /mnt/hadoop/dfs/data: namenode namespaceID = > 1048149291; datanode namespaceID = 313740560 > > what is the problem? If HDFS data are useless, just delete HDFS data from datanodes one by one. > Thanks, > -Gang > > > >
-
Re: cluster startup problem
Hemanth Yamijala 2010-08-31, 02:17
Hi,
On Mon, Aug 30, 2010 at 8:19 AM, Gang Luo <[EMAIL PROTECTED]> wrote: > Hi all, > I am trying to configure and start a hadoop cluster on EC2. I got some problems > here. > > > 1. Can I share hadoop code and its configuration across nodes? Say I have a > distributed file system running in the cluster and all the nodes could see the > hadoop code and conf there. So all the nodes will use the same copy of code and > conf to run. Is it possible? >
If they are on the same path, technically it should be possible. However, I am not sure it is advisable at all. We've tried to do something like this using NFS and it fails in ways that make debugging extremely hard. In short, have local copies on all nodes pointing to the same path is the recommended option.
> 2. if all the nodes could share hadoop and conf, does it mean I can launch > hadoop (bin/start-dfs.sh, bin/start-mapred.sh) from any node (even slave node)? > > 3. I think I specify and master and slave correctly. When I launch hadoop from > master node, no tasktracker or datanode was launched on slave nodes. The log on > slave nodes says: > > ERROR org.apache.hadoop.hdfs.server.datanode.DataNode: java.io.IOException: > Incompatible namespaceIDs in /mnt/hadoop/dfs/data: namenode namespaceID > 1048149291; datanode namespaceID = 313740560 > > what is the problem? > > Thanks, > -Gang > > > > >
-
Re: cluster startup problem
Greg Roelofs 2010-08-31, 19:48
Hemanth Yamijala wrote:
> On Mon, Aug 30, 2010 at 8:19 AM, Gang Luo <[EMAIL PROTECTED]> wrote: > > > > 1. Can I share hadoop code and its configuration across nodes? Say I have a > > distributed file system running in the cluster and all the nodes could see the > > hadoop code and conf there. So all the nodes will use the same copy of code and > > conf to run. Is it possible?
> If they are on the same path, technically it should be possible. > However, I am not sure it is advisable at all. We've tried to do > something like this using NFS and it fails in ways that make debugging > extremely hard.
Read-only NFS? I recently looked into an NFS-related unit-test bug (MR-2041), but those failures were due to directory creation and/or permissions-setting (i.e., writing), apparently timing-related.
Greg
|
|