|
|
-
Using NFS mounted volume for Hadoop installation/configuration
Mehmet Belgin 2013-02-18, 18:00
Hi Everyone,
Will it be any problem if I put the hadoop executables and configuration on a NFS volume, which is shared by all masters and slaves? This way the configuration changes will be available for all nodes, without need for synching any files. While this looks almost like a no-brainer, I am wondering if there are any pitfalls I need to be aware of.
On a related question, is there a best practices (do's and don'ts ) document that you can suggest other than the regular documentation by Apache?
Thanks! -Mehmet
+
Mehmet Belgin 2013-02-18, 18:00
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Chris Embree 2013-02-18, 18:09
I'm doing that currently. No problems to report so far. The only pitfall I've found is around NFS stability. If your NAS is 100% solid no problems. I've seen mtab get messed up and refuse to remount if NFS has any hiccups. If you want to really crazy, consider NFS for your datanode root fs. See the oneSIS project for details. http://onesis.sourceforge.netEnjoy. On Mon, Feb 18, 2013 at 1:00 PM, Mehmet Belgin <[EMAIL PROTECTED] > wrote: > Hi Everyone, > > Will it be any problem if I put the hadoop executables and configuration > on a NFS volume, which is shared by all masters and slaves? This way the > configuration changes will be available for all nodes, without need for > synching any files. While this looks almost like a no-brainer, I am > wondering if there are any pitfalls I need to be aware of. > > On a related question, is there a best practices (do's and don'ts ) > document that you can suggest other than the regular documentation by > Apache? > > Thanks! > -Mehmet
+
Chris Embree 2013-02-18, 18:09
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Paul Wilkinson 2013-02-18, 18:44
That requirement for 100% availability is the issue. If NFS goes down, you lose all sorts of things that are critical. This will work for a dev cluster, but strongly isn't recommended for production. As a first step, consider rsync - that way everything is local, so fewer external dependencies. After that, consider not managing boxes by hand :) Paul On 18 Feb 2013, at 18:09, Chris Embree <[EMAIL PROTECTED]> wrote: > I'm doing that currently. No problems to report so far. > > The only pitfall I've found is around NFS stability. If your NAS is 100% solid no problems. I've seen mtab get messed up and refuse to remount if NFS has any hiccups. > > If you want to really crazy, consider NFS for your datanode root fs. See the oneSIS project for details. http://onesis.sourceforge.net> > Enjoy. > > On Mon, Feb 18, 2013 at 1:00 PM, Mehmet Belgin <[EMAIL PROTECTED]> wrote: >> Hi Everyone, >> >> Will it be any problem if I put the hadoop executables and configuration on a NFS volume, which is shared by all masters and slaves? This way the configuration changes will be available for all nodes, without need for synching any files. While this looks almost like a no-brainer, I am wondering if there are any pitfalls I need to be aware of. >> >> On a related question, is there a best practices (do's and don'ts ) document that you can suggest other than the regular documentation by Apache? >> >> Thanks! >> -Mehmet >
+
Paul Wilkinson 2013-02-18, 18:44
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Chris Embree 2013-02-18, 19:31
Just for clarification, we only use NFS for binaries and config files. HDFS and MarpRed write to local disk. We just don't install an OS there. :) On Mon, Feb 18, 2013 at 1:44 PM, Paul Wilkinson <[EMAIL PROTECTED]>wrote: > That requirement for 100% availability is the issue. If NFS goes down, you > lose all sorts of things that are critical. This will work for a dev > cluster, but strongly isn't recommended for production. > > As a first step, consider rsync - that way everything is local, so fewer > external dependencies. After that, consider not managing boxes by hand :) > > Paul > > > On 18 Feb 2013, at 18:09, Chris Embree <[EMAIL PROTECTED]> wrote: > > I'm doing that currently. No problems to report so far. > > The only pitfall I've found is around NFS stability. If your NAS is 100% > solid no problems. I've seen mtab get messed up and refuse to remount if > NFS has any hiccups. > > If you want to really crazy, consider NFS for your datanode root fs. See > the oneSIS project for details. http://onesis.sourceforge.net> > Enjoy. > > On Mon, Feb 18, 2013 at 1:00 PM, Mehmet Belgin < > [EMAIL PROTECTED]> wrote: > >> Hi Everyone, >> >> Will it be any problem if I put the hadoop executables and configuration >> on a NFS volume, which is shared by all masters and slaves? This way the >> configuration changes will be available for all nodes, without need for >> synching any files. While this looks almost like a no-brainer, I am >> wondering if there are any pitfalls I need to be aware of. >> >> On a related question, is there a best practices (do's and don'ts ) >> document that you can suggest other than the regular documentation by >> Apache? >> >> Thanks! >> -Mehmet > > >
+
Chris Embree 2013-02-18, 19:31
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Chen He 2013-02-18, 18:17
To use NFS as datanode fs may bring performance problems. Millions of requests may block your NFS server. On Mon, Feb 18, 2013 at 12:09 PM, Chris Embree <[EMAIL PROTECTED]> wrote: > I'm doing that currently. No problems to report so far. > > The only pitfall I've found is around NFS stability. If your NAS is 100% > solid no problems. I've seen mtab get messed up and refuse to remount if > NFS has any hiccups. > > If you want to really crazy, consider NFS for your datanode root fs. See > the oneSIS project for details. http://onesis.sourceforge.net> > Enjoy. > > > On Mon, Feb 18, 2013 at 1:00 PM, Mehmet Belgin < > [EMAIL PROTECTED]> wrote: > >> Hi Everyone, >> >> Will it be any problem if I put the hadoop executables and configuration >> on a NFS volume, which is shared by all masters and slaves? This way the >> configuration changes will be available for all nodes, without need for >> synching any files. While this looks almost like a no-brainer, I am >> wondering if there are any pitfalls I need to be aware of. >> >> On a related question, is there a best practices (do's and don'ts ) >> document that you can suggest other than the regular documentation by >> Apache? >> >> Thanks! >> -Mehmet > > >
+
Chen He 2013-02-18, 18:17
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Nan Zhu 2013-02-18, 18:14
I'm also maintaining an experimental Hadoop cluster, and I need to modify the Hadoop source code and test it, so just use NFS to deploy the latest version of code, no problem found yet Best, -- Nan Zhu School of Computer Science, McGill University On Monday, 18 February, 2013 at 1:09 PM, Chris Embree wrote: > I'm doing that currently. No problems to report so far. > > The only pitfall I've found is around NFS stability. If your NAS is 100% solid no problems. I've seen mtab get messed up and refuse to remount if NFS has any hiccups. > > If you want to really crazy, consider NFS for your datanode root fs. See the oneSIS project for details. http://onesis.sourceforge.net> > Enjoy. > > On Mon, Feb 18, 2013 at 1:00 PM, Mehmet Belgin <[EMAIL PROTECTED] (mailto:[EMAIL PROTECTED])> wrote: > > Hi Everyone, > > > > Will it be any problem if I put the hadoop executables and configuration on a NFS volume, which is shared by all masters and slaves? This way the configuration changes will be available for all nodes, without need for synching any files. While this looks almost like a no-brainer, I am wondering if there are any pitfalls I need to be aware of. > > > > On a related question, is there a best practices (do's and don'ts ) document that you can suggest other than the regular documentation by Apache? > > > > Thanks! > > -Mehmet
+
Nan Zhu 2013-02-18, 18:14
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Mehmet Belgin 2013-02-18, 18:53
It looks like the NFS stability and performance are two main concerns. Since my cluster is still experimental, I will continue to use NFS for now. In the future, when we have a larger production cluster, I will consider local configurations.
Thank you all for your replies!
-Mehmet
On Feb 18, 2013, at 1:00 PM, Mehmet Belgin wrote:
> Hi Everyone, > > Will it be any problem if I put the hadoop executables and configuration on a NFS volume, which is shared by all masters and slaves? This way the configuration changes will be available for all nodes, without need for synching any files. While this looks almost like a no-brainer, I am wondering if there are any pitfalls I need to be aware of. > > On a related question, is there a best practices (do's and don'ts ) document that you can suggest other than the regular documentation by Apache? > > Thanks! > -Mehmet
+
Mehmet Belgin 2013-02-18, 18:53
-
Re: Using NFS mounted volume for Hadoop installation/configuration
Chen He 2013-02-18, 19:01
Cloudera manager or Zettaset can be a choice if you like easy configuration. This type of software will do the rysnc for you.
On Mon, Feb 18, 2013 at 12:53 PM, Mehmet Belgin < [EMAIL PROTECTED]> wrote:
> It looks like the NFS stability and performance are two main concerns. > Since my cluster is still experimental, I will continue to use NFS for now. > In the future, when we have a larger production cluster, I will consider > local configurations. > > Thank you all for your replies! > > -Mehmet > > > > On Feb 18, 2013, at 1:00 PM, Mehmet Belgin wrote: > > Hi Everyone, > > Will it be any problem if I put the hadoop executables and configuration > on a NFS volume, which is shared by all masters and slaves? This way the > configuration changes will be available for all nodes, without need for > synching any files. While this looks almost like a no-brainer, I am > wondering if there are any pitfalls I need to be aware of. > > On a related question, is there a best practices (do's and don'ts ) > document that you can suggest other than the regular documentation by > Apache? > > Thanks! > -Mehmet > > >
+
Chen He 2013-02-18, 19:01
|
|