|
Ralph Castain
2012-04-09, 16:10
Robert Evans
2012-04-09, 17:40
Ralph Castain
2012-04-09, 17:45
Kihwal Lee
2012-04-09, 19:27
Ralph Castain
2012-04-09, 20:13
Ralph Castain
2012-04-09, 20:26
Ralph Castain
2012-04-09, 20:30
Kihwal Lee
2012-04-09, 20:45
Ralph Castain
2012-04-09, 21:50
Ralph Castain
2012-04-09, 21:51
Daryn Sharp
2012-04-10, 13:03
Milind.Bhandarkar@...
2012-04-09, 18:01
|
-
Help with errorRalph Castain 2012-04-09, 16:10
Hi folks
I'm trying to develop an AM for the 0.23 branch and running into a problem that I'm having difficulty debugging. My client relocates some supporting files to HDFS, creates the application object for the AM, and submits it to the RM. The file relocation request doesn't generate an error, so I must assume it succeeded. It would be nice if there was some obvious way to verify that, but I haven't discovered it. Can anyone give me a hint? I tried asking hdfs to -ls, but all I get is that "." doesn't exist. I have no idea where the file would be placed, if it would persist once the job fails, etc. When the job is submitted, all I get is an "Error 500", which tells me nothing. Reminds me of the old days of 40 years ago when you'd get the dreaded "error 11", which meant anything from a divide by zero to a memory violation. Are there any debug flags I could set that might provide more info? Thanks Ralph +
Ralph Castain 2012-04-09, 16:10
-
Re: Help with errorRobert Evans 2012-04-09, 17:40
What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use?
--Bobby Evans On 4/9/12 11:10 AM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: Hi folks I'm trying to develop an AM for the 0.23 branch and running into a problem that I'm having difficulty debugging. My client relocates some supporting files to HDFS, creates the application object for the AM, and submits it to the RM. The file relocation request doesn't generate an error, so I must assume it succeeded. It would be nice if there was some obvious way to verify that, but I haven't discovered it. Can anyone give me a hint? I tried asking hdfs to -ls, but all I get is that "." doesn't exist. I have no idea where the file would be placed, if it would persist once the job fails, etc. When the job is submitted, all I get is an "Error 500", which tells me nothing. Reminds me of the old days of 40 years ago when you'd get the dreaded "error 11", which meant anything from a divide by zero to a memory violation. Are there any debug flags I could set that might provide more info? Thanks Ralph +
Robert Evans 2012-04-09, 17:40
-
Re: Help with errorRalph Castain 2012-04-09, 17:45
Hi Bobby
On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: > What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: // set local resources for the application master // local files or archives as needed // In this scenario, the jar file for the application master is part of the local resources Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); // Copy the application master jar to the filesystem // Create a local resource to point to the destination jar path FileSystem fs; FileStatus destStatus; try { fs = FileSystem.get(conf); Path src = new Path(pathOMPItarball); String pathSuffix = appName + "/" + appId.getId(); Path dst = new Path(fs.getHomeDirectory(), pathSuffix); try { fs.copyFromLocalFile(false, true, src, dst); try { destStatus = fs.getFileStatus(dst); LocalResource amJarRsrc = Records.newRecord(LocalResource.class); // Set the type of resource - file or archive // archives are untarred at destination amJarRsrc.setType(LocalResourceType.ARCHIVE); // Set visibility of the resource // Setting to most private option amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); // Set the resource to be copied over amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); // Set timestamp and length of file so that the framework // can do basic sanity checks for the local resource // after it has been copied over to ensure it is the same // resource the client intended to use with the application amJarRsrc.setTimestamp(destStatus.getModificationTime()); amJarRsrc.setSize(destStatus.getLen()); localResources.put("openmpi", amJarRsrc); } catch (Throwable t) { LOG.fatal("Error on file status", t); System.exit(1); } } catch (Throwable t) { LOG.fatal("Error on copy from local file", t); System.exit(1); } } catch (Throwable t) { LOG.fatal("Error getting filesystem configuration", t); System.exit(1); } Note that this appears to work fine when the local resource type was "file" - at least, I was able to make a simple program work that way. Problem I'm having is when I move an archive, which is why I was hoping to look at the HDFS end to see what files are present, and in what locations so I can set the paths accordingly. Thanks Ralph > > --Bobby Evans > > > On 4/9/12 11:10 AM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: > > Hi folks > > I'm trying to develop an AM for the 0.23 branch and running into a problem that I'm having difficulty debugging. My client relocates some supporting files to HDFS, creates the application object for the AM, and submits it to the RM. > > The file relocation request doesn't generate an error, so I must assume it succeeded. It would be nice if there was some obvious way to verify that, but I haven't discovered it. Can anyone give me a hint? I tried asking hdfs to -ls, but all I get is that "." doesn't exist. I have no idea where the file would be placed, if it would persist once the job fails, etc. > > When the job is submitted, all I get is an "Error 500", which tells me nothing. Reminds me of the old days of 40 years ago when you'd get the dreaded "error 11", which meant anything from a divide by zero to a memory violation. Are there any debug flags I could set that might provide more info? > > Thanks > Ralph > > +
Ralph Castain 2012-04-09, 17:45
-
Re: Help with errorKihwal Lee 2012-04-09, 19:27
It looks like the home directory does not exist but the copy went through.
Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. Kihwal On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: Hi Bobby On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: > What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: // set local resources for the application master // local files or archives as needed // In this scenario, the jar file for the application master is part of the local resources Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); // Copy the application master jar to the filesystem // Create a local resource to point to the destination jar path FileSystem fs; FileStatus destStatus; try { fs = FileSystem.get(conf); Path src = new Path(pathOMPItarball); String pathSuffix = appName + "/" + appId.getId(); Path dst = new Path(fs.getHomeDirectory(), pathSuffix); try { fs.copyFromLocalFile(false, true, src, dst); try { destStatus = fs.getFileStatus(dst); LocalResource amJarRsrc = Records.newRecord(LocalResource.class); // Set the type of resource - file or archive // archives are untarred at destination amJarRsrc.setType(LocalResourceType.ARCHIVE); // Set visibility of the resource // Setting to most private option amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); // Set the resource to be copied over amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); // Set timestamp and length of file so that the framework // can do basic sanity checks for the local resource // after it has been copied over to ensure it is the same // resource the client intended to use with the application amJarRsrc.setTimestamp(destStatus.getModificationTime()); amJarRsrc.setSize(destStatus.getLen()); localResources.put("openmpi", amJarRsrc); } catch (Throwable t) { LOG.fatal("Error on file status", t); System.exit(1); } } catch (Throwable t) { LOG.fatal("Error on copy from local file", t); System.exit(1); } } catch (Throwable t) { LOG.fatal("Error getting filesystem configuration", t); System.exit(1); } Note that this appears to work fine when the local resource type was "file" - at least, I was able to make a simple program work that way. Problem I'm having is when I move an archive, which is why I was hoping to look at the HDFS end to see what files are present, and in what locations so I can set the paths accordingly. Thanks Ralph > > --Bobby Evans > > > On 4/9/12 11:10 AM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: > > Hi folks > > I'm trying to develop an AM for the 0.23 branch and running into a problem that I'm having difficulty debugging. My client relocates some supporting files to HDFS, creates the application object for the AM, and submits it to the RM. > > The file relocation request doesn't generate an error, so I must assume it succeeded. It would be nice if there was some obvious way to verify that, but I haven't discovered it. Can anyone give me a hint? I tried asking hdfs to -ls, but all I get is that "." doesn't exist. I have no idea where the file would be placed, if it would persist once the job fails, etc. +
Kihwal Lee 2012-04-09, 19:27
-
Re: Help with errorRalph Castain 2012-04-09, 20:13
I'd be happy to do so - if I could only find the blasted definition of FileStatus!! Can you point me to it?
On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: > It looks like the home directory does not exist but the copy went through. > Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. > > Kihwal > > > > On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: > > Hi Bobby > > On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: > >> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? > > I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: > > // set local resources for the application master > // local files or archives as needed > // In this scenario, the jar file for the application master is part of the local resources > Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); > > LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); > // Copy the application master jar to the filesystem > // Create a local resource to point to the destination jar path > FileSystem fs; > FileStatus destStatus; > try { > fs = FileSystem.get(conf); > Path src = new Path(pathOMPItarball); > String pathSuffix = appName + "/" + appId.getId(); > Path dst = new Path(fs.getHomeDirectory(), pathSuffix); > try { > fs.copyFromLocalFile(false, true, src, dst); > try { > destStatus = fs.getFileStatus(dst); > LocalResource amJarRsrc = Records.newRecord(LocalResource.class); > > // Set the type of resource - file or archive > // archives are untarred at destination > amJarRsrc.setType(LocalResourceType.ARCHIVE); > // Set visibility of the resource > // Setting to most private option > amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); > // Set the resource to be copied over > amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); > // Set timestamp and length of file so that the framework > // can do basic sanity checks for the local resource > // after it has been copied over to ensure it is the same > // resource the client intended to use with the application > amJarRsrc.setTimestamp(destStatus.getModificationTime()); > amJarRsrc.setSize(destStatus.getLen()); > localResources.put("openmpi", amJarRsrc); > } catch (Throwable t) { > LOG.fatal("Error on file status", t); > System.exit(1); > } > } catch (Throwable t) { > LOG.fatal("Error on copy from local file", t); > System.exit(1); > } > } catch (Throwable t) { > LOG.fatal("Error getting filesystem configuration", t); > System.exit(1); > } > > Note that this appears to work fine when the local resource type was "file" - at least, I was able to make a simple program work that way. Problem I'm having is when I move an archive, which is why I was hoping to look at the HDFS end to see what files are present, and in what locations so I can set the paths accordingly. > > Thanks > Ralph > > >> >> --Bobby Evans >> >> >> On 4/9/12 11:10 AM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >> >> Hi folks >> >> I'm trying to develop an AM for the 0.23 branch and running into a problem that I'm having difficulty debugging. My client relocates some supporting files to HDFS, creates the application object for the AM, and submits it to the RM. >> >> The file relocation request doesn't generate an error, so I must assume it succeeded. It would be nice if there was some obvious way to verify that, but I haven't discovered it. Can anyone give me a hint? I tried asking hdfs to -ls, but all I get is that "." doesn't exist. I have no idea where the file would be placed, if it would persist once the job fails, etc. +
Ralph Castain 2012-04-09, 20:13
-
Re: Help with errorRalph Castain 2012-04-09, 20:26
Finally managed to chase down the 0.23 API docs and get the FileStatus definition. No real joy here - I output the path and got:
code: LOG.info("destination path " + destStatus.getPath()); 2012-04-09 14:22:48,359 INFO Hamster (Hamster.java:getApplication(265)) - destination path file:/Users/rhc/yarnrun/13 However, when I attempt to list it: Ralphs-iMac:bin rhc$ ./hdfs dfs -ls /Users/rhc/yarnrun 2012-04-09 14:22:57.640 java[14292:1903] Unable to load realm info from SCDynamicStore 2012-04-09 14:22:57.686 java[14292:1903] Unable to load realm info from SCDynamicStore ls: `/Users/rhc/yarnrun': No such file or directory I've been unable to track down the "realm" warnings, so I don't know if that is pertinent or not. It appears the files are not getting copied across, though the location looks okay to my eyes. On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: > It looks like the home directory does not exist but the copy went through. > Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. > > Kihwal > > > > On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: > > Hi Bobby > > On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: > >> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? > > I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: > > // set local resources for the application master > // local files or archives as needed > // In this scenario, the jar file for the application master is part of the local resources > Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); > > LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); > // Copy the application master jar to the filesystem > // Create a local resource to point to the destination jar path > FileSystem fs; > FileStatus destStatus; > try { > fs = FileSystem.get(conf); > Path src = new Path(pathOMPItarball); > String pathSuffix = appName + "/" + appId.getId(); > Path dst = new Path(fs.getHomeDirectory(), pathSuffix); > try { > fs.copyFromLocalFile(false, true, src, dst); > try { > destStatus = fs.getFileStatus(dst); > LocalResource amJarRsrc = Records.newRecord(LocalResource.class); > > // Set the type of resource - file or archive > // archives are untarred at destination > amJarRsrc.setType(LocalResourceType.ARCHIVE); > // Set visibility of the resource > // Setting to most private option > amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); > // Set the resource to be copied over > amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); > // Set timestamp and length of file so that the framework > // can do basic sanity checks for the local resource > // after it has been copied over to ensure it is the same > // resource the client intended to use with the application > amJarRsrc.setTimestamp(destStatus.getModificationTime()); > amJarRsrc.setSize(destStatus.getLen()); > localResources.put("openmpi", amJarRsrc); > } catch (Throwable t) { > LOG.fatal("Error on file status", t); > System.exit(1); > } > } catch (Throwable t) { > LOG.fatal("Error on copy from local file", t); > System.exit(1); > } > } catch (Throwable t) { > LOG.fatal("Error getting filesystem configuration", t); > System.exit(1); > } +
Ralph Castain 2012-04-09, 20:26
-
Re: Help with errorRalph Castain 2012-04-09, 20:30
Solved the "realm" warning courtesy of stackoverflow:
export HADOOP_OPTS="-Djava.security.krb5.realm=OX.AC.UK -Djava.security.krb5.kdc=kdc0.ox.ac.uk:kdc1.ox.ac.uk" solves it. Didn't help resolve the problem, as expected. On Apr 9, 2012, at 2:26 PM, Ralph Castain wrote: > Finally managed to chase down the 0.23 API docs and get the FileStatus definition. No real joy here - I output the path and got: > > code: LOG.info("destination path " + destStatus.getPath()); > > 2012-04-09 14:22:48,359 INFO Hamster (Hamster.java:getApplication(265)) - destination path file:/Users/rhc/yarnrun/13 > > However, when I attempt to list it: > > Ralphs-iMac:bin rhc$ ./hdfs dfs -ls /Users/rhc/yarnrun > 2012-04-09 14:22:57.640 java[14292:1903] Unable to load realm info from SCDynamicStore > 2012-04-09 14:22:57.686 java[14292:1903] Unable to load realm info from SCDynamicStore > ls: `/Users/rhc/yarnrun': No such file or directory > > I've been unable to track down the "realm" warnings, so I don't know if that is pertinent or not. It appears the files are not getting copied across, though the location looks okay to my eyes. > > > On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: > >> It looks like the home directory does not exist but the copy went through. >> Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. >> >> Kihwal >> >> >> >> On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >> >> Hi Bobby >> >> On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: >> >>> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? >> >> I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: >> >> // set local resources for the application master >> // local files or archives as needed >> // In this scenario, the jar file for the application master is part of the local resources >> Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); >> >> LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); >> // Copy the application master jar to the filesystem >> // Create a local resource to point to the destination jar path >> FileSystem fs; >> FileStatus destStatus; >> try { >> fs = FileSystem.get(conf); >> Path src = new Path(pathOMPItarball); >> String pathSuffix = appName + "/" + appId.getId(); >> Path dst = new Path(fs.getHomeDirectory(), pathSuffix); >> try { >> fs.copyFromLocalFile(false, true, src, dst); >> try { >> destStatus = fs.getFileStatus(dst); >> LocalResource amJarRsrc = Records.newRecord(LocalResource.class); >> >> // Set the type of resource - file or archive >> // archives are untarred at destination >> amJarRsrc.setType(LocalResourceType.ARCHIVE); >> // Set visibility of the resource >> // Setting to most private option >> amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); >> // Set the resource to be copied over >> amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); >> // Set timestamp and length of file so that the framework >> // can do basic sanity checks for the local resource >> // after it has been copied over to ensure it is the same >> // resource the client intended to use with the application >> amJarRsrc.setTimestamp(destStatus.getModificationTime()); >> amJarRsrc.setSize(destStatus.getLen()); >> localResources.put("openmpi", amJarRsrc); >> } catch (Throwable t) { >> LOG.fatal("Error on file status", t); >> System.exit(1); +
Ralph Castain 2012-04-09, 20:30
-
Re: Help with errorKihwal Lee 2012-04-09, 20:45
The path, "file:/Users/rhc/yarnrun/13", indicates that your copy operation's destination was the local file system, instead of hdfs. What is the value of "fs.default.name" set to in core-site.xml?
Kihwal On 4/9/12 3:26 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: Finally managed to chase down the 0.23 API docs and get the FileStatus definition. No real joy here - I output the path and got: code: LOG.info("destination path " + destStatus.getPath()); 2012-04-09 14:22:48,359 INFO Hamster (Hamster.java:getApplication(265)) - destination path file:/Users/rhc/yarnrun/13 However, when I attempt to list it: Ralphs-iMac:bin rhc$ ./hdfs dfs -ls /Users/rhc/yarnrun 2012-04-09 14:22:57.640 java[14292:1903] Unable to load realm info from SCDynamicStore 2012-04-09 14:22:57.686 java[14292:1903] Unable to load realm info from SCDynamicStore ls: `/Users/rhc/yarnrun': No such file or directory I've been unable to track down the "realm" warnings, so I don't know if that is pertinent or not. It appears the files are not getting copied across, though the location looks okay to my eyes. On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: > It looks like the home directory does not exist but the copy went through. > Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. > > Kihwal > > > > On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: > > Hi Bobby > > On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: > >> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? > > I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: > > // set local resources for the application master > // local files or archives as needed > // In this scenario, the jar file for the application master is part of the local resources > Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); > > LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); > // Copy the application master jar to the filesystem > // Create a local resource to point to the destination jar path > FileSystem fs; > FileStatus destStatus; > try { > fs = FileSystem.get(conf); > Path src = new Path(pathOMPItarball); > String pathSuffix = appName + "/" + appId.getId(); > Path dst = new Path(fs.getHomeDirectory(), pathSuffix); > try { > fs.copyFromLocalFile(false, true, src, dst); > try { > destStatus = fs.getFileStatus(dst); > LocalResource amJarRsrc = Records.newRecord(LocalResource.class); > > // Set the type of resource - file or archive > // archives are untarred at destination > amJarRsrc.setType(LocalResourceType.ARCHIVE); > // Set visibility of the resource > // Setting to most private option > amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); > // Set the resource to be copied over > amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); > // Set timestamp and length of file so that the framework > // can do basic sanity checks for the local resource > // after it has been copied over to ensure it is the same > // resource the client intended to use with the application > amJarRsrc.setTimestamp(destStatus.getModificationTime()); > amJarRsrc.setSize(destStatus.getLen()); > localResources.put("openmpi", amJarRsrc); > } catch (Throwable t) { > LOG.fatal("Error on file status", t); > System.exit(1); > } > } catch (Throwable t) { +
Kihwal Lee 2012-04-09, 20:45
-
Re: Help with errorRalph Castain 2012-04-09, 21:50
On Apr 9, 2012, at 2:45 PM, Kihwal Lee wrote: > The path, "file:/Users/rhc/yarnrun/13", indicates that your copy operation's destination was the local file system, instead of hdfs. Yeah, I realized that too after I sent the note. Sure enough - the files are there. > What is the value of "fs.default.name" set to in core-site.xml? <configuration> <property> <name>fs.default.name</name> <value>hdfs://localhost:9000</value> </property> </configuration> > > Kihwal > > > On 4/9/12 3:26 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: > > Finally managed to chase down the 0.23 API docs and get the FileStatus definition. No real joy here - I output the path and got: > > code: LOG.info("destination path " + destStatus.getPath()); > > 2012-04-09 14:22:48,359 INFO Hamster (Hamster.java:getApplication(265)) - destination path file:/Users/rhc/yarnrun/13 > > However, when I attempt to list it: > > Ralphs-iMac:bin rhc$ ./hdfs dfs -ls /Users/rhc/yarnrun > 2012-04-09 14:22:57.640 java[14292:1903] Unable to load realm info from SCDynamicStore > 2012-04-09 14:22:57.686 java[14292:1903] Unable to load realm info from SCDynamicStore > ls: `/Users/rhc/yarnrun': No such file or directory > > I've been unable to track down the "realm" warnings, so I don't know if that is pertinent or not. It appears the files are not getting copied across, though the location looks okay to my eyes. > > > On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: > >> It looks like the home directory does not exist but the copy went through. >> Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. >> >> Kihwal >> >> >> >> On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >> >> Hi Bobby >> >> On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: >> >>> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? >> >> I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: >> >> // set local resources for the application master >> // local files or archives as needed >> // In this scenario, the jar file for the application master is part of the local resources >> Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); >> >> LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); >> // Copy the application master jar to the filesystem >> // Create a local resource to point to the destination jar path >> FileSystem fs; >> FileStatus destStatus; >> try { >> fs = FileSystem.get(conf); >> Path src = new Path(pathOMPItarball); >> String pathSuffix = appName + "/" + appId.getId(); >> Path dst = new Path(fs.getHomeDirectory(), pathSuffix); >> try { >> fs.copyFromLocalFile(false, true, src, dst); >> try { >> destStatus = fs.getFileStatus(dst); >> LocalResource amJarRsrc = Records.newRecord(LocalResource.class); >> >> // Set the type of resource - file or archive >> // archives are untarred at destination >> amJarRsrc.setType(LocalResourceType.ARCHIVE); >> // Set visibility of the resource >> // Setting to most private option >> amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); >> // Set the resource to be copied over >> amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); >> // Set timestamp and length of file so that the framework >> // can do basic sanity checks for the local resource >> // after it has been copied over to ensure it is the same >> // resource the client intended to use with the application >> amJarRsrc.setTimestamp(destStatus.getModificationTime()); +
Ralph Castain 2012-04-09, 21:50
-
Re: Help with errorRalph Castain 2012-04-09, 21:51
On Apr 9, 2012, at 3:50 PM, Ralph Castain wrote: > > On Apr 9, 2012, at 2:45 PM, Kihwal Lee wrote: > >> The path, "file:/Users/rhc/yarnrun/13", indicates that your copy operation's destination was the local file system, instead of hdfs. > > Yeah, I realized that too after I sent the note. Sure enough - the files are there. Quick correction: the path exists, but as a file instead of a directory, and therefore the files to be moved there don't exist. > >> What is the value of "fs.default.name" set to in core-site.xml? > > <configuration> > <property> > <name>fs.default.name</name> > <value>hdfs://localhost:9000</value> > </property> > </configuration> > > >> >> Kihwal >> >> >> On 4/9/12 3:26 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >> >> Finally managed to chase down the 0.23 API docs and get the FileStatus definition. No real joy here - I output the path and got: >> >> code: LOG.info("destination path " + destStatus.getPath()); >> >> 2012-04-09 14:22:48,359 INFO Hamster (Hamster.java:getApplication(265)) - destination path file:/Users/rhc/yarnrun/13 >> >> However, when I attempt to list it: >> >> Ralphs-iMac:bin rhc$ ./hdfs dfs -ls /Users/rhc/yarnrun >> 2012-04-09 14:22:57.640 java[14292:1903] Unable to load realm info from SCDynamicStore >> 2012-04-09 14:22:57.686 java[14292:1903] Unable to load realm info from SCDynamicStore >> ls: `/Users/rhc/yarnrun': No such file or directory >> >> I've been unable to track down the "realm" warnings, so I don't know if that is pertinent or not. It appears the files are not getting copied across, though the location looks okay to my eyes. >> >> >> On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: >> >>> It looks like the home directory does not exist but the copy went through. >>> Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. >>> >>> Kihwal >>> >>> >>> >>> On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >>> >>> Hi Bobby >>> >>> On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: >>> >>>> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? >>> >>> I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: >>> >>> // set local resources for the application master >>> // local files or archives as needed >>> // In this scenario, the jar file for the application master is part of the local resources >>> Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); >>> >>> LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); >>> // Copy the application master jar to the filesystem >>> // Create a local resource to point to the destination jar path >>> FileSystem fs; >>> FileStatus destStatus; >>> try { >>> fs = FileSystem.get(conf); >>> Path src = new Path(pathOMPItarball); >>> String pathSuffix = appName + "/" + appId.getId(); >>> Path dst = new Path(fs.getHomeDirectory(), pathSuffix); >>> try { >>> fs.copyFromLocalFile(false, true, src, dst); >>> try { >>> destStatus = fs.getFileStatus(dst); >>> LocalResource amJarRsrc = Records.newRecord(LocalResource.class); >>> >>> // Set the type of resource - file or archive >>> // archives are untarred at destination >>> amJarRsrc.setType(LocalResourceType.ARCHIVE); >>> // Set visibility of the resource >>> // Setting to most private option >>> amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); >>> // Set the resource to be copied over >>> amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); >>> // Set timestamp and length of file so that the framework >>> // can do basic sanity checks for the local resource +
Ralph Castain 2012-04-09, 21:51
-
Re: Help with errorDaryn Sharp 2012-04-10, 13:03
The copyFromLocalFile method has a void return, but internally is calling FileUtil methods that may return a boolean for success. False appears to be returned if the source file cannot be deleted, or if the dest directory cannot be created. The boolean result is ignored by copyFromLocalFile leading the caller to believe the copy was successful.
I'm not sure if this bug is aggravating your situation, so I'd try to manually create the dest dir and remove the src file. Daryn On Apr 9, 2012, at 4:51 PM, Ralph Castain wrote: > > On Apr 9, 2012, at 3:50 PM, Ralph Castain wrote: > >> >> On Apr 9, 2012, at 2:45 PM, Kihwal Lee wrote: >> >>> The path, "file:/Users/rhc/yarnrun/13", indicates that your copy operation's destination was the local file system, instead of hdfs. >> >> Yeah, I realized that too after I sent the note. Sure enough - the files are there. > > > Quick correction: the path exists, but as a file instead of a directory, and therefore the files to be moved there don't exist. > > >> >>> What is the value of "fs.default.name" set to in core-site.xml? >> >> <configuration> >> <property> >> <name>fs.default.name</name> >> <value>hdfs://localhost:9000</value> >> </property> >> </configuration> >> >> >>> >>> Kihwal >>> >>> >>> On 4/9/12 3:26 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >>> >>> Finally managed to chase down the 0.23 API docs and get the FileStatus definition. No real joy here - I output the path and got: >>> >>> code: LOG.info("destination path " + destStatus.getPath()); >>> >>> 2012-04-09 14:22:48,359 INFO Hamster (Hamster.java:getApplication(265)) - destination path file:/Users/rhc/yarnrun/13 >>> >>> However, when I attempt to list it: >>> >>> Ralphs-iMac:bin rhc$ ./hdfs dfs -ls /Users/rhc/yarnrun >>> 2012-04-09 14:22:57.640 java[14292:1903] Unable to load realm info from SCDynamicStore >>> 2012-04-09 14:22:57.686 java[14292:1903] Unable to load realm info from SCDynamicStore >>> ls: `/Users/rhc/yarnrun': No such file or directory >>> >>> I've been unable to track down the "realm" warnings, so I don't know if that is pertinent or not. It appears the files are not getting copied across, though the location looks okay to my eyes. >>> >>> >>> On Apr 9, 2012, at 1:27 PM, Kihwal Lee wrote: >>> >>>> It looks like the home directory does not exist but the copy went through. >>>> Can you try to LOG the key fields in destStatus including path? It might be ending up in an unexpected place. >>>> >>>> Kihwal >>>> >>>> >>>> >>>> On 4/9/12 12:45 PM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >>>> >>>> Hi Bobby >>>> >>>> On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: >>>> >>>>> What do you mean by relocated some supporting files to HDFS? How do you relocate them? What API do you use? >>>> >>>> I use the LocalResource and FileSystem classes to do the relocation, per the Hadoop example: >>>> >>>> // set local resources for the application master >>>> // local files or archives as needed >>>> // In this scenario, the jar file for the application master is part of the local resources >>>> Map<String, LocalResource> localResources = new HashMap<String, LocalResource>(); >>>> >>>> LOG.info("Copy openmpi tarball from local filesystem and add to local environment"); >>>> // Copy the application master jar to the filesystem >>>> // Create a local resource to point to the destination jar path >>>> FileSystem fs; >>>> FileStatus destStatus; >>>> try { >>>> fs = FileSystem.get(conf); >>>> Path src = new Path(pathOMPItarball); >>>> String pathSuffix = appName + "/" + appId.getId(); >>>> Path dst = new Path(fs.getHomeDirectory(), pathSuffix); >>>> try { >>>> fs.copyFromLocalFile(false, true, src, dst); >>>> try { >>>> destStatus = fs.getFileStatus(dst); >>>> LocalResource amJarRsrc = Records.newRecord(LocalResource.class); >>>> >>>> // Set the type of resource - file or archive +
Daryn Sharp 2012-04-10, 13:03
-
Re: Help with errorMilind.Bhandarkar@... 2012-04-09, 18:01
Ralph,
Do you have a home directory on HDFS ? The job files are moved under your HDFS home directory, in a .staging subdir. In any case, not flagging it as an error, seems to be a genuine yarn bug. - milind --- Milind Bhandarkar Greenplum Labs, EMC (Disclaimer: Opinions expressed in this email are those of the author, and do not necessarily represent the views of any organization, past or present, the author might be affiliated with.) On 4/9/12 10:45 AM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >Hi Bobby > >On Apr 9, 2012, at 11:40 AM, Robert Evans wrote: > >> What do you mean by relocated some supporting files to HDFS? How do >>you relocate them? What API do you use? > >I use the LocalResource and FileSystem classes to do the relocation, per >the Hadoop example: > > // set local resources for the application master > // local files or archives as needed > // In this scenario, the jar file for the application master is part of >the local resources > Map<String, LocalResource> localResources = new HashMap<String, >LocalResource>(); > > LOG.info("Copy openmpi tarball from local filesystem and add to local >environment"); > // Copy the application master jar to the filesystem > // Create a local resource to point to the destination jar path > FileSystem fs; > FileStatus destStatus; > try { > fs = FileSystem.get(conf); > Path src = new Path(pathOMPItarball); > String pathSuffix = appName + "/" + appId.getId(); > Path dst = new Path(fs.getHomeDirectory(), pathSuffix); > try { > fs.copyFromLocalFile(false, true, src, dst); > try { > destStatus = fs.getFileStatus(dst); > LocalResource amJarRsrc = Records.newRecord(LocalResource.class); > > // Set the type of resource - file or archive > // archives are untarred at destination > amJarRsrc.setType(LocalResourceType.ARCHIVE); > // Set visibility of the resource > // Setting to most private option > amJarRsrc.setVisibility(LocalResourceVisibility.APPLICATION); > // Set the resource to be copied over > amJarRsrc.setResource(ConverterUtils.getYarnUrlFromPath(dst)); > // Set timestamp and length of file so that the framework > // can do basic sanity checks for the local resource > // after it has been copied over to ensure it is the same > // resource the client intended to use with the application > amJarRsrc.setTimestamp(destStatus.getModificationTime()); > amJarRsrc.setSize(destStatus.getLen()); > localResources.put("openmpi", amJarRsrc); > } catch (Throwable t) { > LOG.fatal("Error on file status", t); > System.exit(1); > } > } catch (Throwable t) { > LOG.fatal("Error on copy from local file", t); > System.exit(1); > } > } catch (Throwable t) { > LOG.fatal("Error getting filesystem configuration", t); > System.exit(1); > } > >Note that this appears to work fine when the local resource type was >"file" - at least, I was able to make a simple program work that way. >Problem I'm having is when I move an archive, which is why I was hoping >to look at the HDFS end to see what files are present, and in what >locations so I can set the paths accordingly. > >Thanks >Ralph > > >> >> --Bobby Evans >> >> >> On 4/9/12 11:10 AM, "Ralph Castain" <[EMAIL PROTECTED]> wrote: >> >> Hi folks >> >> I'm trying to develop an AM for the 0.23 branch and running into a >>problem that I'm having difficulty debugging. My client relocates some >>supporting files to HDFS, creates the application object for the AM, and >>submits it to the RM. >> >> The file relocation request doesn't generate an error, so I must assume >>it succeeded. It would be nice if there was some obvious way to verify >>that, but I haven't discovered it. Can anyone give me a hint? I tried >>asking hdfs to -ls, but all I get is that "." doesn't exist. I have no >>idea where the file would be placed, if it would persist once the job >>fails, etc. >> >> When the job is submitted, all I get is an "Error 500", which tells me +
Milind.Bhandarkar@... 2012-04-09, 18:01
|