Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop, mail # user - HDFS and distcp issue??


Copy link to this message
-
Re: HDFS and distcp issue??
Dmitriy Ryaboy 2010-12-07, 02:48
Do you have the failing task's log?

-Dmitriy

On Sat, Dec 4, 2010 at 12:47 PM, hadoopman <[EMAIL PROTECTED]> wrote:

> I've run into an interesting problem with syncing a couple of clusters
> using distcp.  We've validated that it works to a local installation from
> our remote cluster.  I suspect our firewalls 'may' be responsible for the
> problem we're experiencing.  We're using ports 9000, 9001 and 50010.I've
> verified all three ports are available to the namenodes and datanodes in
> both directions.  Is there something else we're missing?
>
> Looks like it get's to 80% before it fails.  Here's what we're seeing.
>
> # user@hnn1:~$ hadoop distcp hdfs://hnn1:9000/user/testing
> hdfs://hnn2:9000/user
>
> 10/12/03 15:58:10 INFO tools.DistCp:
> srcPaths=[hdfs://hnn1:9000/user/testing]
>
> 10/12/03 15:58:10 INFO tools.DistCp: destPath=hdfs://hnn2:9000/user
>
> 10/12/03 15:58:11 INFO tools.DistCp: srcCount=6
>
> 10/12/03 15:58:11 INFO mapred.JobClient: Running job: job_201011221457_0019
>
> 10/12/03 15:58:12 INFO mapred.JobClient:  map 0% reduce 0%
>
>  10/12/03 15:58:36 INFO mapred.JobClient:  map 19% reduce 0%
>
> 10/12/03 15:58:45 INFO mapred.JobClient:  map 39% reduce 0%
>
> 10/12/03 15:59:03 INFO mapred.JobClient:  map 60% reduce 0%
>
> 10/12/03 15:59:12 INFO mapred.JobClient:  map 80% reduce 0%
>
> 10/12/03 15:59:32 INFO mapred.JobClient: Task Id :
> attempt_201011221457_0019_m_000000_0, Status : FAILED
>
> java.io.IOException: Copied: 0 Skipped: 0 Failed: 5
>
>        at
> org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)
>
>        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)
>
>        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
>
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
>
>        at org.apache.hadoop.mapred.Child.main(Child.java:170)
>
> 10/12/03 15:59:33 INFO mapred.JobClient:  map 0% reduce 0%
>
> 10/12/03 15:59:55 INFO mapred.JobClient:  map 19% reduce 0%
>
> 10/12/03 16:00:04 INFO mapred.JobClient:  map 39% reduce 0%
>
> 10/12/03 16:00:22 INFO mapred.JobClient:  map 60% reduce 0%
>
> 10/12/03 16:00:31 INFO mapred.JobClient:  map 80% reduce 0%
>
> 10/12/03 16:00:51 INFO mapred.JobClient: Task Id :
> attempt_201011221457_0019_m_000000_1, Status : FAILED
>
> java.io.IOException: Copied: 0 Skipped: 0 Failed: 5
>
>        at
> org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)
>
>        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)
>
>        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
>
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
>
>        at org.apache.hadoop.mapred.Child.main(Child.java:170)
>
> Thanks!
>

--
Dmitriy V Ryaboy
Twitter Analytics
http://twitter.com/squarecog