Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Hadoop >> mail # user >> HDFS and distcp issue??


+
hadoopman 2010-12-04, 20:47
Copy link to this message
-
Re: HDFS and distcp issue??
Do you have the failing task's log?

-Dmitriy

On Sat, Dec 4, 2010 at 12:47 PM, hadoopman <[EMAIL PROTECTED]> wrote:

> I've run into an interesting problem with syncing a couple of clusters
> using distcp.  We've validated that it works to a local installation from
> our remote cluster.  I suspect our firewalls 'may' be responsible for the
> problem we're experiencing.  We're using ports 9000, 9001 and 50010.I've
> verified all three ports are available to the namenodes and datanodes in
> both directions.  Is there something else we're missing?
>
> Looks like it get's to 80% before it fails.  Here's what we're seeing.
>
> # user@hnn1:~$ hadoop distcp hdfs://hnn1:9000/user/testing
> hdfs://hnn2:9000/user
>
> 10/12/03 15:58:10 INFO tools.DistCp:
> srcPaths=[hdfs://hnn1:9000/user/testing]
>
> 10/12/03 15:58:10 INFO tools.DistCp: destPath=hdfs://hnn2:9000/user
>
> 10/12/03 15:58:11 INFO tools.DistCp: srcCount=6
>
> 10/12/03 15:58:11 INFO mapred.JobClient: Running job: job_201011221457_0019
>
> 10/12/03 15:58:12 INFO mapred.JobClient:  map 0% reduce 0%
>
>  10/12/03 15:58:36 INFO mapred.JobClient:  map 19% reduce 0%
>
> 10/12/03 15:58:45 INFO mapred.JobClient:  map 39% reduce 0%
>
> 10/12/03 15:59:03 INFO mapred.JobClient:  map 60% reduce 0%
>
> 10/12/03 15:59:12 INFO mapred.JobClient:  map 80% reduce 0%
>
> 10/12/03 15:59:32 INFO mapred.JobClient: Task Id :
> attempt_201011221457_0019_m_000000_0, Status : FAILED
>
> java.io.IOException: Copied: 0 Skipped: 0 Failed: 5
>
>        at
> org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)
>
>        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)
>
>        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
>
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
>
>        at org.apache.hadoop.mapred.Child.main(Child.java:170)
>
> 10/12/03 15:59:33 INFO mapred.JobClient:  map 0% reduce 0%
>
> 10/12/03 15:59:55 INFO mapred.JobClient:  map 19% reduce 0%
>
> 10/12/03 16:00:04 INFO mapred.JobClient:  map 39% reduce 0%
>
> 10/12/03 16:00:22 INFO mapred.JobClient:  map 60% reduce 0%
>
> 10/12/03 16:00:31 INFO mapred.JobClient:  map 80% reduce 0%
>
> 10/12/03 16:00:51 INFO mapred.JobClient: Task Id :
> attempt_201011221457_0019_m_000000_1, Status : FAILED
>
> java.io.IOException: Copied: 0 Skipped: 0 Failed: 5
>
>        at
> org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)
>
>        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)
>
>        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)
>
>        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)
>
>        at org.apache.hadoop.mapred.Child.main(Child.java:170)
>
> Thanks!
>

--
Dmitriy V Ryaboy
Twitter Analytics
http://twitter.com/squarecog
+
hadoopman 2010-12-07, 03:19
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB