Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> HDFS and distcp issue??


Copy link to this message
-
HDFS and distcp issue??
I've run into an interesting problem with syncing a couple of clusters
using distcp.  We've validated that it works to a local installation
from our remote cluster.  I suspect our firewalls 'may' be responsible
for the problem we're experiencing.  We're using ports 9000, 9001 and
50010.I've verified all three ports are available to the namenodes and
datanodes in both directions.  Is there something else we're missing?

Looks like it get's to 80% before it fails.  Here's what we're seeing.

# user@hnn1:~$ hadoop distcp hdfs://hnn1:9000/user/testing
hdfs://hnn2:9000/user

10/12/03 15:58:10 INFO tools.DistCp:
srcPaths=[hdfs://hnn1:9000/user/testing]

10/12/03 15:58:10 INFO tools.DistCp: destPath=hdfs://hnn2:9000/user

10/12/03 15:58:11 INFO tools.DistCp: srcCount=6

10/12/03 15:58:11 INFO mapred.JobClient: Running job: job_201011221457_0019

10/12/03 15:58:12 INFO mapred.JobClient:  map 0% reduce 0%

  10/12/03 15:58:36 INFO mapred.JobClient:  map 19% reduce 0%

10/12/03 15:58:45 INFO mapred.JobClient:  map 39% reduce 0%

10/12/03 15:59:03 INFO mapred.JobClient:  map 60% reduce 0%

10/12/03 15:59:12 INFO mapred.JobClient:  map 80% reduce 0%

10/12/03 15:59:32 INFO mapred.JobClient: Task Id :
attempt_201011221457_0019_m_000000_0, Status : FAILED

java.io.IOException: Copied: 0 Skipped: 0 Failed: 5

         at
org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)

         at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)

         at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)

         at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)

         at org.apache.hadoop.mapred.Child.main(Child.java:170)

10/12/03 15:59:33 INFO mapred.JobClient:  map 0% reduce 0%

10/12/03 15:59:55 INFO mapred.JobClient:  map 19% reduce 0%

10/12/03 16:00:04 INFO mapred.JobClient:  map 39% reduce 0%

10/12/03 16:00:22 INFO mapred.JobClient:  map 60% reduce 0%

10/12/03 16:00:31 INFO mapred.JobClient:  map 80% reduce 0%

10/12/03 16:00:51 INFO mapred.JobClient: Task Id :
attempt_201011221457_0019_m_000000_1, Status : FAILED

java.io.IOException: Copied: 0 Skipped: 0 Failed: 5

         at
org.apache.hadoop.tools.DistCp$CopyFilesMapper.close(DistCp.java:572)

         at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:57)

         at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:358)

         at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307)

         at org.apache.hadoop.mapred.Child.main(Child.java:170)

Thanks!
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB