-Re: HDFS question
Bryan Beaudreault 2014-01-28, 16:52
Do you have a jobtracker? Without a jobtracker and tasktrackers, distcp is
running in LocalRunner mode. I.E. it is running a single-threaded process
on the local machine. The default behavior of the DFSClient is to write
data locally first, with replicas being placed off-rack then on-rack.
This would explain why everything seems to be going locally, it is also
probably much slower than it could be.
On Tue, Jan 28, 2014 at 11:42 AM, Ognen Duzlevski