Good evening Hadoopers!
at the jobtracker page, click on a job, and click at running reduce
task, I am going to see
task_201302271736_0638_r_000000 reduce > copy (136 of 261 at 0.44 MB/s)
I am really curious where is the data is being copy.
if i clicked at the task, it will show a host that is running the task attempt.
question is "reduce > copy" is referring data copy outbound from host
that is running task attempt, or
referring to data is being copy from other machines inbound to this
host (that's running task attempt)
and in both cases how do i know what machines that host is copy data from/to?