Each reduce task take the role of DFS client by using specified output
folder. The placement of reduce
output result judged by NameNode block placement policy. One thing I can
confirm, as the cluster inner
slave, the first block reduce task produced in one DataNode can be placed
into local filesystem.
On Thu, Jun 16, 2011 at 3:21 AM, Pedro Costa <[EMAIL PROTECTED]> wrote:
> I'm running an MR application that produces an output that is saved in
> HDFS. My application has 5 slave nodes (so it has also 5 data nodes).
> The hdfs file replication factor is 1. I want from my application, or
> from the Hadoop MR source code tell which data node my result should
> be. For example, if I've datanode1-5, I would like to say to the name
> node to save the reduce output result data in a specific datanode.
> Is it possible?