-Re: intermediate results files
Mohammad Tariq 2013-07-01, 23:02
IMHO, it doesn't matter. Your job will write the result just once.
Replica creation is handled at the HDFS layer so it has nothing to with
your job. Your job will still be writing at the same speed.
On Tue, Jul 2, 2013 at 4:16 AM, John Lilley <[EMAIL PROTECTED]>wrote:
> If my reducers are going to create results that are temporary in nature
> (consumed by the next processing stage) is it recommended to use a
> replication factor <3 to improve performance? ****
> ** **