we have a cluster with 32 machines and running C# version of wordcount
program on it.
Map phase is done by different machines but Reduce is only done by one
machine. Our data is around 7G text data and by using one machine for
Reduce phase this job is doing so slowly.
Is there any way to increase number of reducers?