Take a look at CombineFileInputFormat - this will create 'meta splits' which include multiple small spilts, thus reducing #maps which are run.
On Jul 11, 2012, at 5:29 AM, Manoj Babu wrote:
> The no of mappers is depends on the no of blocks. Is it possible to limit the no of mappers size without increasing the HDFS block size?
> Thanks in advance.
Arun C. Murthy