I've found that the solution to control the split size per mapper is to
modify the following configurations:
mapred.min.split.size and mapred.max.split.size, but when I set them both
to 14MB with dfs.block.size = 64MB, the splits are still = 64MB.
So, is there a relation between them that I should consider?
Bertrand Dechoux 2012-10-26, 21:23
Mark Olimpiati 2012-10-29, 04:25
Bertrand Dechoux 2012-10-29, 06:15