Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce, mail # dev - Calculations of the InputSplits


Copy link to this message
-
Calculations of the InputSplits
Praveen Sripati 2011-09-25, 16:42
Hi,

There was a query in StackOverflow regarding high CPU on the client after
submitting jobs (upto 200 jobs in batch and 150MB jar file size).
Calculation of the InputSplit may be one of the reason for the high CPU on
the client. Why should the calculation of the InputSplit happen on the
client? JobTracker is a high-end machine, can't the calculation happen on
the JobTracker?

http://stackoverflow.com/questions/7546064/hadoop-high-cpu-load-on-client-side-after-committing-jobs

Thanks,
Praveen