Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # dev >> Calculations of the InputSplits

Copy link to this message
Calculations of the InputSplits

There was a query in StackOverflow regarding high CPU on the client after
submitting jobs (upto 200 jobs in batch and 150MB jar file size).
Calculation of the InputSplit may be one of the reason for the high CPU on
the client. Why should the calculation of the InputSplit happen on the
client? JobTracker is a high-end machine, can't the calculation happen on
the JobTracker?