I have a job which has let us say 10 mappers running in parallel.
Some are running fast but few of them are taking too long to run.
For example few mappers are taking 5 to 10 mins but others are taking
around 12 hours or more.
Does the difference in the data handled by the mappers can cause such a
variation or is it the issue with connectivity.
Note:The cluster we are using have multiple users running their jobs on it.
Thanks in advance.
Robert Evans 2012-07-06, 17:00
Phani 2012-07-07, 07:24
Manoj Babu 2012-07-09, 17:57
Karthik Kambatla 2012-07-09, 19:02
Manoj Babu 2012-07-10, 06:57
Karthik Kambatla 2012-07-10, 08:39