On May 13, 2011, at 10:48 AM, Todd Lipcon wrote:
>> 2) Any ideas on what is driving the growth in Non DFS Used space? I
>> looked for things like growing log files on the datanodes but didn't find
> Logs are one possible culprit. Another is to look for old files that might
> be orphaned in your mapred.local.dir - there have been bugs in the past
> where we've leaked files. If you shut down the TaskTrackers, you can safely
> delete everything from within mapred.local.dirs.
Part of our S.O.P. during Hadoop bounces is to wipe mapred.local out. The TT doesn't properly clean up after itself.