On Thu, Jan 30, 2014 at 9:29 AM, Umesh Telang <[EMAIL PROTECTED]>wrote:
The recommendation is that you have two data directories per distinct disk.
Right, it has nothing to do with size and everything todo with IO
bandwidth. We could optimize this area (and will) but for now specifying
two data directories per disk is a good workaround.
Doesn't relate to size.
Perfect, great to hear!
Apache MRUnit - Unit testing MapReduce - http://mrunit.apache.org