-Re: Sequence.Sorter Performance
Owen O'Malley 2011-04-25, 18:43
The SequenceFile sorter is ok. It used to be the sort used in the shuffle.
Make sure to set io.sort.factor and io.sort.mb to appropriate values for
your hardware. I'd usually use io.sort.factor as 25 * drives and io.sort.mb
is the amount of memory you can allocate to the sorting.