Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # dev >> Sorting algorithm

Copy link to this message
Re: Sorting algorithm
On Fri, Mar 16, 2012 at 6:05 PM, indrani gorti <[EMAIL PROTECTED]>wrote:

> Hi
> Which is the sorting algorith used in map-reduce to sort the data set in
> the shuffling stage i.e after the mapped for each split up of the entire
> dataset.
Take a look at Chris Douglas' presentation on the sort.

Slides: http://www.slideshare.net/hadoopusergroup/ordered-record-collection
The original in memory sort is a quicksort. After that it is a merge sort.

-- Owen