Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # dev >> Sorting algorithm


Copy link to this message
-
Re: Sorting algorithm
On Fri, Mar 16, 2012 at 6:05 PM, indrani gorti <[EMAIL PROTECTED]>wrote:

> Hi
>
> Which is the sorting algorith used in map-reduce to sort the data set in
> the shuffling stage i.e after the mapped for each split up of the entire
> dataset.
Take a look at Chris Douglas' presentation on the sort.

Slides: http://www.slideshare.net/hadoopusergroup/ordered-record-collection
Video:
http://developer.yahoo.com/blogs/hadoop/posts/2010/01/hadoop_bay_area_january_2010_u/
The original in memory sort is a quicksort. After that it is a merge sort.

-- Owen