Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # dev >> Sorting algorithm


Copy link to this message
-
Re: Sorting algorithm
On Fri, Mar 16, 2012 at 6:05 PM, indrani gorti <[EMAIL PROTECTED]>wrote:

> Hi
>
> Which is the sorting algorith used in map-reduce to sort the data set in
> the shuffling stage i.e after the mapped for each split up of the entire
> dataset.
Take a look at Chris Douglas' presentation on the sort.

Slides: http://www.slideshare.net/hadoopusergroup/ordered-record-collection
Video:
http://developer.yahoo.com/blogs/hadoop/posts/2010/01/hadoop_bay_area_january_2010_u/
The original in memory sort is a quicksort. After that it is a merge sort.

-- Owen
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB