-Sorting/Partitioning of Pig output
Mark 2013-03-27, 18:46
I understand in the traditional map/reduce paradigm that each key will get sent to the same reducer sorted but in pig there is no such thing as a "key". I'm curious to know how pig knows to which reducer to send its output to?
So when creating a custom StoreFunc is there any guarentee on the ordering of Tuples that come into putNext?
And another even more basic question. Do StoreFuncs operate at the Map phase or Reduce phase?