Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> DISTINCT and paritioner

Copy link to this message
DISTINCT and paritioner
The docs say DISTINCT can take a custom partitioner.  How does that work?
 What is "K" and "V"?
I'm having some doubts the docs are correct.  I wrote a test partitioner
that does a System.out of K and V.  I then wrote simple scripts to do JOIN,
GROUP and DISTINCT.  For JOIN and GROUP I see my system.outs(*).  For
DISTINCT, I see nothing....

Using 0.11.1.

William Oberman 2013-07-17, 18:30
Alan Gates 2013-07-18, 15:28
William Oberman 2013-07-19, 13:24