Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> removing dupes from a bag while saving first occurrence


Copy link to this message
-
removing dupes from a bag while saving first occurrence
If I have a bag and would like to remove dupes, while saving the first
occurrence, is this possible?

For example, for the following bag:

(group_1,{(2012-12-15,a),(2012-12-17,a),(2012-12-23,c)})

I would like my result to be the following:

(group_1,{(2012-12-15,a),(2012-12-23,c)})
+
Norbert Burger 2013-03-08, 22:10
+
Chan, Tim 2013-03-08, 23:12
+
Panshul Whisper 2013-03-08, 23:18
+
Panshul Whisper 2013-03-08, 23:21