Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> Subtracting contents of two bags


+
James Newhaven 2013-01-22, 12:46
Copy link to this message
-
Re: Subtracting contents of two bags
You can do an left outer join of A and B and then filter by B is null.

http://pig.apache.org/docs/r0.10.0/basic.html#join-outer

On Tue, Jan 22, 2013 at 4:46 AM, James Newhaven <[EMAIL PROTECTED]>wrote:

> Hi,
>
> I have two relations - A and B.  Both just contain user ids.
>
> I want to get a list of users who are in A but not in B.
>
> I am running Pig 0.9.1 and think this might be possible with the DIFF
> function. I can see that DIFF requires one relation that contains the two
> bags.
>
> How can I create a relation that contains two bags so it can be supplied to
> the DIFF function?
>
> Any suggestions would be appreciated.
>
> Thanks,
> James
>

--
*Note that I'm no longer using my Yahoo! email address. Please email me at
[EMAIL PROTECTED] going forward.*
+
Timothy Potter 2013-01-22, 16:36