James Newhaven 2013-01-22, 12:46
Bill Graham 2013-01-22, 15:53
-Re: Subtracting contents of two bags
Timothy Potter 2013-01-22, 16:36
Bill's suggestion is good, but here is another approach that I think is
cleaner to read:
find_not_in_b = cogroup A by key OUTER, B by key;
not_in_b = foreach (filter find_not_in_b by IsEmpty(B)) generate flatten(A);
On Tue, Jan 22, 2013 at 8:53 AM, Bill Graham <[EMAIL PROTECTED]> wrote:
> You can do an left outer join of A and B and then filter by B is null.
> On Tue, Jan 22, 2013 at 4:46 AM, James Newhaven <[EMAIL PROTECTED]
> > Hi,
> > I have two relations - A and B. Both just contain user ids.
> > I want to get a list of users who are in A but not in B.
> > I am running Pig 0.9.1 and think this might be possible with the DIFF
> > function. I can see that DIFF requires one relation that contains the two
> > bags.
> > How can I create a relation that contains two bags so it can be supplied
> > the DIFF function?
> > Any suggestions would be appreciated.
> > Thanks,
> > James
> *Note that I'm no longer using my Yahoo! email address. Please email me at
> [EMAIL PROTECTED] going forward.*