-RE: Hive QL - NOT IN, NOT EXIST
Peter Chu 2013-05-06, 03:46
It works but it takes a very long time because the subqueries in NOT IN contains 400 million rows (the message table in the example) and the feed table contains 3 million rows.
SELECT uuid from feed f WHERE f.uuid NOT IN (SELECT uuid FROM message);
> Date: Sun, 5 May 2013 20:25:15 -0700
> From: [EMAIL PROTECTED]
> Subject: Re: Hive QL - NOT IN, NOT EXIST
> To: [EMAIL PROTECTED]
> --- On Sun, 5/5/13, Peter Chu <[EMAIL PROTECTED]> wrote:
> > I am wondering if there is any way to do this without resorting to
> > using left outer join and finding nulls.
> I have found this to be an acceptable substitute. Is it not working for you?