Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> Hive QL - NOT IN, NOT EXIST


Copy link to this message
-
RE: Hive QL - NOT IN, NOT EXIST
It works but it takes a very long time because the subqueries in NOT IN contains 400 million rows (the message table in the example) and the feed table contains 3 million rows.
SELECT uuid from feed f WHERE f.uuid NOT IN (SELECT uuid FROM message);
> Date: Sun, 5 May 2013 20:25:15 -0700
> From: [EMAIL PROTECTED]
> Subject: Re: Hive QL - NOT IN, NOT EXIST
> To: [EMAIL PROTECTED]
>
>
> --- On Sun, 5/5/13, Peter Chu <[EMAIL PROTECTED]> wrote:
>
> > I am wondering if there is any way to do this without resorting to
> > using left outer join and finding nulls.
>
> I have found this to be an acceptable substitute.  Is it not working for you?
>