Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Internal error 2999 - misuse of CONCAT? misuse of GROUP?


Copy link to this message
-
RE: Internal error 2999 - misuse of CONCAT? misuse of GROUP?

>Do you need the group-key to be concatenated ? If not, you can just group on all the three columns -

>TCGroupedByFuid = group TCRaw by (SrcFuid.citingdocid,
                                         SrcFuid.col,
                                        SrcFuid.seq);

Hi Thejas,

I had tried that originally before introducing CONCAT(), but I got this error message:

ERROR 0: Scalar has more than one row in the output.
 1st : (14159274,BCI,6), 2nd :(45937168,BCI,17)

I don't understand that, since TCRaw is

(14159274,BCI,6,14159274,14159163,BCI,5,1999,BCI.BCI)
(14159274,BCI,6,14159274,14159163,WOS,11,1999,WOS.SCI)
(14159274,WOS,16,14159274,14159163,BCI,5,1999,BCI.BCI)
(14159274,WOS,16,14159274,14159163,WOS,11,1999,WOS.SCI)

and the 2nd tuple is not a (projection of any) member of TCRaw (though it is a member of SrcFuid). So I think my understanding of GROUP is incorrect.

Thanks for your help!

Will