Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Internal error 2999 - misuse of CONCAT? misuse of GROUP?


Copy link to this message
-
Re: Internal error 2999 - misuse of CONCAT? misuse of GROUP?
In the relation TCRaw, there is no column called SrcFuid.
As a result, you end up using this feature -
http://pig.apache.org/docs/r0.8.0/piglatin_ref2.html#Casting+Relations+to+Sc
alars .
Change your statement to -
 TCGroupedByFuid = group TCRaw by (citingdocid,
                                          col,
                                         seq);

Thanks,
Thejas

On 4/6/11 11:09 AM, "[EMAIL PROTECTED]"
<[EMAIL PROTECTED]> wrote:

>
>
>> Do you need the group-key to be concatenated ? If not, you can just group on
>> all the three columns -
>
>> TCGroupedByFuid = group TCRaw by (SrcFuid.citingdocid,
>                                          SrcFuid.col,
>                                         SrcFuid.seq);
>
> Hi Thejas,
>
> I had tried that originally before introducing CONCAT(), but I got this error
> message:
>
> ERROR 0: Scalar has more than one row in the output.
>  1st : (14159274,BCI,6), 2nd :(45937168,BCI,17)
>
> I don't understand that, since TCRaw is
>
> (14159274,BCI,6,14159274,14159163,BCI,5,1999,BCI.BCI)
> (14159274,BCI,6,14159274,14159163,WOS,11,1999,WOS.SCI)
> (14159274,WOS,16,14159274,14159163,BCI,5,1999,BCI.BCI)
> (14159274,WOS,16,14159274,14159163,WOS,11,1999,WOS.SCI)
>
> and the 2nd tuple is not a (projection of any) member of TCRaw (though it is a
> member of SrcFuid). So I think my understanding of GROUP is incorrect.
>
> Thanks for your help!
>
> Will
>
>