Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Internal error 2999 - misuse of CONCAT? misuse of GROUP?

Copy link to this message
RE: Internal error 2999 - misuse of CONCAT? misuse of GROUP?
Hi Thejas,

Thanks again for your help. When I omit the SrcFuid "qualifier" and use the form you suggest, I get this error (that was actually the reason I tried SrcFuid.<field> to start with.)

Pig Stack Trace
ERROR 1025: Found more than one match: SrcFuid::citingdocid, NewCitationRel::citingdocid

org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1000: Error during parsing. Found more than one match: SrcFuid::citingdocid, NewCitationRel::citingdocid
at org.apache.pig.PigServer$Graph.parseQuery(PigServer.java:1607)
at org.apache.pig.PigServer$Graph.registerQuery(PigServer.java:1551)
at org.apache.pig.PigServer.registerQuery(PigServer.java:523)
at org.apache.pig.tools.grunt.GruntParser.processPig(GruntParser.java:868)
at org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:388)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:165)
at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:141)
at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:90)
at org.apache.pig.Main.run(Main.java:510)
at org.apache.pig.Main.main(Main.java:107)
Caused by: org.apache.pig.impl.logicalLayer.parser.ParseException: Found more than one match: SrcFuid::citingdocid, NewCitationRel::citingdocid
at org.apache.pig.impl.logicalLayer.parser.QueryParser.AliasFieldOrSpec(QueryParser.java:7418)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.ColOrSpec(QueryParser.java:7226)
at org.apache.pig.impl.logicalLayer.parser.QueryParser.BaseEvalSpec(QueryParser.java:5297)
But the good news is that I combined this suggestion with Shawn's and found that this works:

TCGroupedByFuid = group TCRaw by (SrcFuid::citingdocid, SrcFuid::col, SrcFuid::seq);

Thanks Thejas and Shawn!

William F Dowling
Sr Technical Specialist, Software Engineering
Thomson Reuters
0 +1 215 823 3853
-----Original Message-----
From: Thejas M Nair [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, April 06, 2011 3:31 PM
To: [EMAIL PROTECTED]; Dowling, William (Hlthcr&Science)
Subject: Re: Internal error 2999 - misuse of CONCAT? misuse of GROUP?

In the relation TCRaw, there is no column called SrcFuid.
As a result, you end up using this feature -
alars .
Change your statement to -
 TCGroupedByFuid = group TCRaw by (citingdocid,


On 4/6/11 11:09 AM, "[EMAIL PROTECTED]"

>> Do you need the group-key to be concatenated ? If not, you can just group on
>> all the three columns -
>> TCGroupedByFuid = group TCRaw by (SrcFuid.citingdocid,
>                                          SrcFuid.col,
>                                         SrcFuid.seq);
> Hi Thejas,
> I had tried that originally before introducing CONCAT(), but I got this error
> message:
> ERROR 0: Scalar has more than one row in the output.
>  1st : (14159274,BCI,6), 2nd :(45937168,BCI,17)
> I don't understand that, since TCRaw is
> (14159274,BCI,6,14159274,14159163,BCI,5,1999,BCI.BCI)
> (14159274,BCI,6,14159274,14159163,WOS,11,1999,WOS.SCI)
> (14159274,WOS,16,14159274,14159163,BCI,5,1999,BCI.BCI)
> (14159274,WOS,16,14159274,14159163,WOS,11,1999,WOS.SCI)
> and the 2nd tuple is not a (projection of any) member of TCRaw (though it is a
> member of SrcFuid). So I think my understanding of GROUP is incorrect.
> Thanks for your help!
> Will