Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Internal error 2999 - misuse of CONCAT? misuse of GROUP?


Copy link to this message
-
RE: Internal error 2999 - misuse of CONCAT? misuse of GROUP?

>Do you need the group-key to be concatenated ? If not, you can just group on all the three columns -

>TCGroupedByFuid = group TCRaw by (SrcFuid.citingdocid,
                                         SrcFuid.col,
                                        SrcFuid.seq);

Hi Thejas,

I had tried that originally before introducing CONCAT(), but I got this error message:

ERROR 0: Scalar has more than one row in the output.
 1st : (14159274,BCI,6), 2nd :(45937168,BCI,17)

I don't understand that, since TCRaw is

(14159274,BCI,6,14159274,14159163,BCI,5,1999,BCI.BCI)
(14159274,BCI,6,14159274,14159163,WOS,11,1999,WOS.SCI)
(14159274,WOS,16,14159274,14159163,BCI,5,1999,BCI.BCI)
(14159274,WOS,16,14159274,14159163,WOS,11,1999,WOS.SCI)

and the 2nd tuple is not a (projection of any) member of TCRaw (though it is a member of SrcFuid). So I think my understanding of GROUP is incorrect.

Thanks for your help!

Will

NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB