This feature/syntax seems be causing confusion in many cases , so I have proposed deprecating this syntax in the next release .
See - https://issues.apache.org/jira/browse/PIG-1967 .
On 4/6/11 12:30 PM, "Thejas M Nair" <[EMAIL PROTECTED]> wrote:
In the relation TCRaw, there is no column called SrcFuid.
As a result, you end up using this feature -
Change your statement to -
TCGroupedByFuid = group TCRaw by (citingdocid,
On 4/6/11 11:09 AM, "[EMAIL PROTECTED]"
<[EMAIL PROTECTED]> wrote:
>> Do you need the group-key to be concatenated ? If not, you can just group on
>> all the three columns -
>> TCGroupedByFuid = group TCRaw by (SrcFuid.citingdocid,
> Hi Thejas,
> I had tried that originally before introducing CONCAT(), but I got this error
> ERROR 0: Scalar has more than one row in the output.
> 1st : (14159274,BCI,6), 2nd :(45937168,BCI,17)
> I don't understand that, since TCRaw is
> and the 2nd tuple is not a (projection of any) member of TCRaw (though it is a
> member of SrcFuid). So I think my understanding of GROUP is incorrect.
> Thanks for your help!