jamal sasha 2012-11-06, 14:13
Mohammad Tariq 2012-11-06, 14:17
Have you followed Mohammad Tariq's steps?
Or else you can go like this,
A = load '/filename' using PigStorage(',') as (id, val1,val2);
B=foreach A generate id,val2;
C = group B by id;
D = foreach C generate flatten(B.id), MAX(B.val2);
E = distinct D;
Thanks & Regards
Yogesh Kumar Dhari
> From: [EMAIL PROTECTED]
> Date: Tue, 6 Nov 2012 19:47:55 +0530
> Subject: Re: accessing like array
> To: [EMAIL PROTECTED]
> load the data into a relation and use 'generate' to take only the required
> fields from this relation and put into some other relation..then store the
> 2nd relation into some file.
> Mohammad Tariq
> On Tue, Nov 6, 2012 at 7:43 PM, jamal sasha <[EMAIL PROTECTED]> wrote:
> > Hi,
> > I have data in form
> > 1,0.2,0.3
> > 1,0.3,0.4
> > 2,0.8,0.2
> > 2,0.9,0.7
> > and so on..
> > so id, va1,val2 format..
> > This id is already sorted based on val 2
> > I want to select the 2nd element for each id with val2 (ignoring val1)
> > for example in the above dataset what i want to return is
> > 1,0.4
> > 2,0.7
> > How to go about this??
> > Thanks