|
|
jamal sasha 2012-11-06, 14:13
Hi, I have data in form 1,0.2,0.3 1,0.3,0.4 2,0.8,0.2 2,0.9,0.7 and so on.. so id, va1,val2 format..
This id is already sorted based on val 2 I want to select the 2nd element for each id with val2 (ignoring val1) for example in the above dataset what i want to return is 1,0.4 2,0.7
How to go about this?? Thanks
+
jamal sasha 2012-11-06, 14:13
-
Re: accessing like array
Mohammad Tariq 2012-11-06, 14:17
load the data into a relation and use 'generate' to take only the required fields from this relation and put into some other relation..then store the 2nd relation into some file.
Regards, Mohammad Tariq
On Tue, Nov 6, 2012 at 7:43 PM, jamal sasha <[EMAIL PROTECTED]> wrote:
> Hi, > I have data in form > 1,0.2,0.3 > 1,0.3,0.4 > 2,0.8,0.2 > 2,0.9,0.7 > and so on.. > so id, va1,val2 format.. > > This id is already sorted based on val 2 > I want to select the 2nd element for each id with val2 (ignoring val1) > for example in the above dataset what i want to return is > 1,0.4 > 2,0.7 > > How to go about this?? > Thanks >
+
Mohammad Tariq 2012-11-06, 14:17
-
RE: accessing like array
yogesh dhari 2012-11-07, 05:07
Hi Jamal,
Have you followed Mohammad Tariq's steps?
Or else you can go like this,
A = load '/filename' using PigStorage(',') as (id, val1,val2); B=foreach A generate id,val2; C = group B by id; D = foreach C generate flatten(B.id), MAX(B.val2); E = distinct D;
Dump E;
Thanks & Regards Yogesh Kumar Dhari > From: [EMAIL PROTECTED] > Date: Tue, 6 Nov 2012 19:47:55 +0530 > Subject: Re: accessing like array > To: [EMAIL PROTECTED] > > load the data into a relation and use 'generate' to take only the required > fields from this relation and put into some other relation..then store the > 2nd relation into some file. > > Regards, > Mohammad Tariq > > > > On Tue, Nov 6, 2012 at 7:43 PM, jamal sasha <[EMAIL PROTECTED]> wrote: > > > Hi, > > I have data in form > > 1,0.2,0.3 > > 1,0.3,0.4 > > 2,0.8,0.2 > > 2,0.9,0.7 > > and so on.. > > so id, va1,val2 format.. > > > > This id is already sorted based on val 2 > > I want to select the 2nd element for each id with val2 (ignoring val1) > > for example in the above dataset what i want to return is > > 1,0.4 > > 2,0.7 > > > > How to go about this?? > > Thanks > >
+
yogesh dhari 2012-11-07, 05:07
|
|