Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> pig question


+
jamal sasha 2013-04-27, 09:32
Copy link to this message
-
Re: pig question
values = LOAD 'my_path' AS (id1:int, id2:chararray, value:int);
overall = FOREACH (GROUP values BY id1) GENERATE group AS id1,
value/MAX(value) as div_max;

Russell Jurney http://datasyndrome.com

On Apr 27, 2013, at 2:32 AM, jamal sasha <[EMAIL PROTECTED]> wrote:

> Hi,
>  I have data of format
>
> id1,id2, value
> 1 , abc, 2993
> 1, dhu, 9284
> 1,dus,2389
> 2, acs,29392
>
> and so on
>
> For each id1, I want to find the maximum value and then divide value by
> max_value
> so in example above:
>
> 1,abc, 2993/9284
> 1,dhu ,9284/9284
> 1,dus, 2389/9284
> 2,acs, 29392/max_value_for_this id
>
> How do i do this in pig?
> Thanks
+
abhishek 2013-05-07, 02:52
+
Jonathan Coveney 2013-05-07, 08:33
+
abhishek 2013-05-07, 13:48
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB