Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> pig question


Copy link to this message
-
Re: pig question
values = LOAD 'my_path' AS (id1:int, id2:chararray, value:int);
overall = FOREACH (GROUP values BY id1) GENERATE group AS id1,
value/MAX(value) as div_max;

Russell Jurney http://datasyndrome.com

On Apr 27, 2013, at 2:32 AM, jamal sasha <[EMAIL PROTECTED]> wrote:

> Hi,
>  I have data of format
>
> id1,id2, value
> 1 , abc, 2993
> 1, dhu, 9284
> 1,dus,2389
> 2, acs,29392
>
> and so on
>
> For each id1, I want to find the maximum value and then divide value by
> max_value
> so in example above:
>
> 1,abc, 2993/9284
> 1,dhu ,9284/9284
> 1,dus, 2389/9284
> 2,acs, 29392/max_value_for_this id
>
> How do i do this in pig?
> Thanks
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB