Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
HBase >> mail # user >> Re: Multiple Aggregate functions in map reduce program


Copy link to this message
-
Re: Multiple Aggregate functions in map reduce program
Hi,

ideally you want to "scan" through data once and the the (sum,count).

One simple solution is write your own map-reduce with key = department,
value = new VectorWritable(vector);

With vector is an array which array[0] = salary, array[1] = 1.

In the reduce phase all you need is to do the aggregation on array[0] and
array[1] properly.

The reduce value is also array[0] = sum of salary, array[1] = sum of
employees.

This is common problem, I think others might have better solutions.

-- Khang

On Fri, Oct 5, 2012 at 3:02 PM, iwannaplay games <[EMAIL PROTECTED]
> wrote:

> ant result like
>
> Department       TotalSalary      count(employees)
>
> testing                    7000                 3
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB