Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Using Correlation and Covariance UDFs

Copy link to this message
Re: Using Correlation and Covariance UDFs
Hi, Renato:
For CORRELATION, I guess you can do something like
A = load 'random.txt' using PigStorage(':') as
B = group A all;
D = foreach B generate group,COR(A.$0,A.$1,A.$2,A.$3,.......A.$499);

For COVARIANCE, I guess the UDF is COV.

On Tue, Mar 26, 2013 at 3:28 PM, Renato Marroquín Mogrovejo <

> Hi all,
> Could anyone be kind enough to point me to some examples on using the
> COVARIANCE and the CORRELATION UDFS described in here?[1]
> Renato M.
> [1] https://issues.apache.org/jira/browse/PIG-277