Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig, mail # user - optimization for data cube


Copy link to this message
-
optimization for data cube
Haitao Yao 2013-04-03, 03:34
Hi, all
I have a tuple like this:
(group_a,group_b,group_c,value)

and I want to calculate the values in a data cube way, which means I want to generate new tuples from the original one :

(all,all,all,value)
(group_a,all,all,value)
(all,group_b,all,value)
(group_a,group_b,all,value)
(all,all,group_c,value)
(group_a,all,group_c,value)
(all,group_b,group_c,value)

and then group by ($0, $1, $2) .
How can I do this? I've wrote a Eval function, but it can not generate more tuples from one tuple.
thanks.
Haitao Yao
[EMAIL PROTECTED]
weibo: @haitao_yao
Skype:  haitao.yao.final

+
Prasanth J 2013-04-03, 05:19
+
Haitao Yao 2013-04-03, 06:07