Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> pig cookbook


+
Doug Daniels 2011-11-10, 02:32
+
Daniel Dai 2011-11-10, 08:34
+
Ashutosh Chauhan 2011-11-10, 18:59
+
Corinne Chandel 2011-11-10, 20:04
+
Dmitriy Ryaboy 2011-11-10, 23:25
Copy link to this message
-
Re: pig cookbook
Thanks Dmitriy, I didn't know you could do that!

I added an entry to the FAQ that Daniel pointed out: http://bit.ly/sZ8UOL

-Doug

On 11/10/11 6:25 PM, "Dmitriy Ryaboy" <[EMAIL PROTECTED]> wrote:

>Wait, a cross? No, don't do a cross.  You can just cast the result of
>group
>all as a scalar, this will be much more efficient.
>
>D
>
>On Wed, Nov 9, 2011 at 6:32 PM, Doug Daniels
><[EMAIL PROTECTED]>wrote:
>
>> I was just working on a pig script to group some data by a field and
>>then
>> generate percentages for each group.  Without windowing functions at my
>> disposal,  I wound up using a group by on the field for the numerator, a
>> group all for the denominator, and a cross to pull them together.
>>
>> Afterward, I found myself wanting to drop that snippet somewhere that
>> other folks could use it, or (even better) to have found a snippet in
>>the
>> first place and not had to write it!
>>
>> So I was thinking it'd be great to have a "pig cookbook" online with
>> common operations that people do with pig.  Does anything like that
>>exist
>> yet?  If not, where would be a good place for it?
>>
>> Best,
>> Doug
>>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB