Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 55 (0.113s).
Loading phrases to help you
refine your search...
Re: UDF for generating top xx % of results? - Pig - [mail # user]
...ahh, I see, i seem to have misread the question, if it's top xx% entries, then certainly sorting and then limiting.  @Thejas  I had thought that Limit is distributed and does not g...
   Author: hc busy, 2010-06-30, 16:02
Re: Pig at LinkedIn - Pig - [mail # user]
...Russell, fire that neurologist who didn't care about what you had to think about your own problems!! ;-)  But honestly though, I use booleans in my pig scripts too. The trouble is that ...
   Author: hc busy, 2010-06-24, 20:26
Re: Scaling Pig Projects - The Hairy Pig - Pig - [mail # user]
...More great ideas, Scott!  The one thing about idempotency of IMPORT is that you may not necessarily want it. The scripts that I wrote will indeed take alias from a previously imported p...
   Author: hc busy, 2010-06-24, 17:24
Re: simple way to REPLACE on various columns - Pig - [mail # user]
...yeah, that'd be really cool. The other way that we can say this, (to make map reduce interface available in pig), is to allow FOREACH to be nested:   TRIMED_TABLE = FOREACH TABLE { &nbs...
   Author: hc busy, 2010-06-18, 18:20
Re: How to find all possible permutations from a bag - Pig - [mail # user]
...heh, I want n*(n-1)/2 too... Maybe someone out there has an UDF that does this after a group.   On Wed, Jun 16, 2010 at 8:30 AM, Christian  wrote:  ...
   Author: hc busy, 2010-06-16, 18:47
Re: Help with a tricky query - Pig - [mail # user]
...Yeah, that IS hard in pig. I'm not even sure how to do a self-join in Pig. Like you can't really say  T = join Table by id1, Table by id2, Table by id3;  I think PigLatin will comp...
   Author: hc busy, 2010-06-11, 17:59
Re: Behavior of JOIN - Pig - [mail # user]
...Oh, I see what my confusion is... It's the "null"s on which join behaves differently in pig than sql. Right? that's where things are different.   On Thu, Jun 10, 2010 at 12:48 PM, Alan ...
   Author: hc busy, 2010-06-11, 17:44
Re: does EvalFunc generate the entire bag always ? - Pig - [mail # dev]
...well, see that's the thing, the 'sort A by $0' is already nlg(n)  ahh, I see, my own example suffers from this problem.  I guess I'm wondering how 'limit' works in conjunction with...
   Author: hc busy, 2010-06-01, 20:44
Re: cogroup and flattening optionally empty bags - Pig - [mail # user]
...yeah, something like this should work:  o = foreach cg generate FLATTEN(A.a_column) as a_column, ((IsEmpty(B))?toBag(toTuple(null)):(B.b_column)) as B2: o2 = foreach o generate a_column...
   Author: hc busy, 2010-06-01, 20:38
Re: UDF question - Pig - [mail # user]
...oh, that's a good point, can't just return arbitrary types... Even if I derive from base class. Interesting.  Well, the combination of toTuple and toBag will accomplish many tasks. One ...
   Author: hc busy, 2010-06-01, 20:29
Pig (55)
Hadoop (2)
Hive (1)
mail # user (43)
mail # dev (10)
issue (2)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (55)
Daniel Dai (358)
Dmitriy Ryaboy (346)
Alan Gates (333)
Cheolsoo Park (288)
Jonathan Coveney (237)
Russell Jurney (174)
Rohini Palaniswamy (168)
Bill Graham (131)
Olga Natkovich (130)
Prashant Kommireddi (106)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (69)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (41)
pi song (41)
Gianmarco De Francisci Mo...(38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Koji Noguchi (33)
Pradeep Gollakota (33)
Jeff Zhang (32)
Santhosh Srinivasan (29)
hc busy