Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 41 to 50 from 88 (0.102s).
Loading phrases to help you
refine your search...
Re: filter duplicates from a bag - Pig - [mail # user]
...I would say something along these lines:  B = group A by *; C = foreach B generate group, COUNT(A) as count; D = filter C by count > 1; E = foreach D generate group;  Disclaimer...
   Author: Gianmarco De Francisci Mo..., 2012-08-24, 10:19
Re: add a field, ordered - Pig - [mail # user]
...Hi,  We are finalizing a feature that would solve your problems, something like ROW_NUMBER in some SQL dialect, we call it RANK. This operator will add a unique consecutive row number t...
   Author: Gianmarco De Francisci Mo..., 2012-08-14, 10:05
[expand - 3 more] - Re: illustrate - Pig - [mail # dev]
...Hi Dmitriy, I think there are at least a couple things that would be more difficult to do with a UDF implementation, namely: 1) AFAIK, you don't have access to the MR task id within the UDF....
   Author: Gianmarco De Francisci Mo..., 2012-08-12, 09:39
Re: How to selectively ship a class and its dependencies? - Pig - [mail # dev]
...Not sure about the maturity of our code for this task, but Autojar [1] does exactly what you ask. It is GPL licensed, so there might be licensing issues.  [1] http://autojar.sourceforge...
   Author: Gianmarco De Francisci Mo..., 2012-07-21, 08:51
Re: NoClassDefFoundError after upgrading to pig 0.10.0 from 0.9.0 - Pig - [mail # user]
...We can simply generate the pom dynamically as we already do with the ivy.xml file.  Cheers, Gianmarco     On Mon, Jul 2, 2012 at 3:58 AM, Dmitriy Ryaboy  wrote:  ...
   Author: Gianmarco De Francisci Mo..., 2012-07-02, 13:36
Re: Does pig support in clause? - Pig - [mail # user]
...Bloom filters would help efficiency here. A bloom join or semi-join would be a nice addition to Pig.  Cheers, Gianmarco     On Mon, Jun 25, 2012 at 7:50 PM, Alan Gates  w...
   Author: Gianmarco De Francisci Mo..., 2012-06-26, 05:56
Re: A solution for the confusion around "as alias:type" ? - Pig - [mail # dev]
...We have already discussed it and come to a decision: See https://issues.apache.org/jira/browse/PIG-2315 If somebody feels like implementing it I would be happy :)  Cheers,  Gianmar...
   Author: Gianmarco De Francisci Mo..., 2012-06-20, 09:35
Re: Pig blog? - Pig - [mail # dev]
...+1  It would help building and strengthening the community. But we would need someone to care for it. It might take some time.  Cheers, Gianmarco     On Sat, Jun 9, 2012 ...
   Author: Gianmarco De Francisci Mo..., 2012-06-11, 11:31
[expand - 1 more] - Re: Some questions on intermediate serialization in Pig - Pig - [mail # dev]
...So, to recap.  InterSedes writes the R1/R2/R3 thing. I am quite sure it is done for splittability purposes. The RawComparators, as well as InterStorage, operate on binary data that does...
   Author: Gianmarco De Francisci Mo..., 2012-06-09, 17:11
Re: Create rdbms like sequence in Pig on Pig Relation - Pig - [mail # user]
...Hi, Pig will have this functionality as soon as we finish PIG-2353, which is part of this year's GSoC.  Cheers, Gianmarco     On Fri, May 18, 2012 at 8:34 PM, DIPESH KUMAR SIN...
   Author: Gianmarco De Francisci Mo..., 2012-05-24, 05:55
Pig (88)
MapReduce (1)
mail # user (40)
mail # dev (32)
issue (16)
last 7 days (0)
last 30 days (2)
last 90 days (3)
last 6 months (3)
last 9 months (88)
Daniel Dai (400)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (310)
Jonathan Coveney (237)
Rohini Palaniswamy (187)
Russell Jurney (176)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (107)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(38)
Koji Noguchi (38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Pradeep Gollakota (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)
Gianmarco De Francisci Mo...