Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 90 (0.065s).
Loading phrases to help you
refine your search...
[PIG-3221] Bootstrap sampling - Pig - [issue]
...Implement a bootstrap sampling option ( http://en.wikipedia.org/wiki/Bootstrap_(statistics) ) in Pig's SAMPLE operator....
http://issues.apache.org/jira/browse/PIG-3221    Author: Gianmarco De Francisci Mo..., 2013-04-25, 21:06
[PIG-3225] Stratified sampling - Pig - [issue]
...Implement a stratified sampling option ( http://en.wikipedia.org/wiki/Stratified_sampling ) in Pig's SAMPLE operator....
http://issues.apache.org/jira/browse/PIG-3225    Author: Gianmarco De Francisci Mo..., 2013-04-22, 09:14
Re: Rank within a group - Pig - [mail # user]
...Hi,  nested RANK is not supported yet, however it is easy to implement as a UDF. Just sort the records and assign an increasing counter with the UDF. We will probably add support for ne...
   Author: Gianmarco De Francisci Mo..., 2013-04-16, 19:00
[expand - 1 more] - Re: GSoC 2013 - Pig - [mail # user]
...+1 to what Dmitriy says.  Cheers,  Gianmarco   On Mon, Apr 8, 2013 at 8:57 PM, Dmitriy Ryaboy  wrote:  ...
   Author: Gianmarco De Francisci Mo..., 2013-04-09, 07:10
[PIG-2138] Inline_op should use shared dynamic stack - Pig - [issue]
...The way inline_op saves and restores the statement state is brittle and can lead to bugs.To work around the issue one needs to be careful when writing rules not to include operations that ca...
http://issues.apache.org/jira/browse/PIG-2138    Author: Gianmarco De Francisci Mo..., 2013-03-07, 02:08
[PIG-2353] RANK function like in SQL - Pig - [issue]
...Implement a function that given a (sorted) bag adds to each tuple a unique, increasing identifier without gaps, like what RANK does for SQL.This is a candidate project for Google summer of c...
http://issues.apache.org/jira/browse/PIG-2353    Author: Gianmarco De Francisci Mo..., 2013-02-28, 16:17
Re: Limit vs Sample - Pig - [mail # user]
...Hi, LIMIT takes the first X records, so there are no statistical guarantees. SAMPLE takes X% of the records from the whole bag (uniformly), so you have statistical guarantees. No, SAMPLE doe...
   Author: Gianmarco De Francisci Mo..., 2013-02-28, 10:01
[PIG-2808] Add *.project to .gitignore - Pig - [issue]
http://issues.apache.org/jira/browse/PIG-2808    Author: Gianmarco De Francisci Mo..., 2013-02-22, 04:54
[PIG-2691] Duplicate TOKENIZE schema - Pig - [issue]
...TOKENIZE produces a fixed named schema that results in duplicates if used more than once in the same generate statement.We could paramenterize the schema on the name of the field being token...
http://issues.apache.org/jira/browse/PIG-2691    Author: Gianmarco De Francisci Mo..., 2013-02-22, 04:54
Re: [ANNOUNCE] Welcome Bill Graham to join Pig PMC - Pig - [mail # user]
...Congrats Bill! :)  Gianmarco   On Wed, Feb 20, 2013 at 10:00 AM, Jonathan Coveney wrote:  ...
   Author: Gianmarco De Francisci Mo..., 2013-02-20, 14:45
Pig (90)
Storm (3)
MapReduce (1)
mail # user (41)
mail # dev (33)
issue (16)
last 7 days (0)
last 30 days (2)
last 90 days (5)
last 6 months (5)
last 9 months (90)
Daniel Dai (384)
Dmitriy Ryaboy (345)
Alan Gates (334)
Cheolsoo Park (267)
Jonathan Coveney (230)
Russell Jurney (174)
Rohini Palaniswamy (159)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (108)
Aniket Mokashi (82)
Julien Le Dem (82)
Thejas Nair (70)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Serega Sheypak (29)
Gianmarco De Francisci Mo...