Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 22 (0.073s).
Loading phrases to help you
refine your search...
Re: Total count of RandomSampleLoader is unpredicatable - Pig - [mail # dev]
...Not sure if it's the same issue, but I also see the counter of Map input records is greater than the actual number of input records in some cases.  Jie  On Thu, Jul 26, 2012 at 6:0...
   Author: Jie Li, 2012-07-27, 22:25
Breaking down big unit tests - Pig - [mail # dev]
...Hi all,  Apparently some unit test classes are so fat that retesting them is a pain. While reducing the full testing time is a long-term goal, shall we just break down those big units i...
   Author: Jie Li, 2012-07-19, 21:11
Hash aggregation experience - Pig - [mail # dev]
...Hi all,  Has anyone tried the hash aggregation feature in pig 0.10 and seen any performance improvement? Recently I'm benchmarking HashAgg and the combiner to see whether we should use ...
   Author: Jie Li, 2012-07-12, 02:43
Re: Are there any explanations of the implementation of illustrate? - Pig - [mail # dev]
...Some document here: http://wiki.apache.org/pig/PigIllustrate  I agree that more tests are needed for illustrate, otherwise it can be easily broken without notice.  Jie  On Tue...
   Author: Jie Li, 2012-07-03, 20:50
Re: suggestion - Pig - [mail # user]
...Pig does have a "-c" to check the syntax:  pig -x local -c -f x.pig  Jie  On Fri, Jun 29, 2012 at 5:02 AM, Ruslan Al-Fakikh  wrote:...
   Author: Jie Li, 2012-06-29, 17:11
[expand - 1 more] - Re: Some proposals for Pig performance optimization - Pig - [mail # dev]
...Thanks Thejas for the comments! See my answers inline.  l ng  in  Yes the environment set up uses only 1GB data, so there is only 1 reducer for the order-by.  I've also u...
   Author: Jie Li, 2012-06-21, 21:22
[expand - 1 more] - Re: Shall we get the machinations machinating for PIG 0.11? - Pig - [mail # dev]
...I'm working on a list of possible performance optimization and hope some of them will go into 0.11.  Will post it shortly.  Jie  On Wed, Jun 20, 2012 at 11:45 AM, Daniel Dai &...
   Author: Jie Li, 2012-06-20, 18:59
Re: [jira] [Updated] (PIG-2397) Running TPC-H on Pig - Pig - [mail # dev]
...Hi all,  We update the slides and the scripts we used. Any comment is appreciated. We will post our report later.  Jie  On Sat, Dec 10, 2011 at 1:05 AM, Jie Li (Updated) (JIRA...
   Author: Jie Li, 2011-12-10, 06:11
Re: [jira] [Commented] (PIG-1324) Logical Optimizer: Nested column pruning - Pig - [mail # dev]
...Hi Daniel,  Thanks for the example. Does the current pruning happen before each statement, or just after LOAD? Because I can only see one-shot pruning for each table from the output. &n...
   Author: Jie Li, 2011-12-04, 20:50
[expand - 2 more] - Re: Early projection and lazy casting - Pig - [mail # dev]
...Sure. The two lines in bold are just dropping out non-necessary fields. Without them Pig would not project, especially for the table lineitem.  lineitem = load '$input/lineitem' USING P...
   Author: Jie Li, 2011-12-03, 02:42
Pig (22)
Hadoop (10)
MapReduce (10)
Hive (8)
Kafka (2)
Sqoop (1)
mail # dev (11)
issue (10)
mail # user (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (22)
Daniel Dai (383)
Dmitriy Ryaboy (345)
Alan Gates (334)
Cheolsoo Park (267)
Jonathan Coveney (230)
Russell Jurney (174)
Rohini Palaniswamy (159)
Olga Natkovich (131)
Bill Graham (130)
Prashant Kommireddi (108)
Aniket Mokashi (82)
Julien Le Dem (82)
Thejas Nair (70)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Jeff Zhang (32)
Serega Sheypak (29)
Jie Li