Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 21 to 30 from 30 (0.179s).
Loading phrases to help you
refine your search...
Re: [jira] [Commented] (PIG-1324) Logical Optimizer: Nested column pruning - Pig - [mail # dev]
...Hi Daniel,  Thanks for the example. Does the current pruning happen before each statement, or just after LOAD? Because I can only see one-shot pruning for each table from the output. &n...
   Author: Jie Li, 2011-12-04, 20:50
Re: Early projection and lazy casting - Pig - [mail # dev]
...Sure. The two lines in bold are just dropping out non-necessary fields. Without them Pig would not project, especially for the table lineitem.  lineitem = load '$input/lineitem' USING P...
   Author: Jie Li, 2011-12-03, 02:42
Re: Early projection and lazy casting - Pig - [mail # dev]
...Why do joins prevent the early projection? Actually join has the greatest need for it.  Jie  On Fri, Dec 2, 2011 at 7:33 PM, Jonathan Coveney  wrote:  ...
   Author: Jie Li, 2011-12-03, 00:45
Early projection and lazy casting - Pig - [mail # dev]
...Hi all,  We just figured out Pig 0.9.1 doesn't drop those non-necessary fields asap, which really affects the performance. Though http://ofps.oreilly.com/titles/9781449302641/load_and_s...
   Author: Jie Li, 2011-12-03, 00:05
Re: Running TPC-H on Pig - Pig - [mail # dev]
...TPC-E is for transaction, so why is it better for evaluating Hadoop related systems?  We are benchmarking the whole queries. We found that some simple heuristics work very well so far. ...
   Author: Jie Li, 2011-12-02, 21:34
Re: Running TPC-H on Pig - Pig - [mail # dev]
...Yeah sure. We are just about to post them.  Jie  On Tue, Nov 29, 2011 at 8:18 PM, Jonathan Coveney wrote:  ...
   Author: Jie Li, 2011-12-02, 21:27
Re: getting x records at position y? - Pig - [mail # dev]
...I think you can use LIMIT operator, for example,  tmp = FOREACH input GENERATE $y; output = LIMIT tmp x;  Jie  On Fri, Dec 2, 2011 at 8:12 AM, Geert Van Landeghem wrote:  ...
   Author: Jie Li, 2011-12-02, 16:05
Re: Running TPC-H on Pig - Pig - [mail # dev]
...Did you mean the two update functions of TPC-H? I think we can leave them out as Hive did, as usually Hadoop is not for update.  Jie  On Tue, Nov 29, 2011 at 2:42 PM, Santhosh Srin...
   Author: Jie Li, 2011-11-29, 23:18
Re: Running TPC-H on Pig - Pig - [mail # dev]
...Yeah we already have some results but not so good, so we are currently rewriting some of the scripts especially rewriting the joins. Once we can a good result we will publish it.  Jie &...
   Author: Jie Li, 2011-11-29, 23:11
Running TPC-H on Pig - Pig - [mail # dev]
...Hello everyone,  As people are usually more concerned about the performance, we need more benchmarks to identify the bottleneck of the Pig's performance. For a class project we develop ...
   Author: Jie Li, 2011-11-29, 19:38
Sort:
project
Pig (30)
MapReduce (14)
Hadoop (13)
Hive (10)
type
mail # dev (19)
issue (10)
mail # user (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (3)
last 9 months (30)
author
Dmitriy Ryaboy (1351)
Alan Gates (960)
Jonathan Coveney (737)
Daniel Dai (545)
Russell Jurney (493)
Olga Natkovich (453)
Prashant Kommireddi (370)
Bill Graham (333)
Cheolsoo Park (263)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (156)
Jeff Zhang (146)
Santhosh Srinivasan (142)
Jie Li