Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 161 to 170 from 367 (0.083s).
Loading phrases to help you
refine your search...
Re: Persisting Pig Scripts - Pig - [mail # dev]
...I think that would be more of a post-process vs having Pig write the same to a HDFS location. That would avoid having to parse it from job.xml.  On Wed, Jun 6, 2012 at 4:19 PM, Daniel D...
   Author: Prashant Kommireddi, 2012-06-07, 00:16
[PIG-2725] PushUpFilter does not work as expected - Pig - [issue]
...PushUpFilter fails to optimize to place FILTER above GROUP BY as shown in this example http://pig.apache.org/docs/r0.10.0/perf.html#PushUpFilter...
http://issues.apache.org/jira/browse/PIG-2725    Author: Prashant Kommireddi, 2012-06-07, 00:12
Persisting Pig Scripts - Pig - [mail # dev]
...Hi All,  What do you guys think about adding a feature to be able to persist the script (file or cache in case of grunt) on HDFS or locally based on an admin setting (pig.properties). T...
   Author: Prashant Kommireddi, 2012-06-06, 22:52
Re: testDefaultPigProperties in TestPigServer fails - Pig - [mail # user]
...Hi Johannes,  I replied on JIRA. Please take a look, seems like pig-default.properties is not on your classpath.  Thanks, Prashant  On Mon, Jun 4, 2012 at 6:02 AM, Johannes Sc...
   Author: Prashant Kommireddi, 2012-06-05, 05:27
Parsing issue within UDF arguments - Pig - [mail # user]
...I was trying to use REGEX_EXTRACT_ALL and seems like if the argument contains semi-colon, the script errors out.  ______________________________________________________________ Data Fil...
   Author: Prashant Kommireddi, 2012-06-02, 21:14
Re: Number of reduce tasks - Pig - [mail # user]
...Right. And the documentation provides a list of operations that can be parallelized.  On Jun 1, 2012, at 4:50 PM, Dmitriy Ryaboy  wrote:  ...
   Author: Prashant Kommireddi, 2012-06-02, 00:24
Re: Number of reduce tasks - Pig - [mail # user]
...Automatic Heuristic works the same in 0.9.1 http://pig.apache.org/docs/r0.9.1/perf.html#parallel, but you might be better off setting it manually looking at job tracker counters.  You s...
   Author: Prashant Kommireddi, 2012-06-01, 19:26
Re: Number of reduce tasks - Pig - [mail # user]
...Also, please note default number of reducers are based on input dataset. In the basic case, Pig will "automatically" spawn a reducer for each GB of input, so if your input dataset size is 50...
   Author: Prashant Kommireddi, 2012-06-01, 19:03
Re: Job setup for a pig run takes ages - Pig - [mail # user]
...Hi Markus,  Have you checked the JobTracker at the time of launching the job that Map slots were available?  Looks like the input dataset size is ~464 GB. Since you mentioned 10 GB...
   Author: Prashant Kommireddi, 2012-05-31, 09:57
[PIG-2727] PigStorage Source tagging does not need pig.splitCombination to be turned off - Pig - [issue]
...With https://issues.apache.org/jira/browse/PIG-2462, PigStorage does not need "pig.splitCombination to be turned off. I did not take the fix made in 2462 into account while implementing this...
http://issues.apache.org/jira/browse/PIG-2727    Author: Prashant Kommireddi, 2012-05-30, 02:11
Sort:
project
Pig (367)
Hadoop (41)
HBase (4)
MapReduce (3)
type
mail # user (276)
mail # dev (70)
issue (21)
date
last 7 days (0)
last 30 days (9)
last 90 days (62)
last 6 months (115)
last 9 months (367)
author
Dmitriy Ryaboy (1352)
Alan Gates (954)
Jonathan Coveney (731)
Daniel Dai (541)
Russell Jurney (485)
Olga Natkovich (453)
Prashant Kommireddi (367)
Bill Graham (334)
Cheolsoo Park (246)
Mridul Muralidharan (201)
Thejas Nair (195)
Ashutosh Chauhan (169)
Julien Le Dem (154)
Jeff Zhang (146)
Santhosh Srinivasan (142)