Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 251 (0.084s).
Loading phrases to help you
refine your search...
Re: Optimization opportunity for group by followed by join on the same key ? - Pig - [mail # dev]
...Jeff,   There is already a JIRA - https://issues.apache.org/jira/browse/PIG-3849.You can update it with the details/diagrams.Regards,RohiniOn Thu, Mar 5, 2015 at 9:41 AM, Daniel Da...
   Author: Rohini Palaniswamy, 2015-03-06, 00:51
[PIG-4449] Optimize the case of Order by + Limit in nested foreach - Pig - [issue]
...This is one of the very frequently used patternsgrouped_data_set = group data_set by id;capped_data_set = foreach grouped_data_set{  ordered = order joined_data_set by timestamp de...
http://issues.apache.org/jira/browse/PIG-4449    Author: Rohini Palaniswamy, 2015-03-05, 21:10
[PIG-4446] Support for vertex level commit for parallel outputs in Tez - Pig - [issue]
... By default, Tez does AM level commit controlled by the setting TEZ_AM_COMMIT_ALL_OUTPUTS_ON_DAG_SUCCESS. It has support for vertex level commit as well and that makes sense in some cas...
http://issues.apache.org/jira/browse/PIG-4446    Author: Rohini Palaniswamy, 2015-03-04, 23:01
[PIG-4443] Write inputsplits in Tez to disk if the size is huge - Pig - [issue]
...Pig sets the input split information in user payload and when running against a table with 10s of 1000s of partitions, DAG submission fails withjava.io.IOException: Requested data length 305...
http://issues.apache.org/jira/browse/PIG-4443    Author: Rohini Palaniswamy, 2015-03-03, 22:56
[PIG-4059] Pig on Spark - Pig - [issue]
...Setting up your development environment:1. Check out Pig Spark branch.2. Build Pig by running "ant jar".3. Configure these environmental variables:    export HADOOP_USER_CLASS...
http://issues.apache.org/jira/browse/PIG-4059    Author: Rohini Palaniswamy, 2015-02-27, 19:36
[expand - 3 more] - Re: Pig on Spark - Pig - [mail # dev]
...Thanks.project = Pig AND fixVersion = spark-branch and summary ~ "Enable unittest"  -> Needs to be moved under PIG-4266project = Pig AND fixVersion = spark-branch and parent not in (...
   Author: Rohini Palaniswamy, 2015-02-24, 16:11
[PIG-4429] Add Pig alias information to the DAG view in Tez UI - Pig - [issue]
... The DAG view displays vertex name, id, time duration, task and status information for each vertex. It would be good to add the pig alias and feature (join, group by, etc) information t...
http://issues.apache.org/jira/browse/PIG-4429    Author: Rohini Palaniswamy, 2015-02-20, 23:54
[PIG-4428] Support UDFContext style getProperties() for different UDFs in Tez ObjectCache - Pig - [issue]
...Maintain another level of map in the ObjectRegistry and return that when user specifies the UDF class and signature....
http://issues.apache.org/jira/browse/PIG-4428    Author: Rohini Palaniswamy, 2015-02-20, 23:22
[PIG-4427] Log a warning if the PARALLEL specified is not a prime number - Pig - [issue]
... Most of the time users specify default_parallel or PARALLEL in multiples of 10. This causes data skew and is not that effective. For eg: Had a user specify 1000 and all records went in...
http://issues.apache.org/jira/browse/PIG-4427    Author: Rohini Palaniswamy, 2015-02-20, 20:24
[PIG-4424] Different configurations for different stages of script - Pig - [issue]
...From a user:I have a pig script which runs multiple map reduce jobs. (Ex: 'group by' and 'order by' which will be executed as 2 different map reduce jobs)Is there a way to specify different ...
http://issues.apache.org/jira/browse/PIG-4424    Author: Rohini Palaniswamy, 2015-02-19, 00:51
Sort:
project
Pig (251)
Tez (48)
Hive (7)
Bigtop (1)
HBase (1)
HDFS (1)
type
issue (176)
mail # dev (50)
mail # user (25)
date
last 7 days (5)
last 30 days (19)
last 90 days (39)
last 6 months (72)
last 9 months (251)
author
Daniel Dai (440)
Dmitriy Ryaboy (345)
Alan Gates (335)
Cheolsoo Park (273)
Jonathan Coveney (230)
Rohini Palaniswamy (204)
Russell Jurney (175)
Olga Natkovich (131)
Bill Graham (129)
Prashant Kommireddi (110)
Julien Le Dem (81)
Aniket Mokashi (79)
Thejas Nair (70)
Thejas M Nair (65)
Mridul Muralidharan (61)
liyunzhang_intel (51)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (39)
Jeff Zhang (37)
Pradeep Gollakota (36)
Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB