Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 81 to 90 from 267 (0.072s).
Loading phrases to help you
refine your search...
[PIG-3849] Optimize group by followed by join on the same key - Pig - [issue]
... This can be done in one vertex with multiple inputs instead of having an extra vertex to do the join. i.e Currently Vertex 1 (load relation1) > Vertex 2 (group by) -> Vertex 4 (j...
http://issues.apache.org/jira/browse/PIG-3849    Author: Rohini Palaniswamy, 2014-05-30, 20:16
[PIG-3850] Optimize join followed by order by using same key - Pig - [issue]
...Possible optimizations:    1) If it is a skewed join, then we can combine ordering into it instead of doing a additional orderby as we skewed join already involves sampling.&n...
http://issues.apache.org/jira/browse/PIG-3850    Author: Rohini Palaniswamy, 2014-05-30, 20:16
[PIG-3852] Remove SecurityHelper class for Tez and use Tez helpers instead - Pig - [issue]
...TEZ jira have been created to support mapreduce.job.hdfs-servers and mapreduce.job.credentials.binary. So we can get rid of SecurityHelper.java written for Pig on Tez....
http://issues.apache.org/jira/browse/PIG-3852    Author: Rohini Palaniswamy, 2014-05-30, 20:15
[PIG-3856] UnionOptimizer in Tez should optimize the case of replicated join - Pig - [issue]
...Replicate join input that was broadcast to union vertex now needs to be broadcast to all the union predecessors. So we need to Create edges from the Replicate join input to all the union pre...
http://issues.apache.org/jira/browse/PIG-3856    Author: Rohini Palaniswamy, 2014-05-30, 20:15
[expand - 1 more] - Re: Review Request 22058: Merge Tez branch into trunk - Pig - [mail # dev]
...I also did verify the difference between tez branch and trunk after applying this patch using DeltaWalker. Only build.xml was different (Tez branch had tez exectype and hadoop23 as default w...
   Author: Rohini Palaniswamy, 2014-05-30, 17:56
[PIG-3891] FileBasedOutputSizeReader does not calculate size of files in sub-directories - Pig - [issue]
...FileBasedOutputSizeReader only includes files in the top level output directory. So if files are stored under subdirectories (For eg: MultiStorage), it does not have the bytes written correc...
http://issues.apache.org/jira/browse/PIG-3891    Author: Rohini Palaniswamy, 2014-05-30, 01:53
Re: Review Request 21979: PIG-3955 Remove url.openStream() file descriptor leak from JCC - Pig - [mail # dev]
...I don't see FileInputStream implement reset(). Does it work?- RohiniThis is an automatically generated e-mail. To reply, visit:https://reviews.apache.org/r/21979/#review44125On May 28, 2014,...
   Author: Rohini Palaniswamy, 2014-05-28, 16:55
[PIG-3456] Reduce threadlocal conf access in backend for each record - Pig - [issue]
...Noticed few things while browsing code1) DefaultTuple has a protected boolean isNull = false; which is never used. Removing this gives ~3-5% improvement for big jobs2) Config checking with T...
http://issues.apache.org/jira/browse/PIG-3456    Author: Rohini Palaniswamy, 2014-05-27, 23:18
Re: Sampling in operations like Order by - Pig - [mail # dev]
...If there is just one reducer there is no need for sampling (PIG-2784), butwhen there is more than one reducer in order by you need to sample the dataand determine the partition ranges so tha...
   Author: Rohini Palaniswamy, 2014-05-22, 18:33
[PIG-2672] Optimize the use of DistributedCache - Pig - [issue]
...Pig currently copies jar files to a temporary location in hdfs and then adds them to DistributedCache for each job launched. This is inefficient in terms of Space - The jars are distributed...
http://issues.apache.org/jira/browse/PIG-2672    Author: Rohini Palaniswamy, 2014-05-20, 19:14
Pig (267)
Tez (38)
Hive (6)
Bigtop (1)
HBase (1)
HDFS (1)
issue (136)
mail # dev (108)
mail # user (23)
last 7 days (4)
last 30 days (11)
last 90 days (53)
last 6 months (122)
last 9 months (267)
Daniel Dai (400)
Dmitriy Ryaboy (346)
Alan Gates (334)
Cheolsoo Park (310)
Jonathan Coveney (237)
Rohini Palaniswamy (187)
Russell Jurney (176)
Bill Graham (131)
Olga Natkovich (131)
Prashant Kommireddi (107)
Aniket Mokashi (87)
Julien Le Dem (84)
Thejas Nair (71)
Thejas M Nair (63)
Mridul Muralidharan (61)
Ashutosh Chauhan (42)
pi song (41)
Gianmarco De Francisci Mo...(39)
Koji Noguchi (38)
"Cheolsoo Park (35)
Ruslan Al-Fakikh (35)
Dmitriy V. Ryaboy (34)
Pradeep Gollakota (34)
Jeff Zhang (32)
Santhosh Srinivasan (29)