Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 191 to 200 from 7483 (0.178s).
Loading phrases to help you
refine your search...
distinct at field level - Pig - [mail # user]
...Hi All,I have use DISTINCT operator for record level filter. I was wondering if there is anything for a field-level? I have data in the following format:Column1 Column2A      ...
   Author: Sameer Tilak, 2014-03-27, 23:10
[PIG-3848] Dynamically switch to replicate join - Pig - [issue]
...   If data sizes are found to be small after filtering then switch to replicate join dynamically if user has not specified "using" clause explicitly. But this required support from...
http://issues.apache.org/jira/browse/PIG-3848    Author: Rohini Palaniswamy, 2014-03-27, 23:09
[PIG-3847] Sort avoidance for group by and join - Pig - [issue]
...Group by and join only require that the records be grouped together by key. It is not necessary for the keys to be sorted. If we can have a Tez Input/Output implementation that does the grou...
http://issues.apache.org/jira/browse/PIG-3847    Author: Rohini Palaniswamy, 2014-03-27, 23:04
[PIG-3631] Improve performance of replicate-join - Pig - [issue]
...Replicated join is implemented in Tez as follows: POFRJoinTez extends POFRJoin. The difference between two is that replication hash table is constructed out of broadcasting edges in Tez inst...
http://issues.apache.org/jira/browse/PIG-3631    Author: Rohini Palaniswamy, 2014-03-27, 22:59
[PIG-3846] Implement automatic reducer parallelism - Pig - [issue]
...Tez has it built-in. We can start with reusing it and then look at customization for better performance....
http://issues.apache.org/jira/browse/PIG-3846    Author: Rohini Palaniswamy, 2014-03-27, 22:55
[PIG-3775] Use unsorted shuffle in Union, Orderby, Skewed Join to improve performance in Tez - Pig - [issue]
...When implementing Pig union, we need to gather data from two or more upstream vertexes without sorting. The vertex itself might consists of several tasks. Same can be done for the partitione...
http://issues.apache.org/jira/browse/PIG-3775    Author: Rohini Palaniswamy, 2014-03-27, 19:47
[PIG-3750] Fix unit tests for MultiQuery and add new ones for Tez - Pig - [issue]
...  Add for both multiquery on and off cases....
http://issues.apache.org/jira/browse/PIG-3750    Author: Rohini Palaniswamy, 2014-03-27, 19:47
Re: Recordings from Pig user meetup at Linkedin, Mar 14 - Pig - [mail # user]
...Thank you Mark, greatly appreciated!JarcecOn Thu, Mar 27, 2014 at 12:41:41PM -0700, Mark Wagner wrote: -----BEGIN PGP SIGNATURE-----Version: GnuPG v1.4.11 (GNU/Linux)iQIcBAEBAgAGBQJTNIA...
   Author: Jarek Jarcec Cecho, 2014-03-27, 19:47
[PIG-3840] Umbrella jira for Pig on Tez Unit Test porting - Pig - [issue]
...Separating out unit test porting jiras from PIG-3446 which is the main jira for Pig on Tez....
http://issues.apache.org/jira/browse/PIG-3840    Author: Rohini Palaniswamy, 2014-03-27, 19:41
[PIG-3839] Umbrella jira for Pig on Tez Performance Improvements - Pig - [issue]
...Separating out performance improvements from PIG-3446 which is the main jira for Pig on Tez....
http://issues.apache.org/jira/browse/PIG-3839    Author: Rohini Palaniswamy, 2014-03-27, 19:41
HBase (16053)
Hive (15035)
Hadoop (14372)
MapReduce (10606)
Ambari (8475)
Pig (7386)
HDFS (6377)
Cassandra (5544)
Kafka (5341)
Bigtop (5196)
Accumulo (4330)
Flume (3743)
Avro (3086)
Zookeeper (2846)
Sqoop (2784)
mail # user (3383)
issue (1560)
source code (1229)
mail # dev (1153)
web site (107)
wiki (45)
javadoc (6)
last 7 days (115)
last 30 days (240)
last 90 days (676)
last 6 months (1020)
last 9 months (6243)
Dmitriy Ryaboy (346)
Alan Gates (333)
Daniel Dai (315)
Cheolsoo Park (242)
Jonathan Coveney (237)
Russell Jurney (173)
Rohini Palaniswamy (136)
Bill Graham (132)
Olga Natkovich (129)
Prashant Kommireddi (107)
Julien Le Dem (84)
Aniket Mokashi (76)
Thejas Nair (69)
Thejas M Nair (63)
Mridul Muralidharan (61)