Search Hadoop -
Julien Le Dem
Dmitriy V. Ryaboy
Thejas M Nair
Gianmarco De Francisci Mo...
fang fang chen
Jarek Jarcec Cecho
mail # dev
mail # user
last 7 days (0)
last 30 days (3)
last 90 days (21)
last 6 months (78)
last 9 months (371)
Solr & Elasticsearch trainings in New York & San Francisco
San Francisco - Oct 4-6
New York - Oct 10-12
San Francisco - Oct 4-7
New York - Oct 10-12
and all its subprojects:
newest on top
oldest on top
. Results from
Loading phrases to help you
refine your search...
[PIG-4716] Add support for global PIG_OPTS configuration for Pig e2e
...It helps if you want to run the whole e2e with different parameters. For eg: lesser heap size to run more tests in parallel, turn on non-default settings like pig.exec.mapPartAgg etc....
, 2015-10-27, 06:32
[PIG-4708] Upgrade joda-time to 2.8
... Folks writing UDFs are using more recent versions of joda-time and that conflicts with the one bundled with Pig. Would be good to upgrade to 2.8.2 which is the latest one. I als...
, 2015-10-26, 21:29
[PIG-4710] Consider moving from joda-time to JSR-310 implemented in Java 8
... Once we officially drop support for JDK 7, we should give thought to getting rid of joda-time dependency in Pig and switch to JSR-310 implemented in Java 8. This might not...
, 2015-10-22, 20:12
[PIG-4702] Load once for sampling and partitioning in order by for certain LoadFuncs
... For HBase and Accumulo, it will be more efficient on IO to have the data written to disk instead of reading from them again....
, 2015-10-15, 18:47
[PIG-4701] Set alias and feature on all vertices
... While working on PIG-4699, saw that alias, alias location and feature not set on some vertices. Need to track those down (e2e test log is an easy place to find them all) and have those...
, 2015-10-14, 17:48
Re: Dependency version on Kryo
- [mail # dev]
...That should be fine. We wanted to get rid of the kryo dependency in ORC anduse the shaded one that hive uses. But that is in hive-exec jar which ishuge and has too many other jars packed in ...
, 2015-10-14, 17:16
[PIG-4670] Embedded Python scripts still parse line by line
...PIG-3204 fixed pig script parsing to parse in batches instead of line by line. But the fix in BoundScript is not right and it is still parsing line by line. That makes parsing take long time...
, 2015-10-13, 03:34
[PIG-4554] Compress pig.script before encoding
... Currently we truncate the pig script (maxScriptSize = 10240) and base64 encode it and store in config. We should remove the truncation and store the full script by compressing and then...
, 2015-10-09, 11:39
[PIG-4420] Support for map side cross similar to replicate join
... Our CROSS implementation is very costly. Recently had a case where a user was doing a CROSS of 30million records against 3K records and it caused lot of disk error exceptions du...
, 2015-10-08, 19:47
[PIG-4649] [Pig on Tez] Union followed by HCatStorer misses some data
...Script to reproduce:A = LOAD 'data01.txt' USING PigStorage() as (id:chararray, message:chararray);B = LOAD 'data02.txt' USING PigStorage() as (id:chararray, message:chararray);C = UNION A, B...
, 2015-10-06, 21:11
Apache Lucene, Apache Solr and all other Apache Software Foundation projects and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by