Search Hadoop -
Julien Le Dem
Dmitriy V. Ryaboy
Thejas M Nair
Gianmarco De Francisci Mo...
fang fang chen
Jarek Jarcec Cecho
mail # dev
mail # user
last 7 days (4)
last 30 days (32)
last 90 days (64)
last 6 months (93)
last 9 months (362)
and all its subprojects:
newest on top
oldest on top
. Results from
Loading phrases to help you
refine your search...
Welcome to our new Pig PMC member Xuefu Zhang
- [mail # dev]
...It is my pleasure to announce that Xuefu Zhang is our newest addition tothe Pig PMC. Xuefu is a long time committer of Pig and has been activelyinvolved in driving the Pig on Spark effort fo...
, 2016-02-24, 21:30
Re: Set configuration properties for a Pig script within the script.
- [mail # user]
...Pig's HBaseStorage will automatically pick the values from hbase-site.xmlif it is in classpath and store to that HBase instance.On Mon, Feb 22, 2016 at 10:42 AM, Parth Sawant wrote: ...
, 2016-02-23, 23:27
[PIG-4806] UDFContext can be reset in the middle during Tez input and output initialization
...We reinitialize UDFContext ThreadLocal itself in PigProcessor.initialize(). PigProcessor.initialize() is run in parallel with threads that do MRInput.initialize() and MROutput.initialize(). ...
, 2016-02-16, 18:19
[PIG-4759] Fix Classresolution_1 e2e failure
... We had left it as a known issue to be fixed later as that was a very odd and uncommon usage put in just for the particular testcase - store into a file with one StoreFunc, but read bac...
, 2016-02-12, 15:38
[PIG-4801] Provide backward compatibility with mapreduce mapred.task settings
... Some users use settings like mapred.task.id in their UDFs. It is not available in Tez and the job breaks....
, 2016-02-10, 21:37
[PIG-4802] Autoparallelism should estimate less when there is combiner
... When there is a combiner, it reduces records by a lot. Auto-parallelism should take that into account. Also currently we multiply by a factor of 10 if there is any FLATTEN. User...
, 2016-02-10, 21:23
[PIG-4805] Auto-parallelism estimation should use split sub plan specific to the successor
...In PIG-4802, we determine different parallelism factors for different successors (edges).For eg: If we have two successors, one with combine plan and other withoutwe want to compute lesser p...
, 2016-02-10, 20:21
[PIG-4800] EvalFunc.getCacheFiles() fails for different namenode
...Caused by: java.io.FileNotFoundException: File does not exist: /tmp/input.txt at org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:1309) at org.apache.hadoop....
, 2016-02-10, 18:18
[PIG-4692] MultiQuery and Union optimizer in Tez should terminate more than one output in scalar
...PIG-3957 makes POValueOutputTez terminate early if it is Scalar output and has more than one row in TezCompiler. Same should be done in MultiQuery and Union optimizer after the plan change....
, 2016-02-04, 04:17
[PIG-4782] OutOfMemoryError: GC overhead limit exceeded with POPartialAgg
... In some cases, even though spill is triggered the main thread is still executing some user UDF which constructs a DataBag which requires lot of memory. Since we block on spill in POPa...
, 2016-02-01, 16:26
Apache Lucene, Apache Solr and all other Apache Software Foundation projects and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by