Search Hadoop -
Jarek Jarcec Cecho
Jaimin D Jetly
Edward J. Yoon
Joseph K. Bradley
Vinod Kumar Vavilapalli
mail # dev
mail # user
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (6)
last 9 months (678)
Solr & Elasticsearch trainings in New York & San Francisco
San Francisco - Oct 4-6
New York - Oct 10-12
San Francisco - Oct 4-7
New York - Oct 10-12
and all its subprojects:
newest on top
oldest on top
. Results from
Loading phrases to help you
refine your search...
[SPARK-3937] Unsafe memory access inside of Snappy library
...This was observed on master between Spark 1.1 and 1.2. Unfortunately I don't have much information about this other than the stack trace. However, it was concerning enough I figured I should...
, 2016-07-20, 14:42
[SPARK-4820] Spark build encounters "File name too long" on some encrypted filesystems
...This was reported by Luchesar Cekov on github along with a proposed fix. The fix has some potential downstream issues (it will modify the classnames) so until we understand better how many u...
, 2016-06-26, 12:21
[SPARK-1272] Don't fail job if some local directories are buggy
...If Spark cannot create shuffle directories inside of a local directory it might make sense to just log an error and continue, provided that at least one valid shuffle directory exists. Other...
, 2016-05-23, 04:06
[SPARK-1239] Improve fetching of map output statuses
...Instead we should modify the way we fetch map output statuses to take both a mapper and a reducer - or we should just piggyback the statuses on each task....
, 2016-05-07, 02:31
[SPARK-10620] Look into whether accumulator mechanism can replace TaskMetrics
...This task is simply to explore whether the internal representation used by TaskMetrics could be performed by using accumulators rather than having two separate mechanisms. Note that we need ...
, 2016-05-01, 23:40
[SPARK-1844] Support maven-style dependency resolution in sbt build
...[Currently this is a brainstorm/wish - not sure it's possible]Ivy/sbt and maven use fundamentally different strategies when transitive dependencies conflict (i.e. when we have tw...
, 2016-04-11, 23:39
[SPARK-2208] local metrics tests can fail on fast machines
...I'm temporarily disabling this check. I think the issue is that on fast machines the fetch wait time can actually be zero, even across all tasks.We should see if we can write this in a diffe...
, 2016-03-26, 13:12
[SPARK-5158] Allow for keytab-based HDFS security in Standalone mode
...There have been a handful of patches for allowing access to Kerberized HDFS clusters in standalone mode. The main reason we haven't accepted these patches have been that they rely on insecur...
, 2016-02-16, 21:14
[PARQUET-118] Provide option to use on-heap buffers for Snappy compression/decompression
...The current code uses direct off-heap buffers for decompression. If many decompressors are instantiated across multiple threads, and/or the objects being decompressed are large, this can lea...
, 2016-02-05, 00:45
[SPARK-1079] EC2 scripts should allow mounting as XFS or EXT4
...These offer much better performance when running benchmarks: I've done a hacked together implementation here, but it would be better if you could officially give a filesystem as an argument ...
, 2016-01-27, 10:10
Apache Lucene, Apache Solr and all other Apache Software Foundation projects and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by