Edward J. Yoon
Arun C Murthy
Vinod Kumar Vavilapalli
Jarek Jarcec Cecho
Matthias J. Sax
last 7 days (1)
last 30 days (1)
last 90 days (4)
last 6 months (5)
last 9 months (23)
newest on top
oldest on top
. Results from
Loading phrases to help you
refine your search...
[SPARK-19635] Feature parity for Chi-square hypothesis testing in MLlib
...This ticket tracks porting the functionality of spark.mllib.Statistics.chiSqTest over to spark.ml.Here is a design doc:https://docs.google.com/document/d/1ELVpGV3EBjc2KQPLN9_9_Ge9gWchPZ6SGtD...
, 2018-03-15, 01:55
[SPARK-21866] SPIP: Image support in Spark
...Background and motivationAs Apache Spark is being used more and more in the industry, some new use cases are emerging for different data formats beyond the traditional SQL types or the numer...
, 2018-02-12, 08:24
[SPARK-22666] Spark reader source for image format
...The current API for the new image format is implemented as a standalone feature, in order to make it reside within the mllib package. As discussed in SPARK-21866, users should be able to loa...
, 2018-02-12, 07:54
[SPARK-19634] Feature parity for descriptive statistics in MLlib
...This ticket tracks porting the functionality of spark.mllib.MultivariateOnlineSummarizer over to spark.ml.A design has been discussed in SPARK-19208 . Here is a design doc:https://docs.googl...
, 2018-01-29, 11:50
[SPARK-20077] Documentation for ml.stats.Correlation
...Now that (Pearson) correlations are available in spark.ml, we need to write some documentation to go along with this feature. It can simply be looking at the unit tests for example right now...
, 2017-11-06, 08:25
[SPARK-12210] Small example that shows how to integrate spark.mllib with spark.ml
...Since we are missing a number of algorithms in spark.ml such as clustering or LDA, we should have a small example that shows the recommended way to go back and forth between spark.ml and spa...
, 2017-04-08, 09:57
[SPARK-20076] Python interface for ml.stats.Correlation
...The (Pearson) statistics have been exposed with a Dataframe interface as part of SPARK-19636 in the Scala interface. We should now make these available in Python....
, 2017-04-07, 09:00
[SPARK-19636] Feature parity for correlation statistics in MLlib
...This ticket tracks porting the functionality of spark.mllib.Statistics.corr() over to spark.ml.Here is a design doc:https://docs.google.com/document/d/1ELVpGV3EBjc2KQPLN9_9_Ge9gWchPZ6SGtDW5t...
, 2017-03-24, 01:43
[SPARK-14567] Add instrumentation logs to MLlib training algorithms
...In order to debug performance issues when training mllib algorithms,it is useful to log some metrics about the training dataset, the training parameters, etc.This ticket is an umbrella to ad...
, 2017-01-17, 23:40
[SPARK-16258] Automatically append the grouping keys in SparkR's gapply
...While working on the group apply function for python , we found it easier to depart from SparkR's gapply function in the following way: the keys are appended by default to the spa...
, 2016-08-01, 01:05
Apache Lucene, Apache Solr and all other Apache Software Foundation project and their respective logos are trademarks of the Apache Software Foundation.
Elasticsearch, Kibana, Logstash, and Beats are trademarks of Elasticsearch BV, registered in the U.S. and in other countries. This site and Sematext Group is in no way affiliated with Elasticsearch BV.
Service operated by