Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 45 (0.709s).
Loading phrases to help you
refine your search...
[SPARK-6710] Wrong initial bias in GraphX SVDPlusPlus - Spark - [issue]
...In the initialization portion of GraphX SVDPlusPluS, the initialization of biases appears to be incorrect. Specifically, in line https://github.com/apache/spark/blob/master/graphx/src/main/s...
http://issues.apache.org/jira/browse/SPARK-6710    Author: Michael Malak, 2015-04-11, 08:03
Re: How to restrict foreach on a streaming RDD only once upon receiver completion - Spark - [mail # user]
...You could have your receiver send a "magic value" when it is done. I discuss this Spark Streaming pattern in my presentation "Spark Gotchas and Anti-Patterns". In the PDF version, it's slide...
   Author: Michael Malak, 2015-04-06, 20:15
Wrong initial bias in GraphX SVDPlusPlus? - Spark - [mail # dev]
...I believe that in the initialization portion of GraphX SVDPlusPluS, the initialization of biases is incorrect. Specifically, in line https://github.com/apache/spark/blob/master/graphx/src/ma...
   Author: Michael Malak, 2015-04-03, 15:43
Spark GraphX In Action on documentation page? - Spark - [mail # user]
...Can my new book, Spark GraphX In Action, which is currently in MEAP http://manning.com/malak/, be added to https://spark.apache.org/documentation.html and, if appropriate, to https://spark.a...
   Author: Michael Malak, 2015-03-24, 18:52
textFile() ordering and header rows - Spark - [mail # dev]
...Since RDDs are generally unordered, aren't things like textFile().first() not guaranteed to return the first row (such as looking for a header row)? If so, doesn't that make the example in h...
   Author: Michael Malak, 2015-02-23, 02:09
[SPARK-5343] ShortestPaths traverses backwards - Spark - [issue]
...GraphX ShortestPaths seems to be following edges backwards instead of forwards:import org.apache.spark.graphx._val g = Graph(sc.makeRDD(Array((1L,""), (2L,""), (3L,""))), sc.makeRDD(Array(Ed...
http://issues.apache.org/jira/browse/SPARK-5343    Author: Michael Malak, 2015-02-10, 23:02
Word2Vec IndexedRDD - Spark - [mail # dev]
...1. Is IndexedRDD planned for 1.3? https://issues.apache.org/jira/browse/SPARK-23652. Once IndexedRDD is in, is it planned to convert Word2VecModel to it from its current Map[String,Array[Flo...
   Author: Michael Malak, 2015-02-02, 02:05
Re: spark challenge: zip with next??? - Spark - [mail # user]
...But isn't foldLeft() overkill for the originally stated use case of max diff of adjacent pairs? Isn't foldLeft() for recursive non-commutative non-associative accumulation as opposed to an e...
   Author: Michael Malak, 2015-01-30, 16:07
[expand - 1 more] - Re: renaming SchemaRDD -> DataFrame - Spark - [mail # dev]
...I personally have no preference DataFrame vs. DataTable, but only wish to lay out the history and etymology simply because I'm into that sort of thing."Frame" comes from Marvin Minsky's 1970...
   Author: Michael Malak, 2015-01-27, 18:01
[expand - 1 more] - Re: GraphX ShortestPaths backwards? - Spark - [mail # dev]
...I created https://issues.apache.org/jira/browse/SPARK-5343 for this.From: Michael Malak To: "[EMAIL PROTECTED]" Cc: Sent: Monday, January 19, 2015 5:09 PMSubject: GraphX ShortestPaths backwa...
   Author: Michael Malak, 2015-01-21, 04:22
Sort:
project
Spark (30)
Hive (13)
Avro (1)
Pig (1)
type
mail # user (24)
mail # dev (13)
issue (8)
date
last 7 days (0)
last 30 days (3)
last 90 days (9)
last 6 months (16)
last 9 months (45)
author
Ted Yu (2022)
Harsh J (1318)
Jun Rao (1100)
Todd Lipcon (1014)
Andrew Purtell (1011)
Stack (1001)
GitHub Import (895)
Jonathan Ellis (862)
Josh Elser (860)
stack (828)
Jarek Jarcec Cecho (814)
Yusaku Sako (793)
Hitesh Shah (789)
Siddharth Seth (773)
Jean-Daniel Cryans (753)
Eric Newton (736)
Brock Noland (724)
Steve Loughran (717)
Jonathan Hsieh (702)
James Taylor (688)
Roman Shaposhnik (687)
Namit Jain (648)
Hyunsik Choi (646)
Owen O'Malley (618)
Bikas Saha (583)
Michael Malak
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB