Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 104 (0.21s).
Loading phrases to help you
refine your search...
[HIVE-3108] SELECT count(DISTINCT col) ... returns 0 if "col" is a partition column - Hive - [issue]
...Suppose "stocks" is a managed OR external table, partitioned by "exchange" and "symbol". "count(DISTINCT x)" returns 0 for either "exchange", "symbol", or both:hive> SELECT count(DISTINCT...
http://issues.apache.org/jira/browse/HIVE-3108    Author: Dean Wampler, 2014-12-05, 01:16
[SPARK-4564] SchemaRDD.groupBy(groupingExprs)(aggregateExprs) doesn't return the groupingExprs as part of the output schema - Spark - [issue]
...In the following example, I would expect the "grouped" schema to contain two fields, the String name and the Long count, but it only contains the Long count.// Assumes val sc = new SparkCont...
http://issues.apache.org/jira/browse/SPARK-4564    Author: Dean Wampler, 2014-11-23, 16:52
Re: How spark and hive integrate in long term? - Spark - [mail # dev]
...I can't comment on plans for Spark SQL's support for Hive, but severalcompanies are porting Hive itself onto Spark:http://blog.cloudera.com/blog/2014/11/apache-hive-on-apache-spark-the-first...
   Author: Dean Wampler, 2014-11-21, 23:15
Re: best IDE for scala + spark development? - Spark - [mail # dev]
...For what it's worth, I use Sublime Text + the SBT console for everything. Ican live without the extra IDE features.However, if you like an IDE, the Eclipse "Scala IDE" 4.0 RC1 is a bigimprov...
   Author: Dean Wampler, 2014-10-27, 13:16
[expand - 1 more] - Re: scala Vector vs mllib Vector - Spark - [mail # user]
...Spark isolates each task, so I would use the MLlib vector. I didn't mentionthis, but it also integrates with Breeze, a Scala mathematics library thatyou might find useful.deanDean Wampler, P...
   Author: Dean Wampler, 2014-10-04, 13:54
Re: SparkSQL Thriftserver in Mesos - Spark - [mail # user]
...The Mesos install guide says this:"To use Mesos from Spark, you need a Spark binary package available in aplace accessible by Mesos, and a Spark driver program configured to connectto Mesos....
   Author: Dean Wampler, 2014-09-22, 19:41
[expand - 1 more] - Re: Dependency Problem with Spark / ScalaTest / SBT - Spark - [mail # user]
...Sorry, I meant any *other* SBT files.However, what happens if you remove the line:        exclude("org.eclipse.jetty.orbit", "javax.servlet")deanDean Wampler, Ph.D.A...
   Author: Dean Wampler, 2014-09-14, 16:57
[expand - 2 more] - Re: Issue with Spark on EC2 using spark-ec2 script - Spark - [mail # user]
...It looked like you were running in standalone mode (master set tolocal[4]). That's how I ran it.Dean Wampler, Ph.D.Author: Programming Scala, 2nd Edition (O'Reilly)Typesafe @deanwampler...
   Author: Dean Wampler, 2014-08-01, 13:33
Re: Recommended pipeline automation tool? Oozie? - Spark - [mail # user]
...If you're already using Scala for Spark programming and you hate Oozie XMLas much as I do ;), you might check out Scoozie, a Scala DSL for Oozie:https://github.com/klout/scoozieOn Thu, Jul 1...
   Author: Dean Wampler, 2014-07-15, 21:34
Re: Spark vs Google cloud dataflow - Spark - [mail # user]
...... and to be clear on the point, Summingbird is not limited to MapReduce.It abstracts over Scalding (which abstracts over Cascading, which is beingmoved from MR to Spark) and over Storm for...
   Author: Dean Wampler, 2014-06-27, 12:41
Sort:
project
Hive (94)
Spark (10)
type
mail # user (98)
issue (3)
mail # dev (3)
date
last 7 days (0)
last 30 days (3)
last 90 days (6)
last 6 months (10)
last 9 months (104)
author
Ted Yu (1817)
Harsh J (1301)
Jun Rao (1013)
Todd Lipcon (994)
Stack (986)
Andrew Purtell (871)
Jonathan Ellis (853)
stack (756)
Jean-Daniel Cryans (750)
Jarek Jarcec Cecho (747)
Yusaku Sako (741)
Eric Newton (705)
Jonathan Hsieh (681)
Roman Shaposhnik (677)
Hitesh Shah (674)
Josh Elser (664)
Steve Loughran (653)
Namit Jain (648)
Siddharth Seth (640)
Brock Noland (629)
Owen O'Malley (623)
Hyunsik Choi (578)
Neha Narkhede (564)
Arun C Murthy (548)
Eli Collins (545)
Dean Wampler
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB