Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 72 (0.295s).
Loading phrases to help you
refine your search...
Re: Reading from CSV file with spark-csv_2.10 - Spark - [mail # user]
...Hi Florin,I might be wrong but timestamp looks like a keyword in SQL that the enginegets confused with. If it is a column name of your table, you might want tochange it. (https://cwiki.apach...
   Author: Jerry Lam, 2015-02-05, 15:43
[expand - 1 more] - Re: Union in Spark - Spark - [mail # user]
...Hi Deep,How do you know the cluster is not responsive because of "Union"?Did you check the spark web console?Best Regards,JerryOn Mon, Feb 2, 2015 at 1:21 AM, Deep Pradhan wrote: ...
   Author: Jerry Lam, 2015-02-02, 06:24
Re: Spark Team - Paco Nathan said that your team can help - Spark - [mail # user]
...Hi Sudipta,I would also like to suggest to ask this question in Cloudera mailing listsince you have HDFS, MAPREDUCE and Yarn requirements. Spark can work withHDFS and YARN but it is more lik...
   Author: Jerry Lam, 2015-01-22, 15:23
Re: Why Parquet Predicate Pushdown doesn't work? - Spark - [mail # user]
...Hi guys,Does this issue affect 1.2.0 only or all previous releases as well?Best Regards,JerryOn Thu, Jan 8, 2015 at 1:40 AM, Xuelin Cao  wrote: ...
   Author: Jerry Lam, 2015-01-19, 19:43
Re: IndexedRDD - Spark - [mail # user]
...Hi guys,I'm interested in the IndexedRDD too.How many rows in the big table that matches the small table in every run?If the number of rows stay constant, then I think Jem wants the runtime ...
   Author: Jerry Lam, 2015-01-13, 17:06
[expand - 1 more] - Re: Spark SQL: Storing AVRO Schema in Parquet - Spark - [mail # user]
...Hi Raghavendra,This makes a lot of sense. Thank you.The problem is that I'm using Spark SQL right now to generate the parquetfile.What I think I need to do is to use Spark directly and trans...
   Author: Jerry Lam, 2015-01-09, 17:03
Spark or Tachyon: capture data lineage - Spark - [mail # user]
...Hi spark developers,I was thinking it would be nice to extract the data lineage informationfrom a data processing pipeline. I assume that spark/tachyon keeps thisinformation somewhere. For i...
   Author: Jerry Lam, 2015-01-02, 20:26
SparkSQL: CREATE EXTERNAL TABLE with a SchemaRDD - Spark - [mail # user]
...Hi spark users,I'm trying to create external table using HiveContext after creating aschemaRDD and saving the RDD into a parquet file on hdfs.I would like to use the schema in the schemaRDD ...
   Author: Jerry Lam, 2014-12-23, 20:26
[expand - 1 more] - Re: UNION two RDDs - Spark - [mail # user]
...Hi Sean and Madhu,Thank you for the explanation. I really appreciate it.Best Regards,JerryOn Fri, Dec 19, 2014 at 4:50 AM, Sean Owen  wrote: ...
   Author: Jerry Lam, 2014-12-22, 21:48
[expand - 2 more] - Re: Spark SQL 1.1.1 reading LZO compressed json files - Spark - [mail # user]
...Hi Michael,This is what I did. I was thinking if there is a more efficient way toaccomplish this.I was doing a very simple benchmark: Convert lzo compressed json files toparquet files using ...
   Author: Jerry Lam, 2014-12-17, 19:02
Sort:
project
HBase (35)
Spark (23)
Pig (11)
MapReduce (3)
type
mail # user (70)
issue (2)
date
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (12)
last 9 months (72)
author
Ted Yu (2019)
Harsh J (1318)
Jun Rao (1098)
Todd Lipcon (1014)
Andrew Purtell (1010)
Stack (1001)
GitHub Import (895)
Jonathan Ellis (862)
Josh Elser (861)
stack (827)
Jarek Jarcec Cecho (814)
Yusaku Sako (793)
Hitesh Shah (788)
Siddharth Seth (773)
Jean-Daniel Cryans (754)
Eric Newton (736)
Brock Noland (724)
Steve Loughran (717)
Jonathan Hsieh (702)
James Taylor (688)
Roman Shaposhnik (687)
Namit Jain (648)
Hyunsik Choi (646)
Owen O'Malley (618)
Bikas Saha (583)
Jerry Lam
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB