Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 70 (0.145s).
Loading phrases to help you
refine your search...
Re: Spark Team - Paco Nathan said that your team can help - Spark - [mail # user]
...Hi Sudipta,I would also like to suggest to ask this question in Cloudera mailing listsince you have HDFS, MAPREDUCE and Yarn requirements. Spark can work withHDFS and YARN but it is more lik...
   Author: Jerry Lam, 2015-01-22, 15:23
Re: Why Parquet Predicate Pushdown doesn't work? - Spark - [mail # user]
...Hi guys,Does this issue affect 1.2.0 only or all previous releases as well?Best Regards,JerryOn Thu, Jan 8, 2015 at 1:40 AM, Xuelin Cao  wrote: ...
   Author: Jerry Lam, 2015-01-19, 19:43
Re: IndexedRDD - Spark - [mail # user]
...Hi guys,I'm interested in the IndexedRDD too.How many rows in the big table that matches the small table in every run?If the number of rows stay constant, then I think Jem wants the runtime ...
   Author: Jerry Lam, 2015-01-13, 17:06
[expand - 1 more] - Re: Spark SQL: Storing AVRO Schema in Parquet - Spark - [mail # user]
...Hi Raghavendra,This makes a lot of sense. Thank you.The problem is that I'm using Spark SQL right now to generate the parquetfile.What I think I need to do is to use Spark directly and trans...
   Author: Jerry Lam, 2015-01-09, 17:03
Spark or Tachyon: capture data lineage - Spark - [mail # user]
...Hi spark developers,I was thinking it would be nice to extract the data lineage informationfrom a data processing pipeline. I assume that spark/tachyon keeps thisinformation somewhere. For i...
   Author: Jerry Lam, 2015-01-02, 20:26
SparkSQL: CREATE EXTERNAL TABLE with a SchemaRDD - Spark - [mail # user]
...Hi spark users,I'm trying to create external table using HiveContext after creating aschemaRDD and saving the RDD into a parquet file on hdfs.I would like to use the schema in the schemaRDD ...
   Author: Jerry Lam, 2014-12-23, 20:26
[expand - 1 more] - Re: UNION two RDDs - Spark - [mail # user]
...Hi Sean and Madhu,Thank you for the explanation. I really appreciate it.Best Regards,JerryOn Fri, Dec 19, 2014 at 4:50 AM, Sean Owen  wrote: ...
   Author: Jerry Lam, 2014-12-22, 21:48
[expand - 2 more] - Re: Spark SQL 1.1.1 reading LZO compressed json files - Spark - [mail # user]
...Hi Michael,This is what I did. I was thinking if there is a more efficient way toaccomplish this.I was doing a very simple benchmark: Convert lzo compressed json files toparquet files using ...
   Author: Jerry Lam, 2014-12-17, 19:02
[expand - 1 more] - Re: Accessing rows of a row in Spark - Spark - [mail # user]
...Hi Mark,Thank you for helping out.The items I got back from Spark SQL has the type information as follows:scala> itemsres16: org.apache.spark.sql.Row = [WrappedArray([1,orange],[2,apple])...
   Author: Jerry Lam, 2014-12-15, 19:49
Filtering nested data using Spark SQL - Spark - [mail # user]
...Hi spark users,I'm trying to filter a json file that has the following schema using SparkSQL:root |-- user_id: string (nullable = true) |-- item: array (nullable = true) | &nb...
   Author: Jerry Lam, 2014-12-10, 23:12
Sort:
project
HBase (35)
Spark (21)
Pig (11)
MapReduce (3)
type
mail # user (68)
issue (2)
date
last 7 days (1)
last 30 days (5)
last 90 days (10)
last 6 months (11)
last 9 months (70)
author
Ted Yu (1919)
Harsh J (1306)
Jun Rao (1052)
Todd Lipcon (1004)
Stack (994)
Andrew Purtell (907)
Jonathan Ellis (855)
stack (787)
Jarek Jarcec Cecho (753)
Yusaku Sako (751)
Jean-Daniel Cryans (750)
Hitesh Shah (723)
Josh Elser (721)
Eric Newton (712)
Jonathan Hsieh (686)
Roman Shaposhnik (681)
Brock Noland (677)
Siddharth Seth (667)
Steve Loughran (660)
Namit Jain (648)
Owen O'Malley (622)
Hyunsik Choi (597)
Neha Narkhede (571)
James Taylor (566)
Arun C Murthy (548)
Jerry Lam
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB