Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 30 (0.236s).
Loading phrases to help you
refine your search...
Is it possible to flush memtable in one virtual center? - Cassandra - [mail # user]
...We have one ring and two virtual data centers in our Cassandra cluster? oneis for Real-Time and the other is for analytics. My questions are:   1. Are there memtables in Analytics ...
   Author: Benyi Wang, 2014-12-15, 23:22
HTTP 500 Error for SparkUI in YARN Cluster mode - Spark - [mail # user]
...I got this error when I click "Track URL: ApplicationMaster" when I run aspark job in YARN cluster mode. I found this jirahttps://issues.apache.org/jira/browse/YARN-800, but I could not get ...
   Author: Benyi Wang, 2014-12-14, 19:29
What happens at server side when using SSTableLoader - Cassandra - [mail # user]
...Is there a page explaining what happens at server side when usingSSTableLoader?I'm seeking the answers of the following questions:   1. What's about the existing data in the table?...
   Author: Benyi Wang, 2014-12-01, 21:59
[SQOOP-1714] DateSplitter makes wrong splits - Sqoop - [issue]
...If the split-by column is a Date type, Sqoop will send a query to read Min(Date) and Max(Date), those two values are passed to DateSplitter. DateSplitter converts those values into long, and...
http://issues.apache.org/jira/browse/SQOOP-1714    Author: Benyi Wang, 2014-11-12, 05:20
Custom persist or cache of RDD? - Spark - [mail # user]
...When I have a multi-step process flow like this:A -> B -> C -> D -> E -> FI need to store B and D's results into parquet filesB.saveAsParquetFileD.saveAsParquetFileIf I don't ...
   Author: Benyi Wang, 2014-11-10, 19:35
[expand - 1 more] - Re: Best practice for join - Spark - [mail # user]
...I'm using spark-1.0.0 in CDH 5.1.0. The big problem is SparkSQL doesn'tsupport Hash join in this version.On Tue, Nov 4, 2014 at 10:54 PM, Akhil Das wrote: ...
   Author: Benyi Wang, 2014-11-05, 07:43
What's wrong with my settings about shuffle/storage.memoryFraction - Spark - [mail # user]
...I don't need to cache RDDs in my spark Application, but there is a bigshuffle in the data processing. I can always find Shuffle spill (memory)and Shuffle spill (disk). I'm wondering if I can...
   Author: Benyi Wang, 2014-11-04, 19:24
[IMPALA-402] Dynamic partition expr involving rand() is evaluated only once - Impala - [issue]
...I found two problems: "Insert overwrite table" doesn't clean up the directory (external table)$ hadoop fs -ls -R /user/benyiw/tmp_abc;drwxr-xr-x   - impala supergroup     &nbs...
http://issues.cloudera.org/browse/IMPALA-402    Author: Benyi Wang, 2014-10-08, 18:59
How to make Spark-sql join using HashJoin - Spark - [mail # user]
...I'm using CDH 5.1.0 with Spark-1.0.0. There is spark-sql-1.0.0 in clouder'amaven repository. After put it into the classpath, I can use spark-sql inmy application.One of issue is that I coul...
   Author: Benyi Wang, 2014-10-06, 23:21
Spark Language Integrated SQL for join on expression - Spark - [mail # user]
...scala> userres19: org.apache.spark.sql.SchemaRDD =SchemaRDD[0] at RDD at SchemaRDD.scala:98== Query Plan ==ParquetTableScan [id#0,name#1], (ParquetRelation/user/hive/warehouse/user), None...
   Author: Benyi Wang, 2014-09-29, 22:40
Sort:
project
Spark (7)
Hive (6)
Hadoop (5)
Avro (3)
Sqoop (3)
Cassandra (2)
Impala (2)
HBase (1)
MapReduce (1)
type
mail # user (16)
mail # dev (8)
issue (6)
date
last 7 days (2)
last 30 days (3)
last 90 days (10)
last 6 months (14)
last 9 months (30)
author
Ted Yu (1832)
Harsh J (1302)
Jun Rao (1014)
Todd Lipcon (994)
Stack (985)
Andrew Purtell (875)
Jonathan Ellis (854)
stack (757)
Jean-Daniel Cryans (750)
Jarek Jarcec Cecho (747)
Yusaku Sako (742)
Eric Newton (707)
Jonathan Hsieh (682)
Hitesh Shah (677)
Roman Shaposhnik (677)
Josh Elser (674)
Steve Loughran (651)
Namit Jain (648)
Siddharth Seth (643)
Brock Noland (633)
Owen O'Malley (623)
Hyunsik Choi (582)
Neha Narkhede (566)
Arun C Murthy (548)
Eli Collins (545)
Benyi Wang
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB