Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 32 (0.134s).
Loading phrases to help you
refine your search...
[expand - 2 more] - Re: hive auto join conversion - Hive - [mail # user]
...Yeah, I was trying the same thing, though a little big ugly.My query needs to LJ/J with multiple tables. When there are 1 or 2 LJ/Js,rewriting works but when there are > 3 tables, the got...
   Author: Chen Song, 2014-08-12, 15:38
[expand - 4 more] - Re: saveAsTextFiles file not found exception - Spark - [mail # user]
...Thanks for putting this together, Andrew.On Tue, Aug 12, 2014 at 2:11 AM, Andrew Ash  wrote:Chen Song ...
   Author: Chen Song, 2014-08-12, 15:26
[expand - 1 more] - Re: increase parallelism of reading from hdfs - Spark - [mail # user]
...Thanks Paul. I will give a try.On Mon, Aug 11, 2014 at 1:11 PM, Paul Hamilton wrote:Chen Song ...
   Author: Chen Song, 2014-08-11, 19:55
spark streaming multiple file output paths - Spark - [mail # user]
...In Spark Streaming, is there a way to write output to different paths basedon the partition key? The saveAsTextFiles method will write output in thesame directory.For example, if the partiti...
   Author: Chen Song, 2014-08-07, 15:39
question on HIVE-5891 - Hive - [mail # user]
...I am using cdh5 distribution and It doesn't look like this jirahttps://issues.apache.org/jira/browse/HIVE-5891is backported into cdh 5.1.0.Is there a workaround to modify the query that is s...
   Author: Chen Song, 2014-08-04, 15:00
yarn container memory setting - Hadoop - [mail # user]
...I read a bit on documentation on yarn memory tuning and found thatIt is suggested to set mapreduce.map.java.opts = 0.8 *mapreduce.map.memory.mb.I am wondering why is 0.8, but not 0.9 or high...
   Author: Chen Song, 2014-07-21, 20:45
analyze job link for mr job in yarn - Hadoop - [mail # user]
...In MRv1, there is a link "Analyze this job" in job history page, which Ifind very useful.In Yarn/MRv2, I don't find such a link in resource manager, or historyserver. I found there is a tick...
   Author: Chen Song, 2014-07-18, 15:41
Re: Distribute data from Kafka evenly on cluster - Spark - [mail # user]
...Speaking of this, I have another related question.In my spark streaming job, I set up multiple consumers to receive data fromKafka, with each worker from one partition.Initially, Spark is in...
   Author: Chen Song, 2014-07-18, 14:43
[expand - 2 more] - Re: spark streaming rate limiting from kafka - Spark - [mail # user]
...Thanks Tathagata,That would be awesome if Spark streaming can support receiving rate ingeneral. I tried to explore the link you provided but could not find anyspecific JIRA related to this? ...
   Author: Chen Song, 2014-07-18, 14:20
[expand - 2 more] - Re: how to pass extra Java opts to workers for spark streaming jobs - Spark - [mail # user]
...Thanks Andrew, I tried and it works.On Fri, Jul 18, 2014 at 12:53 AM, Andrew Or  wrote:Chen Song ...
   Author: Chen Song, 2014-07-18, 14:07
Sort:
project
Hive (14)
Spark (10)
HBase (3)
Hadoop (2)
Flume (1)
Kafka (1)
MapReduce (1)
type
mail # user (32)
date
last 7 days (0)
last 30 days (0)
last 90 days (14)
last 6 months (17)
last 9 months (32)
author
Ted Yu (1637)
Harsh J (1293)
Jun Rao (1027)
Todd Lipcon (1002)
Stack (974)
Jonathan Ellis (842)
Andrew Purtell (795)
Jean-Daniel Cryans (755)
jacques@... (738)
stack (716)
Yusaku Sako (706)
Jarek Jarcec Cecho (699)
Eric Newton (696)
Jonathan Hsieh (675)
Roman Shaposhnik (656)
Brock Noland (653)
Namit Jain (649)
Neha Narkhede (647)
Hitesh Shah (626)
Owen O'Malley (625)
Steve Loughran (616)
Siddharth Seth (614)
Josh Elser (561)
Eli Collins (545)
Arun C Murthy (543)
Chen Song