Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 96 (0.276s).
Loading phrases to help you
refine your search...
spark stream + cassandra (execution on event) - Spark - [mail # user]
...Hi .   I want to use spark streaming to read data from cassandra.But in my case I need process data based on event. (not retrieving the dataconstantly from Cassandra).Question:&nbs...
   Author: Oleg Ruchovets, 2014-12-31, 09:46
[expand - 1 more] - Re: spark streaming python + kafka - Spark - [mail # user]
...Wow , that would be great.When do you think it will be GA?ThanksOleg.On Tue, Dec 23, 2014 at 9:47 AM, Davies Liu  wrote: ...
   Author: Oleg Ruchovets, 2014-12-23, 04:56
[expand - 1 more] - Re: pyspark and hdfs file name - Spark - [mail # user]
...Hi Devies.Thank you for the quick answer.I have a code like this:....sc = SparkContext(appName="TAD")lines = sc.textFile(sys.argv[1], 1)result = lines.map(doSplit).groupByKey().map(lambda (k...
   Author: Oleg Ruchovets, 2014-11-14, 08:14
[expand - 2 more] - Re: High Availability hadoop cluster. - Hadoop - [mail # user]
...Great.Thank you for the link.Just to be sure - JN can be installed on data nodes like zookeeper?If we have 2 Name Nodes and 15 Data Nodes - is it correct  to install ZKand JN on datanod...
   Author: Oleg Ruchovets, 2014-11-06, 11:03
[expand - 2 more] - Re: pyspark on yarn - lost executor - Spark - [mail # user]
...Great.  Upgrade helped.Still need some inputs:1) Is there any log files of spark job execution?2) Where can I read about tuning / parameter configuration:For example:what is the me...
   Author: Oleg Ruchovets, 2014-09-18, 09:47
[expand - 1 more] - Re: PySpark on Yarn - how group by data properly - Spark - [mail # user]
...I am expand my data set and executing pyspark on yarn:   I payed attention that only 2 processes processed the data:14210 yarn      20   0 2463m 2.0g 9708 R 100...
   Author: Oleg Ruchovets, 2014-09-16, 12:29
[expand - 6 more] - Re: cassandra + spark / pyspark - Cassandra - [mail # user]
...Thank you Rohit.   I sent the email to you.ThanksOleg.On Thu, Sep 11, 2014 at 10:51 PM, Rohit Rai  wrote: ...
   Author: Oleg Ruchovets, 2014-09-11, 16:12
[expand - 1 more] - Re: multi datacenter replication - Cassandra - [mail # user]
...Thank you very much for the links.  Just to be sure: is this capability available for COMMUNITY ADDITION?ThanksOleg.On Wed, Sep 10, 2014 at 11:49 PM, Alain RODRIGUEZ wrote: ...
   Author: Oleg Ruchovets, 2014-09-10, 16:22
[expand - 1 more] - Re: pyspark and cassandra - Spark - [mail # user]
...Hi ,  I try to evaluate different option of spark + cassandra and I have coupleof additional questions.  My aim is to use cassandra only without hadoop:  1) Is ...
   Author: Oleg Ruchovets, 2014-09-10, 15:32
hardware sizing for cassandra - Cassandra - [mail # user]
...Hi ,   Where can I find the document with best practices about sizing forcassandra deployment?   We have 1000 writes / reads per second. record size 1k.Questions: &n...
   Author: Oleg Ruchovets, 2014-09-09, 18:03
Sort:
project
HBase (39)
Hadoop (19)
Kafka (16)
Spark (13)
Cassandra (3)
HDFS (2)
MapReduce (2)
Zookeeper (2)
type
mail # user (95)
mail # dev (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (4)
last 9 months (96)
author
Ted Yu (1984)
Harsh J (1315)
Jun Rao (1088)
Todd Lipcon (1012)
Stack (999)
Andrew Purtell (973)
GitHub Import (895)
Jonathan Ellis (858)
Josh Elser (823)
stack (818)
Jarek Jarcec Cecho (807)
Yusaku Sako (786)
Hitesh Shah (765)
Jean-Daniel Cryans (753)
Siddharth Seth (742)
Eric Newton (733)
Brock Noland (726)
Jonathan Hsieh (700)
Steve Loughran (693)
Roman Shaposhnik (686)
Namit Jain (648)
Hyunsik Choi (640)
James Taylor (637)
Owen O'Malley (619)
Neha Narkhede (579)
Oleg Ruchovets
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB