Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 94 (0.507s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: pyspark and hdfs file name - Spark - [mail # user]
...Hi Devies.Thank you for the quick answer.I have a code like this:....sc = SparkContext(appName="TAD")lines = sc.textFile(sys.argv[1], 1)result = lines.map(doSplit).groupByKey().map(lambda (k...
   Author: Oleg Ruchovets, 2014-11-14, 08:14
[expand - 2 more] - Re: High Availability hadoop cluster. - Hadoop - [mail # user]
...Great.Thank you for the link.Just to be sure - JN can be installed on data nodes like zookeeper?If we have 2 Name Nodes and 15 Data Nodes - is it correct  to install ZKand JN on datanod...
   Author: Oleg Ruchovets, 2014-11-06, 11:03
[expand - 2 more] - Re: pyspark on yarn - lost executor - Spark - [mail # user]
...Great.  Upgrade helped.Still need some inputs:1) Is there any log files of spark job execution?2) Where can I read about tuning / parameter configuration:For example:what is the me...
   Author: Oleg Ruchovets, 2014-09-18, 09:47
[expand - 1 more] - Re: PySpark on Yarn - how group by data properly - Spark - [mail # user]
...I am expand my data set and executing pyspark on yarn:   I payed attention that only 2 processes processed the data:14210 yarn      20   0 2463m 2.0g 9708 R 100...
   Author: Oleg Ruchovets, 2014-09-16, 12:29
[expand - 6 more] - Re: cassandra + spark / pyspark - Cassandra - [mail # user]
...Thank you Rohit.   I sent the email to you.ThanksOleg.On Thu, Sep 11, 2014 at 10:51 PM, Rohit Rai  wrote: ...
   Author: Oleg Ruchovets, 2014-09-11, 16:12
[expand - 1 more] - Re: multi datacenter replication - Cassandra - [mail # user]
...Thank you very much for the links.  Just to be sure: is this capability available for COMMUNITY ADDITION?ThanksOleg.On Wed, Sep 10, 2014 at 11:49 PM, Alain RODRIGUEZ wrote: ...
   Author: Oleg Ruchovets, 2014-09-10, 16:22
[expand - 1 more] - Re: pyspark and cassandra - Spark - [mail # user]
...Hi ,  I try to evaluate different option of spark + cassandra and I have coupleof additional questions.  My aim is to use cassandra only without hadoop:  1) Is ...
   Author: Oleg Ruchovets, 2014-09-10, 15:32
hardware sizing for cassandra - Cassandra - [mail # user]
...Hi ,   Where can I find the document with best practices about sizing forcassandra deployment?   We have 1000 writes / reads per second. record size 1k.Questions: &n...
   Author: Oleg Ruchovets, 2014-09-09, 18:03
[expand - 1 more] - Re: PySpark on Yarn a lot of python scripts project - Spark - [mail # user]
...Ok , I  didn't explain my self correct:   In case of java having a lot of classes jar should be used.   All examples for PySpark I found is one py script( Pi , wordc...
   Author: Oleg Ruchovets, 2014-09-05, 17:21
[expand - 11 more] - Re: pyspark yarn got exception - Spark - [mail # user]
...Great,It is working now!!!!ThanksOleg.On Fri, Sep 5, 2014 at 12:50 PM, Oleg Ruchovets wrote:...
   Author: Oleg Ruchovets, 2014-09-05, 08:51
Sort:
project
HBase (39)
Hadoop (19)
Kafka (16)
Spark (11)
Cassandra (3)
HDFS (2)
MapReduce (2)
Zookeeper (2)
type
mail # user (93)
mail # dev (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (15)
last 9 months (94)
author
Ted Yu (1822)
Harsh J (1304)
Jun Rao (1015)
Todd Lipcon (994)
Stack (986)
Andrew Purtell (871)
Jonathan Ellis (853)
stack (756)
Jean-Daniel Cryans (752)
Jarek Jarcec Cecho (747)
Yusaku Sako (742)
Eric Newton (706)
Jonathan Hsieh (682)
Roman Shaposhnik (677)
Hitesh Shah (674)
Josh Elser (666)
Steve Loughran (653)
Namit Jain (648)
Siddharth Seth (642)
Brock Noland (633)
Owen O'Malley (623)
Hyunsik Choi (579)
Neha Narkhede (565)
Arun C Murthy (548)
Eli Collins (545)
Oleg Ruchovets
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB