Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 9 from 9 (0.181s).
Loading phrases to help you
refine your search...
Efficient way to split an input data set into different output files - Spark - [mail # user]
...I'm trying to set up a PySpark ETL job that takes in JSON log files andspits out fact table files for upload to Redshift.  Is there an efficientway to send different event types to diff...
   Author: Tom Seddon, 2014-11-19, 11:41
Re: ERROR ConnectionManager: Corresponding SendingConnection to ConnectionManagerId - Spark - [mail # user]
...Yes please can you share.  I am getting this error after expanding myapplication to include a large broadcast variable. Would be good to know ifit can be fixed with configuration.On 23 ...
   Author: Tom Seddon, 2014-11-11, 15:19
Re: Broadcast failure with variable size of ~ 500mb with "key already cancelled ?" - Spark - [mail # user]
...Hi,Just wondering if anyone has any advice about this issue, as I amexperiencing the same thing.  I'm working with multiple broadcast variablesin PySpark, most of which are small, but o...
   Author: Tom Seddon, 2014-11-11, 14:41
[expand - 1 more] - Re: How to load data in Drill - Drill - [mail # user]
...Hi,  Jinfeng, do you want a copy of my parquet file too?  If so, can send later tonight.  Cheers,  Tom    On 3 December 2013 07:38, Madhu Borkar  wrote: &n...
   Author: Tom Seddon, 2013-12-04, 15:02
[expand - 1 more] - Re: What storages does drill support? - Drill - [mail # user]
...The storage-engines.json under /conf I think but I'll have to check that later as I don't have my laptop with me.   On 4 December 2013 14:50, He, Yunlong  wrote:  ...
   Author: Tom Seddon, 2013-12-04, 14:58
Re: Presto -> SQL engines for HDFS ? - Drill - [mail # dev]
...There's a comparison of a few of these in this document.  Perhaps Presto needs to be added.  http://online.liebertpub.com/doi/pdfplus/10.1089/big.2013.0011   On 7 November 201...
   Author: Tom Seddon, 2013-11-07, 10:33
[expand - 1 more] - Re: Distributed Drill question - Drill - [mail # user]
...Thanks Jacques, yes that answers it.  I'm researching as much as I can about Drill for my masters project.  Perhaps I'll be in a position to contribute documentation at a later sta...
   Author: Tom Seddon, 2013-11-02, 15:24
Re: Query HDFS - Drill - [mail # dev]
...Hi,  I'm also interested in querying data residing in HDFS.  Grateful for any advice on how to achieve this.  Thanks,  Tom    On 18 October 2013 00:10, Timothy ...
   Author: Tom Seddon, 2013-10-19, 15:20
[expand - 1 more] - Re: Drill Masters Project - Drill - [mail # user]
...Thanks Jacques.  I'm very happy to get involved and share my experiences.  I'm looking for the best way to set up a cluster now.  In terms of evaluating Drill's performance, d...
   Author: Tom Seddon, 2013-08-29, 09:25
Sort:
project
Drill (6)
Spark (3)
type
mail # user (7)
mail # dev (2)
date
last 7 days (0)
last 30 days (3)
last 90 days (3)
last 6 months (3)
last 9 months (9)
author
Ted Yu (1779)
Harsh J (1298)
Jun Rao (1001)
Todd Lipcon (993)
Stack (981)
Andrew Purtell (852)
Jonathan Ellis (846)
Jean-Daniel Cryans (750)
stack (747)
Yusaku Sako (737)
Jarek Jarcec Cecho (725)
Eric Newton (695)
Jonathan Hsieh (674)
Roman Shaposhnik (673)
Namit Jain (649)
Hitesh Shah (645)
Steve Loughran (635)
Josh Elser (630)
Siddharth Seth (627)
Owen O'Malley (624)
Brock Noland (605)
Neha Narkhede (556)
Arun C Murthy (546)
Eli Collins (545)
Hyunsik Choi (544)
Tom Seddon
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB