Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
clear query|facets|time Search criteria: .   Results from 1 to 10 from 15 (0.16s).
Loading phrases to help you
refine your search...
DistributedCache.addArchiveToClassPath doesn't seem to work - Hadoop - [mail # user]
...I've got a tar.gz file that has many 3rd party jars in it that my MR job requires.  This tar.gz file is located on hdfs.  When configuring my MR job, I call DistributedCache.addArc...
   Author: John Conwell, 2013-12-17, 21:09
Map/Reduce/Driver jar(s) organization - Hadoop - [mail # user]
...I'm curious what are some best practices for structuring jars for a business framework that uses Map/Reduce?  Note: This is assuming you aren't invoking MR manually via the cmd line, bu...
   Author: John Conwell, 2013-11-25, 18:06
Re: AWS MapReduce - Hadoop - [mail # user]
...AWS MapReduce (EMR) does not use S3 for its HDFS persistance.  If it did your S3 billing would be massive :)  EMR reads all input jar files and input data from S3, but it copies th...
   Author: John Conwell, 2012-03-05, 15:40
Re: HADOOP PIPES with CUDA - Hadoop - [mail # user]
...Do you mean porting existing cuda code away from Cuda to just some language like python using pipes?  Or creating a solution that uses pipes to chain mappers / reducers together, where ...
   Author: John Conwell, 2012-02-13, 17:49
Re: Sorting text data - Hadoop - [mail # user]
...If you use the TextInputFormat is your mapreduce job's input format, then Hadoop doesn't need your input data to be in a sequence file.  It will read your text file, and call the mapper...
   Author: John Conwell, 2012-01-30, 16:40
Re: Running a job continuously - Hadoop - [mail # user]
...You might also want to take a look at Storm, as thats what its design to do: https://github.com/nathanmarz/storm/wiki  On Mon, Dec 5, 2011 at 1:34 PM, Mike Spreitzer  wrote:  ...
   Author: John Conwell, 2011-12-05, 21:58
Re: choices for deploying a small hadoop cluster on EC2 - Hadoop - [mail # user]
...I'm a big fan of Whirr, though I dont think it support EBS persistance.  My hadoop deployment strategy has always been store input and output data on S3, spin up my hadoop cluster with ...
   Author: John Conwell, 2011-11-29, 20:33
Re: Matrix multiplication in Hadoop - Hadoop - [mail # user]
...I'm not sure, but I would suspect that Mahout has some low level map/reduce jobs for this.  You might start there.   On Fri, Nov 18, 2011 at 8:59 AM, Mike Spreitzer  wrote: &n...
   Author: John Conwell, 2011-11-18, 17:02
Re: How to iterate over a hdfs folder with hadoop - Hadoop - [mail # user]
...FileStatus[] files = fs.listStatus(new Path(path));  for (FileStatus fileStatus : files)  {  //...do stuff ehre  }  On Mon, Oct 10, 2011 at 8:03 AM, Raimon Bosch wro...
   Author: John Conwell, 2011-10-10, 15:09
What should be in the hosts file on a hadoop cluster? - Hadoop - [mail # user]
...In trouble shooting some issues on our hadoop cluster on EC2, I keep getting pointed back to properly configuring the /etc/hosts file.  But the problem is I've found about 5 different c...
   Author: John Conwell, 2011-10-07, 21:46
Sort:
project
Hadoop (15)
Pig (4)
HDFS (3)
MapReduce (2)
Sqoop (1)
type
mail # user (15)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (15)
author
Harsh J (561)
Steve Loughran (405)
Owen O'Malley (394)
Todd Lipcon (237)
Eli Collins (182)
Alejandro Abdelnur (179)
Arun C Murthy (166)
Allen Wittenauer (161)
Chris Nauroth (156)
Ted Yu (139)
Tom White (120)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Colin Patrick McCabe (101)
Doug Cutting (96)
Aaron Kimball (94)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Benoy Antony (72)
Konstantin Boudnik (72)
Runping Qi (72)
Karthik Kambatla (67)
John Conwell
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB