Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 71 to 80 from 225 (0.166s).
Loading phrases to help you
refine your search...
Re: Can I have multiple reducers? - Hadoop - [mail # user]
...If you need another shuffle after your first reduce pass, then you need a second MapReduce job to run after the first one. Just use an IdentityMapper.  This is a reasonably common situa...
   Author: Aaron Kimball, 2009-10-23, 02:54
Re: openssh - can't achieve passphraseless ssh - Hadoop - [mail # user]
...Another sneaky permissions requirement is that ~/.ssh/ itself must be mode 0750 or more strict.  - Aaron  On Wed, Oct 21, 2009 at 2:47 PM, Edward Capriolo wrote:  ...
   Author: Aaron Kimball, 2009-10-22, 04:27
Re: streaming data from HDFS outside of hadoop - Hadoop - [mail # user]
...You shouldn't directly instantiate and intialize FileSystem implementations; there's a factory method you should use.  Do instead:  private void initHadoop(String ip, int port) thr...
   Author: Aaron Kimball, 2009-10-21, 05:02
Re: How can I run such a mapreduce program? - Hadoop - [mail # user]
...If you're working with the Cloudera distribution, you can install CDH1 (0.18.3) and CDH2 (0.20.1) side-by-side on your development machine.  They'll install to /usr/lib/hadoop-0.18 and ...
   Author: Aaron Kimball, 2009-10-17, 06:55
Re: Error in FileSystem.get() - Hadoop - [mail # user]
...Bhupesh: If you use FileSystem.newInstance(), does that return the correct object type? This sidesteps CACHE. - A  On Thu, Oct 15, 2009 at 3:07 PM, Bhupesh Bansal wrote:  ...
   Author: Aaron Kimball, 2009-10-15, 22:51
Re: Locality when placing Map tasks - Hadoop - [mail # user]
...Map tasks are generated based on InputSplits. An InputSplit is a logical description of the work that a task should use. The array of InputSplit objects is created on the client by the Input...
   Author: Aaron Kimball, 2009-10-06, 22:20
Re: FileSystem Caching in Hadoop - Hadoop - [mail # user]
...Edward,  Interesting concept. I imagine that implementing "CachedInputFormat" over something like memcached would make for the most straightforward implementation. You could store 64MB ...
   Author: Aaron Kimball, 2009-10-06, 22:12
Re: Easiest way to pass dynamic variable to Map Class - Hadoop - [mail # user]
...You can set these in the JobConf when you're creating the MapReduce job, and then read them back in the configure() method of the Mapper class.  - Aaron  On Mon, Oct 5, 2009 at 4:5...
   Author: Aaron Kimball, 2009-10-05, 23:52
[expand - 1 more] - Re: Is it OK to run with no secondary namenode? - Hadoop - [mail # user]
...Quite possible. :\ - A  On Thu, Oct 1, 2009 at 5:17 PM, Mayuran Yogarajah  wrote:  ...
   Author: Aaron Kimball, 2009-10-05, 19:23
Re: Prepare input data for Hadoop - Hadoop - [mail # user]
...Use an external database (e.g., mysql) or some other transactional bookkeeping system to record the state of all your datasets (STAGING, UPLOADED, PROCESSED)  - Aaron   On Thu, Sep...
   Author: Aaron Kimball, 2009-09-22, 22:53
Sort:
project
Hadoop (223)
MapReduce (122)
Hive (18)
HDFS (8)
Avro (6)
HBase (5)
Sqoop (3)
Pig (2)
Accumulo (1)
Flume (1)
Spark (1)
type
mail # user (182)
mail # general (29)
mail # dev (11)
issue (3)
date
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (2)
last 9 months (225)
author
Harsh J (559)
Owen O'Malley (394)
Steve Loughran (391)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (126)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (93)
Edward Capriolo (87)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (70)
Suresh Srinivas (64)