Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 61 to 70 from 225 (0.107s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Processing 10MB files in Hadoop - Hadoop - [mail # user]
...By default you get at least one task per file; if any file is bigger than a block, then that file is broken up into N tasks where each is one block long. Not sure what you mean by "properly ...
   Author: Aaron Kimball, 2009-11-28, 02:21
Re: conf.Configuration: java.io.IOException: config(config) - Hadoop - [mail # dev]
...When running Hadoop with DEBUG logging on, this IOException was actually responsible for well-over 90% of the lines of text in my logs, making them unreadable.  We actually removed this...
   Author: Aaron Kimball, 2009-11-27, 20:09
Re: Good idea to run NameNode and JobTracker on same machine? - Hadoop - [mail # user]
...The real kicker is going to be memory consumption of one or both of these services. The NN in particular uses a large amount of RAM to store the filesystem image. I think that those who are ...
   Author: Aaron Kimball, 2009-11-27, 18:50
Re: part-00000.deflate as output - Hadoop - [mail # user]
...You are always free to run with compression disabled. But in many production situations, space or performance concerns dictate that all data sets are stored compressed, so I think Tim was as...
   Author: Aaron Kimball, 2009-11-27, 18:44
Re: RE: please help in setting hadoop - Hadoop - [mail # user]
...You've set hadoop.tmp.dir to /home/hadoop/hadoop-${user.name}.  This means that on every node, you're going to need a directory named (e.g.) /home/hadoop/hadoop-root/, since it seems as...
   Author: Aaron Kimball, 2009-11-27, 18:40
Re: error setting up hdfs? - Hadoop - [mail # user]
...You don't "need" to specify a path. If you don't specify a path argument for ls, then it uses your home directory in HDFS ("/user/"). When you first started HDFS, /user/hadoop didn't exist, ...
   Author: Aaron Kimball, 2009-11-10, 21:47
Re: How to build and deploy Hadoop 0.21 ? - Hadoop - [mail # dev]
...On Thu, Nov 5, 2009 at 2:34 AM, Andrei Dragomir  wrote:   Those are build-time dependencies. Ideally you'll ignore them post-build.    Yup.    I have created a ...
   Author: Aaron Kimball, 2009-11-09, 04:13
Re: Can I install two different version of hadoop in the same cluster ? - Hadoop - [mail # user]
...Also hadoop.tmp.dir and mapred.local.dir in your xml configuration, and the environment variables HADOOP_LOG_DIR and HADOOP_PID_DIR in hadoop-env.sh.  - Aaron  On Thu, Oct 29, 2009...
   Author: Aaron Kimball, 2009-10-30, 06:00
[expand - 1 more] - Re: Which FileInputFormat to use for fixed length records? - Hadoop - [mail # user]
...I think these would be good to add to mapreduce in the {{org.apache.hadoop.mapreduce.lib.input}} package. Please file a JIRA and apply a patch! - Aaron  On Wed, Oct 28, 2009 at 11:15 AM...
   Author: Aaron Kimball, 2009-10-28, 19:58
Re: How to give consecutive numbers to output records? - Hadoop - [mail # user]
...There is no in-MapReduce mechanism for cross-task synchronization. You'll need to use something like Zookeeper for this, or another external database. Note that this will greatly complicate ...
   Author: Aaron Kimball, 2009-10-28, 04:27
Hadoop (223)
MapReduce (122)
Hive (18)
HDFS (8)
Avro (6)
HBase (5)
Sqoop (3)
Pig (2)
Accumulo (1)
Flume (1)
Spark (1)
mail # user (182)
mail # general (29)
mail # dev (11)
issue (3)
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (2)
last 9 months (225)
Harsh J (559)
Owen O'Malley (394)
Steve Loughran (390)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (125)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (92)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (69)
Suresh Srinivas (64)