Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 41 to 50 from 275 (0.127s).
Loading phrases to help you
refine your search...
Re: How many records will be passed to a map function?? - Hadoop - [mail # general]
...Short answer: FileInputFormat & friends generate splits based on byte ranges.  Assuming your records are all equally sized, you'll get half your records in each mapper. If your records ...
   Author: Aaron Kimball, 2010-06-19, 00:13
Re: Is it possible ....!!! - Hadoop - [mail # user]
...Hadoop has some classes for controlling how sockets are used. See org.apache.hadoop.net.StandardSocketFactory, SocksSocketFactory.  The socket factory implementation chosen is controlle...
   Author: Aaron Kimball, 2010-06-10, 15:09
Re: Mapper Reducer : Unit Test and mocking with static variables - Hadoop - [mail # general]
...Varene,  You might want to check out MRUnit. It's a unit test harness that contains mock objects for the context & other associated classes, and works with JUnit.  It's included in...
   Author: Aaron Kimball, 2010-05-28, 00:12
Re: Hadoop Data Sharing - Hadoop - [mail # general]
...Perhaps this is guidance in the area you were hoping for: If your data is i n objects that implement the interface 'Writable', then you can use the SequenceFileOutputFormat and SequenceFileI...
   Author: Aaron Kimball, 2010-05-11, 17:34
Re: Hadoop Data Sharing - Hadoop - [mail # general]
...What objects are you referring to? I'm not sure I understand your question. - Aaron  On Tue, May 11, 2010 at 6:38 AM, Renato MarroquĂ­n Mogrovejo  wrote:  ? to I...
   Author: Aaron Kimball, 2010-05-11, 17:31
Re: help on CombineFileInputFormat - Hadoop - [mail # user]
...Zhenyu,  It's a bit complicated and involves some layers of indirection. CombineFileRecordReader is a sort of shell RecordReader that passes the actual work of reading records to anothe...
   Author: Aaron Kimball, 2010-05-10, 09:12
Re: Hadoop Data Sharing - Hadoop - [mail # general]
...Renato,  In general if you need to perform a multi-pass MapReduce workflow, each pas s materializes its output to files. The subsequent pass then reads those same files back in as input...
   Author: Aaron Kimball, 2010-05-06, 01:04
Re: Different exception handling on corrupt GZip file reading - Hadoop - [mail # general]
...If you ever wonder "why doesn't Hadoop do _REASONABLE_THING_X_", the answer is usually one of:  * Somebody made a mistake the first time it got written * Nobody needed quite that corner...
   Author: Aaron Kimball, 2010-04-15, 16:28
Re: DBInputFormat number of mappers - Hadoop - [mail # general]
...Hi Dan,  It's also worth pointing out that DBInputFormat's queries are written in such a way as to make parallelism more likely to hurt than to help. Each mapper submits a query to the ...
   Author: Aaron Kimball, 2010-04-15, 16:20
Re: Sqoop is moving to github! - Hadoop - [mail # general]
...On Tue, Mar 30, 2010 at 7:54 AM, Bernd Fondermann  wrote:  I believe that technically the discussion (such as it is) on MAPREDUCE-1644 will stand as the record of vote. It's still ...
   Author: Aaron Kimball, 2010-03-30, 17:58
Sort:
project
Hadoop (275)
MapReduce (131)
Hive (22)
Avro (10)
HDFS (9)
HBase (5)
Pig (3)
Sqoop (3)
Flume (1)
type
mail # user (224)
mail # general (38)
mail # dev (11)
issue (2)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (275)
author
Harsh J (1386)
Steve Loughran (940)
Owen O'Malley (816)
Todd Lipcon (759)
Arun C Murthy (576)
Eli Collins (515)
Allen Wittenauer (461)
Konstantin Boudnik (347)
Doug Cutting (344)
Mark Kerzner (334)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (304)
jason hadoop (279)
Aaron Kimball