| clear query|facets|time |
Search criteria: .
Results from 41 to 50 from
275 (0.114s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: How many records will be passed to a map function?? - Hadoop - [mail # general]
|
|
...Short answer: FileInputFormat & friends generate splits based on byte ranges. Assuming your records are all equally sized, you'll get half your records in each mapper. If your records ...
|
|
|
Author: Aaron Kimball,
2010-06-19, 00:13
|
|
|
Re: Is it possible ....!!! - Hadoop - [mail # user]
|
|
...Hadoop has some classes for controlling how sockets are used. See org.apache.hadoop.net.StandardSocketFactory, SocksSocketFactory. The socket factory implementation chosen is controlle...
|
|
|
Author: Aaron Kimball,
2010-06-10, 15:09
|
|
|
Re: Mapper Reducer : Unit Test and mocking with static variables - Hadoop - [mail # general]
|
|
...Varene, You might want to check out MRUnit. It's a unit test harness that contains mock objects for the context & other associated classes, and works with JUnit. It's included in...
|
|
|
Author: Aaron Kimball,
2010-05-28, 00:12
|
|
|
Re: Hadoop Data Sharing - Hadoop - [mail # general]
|
|
...Perhaps this is guidance in the area you were hoping for: If your data is i n objects that implement the interface 'Writable', then you can use the SequenceFileOutputFormat and SequenceFileI...
|
|
|
Author: Aaron Kimball,
2010-05-11, 17:34
|
|
|
Re: Hadoop Data Sharing - Hadoop - [mail # general]
|
|
...What objects are you referring to? I'm not sure I understand your question. - Aaron On Tue, May 11, 2010 at 6:38 AM, Renato MarroquĂn Mogrovejo wrote: ? to I...
|
|
|
Author: Aaron Kimball,
2010-05-11, 17:31
|
|
|
Re: help on CombineFileInputFormat - Hadoop - [mail # user]
|
|
...Zhenyu, It's a bit complicated and involves some layers of indirection. CombineFileRecordReader is a sort of shell RecordReader that passes the actual work of reading records to anothe...
|
|
|
Author: Aaron Kimball,
2010-05-10, 09:12
|
|
|
Re: Hadoop Data Sharing - Hadoop - [mail # general]
|
|
...Renato, In general if you need to perform a multi-pass MapReduce workflow, each pas s materializes its output to files. The subsequent pass then reads those same files back in as input...
|
|
|
Author: Aaron Kimball,
2010-05-06, 01:04
|
|
|
Re: Different exception handling on corrupt GZip file reading - Hadoop - [mail # general]
|
|
...If you ever wonder "why doesn't Hadoop do _REASONABLE_THING_X_", the answer is usually one of: * Somebody made a mistake the first time it got written * Nobody needed quite that corner...
|
|
|
Author: Aaron Kimball,
2010-04-15, 16:28
|
|
|
Re: DBInputFormat number of mappers - Hadoop - [mail # general]
|
|
...Hi Dan, It's also worth pointing out that DBInputFormat's queries are written in such a way as to make parallelism more likely to hurt than to help. Each mapper submits a query to the ...
|
|
|
Author: Aaron Kimball,
2010-04-15, 16:20
|
|
|
Re: Sqoop is moving to github! - Hadoop - [mail # general]
|
|
...On Tue, Mar 30, 2010 at 7:54 AM, Bernd Fondermann wrote: I believe that technically the discussion (such as it is) on MAPREDUCE-1644 will stand as the record of vote. It's still ...
|
|
|
Author: Aaron Kimball,
2010-03-30, 17:58
|
|
|
|