Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 21 to 30 from 36 (0.21s).
Loading phrases to help you
refine your search...
Re: AW: How to split a big file in HDFS by size - MapReduce - [mail # user]
...Hi,  On Tue, Jun 21, 2011 at 16:14, Mapred Learn  wrote: kes FS.  Have a look at this:  http://stackoverflow.com/questions/3960651/splitting-gzipped-logfiles-witho ut-sto...
   Author: Niels Basjes, 2011-06-21, 20:03
Re: AW: How to split a big file in HDFS by size - MapReduce - [mail # user]
...Hi,  On Mon, Jun 20, 2011 at 16:13, Mapred Learn  wrote:  Yes, that is very true.  Best regards / Met vriendelijke groeten,  Niels Basjes...
   Author: Niels Basjes, 2011-06-20, 14:38
Re: How to merge several SequenceFile into one? - MapReduce - [mail # user]
...Hi,   The simplest way to do that is to create a job that - input format = sequence file - map = identity mapper - reduce = identity reduce - output = sequence file and  job.setNum...
   Author: Niels Basjes, 2011-05-25, 19:25
Including external libraries in my job. - MapReduce - [mail # user]
...Hi,  I've written my first very simple job that does something with hbase.  Now when I try to submit my jar in my cluster I get this:  [nbasjes@master ~/src/catalogloader/run]...
   Author: Niels Basjes, 2011-05-03, 13:42
Re: hadoop mr cluster mode on my laptop? - MapReduce - [mail # user]
...Hi,  You should be doing the setup for what is called "Pseudo-distributed" mode. Have a look at this: http://hadoop.apache.org/common/docs/r0.20.2/quickstart.html#PseudoDistribu ted &nb...
   Author: Niels Basjes, 2011-04-18, 13:20
Re: Small linux distros to run hadoop ? - MapReduce - [mail # user]
...Hi,  2011/4/15 web service : want to  I usually use a fully stripped CentOS 5 to run cluster nodes. Works perfectly and can be fully automated using the kickstart scripting for ana...
   Author: Niels Basjes, 2011-04-15, 14:49
Re: When use hadoop mapreduce? - MapReduce - [mail # user]
...Hi,  2011/2/17 Pedro Costa :  The summary I usually give goes something like this: IF your computation takes too long on a single system AND you can split the work up into a lot of...
   Author: Niels Basjes, 2011-02-18, 21:48
Re: Is a Block compressed (GZIP) SequenceFile splittable in MR operation? - MapReduce - [mail # user]
...Hi,  2011/1/31 Sean Bigdatafun :  Correct, gzip is a stream compression system which effectively means you can only start at the beginning of the data with decompressing.   AF...
   Author: Niels Basjes, 2011-01-31, 08:36
Re: FILE_BYTES_WRITTEN and HDFS_BYTES_WRITTEN - MapReduce - [mail # user]
...For some parts of a task the system stores information on the local (non-HDFS) file system of the node that is actually running the job. That is the FILE_.. Stuff written to HDFS is the HDFS...
   Author: Niels Basjes, 2010-11-30, 20:43
Re: Control the number of Mappers - MapReduce - [mail # user]
...Ah,  In that case this should answer your question: http://wiki.apache.org/hadoop/HowManyMapsAndReduces   2010/11/25 Shai Erera :    Met vriendelijke groeten,  Niels...
   Author: Niels Basjes, 2010-11-25, 20:18
Sort:
project
Hadoop (58)
MapReduce (36)
Pig (10)
HBase (4)
HDFS (1)
type
mail # user (35)
issue (1)
date
last 7 days (1)
last 30 days (3)
last 90 days (3)
last 6 months (5)
last 9 months (36)
author
Harsh J (1036)
Arun C Murthy (500)
Vinod Kumar Vavilapalli (351)
Todd Lipcon (282)
Amar Kamat (184)
Mohammad Tariq (174)
Thomas Graves (174)
Owen O'Malley (162)
Hemanth Yamijala (155)
Amareshwari Sriramadasu (153)
Pedro Costa (153)
Ted Yu (148)
Robert Evans (146)
Tom White (138)
Aaron Kimball (131)
Niels Basjes