Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 20 from 35 (2.986s).
Loading phrases to help you
refine your search...
Re: Merge sorting reduce output files - MapReduce - [mail # user]
...Robert,  On Tue, Feb 28, 2012 at 23:28, Robert Evans  wrote:  l  The thing is that this application is a "Auto Complete" feature that has a key that is "the letters that ...
   Author: Niels Basjes, 2012-02-29, 10:59
Re: Merge sorting reduce output files - MapReduce - [mail # user]
...Hi Robert,  On Tue, Feb 28, 2012 at 21:41, Robert Evans  wrote:   I know about that partitioner. The trouble I have is comming up with a partitioning that "evenly" balances th...
   Author: Niels Basjes, 2012-02-28, 21:46
Merge sorting reduce output files - MapReduce - [mail # user]
...Hi,  We have a job that outputs a set of files that are several hundred MB of text each. Using the comparators and such we can produce output files that are each sorted by themselves. &...
   Author: Niels Basjes, 2012-02-28, 20:10
Should splittable Gzip be a "core" hadoop feature? - MapReduce - [mail # user]
...Hi,  Some time ago I had an idea and implemented it.  Normally you can only run a single gzipped input file through a single mapper and thus only on a single CPU core. What I creat...
   Author: Niels Basjes, 2012-02-28, 15:50
Re: unsort algorithmus in map/reduce - MapReduce - [mail # user]
...Why not do something very simple: Use the MD5 of the URL as the key you do the sorting by. This scales very easy and highly randomized order. Maybe not the optimal maximum distance, but cert...
   Author: Niels Basjes, 2011-10-25, 12:21
Re: output from one map reduce job as the input to another map reduce job? - MapReduce - [mail # user]
...To me it sounds like the asker should checkout tools like storm and s4 instead of hadoop.  http://www.infoq.com/news/2011/09/twitter-storm-real-time-hadoop  Met vriendelijke groet,...
   Author: Niels Basjes, 2011-09-28, 07:21
Re: How to Create an effective chained MapReduce program. - MapReduce - [mail # user]
...Hi,  In the past i've had the same situation where I needed the data for debugging. Back then I chose to create a second job with simply SequenceFileInputFormat, IdentityMapper, Identit...
   Author: Niels Basjes, 2011-09-06, 05:57
Re: Excuting a shell script inside the HDFS - MapReduce - [mail # user]
...Yes, that way it could work. I'm just wondering ... Why would you want to have a script like this in HDFS?  Met vriendelijk groet,  Niels Basjes Op 16 aug. 2011 06:49 schreef "Fris...
   Author: Niels Basjes, 2011-08-16, 19:00
Re: How to select random n records using mapreduce ? - MapReduce - [mail # user]
...The only solution I can think of is by creating a counter in Hadoop that is incremented each time a mapper lets a record through. As soon as the value reaches a preselected value the mappers...
   Author: Niels Basjes, 2011-06-27, 19:28
Re: AW: How to split a big file in HDFS by size - MapReduce - [mail # user]
...Hi,  On Tue, Jun 21, 2011 at 16:14, Mapred Learn  wrote: kes FS.  Have a look at this:  http://stackoverflow.com/questions/3960651/splitting-gzipped-logfiles-witho ut-sto...
   Author: Niels Basjes, 2011-06-21, 20:03
Sort:
project
Hadoop (58)
MapReduce (35)
Pig (10)
HBase (4)
HDFS (1)
type
mail # user (34)
issue (1)
date
last 7 days (0)
last 30 days (2)
last 90 days (2)
last 6 months (4)
last 9 months (35)
author
Harsh J (1037)
Arun C Murthy (500)
Vinod Kumar Vavilapalli (351)
Todd Lipcon (282)
Amar Kamat (184)
Mohammad Tariq (175)
Thomas Graves (174)
Owen O'Malley (162)
Hemanth Yamijala (155)
Amareshwari Sriramadasu (153)
Pedro Costa (153)
Ted Yu (148)
Robert Evans (146)
Tom White (138)
Aaron Kimball (131)
Niels Basjes