| clear query|facets|time |
Search criteria: .
Results from 11 to 20 from
35 (2.986s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Merge sorting reduce output files - MapReduce - [mail # user]
|
|
...Robert, On Tue, Feb 28, 2012 at 23:28, Robert Evans wrote: l The thing is that this application is a "Auto Complete" feature that has a key that is "the letters that ...
|
|
|
Author: Niels Basjes,
2012-02-29, 10:59
|
|
|
Re: Merge sorting reduce output files - MapReduce - [mail # user]
|
|
...Hi Robert, On Tue, Feb 28, 2012 at 21:41, Robert Evans wrote: I know about that partitioner. The trouble I have is comming up with a partitioning that "evenly" balances th...
|
|
|
Author: Niels Basjes,
2012-02-28, 21:46
|
|
|
Merge sorting reduce output files - MapReduce - [mail # user]
|
|
...Hi, We have a job that outputs a set of files that are several hundred MB of text each. Using the comparators and such we can produce output files that are each sorted by themselves. &...
|
|
|
Author: Niels Basjes,
2012-02-28, 20:10
|
|
|
Should splittable Gzip be a "core" hadoop feature? - MapReduce - [mail # user]
|
|
...Hi, Some time ago I had an idea and implemented it. Normally you can only run a single gzipped input file through a single mapper and thus only on a single CPU core. What I creat...
|
|
|
Author: Niels Basjes,
2012-02-28, 15:50
|
|
|
Re: unsort algorithmus in map/reduce - MapReduce - [mail # user]
|
|
...Why not do something very simple: Use the MD5 of the URL as the key you do the sorting by. This scales very easy and highly randomized order. Maybe not the optimal maximum distance, but cert...
|
|
|
Author: Niels Basjes,
2011-10-25, 12:21
|
|
|
Re: output from one map reduce job as the input to another map reduce job? - MapReduce - [mail # user]
|
|
...To me it sounds like the asker should checkout tools like storm and s4 instead of hadoop. http://www.infoq.com/news/2011/09/twitter-storm-real-time-hadoop Met vriendelijke groet,...
|
|
|
Author: Niels Basjes,
2011-09-28, 07:21
|
|
|
Re: How to Create an effective chained MapReduce program. - MapReduce - [mail # user]
|
|
...Hi, In the past i've had the same situation where I needed the data for debugging. Back then I chose to create a second job with simply SequenceFileInputFormat, IdentityMapper, Identit...
|
|
|
Author: Niels Basjes,
2011-09-06, 05:57
|
|
|
Re: Excuting a shell script inside the HDFS - MapReduce - [mail # user]
|
|
...Yes, that way it could work. I'm just wondering ... Why would you want to have a script like this in HDFS? Met vriendelijk groet, Niels Basjes Op 16 aug. 2011 06:49 schreef "Fris...
|
|
|
Author: Niels Basjes,
2011-08-16, 19:00
|
|
|
Re: How to select random n records using mapreduce ? - MapReduce - [mail # user]
|
|
...The only solution I can think of is by creating a counter in Hadoop that is incremented each time a mapper lets a record through. As soon as the value reaches a preselected value the mappers...
|
|
|
Author: Niels Basjes,
2011-06-27, 19:28
|
|
|
Re: AW: How to split a big file in HDFS by size - MapReduce - [mail # user]
|
|
...Hi, On Tue, Jun 21, 2011 at 16:14, Mapred Learn wrote: kes FS. Have a look at this: http://stackoverflow.com/questions/3960651/splitting-gzipped-logfiles-witho ut-sto...
|
|
|
Author: Niels Basjes,
2011-06-21, 20:03
|
|
|
|