Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 21 to 30 from 38 (0.126s).
Loading phrases to help you
refine your search...
Re: unsort algorithmus in map/reduce - MapReduce - [mail # user]
...Why not do something very simple: Use the MD5 of the URL as the key you do the sorting by. This scales very easy and highly randomized order. Maybe not the optimal maximum distance, but cert...
   Author: Niels Basjes, 2011-10-25, 12:21
Re: output from one map reduce job as the input to another map reduce job? - MapReduce - [mail # user]
...To me it sounds like the asker should checkout tools like storm and s4 instead of hadoop.  http://www.infoq.com/news/2011/09/twitter-storm-real-time-hadoop  Met vriendelijke groet,...
   Author: Niels Basjes, 2011-09-28, 07:21
Re: How to Create an effective chained MapReduce program. - MapReduce - [mail # user]
...Hi,  In the past i've had the same situation where I needed the data for debugging. Back then I chose to create a second job with simply SequenceFileInputFormat, IdentityMapper, Identit...
   Author: Niels Basjes, 2011-09-06, 05:57
Re: Excuting a shell script inside the HDFS - MapReduce - [mail # user]
...Yes, that way it could work. I'm just wondering ... Why would you want to have a script like this in HDFS?  Met vriendelijk groet,  Niels Basjes Op 16 aug. 2011 06:49 schreef "Fris...
   Author: Niels Basjes, 2011-08-16, 19:00
Re: How to select random n records using mapreduce ? - MapReduce - [mail # user]
...The only solution I can think of is by creating a counter in Hadoop that is incremented each time a mapper lets a record through. As soon as the value reaches a preselected value the mappers...
   Author: Niels Basjes, 2011-06-27, 19:28
[expand - 1 more] - Re: AW: How to split a big file in HDFS by size - MapReduce - [mail # user]
...Hi,  On Tue, Jun 21, 2011 at 16:14, Mapred Learn  wrote: kes FS.  Have a look at this:  http://stackoverflow.com/questions/3960651/splitting-gzipped-logfiles-witho ut-sto...
   Author: Niels Basjes, 2011-06-21, 20:03
Re: How to merge several SequenceFile into one? - MapReduce - [mail # user]
...Hi,   The simplest way to do that is to create a job that - input format = sequence file - map = identity mapper - reduce = identity reduce - output = sequence file and  job.setNum...
   Author: Niels Basjes, 2011-05-25, 19:25
Including external libraries in my job. - MapReduce - [mail # user]
...Hi,  I've written my first very simple job that does something with hbase.  Now when I try to submit my jar in my cluster I get this:  [nbasjes@master ~/src/catalogloader/run]...
   Author: Niels Basjes, 2011-05-03, 13:42
Re: hadoop mr cluster mode on my laptop? - MapReduce - [mail # user]
...Hi,  You should be doing the setup for what is called "Pseudo-distributed" mode. Have a look at this: http://hadoop.apache.org/common/docs/r0.20.2/quickstart.html#PseudoDistribu ted &nb...
   Author: Niels Basjes, 2011-04-18, 13:20
Re: Small linux distros to run hadoop ? - MapReduce - [mail # user]
...Hi,  2011/4/15 web service : want to  I usually use a fully stripped CentOS 5 to run cluster nodes. Works perfectly and can be fully automated using the kickstart scripting for ana...
   Author: Niels Basjes, 2011-04-15, 14:49
Hadoop (41)
MapReduce (35)
Pig (11)
HBase (6)
HDFS (3)
Cassandra (1)
mail # user (34)
issue (3)
mail # dev (1)
last 7 days (1)
last 30 days (1)
last 90 days (4)
last 6 months (4)
last 9 months (38)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (215)
Amar Kamat (181)
Thomas Graves (165)
Jason Lowe (159)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Devaraj K (103)
Ramya Sunil (103)
Alejandro Abdelnur (102)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (78)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (70)
Mahadev konar (67)
Ravi Prakash (66)
Niels Basjes