Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 19 (0.138s).
Loading phrases to help you
refine your search...
Re: How do I perform a scalable cartesian product - MapReduce - [mail # user]
...Hi Steve,  if the only problem is that the size of your zipcode squared (more  accurately, n * (n-1) if you order the pairs of persons, assuming that  distance is symmetric) i...
   Author: Christoph Schmitz, 2013-08-15, 11:01
AW: mortbay, huge files and the ulimit - MapReduce - [mail # user]
...Hi Elmar,  I don't know about the technicalities of your problem, but why don't you us e a reduce-side join in the first place? (I.e., a sort-merge join instead o f a hash join.)  ...
   Author: Christoph Schmitz, 2012-08-29, 14:23
Re: Distributing Keys across Reducers - MapReduce - [mail # user]
...Hi Dave,  I haven't actually done this in practice, so take this with a grain of  salt ;-)  One way to circumvent your problem might be to add entropy to the keys,  i.e.,...
   Author: Christoph Schmitz, 2012-07-20, 14:21
AW: Understanding job completion in other nodes - MapReduce - [mail # user]
...Hi Hamid,  I'm not sure if I understand your question correctly, but I think this is e xactly what the standard workflow in a Hadoop application looks like:  Job job1 = new Job(......
   Author: Christoph Schmitz, 2012-06-26, 09:19
[expand - 1 more] - AW: how to overwrite output in HDFS? - MapReduce - [mail # user]
...Hi Xin,  when you're running your MapReduce job, at some point you'll have to wire it together, i.e., say what the mapper class is, what the reducer class is, etc. There you can also co...
   Author: Christoph Schmitz, 2012-04-03, 12:40
AW: Performance improvement-Cluster vs Pseudo - MapReduce - [mail # user]
...Hi Ashish,  IMHO your numbers (2 machines, 10 URLs) are way too small to outweigh the n atural overhead that occurs with a distributed computation (distributing th e program code, coord...
   Author: Christoph Schmitz, 2012-03-30, 08:46
AW: Other than hadoop - MapReduce - [mail # user]
...How about GridGain? Not sure abouts its liveliness, though.  Regards, Christoph  Von: real great.. [mailto:[EMAIL PROTECTED]]  Gesendet: Montag, 30. Januar 2012 14:48 An: [EMA...
   Author: Christoph Schmitz, 2012-01-30, 14:13
AW: Output of MAP Class only - MapReduce - [mail # user]
...Hi Rajen,  you can write stuff to the task attempt directory and it will be included i n the output of your MapReduce job.  You can get the directory from the Mapper context:  ...
   Author: Christoph Schmitz, 2011-09-30, 12:16
[MAPREDUCE-2845] Default replication level mapred.submit.replication=10 causes warnings on small clusters - MapReduce - [issue]
...By default, the replication level for job jars, libjars and the distributed cache in general is mapred.submit.replication=10. This yields under-replication warnings for these files on small ...
http://issues.apache.org/jira/browse/MAPREDUCE-2845    Author: Christoph Schmitz, 2011-09-27, 12:25
AW: Under-replication warnings for Distributed Cache? - MapReduce - [mail # user]
...apred.submit.replication to the number > of data nodes? And more generally,  should I worry about this warning?  Ok, will do! Thank for the help,  Christoph...
   Author: Christoph Schmitz, 2011-08-16, 06:39
MapReduce (19)
mail # user (18)
issue (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (19)
Harsh J (454)
Arun C Murthy (326)
Vinod Kumar Vavilapalli (309)
Todd Lipcon (223)
Amar Kamat (181)
Thomas Graves (166)
Jason Lowe (163)
Amareshwari Sriramadasu (152)
Sandy Ryza (124)
Tom White (111)
Siddharth Seth (109)
Aaron Kimball (107)
Owen O'Malley (105)
Alejandro Abdelnur (103)
Devaraj K (103)
Ramya Sunil (103)
Robert Joseph Evans (101)
Hemanth Yamijala (97)
Steve Loughran (90)
Ted Yu (80)
Eli Collins (77)
Ravi Gummadi (76)
Karthik Kambatla (71)
Mahadev konar (67)
Ravi Prakash (66)
Christoph Schmitz