Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 53 (0.178s).
Loading phrases to help you
refine your search...
Re: MapReduce - FileInputFormat and Locality - MapReduce - [mail # user]
...I think that you just said what the OP said.  Your two cases reduce to the same single case that they had.  Whether this matters is another question, but it seems like it could in ...
   Author: Ted Dunning, 2013-05-09, 02:10
Re: Hardware Selection for Hadoop - MapReduce - [mail # user]
...On Tue, May 7, 2013 at 5:53 AM, Michael Segel wrote:   This is an important comment.  Some controllers fold when you start pushing too much data.  Testing nodes independently ...
   Author: Ted Dunning, 2013-05-07, 15:51
Re: Cartesian product in hadoop - MapReduce - [mail # user]
...It is rarely practical to do exhaustive comparisons on datasets of this size.  The method used is to heuristically prune the cartesian product set and only examine pairs that have a hig...
   Author: Ted Dunning, 2013-04-18, 18:52
Re: Physically moving HDFS cluster to new - MapReduce - [mail # user]
...It may or may not help you in your current distress, but MapR's distribution could handle this pretty easily.  One method is direct distcp between clusters, but you could also use MapR'...
   Author: Ted Dunning, 2013-04-18, 02:23
Re: Copy Vs DistCP - MapReduce - [mail # user]
...On Sun, Apr 14, 2013 at 10:33 AM, Mathias Herberts  wrote:   Uh... hmm...  This is a good suggestion.  Obvious in fact.  In retrospect.  I would also suggest th...
   Author: Ted Dunning, 2013-04-14, 18:01
Re: Copy Vs DistCP - MapReduce - [mail # user]
...Inline   On Sun, Apr 14, 2013 at 1:13 AM, Mathias Herberts  wrote:   Well, I will admit it was a shameless correction to Lance's absolute and incorrect claim.    Thi...
   Author: Ted Dunning, 2013-04-14, 17:00
Re: Copy Vs DistCP - MapReduce - [mail # user]
...Lance,  Never say never.  Linux programs can read from the right kind of Hadoop cluster without using FUSE.     On Fri, Apr 12, 2013 at 10:15 AM, Lance Norskog  wrot...
   Author: Ted Dunning, 2013-04-14, 04:14
Re: Million docs and word count scenario - MapReduce - [mail # user]
...Putting each document into a separate file is not likely to be a great thing to do.  On the other hand, putting them all into one file may not be what you want either.  It is proba...
   Author: Ted Dunning, 2013-03-29, 13:05
Re: - MapReduce - [mail # user]
...I would agree with David that this is not normally a good idea.  There are situations, however, where you do need to control location of data and where the computation occurs.  The...
   Author: Ted Dunning, 2013-03-25, 07:26
Re: Accumulo and Mapreduce - MapReduce - [mail # user]
...Chaining the jobs is a fantastically inefficient solution.  If you use Pig or Cascading, the optimizer will glue all of your map functions into a single mapper.  The result is some...
   Author: Ted Dunning, 2013-03-04, 19:43
Sort:
project
Zookeeper (454)
Hadoop (321)
HBase (251)
Pig (64)
MapReduce (53)
HDFS (48)
Chukwa (1)
type
mail # user (52)
mail # dev (1)
date
last 7 days (0)
last 30 days (2)
last 90 days (12)
last 6 months (19)
last 9 months (53)
author
Harsh J (1041)
Arun C Murthy (501)
Vinod Kumar Vavilapalli (351)
Todd Lipcon (283)
Amar Kamat (184)
Mohammad Tariq (174)
Thomas Graves (173)
Owen O'Malley (162)
Hemanth Yamijala (155)
Amareshwari Sriramadasu (153)
Pedro Costa (153)
Ted Yu (148)
Robert Evans (147)
Tom White (138)
Aaron Kimball (131)
Ted Dunning