Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 151 to 160 from 163 (0.129s).
Loading phrases to help you
refine your search...
Re: Development Branch for the Hadoop Next Generation - Hadoop - [mail # general]
...0.23 is trunk right now. 0.22 was branched for stabilization before release .   On 6/13/11 9:14 AM, "Marcos Ortiz"  wrote:  On 06/13/2011 09:01 AM, Harsh J wrote: /),   G...
   Author: Robert Evans, 2011-06-13, 13:58
[expand - 1 more] - Re: Automatic line number in reducer output - Hadoop - [mail # user]
...In this case you probably want two different classes.  You can have the bas e Reducer class that adds in the line count, and then subclass it for the c ombiner, that sets a flag to not ...
   Author: Robert Evans, 2011-06-10, 14:31
Re: Linear scalability question - Hadoop - [mail # user]
...Shantian,  You are correct.  The other big factor in this is the cost of connections b etween the Mappers and the Reducers.  With N mappers and M reducers you wil l make M*N c...
   Author: Robert Evans, 2011-06-09, 14:23
Re: DistributedCache - Hadoop - [mail # user]
...I think the issue you are seeing is because the distributed cache is not se t up by default to create symlinks to the files it pulls over.  If you want  to access them through syml...
   Author: Robert Evans, 2011-06-09, 13:49
Re: Hadoop project - help needed - Hadoop - [mail # user]
...Parismav,  So you are more or less trying to scrape some data in a distributed way.  W ell there are several things that you could do, just be careful I am not su re the terms of s...
   Author: Robert Evans, 2011-05-31, 15:54
Re: Sorting ... - Hadoop - [mail # user]
...Also if you want something that is fairly fast and a lot less dev work to g et going you might want to look at pig.  They can do a distributed order by  that is fairly good.  ...
   Author: Robert Evans, 2011-05-26, 15:34
Re: Applications creates bigger output than input? - Hadoop - [mail # user]
...I'm not sure if this has been mentioned or not but in Machine Learning with  text based documents, the first stage is often a glorified word count acti on.  Except much of the time...
   Author: Robert Evans, 2011-05-19, 14:57
Re: current line number as key? - Hadoop - [mail # user]
...You are correct, that there is no easy and efficient way to do this.  You could create a new InputFormat that derives from FileInputFormat that m akes it so the files do not split, and ...
   Author: Robert Evans, 2011-05-18, 19:18
Re: FileSystem API - Moving files in HDFS - Hadoop - [mail # user]
...If they are lots of large files, and you need to copy them quickly, i.e. No t have all the data go through a single machine, you can use hadoop distcp  too.   On 5/14/11 12:49 AM, ...
   Author: Robert Evans, 2011-05-16, 17:06
Re: Exception in thread "AWT-EventQueue-0" java.lang.NullPointerException - Hadoop - [mail # user]
...What version of hadoop are you using?   On 5/14/11 9:37 AM, "Lạc Trung"  wrote:  Hello everybody !    This exception was thrown when I tried to copy a file from loca...
   Author: Robert Evans, 2011-05-16, 17:03
Hadoop (161)
MapReduce (120)
HDFS (24)
Bigtop (1)
mail # user (68)
mail # dev (52)
mail # general (43)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (163)
Harsh J (560)
Owen O'Malley (394)
Steve Loughran (392)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (125)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (92)
Edward Capriolo (87)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (69)
Suresh Srinivas (65)
Robert Evans