Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 171 to 180 from 198 (0.077s).
Loading phrases to help you
refine your search...
Re: how to use hadoop in real life? - Hadoop - [mail # user]
...In general hadoop is simpler than you might imagine.  Yes, you need to create directories to store data.  This is much lighter weight than creating a table in SQL.  But the ke...
   Author: Ted Dunning, 2009-07-08, 17:17
[expand - 2 more] - Re: Sorting data sets - Hadoop - [mail # user]
...I know that this is probably old news, but sorting on time has the wonderful property of being unique up to ties.  If you have bounds on timing errors, having input that is sorted makes...
   Author: Ted Dunning, 2009-07-08, 16:46
Re: HDFS, client caches and transfer speeds - Hadoop - [mail # user]
...Is the client doing the writing part of the Hadoop system?  If so, it is definitely writing locally, but also to the cluster as well.  The simplest test is to just write from a mac...
   Author: Ted Dunning, 2009-07-07, 18:09
Re: Copy files https -> HDFS - Hadoop - [mail # user]
...To clarify, this just copies the file to HDFS.  In addition, it is good practice to use counters to provide progress updates (number of files started, bytes copied, total seconds of cop...
   Author: Ted Dunning, 2009-07-07, 16:39
Re: Need help understanding the source - Hadoop - [mail # dev]
...I would consider this to be a very delicate optimization with little utility in the real world.  It is very, very rare to reliably know how many records the reducer will see.  Gett...
   Author: Ted Dunning, 2009-07-06, 18:42
Re: end of input event to a mapper - Hadoop - [mail # user]
...That does not quite mean that this is the last map() call in a global sense.  In fact, the entire map task could be run a second time by the framework.  Close does mean that this p...
   Author: Ted Dunning, 2009-07-06, 18:13
[expand - 8 more] - Re: Parallell maps - Hadoop - [mail # user]
...On Fri, Jul 3, 2009 at 4:36 PM, Marcus Herou wrote:   yes.  exactly.  By reading data sequentially, things move vastly faster.    Several of the posters in this thre...
   Author: Ted Dunning, 2009-07-04, 01:48
[expand - 1 more] - Re: Using addCacheArchive - Hadoop - [mail # user]
...This code assumes that the files are in the working directory of the mapper.  You should ask the cache where they are instead nad use the the paths that it gives you.  See the code...
   Author: Ted Dunning, 2009-07-02, 19:25
Re: RDF Data Store on Hadoop - Hadoop - [mail # user]
...Heart looks related to Hama and I would expect a similar evolution.  On Thu, Jul 2, 2009 at 5:32 AM, Alex McLintock wrote:  ...
   Author: Ted Dunning, 2009-07-02, 18:26
[expand - 1 more] - Re: FYI, Large-scale graph computing at Google - Hadoop - [mail # user]
...Michal,  Can you say why it is difficult?  Is it because you have to run many map-reduce iterations?  If you allow many iterations, it seems like a fairly simple map reduce pr...
   Author: Ted Dunning, 2009-07-02, 17:45
Drill (270)
Zookeeper (250)
Hadoop (193)
HBase (134)
Pig (38)
HDFS (37)
MapReduce (35)
Chukwa (1)
Impala (1)
mail # user (136)
mail # general (34)
mail # dev (27)
issue (1)
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (198)
Harsh J (558)
Owen O'Malley (394)
Steve Loughran (388)
Todd Lipcon (239)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (121)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (92)
Edward Capriolo (87)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (70)
Suresh Srinivas (65)
Ted Dunning