Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 161 to 170 from 197 (0.079s).
Loading phrases to help you
refine your search...
Re: map side Vs. Reduce side join - Hadoop - [mail # user]
...One not-so-obvious instance of a map-side join is in term cooccurrence analysis for documents.  This is essentially a join of the document to term relation to itself.  Of course, t...
   Author: Ted Dunning, 2009-07-17, 07:03
Re: Looking for counterpart of Configure Method - Hadoop - [mail # user]
...I don't know what you mean by that.  The guarantee is that each mapper object will have close called and that map will never be called after close is called.  On Tue, Jul 14, 2009 ...
   Author: Ted Dunning, 2009-07-14, 18:43
Re: Disk configuration. - Hadoop - [mail # user]
...Be very cautious with spaces (i.e. don't use them)  On Mon, Jul 13, 2009 at 12:38 PM, Scott Carey wrote:     Ted Dunning, CTO DeepDyve  111 West Evelyn Ave. Ste. 202 Sunn...
   Author: Ted Dunning, 2009-07-13, 19:55
Re: Accessing static variables in map function - Hadoop - [mail # user]
...And NEVER expect updates to these variables to work like you think.  On Thu, Jul 9, 2009 at 8:24 PM, jason hadoop  wrote:  ...
   Author: Ted Dunning, 2009-07-10, 18:37
Re: Limit the number of open files in MultipleTextOutputFormat - Hadoop - [mail # user]
...On Fri, Jul 10, 2009 at 1:16 AM, Marcus Herou wrote:    Generally having lots of small files is very bad for performance.  It sounds like you are headed that direction.  ...
   Author: Ted Dunning, 2009-07-10, 17:06
Re: How to make data available in 10 minutes. - Hadoop - [mail # user]
...You are basically re-inventing lots of capabilities that others have solved before.  The idea of building an index that refers to files which are constructed by progressive merging is v...
   Author: Ted Dunning, 2009-07-09, 20:05
Re: Lucene index creation using Hadoop - Hadoop - [mail # user]
...Exactly as we do.  Also, I find that with a large enough collection to care about speed that we have many more shards than we have reducers so parallelism in indexing is nearly perfect....
   Author: Ted Dunning, 2009-07-09, 16:57
Re: Extracting data from HDFS and displaying stats to a webpage - Hadoop - [mail # user]
...On Wed, Jul 8, 2009 at 7:46 PM, Christophe Bisciglia  wrote:   This definitely used to be true, but look at the recent news: http://www.docstoc.com/docs/7493304/HBase-Goes-Realtime...
   Author: Ted Dunning, 2009-07-09, 05:15
Re: Merging many output files from reducer - Hadoop - [mail # user]
...On Wed, Jul 8, 2009 at 3:38 PM, Owen O'Malley  wrote:    Also, the need to merge often arises from a need to import the data into an external database.  That doesn't soun...
   Author: Ted Dunning, 2009-07-08, 23:55
Re: how to use hadoop in real life? - Hadoop - [mail # user]
...In general hadoop is simpler than you might imagine.  Yes, you need to create directories to store data.  This is much lighter weight than creating a table in SQL.  But the ke...
   Author: Ted Dunning, 2009-07-08, 17:17
Drill (259)
Zookeeper (250)
Hadoop (192)
HBase (134)
Pig (38)
HDFS (37)
MapReduce (35)
Chukwa (1)
mail # user (136)
mail # general (33)
mail # dev (27)
issue (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (197)
Harsh J (554)
Owen O'Malley (396)
Steve Loughran (378)
Todd Lipcon (238)
Eli Collins (181)
Alejandro Abdelnur (162)
Arun C Murthy (161)
Chris Nauroth (141)
Allen Wittenauer (124)
Tom White (118)
Nigel Daley (115)
Ted Yu (114)
Daryn Sharp (110)
Konstantin Shvachko (106)
Aaron Kimball (93)
Doug Cutting (93)
Edward Capriolo (87)
Colin Patrick McCabe (86)
Mark Kerzner (86)
jason hadoop (82)
Hairong Kuang (74)
Runping Qi (72)
Konstantin Boudnik (70)
Benoy Antony (69)
Suresh Srinivas (63)
Ted Dunning