Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 161 to 170 from 198 (0.074s).
Loading phrases to help you
refine your search...
[expand - 1 more] - Re: Data-local map tasks lower than Launched map tasks even with full replication - Hadoop - [mail # user]
...This is a very non-typical node for hadoop clusters.  8 cores is not that uncommon, but normally nodes have only 2 disks.  The rationale for a small number of spindles per machine ...
   Author: Ted Dunning, 2009-07-18, 00:09
Re: map side Vs. Reduce side join - Hadoop - [mail # user]
...One not-so-obvious instance of a map-side join is in term cooccurrence analysis for documents.  This is essentially a join of the document to term relation to itself.  Of course, t...
   Author: Ted Dunning, 2009-07-17, 07:03
Re: Looking for counterpart of Configure Method - Hadoop - [mail # user]
...I don't know what you mean by that.  The guarantee is that each mapper object will have close called and that map will never be called after close is called.  On Tue, Jul 14, 2009 ...
   Author: Ted Dunning, 2009-07-14, 18:43
Re: Disk configuration. - Hadoop - [mail # user]
...Be very cautious with spaces (i.e. don't use them)  On Mon, Jul 13, 2009 at 12:38 PM, Scott Carey wrote:     Ted Dunning, CTO DeepDyve  111 West Evelyn Ave. Ste. 202 Sunn...
   Author: Ted Dunning, 2009-07-13, 19:55
[expand - 1 more] - Re: Accessing static variables in map function - Hadoop - [mail # user]
...And NEVER expect updates to these variables to work like you think.  On Thu, Jul 9, 2009 at 8:24 PM, jason hadoop  wrote:  ...
   Author: Ted Dunning, 2009-07-10, 18:37
Re: Limit the number of open files in MultipleTextOutputFormat - Hadoop - [mail # user]
...On Fri, Jul 10, 2009 at 1:16 AM, Marcus Herou wrote:    Generally having lots of small files is very bad for performance.  It sounds like you are headed that direction.  ...
   Author: Ted Dunning, 2009-07-10, 17:06
Re: How to make data available in 10 minutes. - Hadoop - [mail # user]
...You are basically re-inventing lots of capabilities that others have solved before.  The idea of building an index that refers to files which are constructed by progressive merging is v...
   Author: Ted Dunning, 2009-07-09, 20:05
[expand - 1 more] - Re: Lucene index creation using Hadoop - Hadoop - [mail # user]
...Exactly as we do.  Also, I find that with a large enough collection to care about speed that we have many more shards than we have reducers so parallelism in indexing is nearly perfect....
   Author: Ted Dunning, 2009-07-09, 16:57
Re: Extracting data from HDFS and displaying stats to a webpage - Hadoop - [mail # user]
...On Wed, Jul 8, 2009 at 7:46 PM, Christophe Bisciglia  wrote:   This definitely used to be true, but look at the recent news: http://www.docstoc.com/docs/7493304/HBase-Goes-Realtime...
   Author: Ted Dunning, 2009-07-09, 05:15
Re: Merging many output files from reducer - Hadoop - [mail # user]
...On Wed, Jul 8, 2009 at 3:38 PM, Owen O'Malley  wrote:    Also, the need to merge often arises from a need to import the data into an external database.  That doesn't soun...
   Author: Ted Dunning, 2009-07-08, 23:55
Sort:
project
Drill (268)
Zookeeper (250)
Hadoop (193)
HBase (134)
Pig (38)
HDFS (37)
MapReduce (35)
Chukwa (1)
Impala (1)
type
mail # user (136)
mail # general (34)
mail # dev (27)
issue (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (198)
author
Harsh J (558)
Owen O'Malley (394)
Steve Loughran (388)
Todd Lipcon (239)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (121)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (92)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (70)
Suresh Srinivas (64)
Ted Dunning