Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 197 (0.055s).
Loading phrases to help you
refine your search...
Re: Unclear Hadoop 2.1X documentation - Hadoop - [mail # general]
...This is a very small amount of memory for running Hadoop + user programs.  You might consider running your tests on a cloud provider like Amazon.  That will give you access to dece...
   Author: Ted Dunning, 2013-09-16, 00:57
Re: Hardware Selection for Hadoop - Hadoop - [mail # user]
...Data nodes normally are also task nodes.  With 8 physical cores it isn't that unreasonable to have 64GB whereas 24GB really is going to pinch.  Achieving highest performance requir...
   Author: Ted Dunning, 2013-05-05, 18:47
Re: Bloom Filter analogy in SQL - Hadoop - [mail # user]
...This isn't a very Hadoop question.  A Bloom filter is a very low level data structure that doesn't really any correlate in SQL.  It allows you to find duplicates quickly and probab...
   Author: Ted Dunning, 2013-03-30, 06:31
Re: Naïve k-means using hadoop - Hadoop - [mail # user]
...Spark would be an excellent choice for the iterative sort of k-means.  It could be good for sketch-based algorithms as well, but the difference would be much less pronounced.   &nb...
   Author: Ted Dunning, 2013-03-27, 16:49
[HADOOP-2781] Hadoop/Groovy integration - Hadoop - [issue]
...This is a place-holder issue to hold initial release of the groovy integration for hadoop.The goal is to be able to write very simple map-reduce programs in just a few lines of code in a fun...
http://issues.apache.org/jira/browse/HADOOP-2781    Author: Ted Dunning, 2013-03-22, 19:34
Re: Question related to Decompressor interface - Hadoop - [mail # user]
...All of these suggestions tend to founder on the problem of key management.  What you need to do is  1) define your threats.  2) define your architecture including key manageme...
   Author: Ted Dunning, 2013-02-11, 06:08
Re: Mutiple dfs.data.dir vs RAID0 - Hadoop - [mail # user]
...Typical best practice is to have a separate file system per spindle.  If you have a RAID only controller (many are), then you just create one RAID per spindle.  The effect is the s...
   Author: Ted Dunning, 2013-02-11, 06:04
Re: How can I limit reducers to one-per-node? - Hadoop - [mail # user]
...For crawler type apps, typically you direct all of the URL's to crawl from a single domain to a single reducer.  Typically, you also have many reducers so that you can get decent bandwi...
   Author: Ted Dunning, 2013-02-11, 05:55
Re: How to Backup HDFS data ? - Hadoop - [mail # user]
...Incremental backups are nice to avoid copying all your data again.  You can code these at the application layer if you have nice partitioning and keep track correctly.  You can als...
   Author: Ted Dunning, 2013-01-25, 07:42
Re: Estimating disk space requirements - Hadoop - [mail # user]
...Jeff makes some good points here.  On Fri, Jan 18, 2013 at 5:01 PM, Jeffrey Buell  wrote:   This is definitely a good point with respect to untainted Hadoop, but with a system...
   Author: Ted Dunning, 2013-01-19, 03:39
Drill (246)
Zookeeper (242)
Hadoop (192)
HBase (134)
Pig (38)
HDFS (37)
MapReduce (35)
Chukwa (1)
mail # user (136)
mail # general (33)
mail # dev (27)
issue (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (197)
Harsh J (537)
Owen O'Malley (402)
Steve Loughran (358)
Todd Lipcon (234)
Eli Collins (181)
Arun C Murthy (157)
Chris Nauroth (129)
Alejandro Abdelnur (121)
Allen Wittenauer (115)
Nigel Daley (112)
Tom White (111)
Daryn Sharp (108)
Konstantin Shvachko (102)
Ted Yu (95)
Aaron Kimball (93)
Ted Dunning