Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 198 (0.163s).
Loading phrases to help you
refine your search...
Re: Code guidelines and bash - Hadoop - [mail # general]
...Indeed. But I, for one, have had more than 80 characters per line ever since I got a vt-100 terminal. I don't know of any dev environments in common use today that can't display >100 char...
   Author: Ted Dunning, 2014-07-28, 04:28
Re: Unclear Hadoop 2.1X documentation - Hadoop - [mail # general]
...This is a very small amount of memory for running Hadoop + user programs.  You might consider running your tests on a cloud provider like Amazon.  That will give you access to dece...
   Author: Ted Dunning, 2013-09-16, 00:57
[expand - 1 more] - Re: Hardware Selection for Hadoop - Hadoop - [mail # user]
...Data nodes normally are also task nodes.  With 8 physical cores it isn't that unreasonable to have 64GB whereas 24GB really is going to pinch.  Achieving highest performance requir...
   Author: Ted Dunning, 2013-05-05, 18:47
Re: Bloom Filter analogy in SQL - Hadoop - [mail # user]
...This isn't a very Hadoop question.  A Bloom filter is a very low level data structure that doesn't really any correlate in SQL.  It allows you to find duplicates quickly and probab...
   Author: Ted Dunning, 2013-03-30, 06:31
Re: Naïve k-means using hadoop - Hadoop - [mail # user]
...Spark would be an excellent choice for the iterative sort of k-means.  It could be good for sketch-based algorithms as well, but the difference would be much less pronounced.   &nb...
   Author: Ted Dunning, 2013-03-27, 16:49
[HADOOP-2781] Hadoop/Groovy integration - Hadoop - [issue]
...This is a place-holder issue to hold initial release of the groovy integration for hadoop.The goal is to be able to write very simple map-reduce programs in just a few lines of code in a fun...
http://issues.apache.org/jira/browse/HADOOP-2781    Author: Ted Dunning, 2013-03-22, 19:34
Re: Question related to Decompressor interface - Hadoop - [mail # user]
...All of these suggestions tend to founder on the problem of key management.  What you need to do is  1) define your threats.  2) define your architecture including key manageme...
   Author: Ted Dunning, 2013-02-11, 06:08
Re: Mutiple dfs.data.dir vs RAID0 - Hadoop - [mail # user]
...Typical best practice is to have a separate file system per spindle.  If you have a RAID only controller (many are), then you just create one RAID per spindle.  The effect is the s...
   Author: Ted Dunning, 2013-02-11, 06:04
Re: How can I limit reducers to one-per-node? - Hadoop - [mail # user]
...For crawler type apps, typically you direct all of the URL's to crawl from a single domain to a single reducer.  Typically, you also have many reducers so that you can get decent bandwi...
   Author: Ted Dunning, 2013-02-11, 05:55
Re: How to Backup HDFS data ? - Hadoop - [mail # user]
...Incremental backups are nice to avoid copying all your data again.  You can code these at the application layer if you have nice partitioning and keep track correctly.  You can als...
   Author: Ted Dunning, 2013-01-25, 07:42
Drill (282)
Zookeeper (250)
Hadoop (193)
HBase (134)
Pig (38)
HDFS (37)
MapReduce (35)
Chukwa (1)
Impala (1)
mail # user (136)
mail # general (34)
mail # dev (27)
issue (1)
last 7 days (0)
last 30 days (0)
last 90 days (1)
last 6 months (1)
last 9 months (198)
Harsh J (559)
Owen O'Malley (394)
Steve Loughran (390)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (126)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (93)
Edward Capriolo (88)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (70)
Suresh Srinivas (64)
Ted Dunning