| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
321 (0.406s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: Hardware Selection for Hadoop - Hadoop - [mail # user]
|
|
...Data nodes normally are also task nodes. With 8 physical cores it isn't that unreasonable to have 64GB whereas 24GB really is going to pinch. Achieving highest performance requir...
|
|
|
Author: Ted Dunning,
2013-05-05, 18:47
|
|
|
Re: Hardware Selection for Hadoop - Hadoop - [mail # user]
|
|
...I think that having more than 6 drives is better. More memory never hurts. If you have too little, you may have to run with fewer slots than optimal. 10GB networking is goo...
|
|
|
Author: Ted Dunning,
2013-04-29, 18:52
|
|
|
Re: Bloom Filter analogy in SQL - Hadoop - [mail # user]
|
|
...This isn't a very Hadoop question. A Bloom filter is a very low level data structure that doesn't really any correlate in SQL. It allows you to find duplicates quickly and probab...
|
|
|
Author: Ted Dunning,
2013-03-30, 06:31
|
|
|
Re: Naïve k-means using hadoop - Hadoop - [mail # user]
|
|
...Spark would be an excellent choice for the iterative sort of k-means. It could be good for sketch-based algorithms as well, but the difference would be much less pronounced. &nb...
|
|
|
Author: Ted Dunning,
2013-03-27, 16:49
|
|
|
[HADOOP-2781] Hadoop/Groovy integration - Hadoop - [issue]
|
|
...This is a place-holder issue to hold initial release of the groovy integration for hadoop.The goal is to be able to write very simple map-reduce programs in just a few lines of code in a fun...
|
|
|
http://issues.apache.org/jira/browse/HADOOP-2781
Author: Ted Dunning,
2013-03-22, 19:34
|
|
|
Re: Question related to Decompressor interface - Hadoop - [mail # user]
|
|
...All of these suggestions tend to founder on the problem of key management. What you need to do is 1) define your threats. 2) define your architecture including key manageme...
|
|
|
Author: Ted Dunning,
2013-02-11, 06:08
|
|
|
Re: Mutiple dfs.data.dir vs RAID0 - Hadoop - [mail # user]
|
|
...Typical best practice is to have a separate file system per spindle. If you have a RAID only controller (many are), then you just create one RAID per spindle. The effect is the s...
|
|
|
Author: Ted Dunning,
2013-02-11, 06:04
|
|
|
Re: How can I limit reducers to one-per-node? - Hadoop - [mail # user]
|
|
...For crawler type apps, typically you direct all of the URL's to crawl from a single domain to a single reducer. Typically, you also have many reducers so that you can get decent bandwi...
|
|
|
Author: Ted Dunning,
2013-02-11, 05:55
|
|
|
Re: How to Backup HDFS data ? - Hadoop - [mail # user]
|
|
...Incremental backups are nice to avoid copying all your data again. You can code these at the application layer if you have nice partitioning and keep track correctly. You can als...
|
|
|
Author: Ted Dunning,
2013-01-25, 07:42
|
|
|
Re: Estimating disk space requirements - Hadoop - [mail # user]
|
|
...Jeff makes some good points here. On Fri, Jan 18, 2013 at 5:01 PM, Jeffrey Buell wrote: This is definitely a good point with respect to untainted Hadoop, but with a system...
|
|
|
Author: Ted Dunning,
2013-01-19, 03:39
|
|
|
|