Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 39 (0.071s).
Loading phrases to help you
refine your search...
Re: Wrapping around BitSet with the Writable interface - HDFS - [mail # user]
...Another interesting alternative is the EWAH implementation of java bitsets that allow efficient compressed bitsets with very fast OR operations.  https://github.com/lemire/javaewah &nbs...
   Author: Ted Dunning, 2013-05-12, 20:27
Re: issues with decrease the default.block.size - HDFS - [mail # user]
...The block size controls lots of things in Hadoop.  It affects read parallelism, scalability, block allocation and other aspects of operations either directly or indirectly.   On Su...
   Author: Ted Dunning, 2013-05-12, 20:11
Re: What's the best disk configuration for hadoop? SSD's Raid levels, etc? - HDFS - [mail # user]
...This sounds (with no real evidence) like you are a bit light on memory for that number of cores.  That could cause you to be spilling map outputs early and very much slowing things down...
   Author: Ted Dunning, 2013-05-11, 22:23
Re: M/R job optimization - HDFS - [mail # user]
...Have you checked the logs?  Is there a task that is taking a long time?  What is that task doing?  There are two basic possibilities:  a) you have a skewed join like the ...
   Author: Ted Dunning, 2013-04-26, 18:00
Re: Which hadoop installation should I use on ubuntu server? - HDFS - [mail # user]
...Also, Canonical just announced that MapR is available in the Partner repos.   On Thu, Mar 28, 2013 at 7:22 AM, Nitin Pawar wrote:  ...
   Author: Ted Dunning, 2013-03-28, 07:55
Re: Static class vs Normal Class when to use - HDFS - [mail # user]
...Another Ted piping in.  For Hadoop use, it is dangerous to use anything but a static class for your mapper and reducer functions since you may accidentally think that you can access a c...
   Author: Ted Dunning, 2013-03-28, 07:54
Re: Hadoop distcp from CDH4 to Amazon S3 - Improve Throughput - HDFS - [mail # user]
...The EMR distributions have special versions of the s3 file system.  They might be helpful here.  Of course, you likely aren't running those if you are seeing 5MB/s.  An extrem...
   Author: Ted Dunning, 2013-03-28, 07:45
Re: Naïve k-means using hadoop - HDFS - [mail # user]
...And, of course, due credit should be given here.  The advanced clustering algorithms in Crunch were lifted from the new stuff in Mahout pretty much step for step.  The Mahout group...
   Author: Ted Dunning, 2013-03-27, 16:47
Re: copytolocal vs distcp - HDFS - [mail # user]
...Try file:///fs4/outdir  Symbolic links can also help.  Note that this file system has to be visible with the same path on all hosts.  You may also be bandwidth limited by what...
   Author: Ted Dunning, 2013-03-09, 19:00
Re: product recommendations engine - HDFS - [mail # user]
...Yeah... you can make this work.  First, if your setup is relatively small, then you won't need Hadoop.  Second, having lots of kinds of actions is a very reasonable thing to have. ...
   Author: Ted Dunning, 2013-02-18, 03:13
Drill (272)
Zookeeper (250)
Hadoop (193)
HBase (134)
Pig (38)
HDFS (37)
MapReduce (35)
Chukwa (1)
Impala (1)
mail # user (38)
mail # dev (1)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (39)
Todd Lipcon (326)
Eli Collins (263)
Harsh J (261)
Colin Patrick McCabe (241)
Tsz Wo (203)
Jing Zhao (175)
Chris Nauroth (166)
Arpit Agarwal (152)
Andrew Wang (143)
Aaron T. Myers (141)
Haohui Mai (141)
Suresh Srinivas (138)
Brandon Li (137)
Kihwal Lee (114)
Daryn Sharp (105)
Ted Yu (83)
Uma Maheswara Rao G (82)
Alejandro Abdelnur (73)
Tsz Wo Nicholas Sze (64)
Konstantin Shvachko (63)
Akira AJISAKA (61)
Stephen Chu (58)
Yongjun Zhang (57)
Steve Loughran (53)
Allen Wittenauer (49)
Ted Dunning