Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 11 to 16 from 16 (0.068s).
Loading phrases to help you
refine your search...
Shuffle's getMapOutput() fails with EofException, followed by IllegalStateException - HDFS - [mail # user]
...I'm having exactly this problem, and it's causing my job to fail when I try to process a larger amount of data (I'm attempting to process 30GB of compressed CSVs and the entire job fails eve...
   Author: David Parks, 2012-12-13, 04:22
RE: Hadoop 101 - HDFS - [mail # user]
...Nothing that I'm aware of for text files, I'd just use standard unix utils to process it outside of Hadoop.  As to getting a reader from any of the Input Formats, here's the typical exa...
   Author: David Parks, 2012-12-13, 04:16
Can we declare some HDFS nodes "primary" - HDFS - [mail # user]
...Assume for a moment that you have a large cluster of 500 AWS spot instance servers running. And you want to keep the bid price low, so at some point it's likely that the whole cluster will g...
   Author: David Parks, 2012-12-11, 11:39
RE: When reduce function is used as combiner? - HDFS - [mail # user]
...The map task may use a combiner 0+ times. Basically that means (as far as I understand), if the map output data is below some internal hadoop threshold, it'll just send it to the reducer, if...
   Author: David Parks, 2012-12-11, 11:32
RE: [Bulk] Re: Failed To Start  SecondaryNameNode in Secure Mode - HDFS - [mail # user]
...I'm curious about profiling, I see some documentation about it (1.0.3 on AWS), but the references to JobConf seem to be for the "old api" and I've got everything running on the "new api". &n...
   Author: David Parks, 2012-12-04, 08:00
How do map tasks get assigned efficiently? - HDFS - [mail # user]
...Even after reading O'reillys book on hadoop I don't feel like I have a clear vision of how the map tasks get assigned.     They depend on splits right?     But I have 3 j...
   Author: David Parks, 2012-10-24, 06:10
MapReduce (21)
Hadoop (14)
HDFS (11)
Pig (3)
HBase (1)
mail # user (16)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (16)
Todd Lipcon (326)
Eli Collins (263)
Harsh J (261)
Colin Patrick McCabe (240)
Tsz Wo (203)
Jing Zhao (174)
Chris Nauroth (166)
Arpit Agarwal (152)
Aaron T. Myers (141)
Andrew Wang (141)
Suresh Srinivas (138)
Brandon Li (137)
Haohui Mai (136)
Kihwal Lee (114)
Daryn Sharp (105)
Ted Yu (82)
Uma Maheswara Rao G (82)
Alejandro Abdelnur (73)
Tsz Wo Nicholas Sze (63)
Konstantin Shvachko (62)
Akira AJISAKA (61)
Stephen Chu (58)
Yongjun Zhang (56)
Steve Loughran (52)
Allen Wittenauer (48)
David Parks