Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 41 to 50 from 142 (0.246s).
Loading phrases to help you
refine your search...
Re: Slow shuffle stage? - Hadoop - [mail # user]
...892 nodes, 4 tasks each, 3:1 mapper/reducer ratio.  Each map task  outputs four records, ~18MB each.  They are fairly evenly distributed to  the 17 reducers.  As to ...
   Author: Keith Wiley, 2011-11-11, 15:52
Slow shuffle stage? - Hadoop - [mail # user]
...What sorts of causes might be responsible for a long or slow shuffle  stage?  For example, I have a job of 266 maps (each emitting 4 records)  and 17 reduces (each ingesting a...
   Author: Keith Wiley, 2011-11-11, 01:20
Streaming input, data locality - Hadoop - [mail # user]
...I would like my streaming job to receive the names of files stored on  HDFS, but not the actual contents of the files, and I would like data  locality to be honored (I want mappers...
   Author: Keith Wiley, 2011-08-19, 21:46
cmdenv LD_LIBRARY_PATH - Hadoop - [mail # user]
...I know you can do something like this:  -cmdenv LD_LIBRARY_PATH=./my_libs  if you have shared libraries in a subdirectory under the cwd (such as  occurs when using -cacheArchi...
   Author: Keith Wiley, 2011-08-05, 18:37
Re: streaming cacheArchive shared libraries - Hadoop - [mail # user]
...Right, so it was pushed down a level into the "testlink" directory.   That's why my shared libraries were not linking properly to my mapper  executable.  I can fix that by usi...
   Author: Keith Wiley, 2011-08-05, 17:49
Re: streaming cacheArchive shared libraries - Hadoop - [mail # user]
...Okay, I think I understand.  The symlink name that follows the pound  sign in the -cacheArchive directive isn't the name of the transferred  jar file -- it is the name of a di...
   Author: Keith Wiley, 2011-08-05, 17:48
Re: streaming cacheArchive shared libraries - Hadoop - [mail # user]
...Quick followup.  I substituted the true mapper for a little python  script that just lists the cwd's contents and dumps them to the  streaming output (stderr).  Oddly, I ...
   Author: Keith Wiley, 2011-08-05, 17:27
streaming cacheArchive shared libraries - Hadoop - [mail # user]
...I can use cacheFile to load .so files into the distributed cache and it  works fine (the streaming executable links against the .so and runs),  but I can't get it to work with -cac...
   Author: Keith Wiley, 2011-08-05, 17:10
Re: Upload, then decompress archive on HDFS? - Hadoop - [mail # user]
...I can envision an M/R job for the purpose of manipulating hdfs, such as  (de)compressing files and resaving them back to HDFS.  I just didn't  think it should be necessary to ...
   Author: Keith Wiley, 2011-08-05, 15:14
Upload, then decompress archive on HDFS? - Hadoop - [mail # user]
...Instead of "hd fs -put" hundreds of files of X megs, I want to do it  once on a gzipped (or zipped) archive, one file, much smaller total  megs.  Then I want to decompress the...
   Author: Keith Wiley, 2011-08-04, 23:28
Sort:
project
Hadoop (142)
Hive (55)
MapReduce (17)
HDFS (9)
type
mail # user (135)
mail # general (5)
mail # dev (2)
date
last 7 days (0)
last 30 days (1)
last 90 days (1)
last 6 months (4)
last 9 months (142)
author
Harsh J (1375)
Steve Loughran (928)
Owen O'Malley (816)
Todd Lipcon (755)
Arun C Murthy (573)
Eli Collins (513)
Allen Wittenauer (461)
Doug Cutting (344)
Konstantin Boudnik (335)
Mark Kerzner (334)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (302)
jason hadoop (279)
Keith Wiley