Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 15 (0.273s).
Loading phrases to help you
refine your search...
Re: how to differentiate which input directory current record comes from? - MapReduce - [mail # user]
...If you use the new API, you can access the MapContext object in the setup method of the mapper. Then, you can get the input split with MapContext#getInputSplit(), cast it to FileSplit and ob...
   Author: Sigurd Spieckermann, 2012-12-15, 10:17
Re: Spill file compression - MapReduce - [mail # user]
...When I log the calls of the combiner function and print the number of elements iterated over, it is all 1 during the spill-writing phase and the combiner is called very often. Is this normal...
   Author: Sigurd Spieckermann, 2012-11-07, 15:14
Re: Spill file compression - MapReduce - [mail # user]
...Hm, maybe I need some clarification on what the combiner exactly does. From what I understand from "Hadoop - The Definitive Guide", there are a few occasions when a combiner may be called be...
   Author: Sigurd Spieckermann, 2012-11-07, 14:29
Re: Spill file compression - MapReduce - [mail # user]
...OK, I found the answer to one of my questions just now -- the location of the spill files and their sizes. So, there's a discrepancy between what I see and what you said about the compressio...
   Author: Sigurd Spieckermann, 2012-11-07, 13:18
Re: Spill file compression - MapReduce - [mail # user]
...OK, just wanted to confirm. Maybe there is another problem then. I just looked at the task logs and there were ~200 spills recorded for a single task, only afterwards there was a merge phase...
   Author: Sigurd Spieckermann, 2012-11-07, 13:12
Spill file compression - MapReduce - [mail # user]
...Hi guys,  I've encountered a situation where the ratio between "Map output bytes" and "Map output materialized bytes" is quite huge and during the map-phase data is spilled to disk quit...
   Author: Sigurd Spieckermann, 2012-11-07, 12:32
Re: Data locality of map-side join - MapReduce - [mail # user]
...Interesting to know that Hive and Pig are doing something in this direction. I'm dealing with the Hadoop join-package which doesn't use DistributedCache though but it rather pulls the other ...
   Author: Sigurd Spieckermann, 2012-10-23, 09:22
Data locality of map-side join - MapReduce - [mail # user]
...Hi guys,  I've been trying to figure out whether a map-side join using the  join-package does anything clever regarding data locality with respect  to at least one of the part...
   Author: Sigurd Spieckermann, 2012-10-22, 20:29
Join-package in new API? - MapReduce - [mail # user]
...Hi,  I've just noticed that the join-package only exists in the old map-reduce API. Is there a particular reason why it's not in the new API? (deprecated maybe?) If so, what is the appr...
   Author: Sigurd Spieckermann, 2012-10-10, 14:47
Re: Reading from HDFS from inside the mapper - MapReduce - [mail # user]
...OK, I see... Is there any way to change this? I need guaranteed order in order for the map-side join to work correctly and I need the standalone mode for debugging code that is executed on t...
   Author: Sigurd Spieckermann, 2012-09-17, 13:50
Sort:
project
MapReduce (15)
Hadoop (3)
HDFS (1)
type
mail # user (15)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (1)
last 9 months (15)
author
Harsh J (1046)
Arun C Murthy (501)
Vinod Kumar Vavilapalli (351)
Todd Lipcon (283)
Amar Kamat (184)
Mohammad Tariq (174)
Thomas Graves (173)
Owen O'Malley (162)
Hemanth Yamijala (155)
Amareshwari Sriramadasu (153)
Pedro Costa (153)
Ted Yu (148)
Robert Evans (147)
Tom White (138)
Aaron Kimball (131)
Sigurd Spieckermann