Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 7 from 7 (0.069s).
Loading phrases to help you
refine your search...
Re: How to acquire Input Filename - MapReduce - [mail # user]
...As I said, getInputSplit() Returns a TaggedInputSplit which is private.  On Thu, Dec 6, 2012 at 7:50 AM, Olivier Varene - echo wrote:  ...
   Author: Hans Uhlig, 2012-12-08, 18:31
How to acquire Input Filename - MapReduce - [mail # dev]
...I am currently using multiple inputs to merge quite a few different but related filetypes and I am attempting to track down some bad data. However Multiple inputs shields the FileSplit behin...
   Author: Hans Uhlig, 2012-12-06, 02:40
How to acquire Input Filename - MapReduce - [mail # user]
...I am currently using multiple inputs to merge quite a few different but related filetypes and I am attempting to track down some bad data. However Multiple inputs shields the FileSplit behin...
   Author: Hans Uhlig, 2012-12-05, 23:24
Re: Mapper Record Spillage - MapReduce - [mail # user]
...If that is the case then these two lines should make more than enough memory. On a virtually unused cluster.  job.getConfiguration().setInt("io.sort.mb", 2048); job.getConfiguration().s...
   Author: Hans Uhlig, 2012-03-11, 08:06
Re: Mapper Record Spillage - MapReduce - [mail # user]
...That was a typo in my email not in the configuration. Is the memory reserved for the tasks when the task tracker starts? You seem to be suggesting that I need to set the memory to be the sam...
   Author: Hans Uhlig, 2012-03-11, 05:54
Re: Mapper Record Spillage - MapReduce - [mail # user]
...I am attempting to specify this for a single job during its creation/submission. Not via the general construct. I am using the new api so I am adding the values to the conf passed into new J...
   Author: Hans Uhlig, 2012-03-11, 04:08
Mapper Record Spillage - MapReduce - [mail # user]
...I am attempting to speed up a mapping process whose input is GZIP compressed CSV files. The files range from 1-2GB, I am running on a Cluster where each node has a total of 32GB memory avail...
   Author: Hans Uhlig, 2012-03-11, 04:00
Sort:
project
Hive (7)
MapReduce (7)
HDFS (4)
Flume (1)
HBase (1)
Hadoop (1)
Pig (1)
type
mail # user (6)
mail # dev (1)
date
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (3)
last 9 months (7)
author
Harsh J (1035)
Arun C Murthy (500)
Vinod Kumar Vavilapalli (346)
Todd Lipcon (282)
Amar Kamat (184)
Mohammad Tariq (174)
Thomas Graves (173)
Owen O'Malley (162)
Hemanth Yamijala (155)
Amareshwari Sriramadasu (153)
Pedro Costa (153)
Ted Yu (148)
Robert Evans (146)
Tom White (138)
Aaron Kimball (131)
Hans Uhlig