Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 53 (0.105s).
Loading phrases to help you
refine your search...
Re: VM reuse! - Hadoop - [mail # user]
...Hi Rahul  If you look at larger cluster and jobs that involve larger input data sets. The data would be spread across the whole cluster, and a single node might have  various block...
   Author: Bejoy Ks, 2013-04-16, 09:10
Re: Unexpected Hadoop behavior: map task re-running after reducer has been running - Hadoop - [mail # user]
...Hi David  The issue with the maps getting re triggered is because one of the node where map outputs are stored are getting lost during reduce phase. As a result of this the map outputs ...
   Author: Bejoy Ks, 2013-03-11, 13:16
Re: One file per mapper? - Hadoop - [mail # user]
...Hi Terry  If you are having files smaller than hdfs block size and if you are using Default TextInputFormat with the default properties for split sizes there would be just one file per ...
   Author: Bejoy Ks, 2012-10-08, 14:28
Re: How to lower the total number of map tasks - Hadoop - [mail # user]
...Sorry for the typo, the property name is mapred.max.split.size  Also just for changing the number of map tasks you don't need to modify the hdfs block size.  On Tue, Oct 2, 2012 at...
   Author: Bejoy Ks, 2012-10-02, 17:03
Re: How to lower the total number of map tasks - Hadoop - [mail # user]
...Hi  You need to alter the value of mapred.max.split size to a value larger than your block size to have less number of map tasks than the default.  On Tue, Oct 2, 2012 at 10:04 PM,...
   Author: Bejoy Ks, 2012-10-02, 17:01
Re: Understanding of the hadoop distribution system (tuning) - Hadoop - [mail # user]
...Hi Elaine  Slots (mapred.tasktracker.[map/reduce].tasks.maximum) are configured on a cluster/node/TaskTracker level and not on a job level. You configure this based on the available res...
   Author: Bejoy Ks, 2012-09-11, 06:42
Re: Replication Factor Modification - Hadoop - [mail # user]
...Hi  Uddipan  As Harsh mentioned, replication factor is a client side property . So you need to update the value for 'dfs.replication' in hdfs-site.xml as per your requirement in yo...
   Author: Bejoy Ks, 2012-09-05, 18:38
Re: Replication Factor Modification - Hadoop - [mail # user]
...Hi  You can change the replication factor of an existing directory using '-setrep'  http://hadoop.apache.org/common/docs/r0.20.0/hdfs_shell.html#setrep  The below command will...
   Author: Bejoy Ks, 2012-09-05, 18:16
Re: Using hadoop for analytics - Hadoop - [mail # user]
...Hi Prashant  Welcome to Hadoop Community. :)  Hadoop is meant for processing large data volumes. Saying that, for your custom requirements you should write your own mapper and redu...
   Author: Bejoy Ks, 2012-09-05, 08:57
Re: knowing the nodes on which reduce tasks will run - Hadoop - [mail # user]
...Hi Abhay  You need this value to be changed before you submit your job and restart TT. Modifying this value in  mid time won't affect the running jobs.  On Mon, Sep 3, 2012 at...
   Author: Bejoy Ks, 2012-09-03, 15:46
Sort:
project
Hive (132)
MapReduce (67)
Hadoop (53)
HDFS (12)
Pig (1)
type
mail # user (53)
date
last 7 days (0)
last 30 days (0)
last 90 days (2)
last 6 months (2)
last 9 months (53)
author
Harsh J (1373)
Steve Loughran (928)
Owen O'Malley (816)
Todd Lipcon (755)
Arun C Murthy (573)
Eli Collins (512)
Allen Wittenauer (461)
Doug Cutting (344)
Konstantin Boudnik (335)
Mark Kerzner (334)
Edward Capriolo (328)
Ted Dunning (321)
Brian Bockelman (305)
Tom White (302)
jason hadoop (279)
Bejoy Ks