Home | About | Sematext search-lucene.com search-hadoop.com
clear query|facets|time Search criteria: .   Results from 1 to 10 from 41 (0.089s).
Loading phrases to help you
refine your search...
Re: VM reuse! - Hadoop - [mail # user]
...Hi Rahul  If you look at larger cluster and jobs that involve larger input data sets. The data would be spread across the whole cluster, and a single node might have  various block...
   Author: Bejoy Ks, 2013-04-16, 09:10
Re: Unexpected Hadoop behavior: map task re-running after reducer has been running - Hadoop - [mail # user]
...Hi David  The issue with the maps getting re triggered is because one of the node where map outputs are stored are getting lost during reduce phase. As a result of this the map outputs ...
   Author: Bejoy Ks, 2013-03-11, 13:16
Re: One file per mapper? - Hadoop - [mail # user]
...Hi Terry  If you are having files smaller than hdfs block size and if you are using Default TextInputFormat with the default properties for split sizes there would be just one file per ...
   Author: Bejoy Ks, 2012-10-08, 14:28
[expand - 1 more] - Re: How to lower the total number of map tasks - Hadoop - [mail # user]
...Sorry for the typo, the property name is mapred.max.split.size  Also just for changing the number of map tasks you don't need to modify the hdfs block size.  On Tue, Oct 2, 2012 at...
   Author: Bejoy Ks, 2012-10-02, 17:03
Re: Understanding of the hadoop distribution system (tuning) - Hadoop - [mail # user]
...Hi Elaine  Slots (mapred.tasktracker.[map/reduce].tasks.maximum) are configured on a cluster/node/TaskTracker level and not on a job level. You configure this based on the available res...
   Author: Bejoy Ks, 2012-09-11, 06:42
[expand - 1 more] - Re: Replication Factor Modification - Hadoop - [mail # user]
...Hi  Uddipan  As Harsh mentioned, replication factor is a client side property . So you need to update the value for 'dfs.replication' in hdfs-site.xml as per your requirement in yo...
   Author: Bejoy Ks, 2012-09-05, 18:38
Re: Using hadoop for analytics - Hadoop - [mail # user]
...Hi Prashant  Welcome to Hadoop Community. :)  Hadoop is meant for processing large data volumes. Saying that, for your custom requirements you should write your own mapper and redu...
   Author: Bejoy Ks, 2012-09-05, 08:57
Re: knowing the nodes on which reduce tasks will run - Hadoop - [mail # user]
...Hi Abhay  You need this value to be changed before you submit your job and restart TT. Modifying this value in  mid time won't affect the running jobs.  On Mon, Sep 3, 2012 at...
   Author: Bejoy Ks, 2012-09-03, 15:46
Re: help in distribution of a task with hadoop - Hadoop - [mail # user]
...Hi Bertrand  -libjars option works well with the 'hadoop jar' command. Instead of executing your runnable with the plain java 'jar' command use 'hadoop jar' . When you use hadoop jar yo...
   Author: Bejoy Ks, 2012-08-13, 18:29
Re: Problem with hadoop filesystem after restart cluster - Hadoop - [mail # user]
...Hi Andy  Is your hadoop.tmp.dir or dfs.name.dir configured to /tmp? If so it can happen as /tmp dir gets wiped out on OS restarts  Regards Bejoy KS  ...
   Author: Bejoy Ks, 2012-08-08, 11:27
Hive (94)
MapReduce (51)
Hadoop (40)
HDFS (12)
Pig (1)
mail # user (41)
last 7 days (0)
last 30 days (0)
last 90 days (0)
last 6 months (0)
last 9 months (41)
Harsh J (559)
Owen O'Malley (394)
Steve Loughran (391)
Todd Lipcon (238)
Eli Collins (182)
Alejandro Abdelnur (178)
Arun C Murthy (163)
Allen Wittenauer (148)
Chris Nauroth (146)
Ted Yu (125)
Tom White (121)
Daryn Sharp (115)
Nigel Daley (115)
Konstantin Shvachko (107)
Doug Cutting (96)
Aaron Kimball (94)
Colin Patrick McCabe (92)
Edward Capriolo (87)
Mark Kerzner (87)
jason hadoop (82)
Hairong Kuang (74)
Konstantin Boudnik (72)
Runping Qi (72)
Benoy Antony (69)
Suresh Srinivas (64)
Bejoy Ks