| clear query|facets|time |
Search criteria: .
Results from 1 to 10 from
53 (0.386s).
|
|
|
Loading phrases to help you refine your search...
|
|
Re: VM reuse! - Hadoop - [mail # user]
|
|
...Hi Rahul If you look at larger cluster and jobs that involve larger input data sets. The data would be spread across the whole cluster, and a single node might have various block...
|
|
|
Author: Bejoy Ks,
2013-04-16, 09:10
|
|
|
Re: Unexpected Hadoop behavior: map task re-running after reducer has been running - Hadoop - [mail # user]
|
|
...Hi David The issue with the maps getting re triggered is because one of the node where map outputs are stored are getting lost during reduce phase. As a result of this the map outputs ...
|
|
|
Author: Bejoy Ks,
2013-03-11, 13:16
|
|
|
Re: One file per mapper? - Hadoop - [mail # user]
|
|
...Hi Terry If you are having files smaller than hdfs block size and if you are using Default TextInputFormat with the default properties for split sizes there would be just one file per ...
|
|
|
Author: Bejoy Ks,
2012-10-08, 14:28
|
|
|
Re: How to lower the total number of map tasks - Hadoop - [mail # user]
|
|
...Sorry for the typo, the property name is mapred.max.split.size Also just for changing the number of map tasks you don't need to modify the hdfs block size. On Tue, Oct 2, 2012 at...
|
|
|
Author: Bejoy Ks,
2012-10-02, 17:03
|
|
|
Re: How to lower the total number of map tasks - Hadoop - [mail # user]
|
|
...Hi You need to alter the value of mapred.max.split size to a value larger than your block size to have less number of map tasks than the default. On Tue, Oct 2, 2012 at 10:04 PM,...
|
|
|
Author: Bejoy Ks,
2012-10-02, 17:01
|
|
|
Re: Understanding of the hadoop distribution system (tuning) - Hadoop - [mail # user]
|
|
...Hi Elaine Slots (mapred.tasktracker.[map/reduce].tasks.maximum) are configured on a cluster/node/TaskTracker level and not on a job level. You configure this based on the available res...
|
|
|
Author: Bejoy Ks,
2012-09-11, 06:42
|
|
|
Re: Replication Factor Modification - Hadoop - [mail # user]
|
|
...Hi Uddipan As Harsh mentioned, replication factor is a client side property . So you need to update the value for 'dfs.replication' in hdfs-site.xml as per your requirement in yo...
|
|
|
Author: Bejoy Ks,
2012-09-05, 18:38
|
|
|
Re: Replication Factor Modification - Hadoop - [mail # user]
|
|
...Hi You can change the replication factor of an existing directory using '-setrep' http://hadoop.apache.org/common/docs/r0.20.0/hdfs_shell.html#setrep The below command will...
|
|
|
Author: Bejoy Ks,
2012-09-05, 18:16
|
|
|
Re: Using hadoop for analytics - Hadoop - [mail # user]
|
|
...Hi Prashant Welcome to Hadoop Community. :) Hadoop is meant for processing large data volumes. Saying that, for your custom requirements you should write your own mapper and redu...
|
|
|
Author: Bejoy Ks,
2012-09-05, 08:57
|
|
|
Re: knowing the nodes on which reduce tasks will run - Hadoop - [mail # user]
|
|
...Hi Abhay You need this value to be changed before you submit your job and restart TT. Modifying this value in mid time won't affect the running jobs. On Mon, Sep 3, 2012 at...
|
|
|
Author: Bejoy Ks,
2012-09-03, 15:46
|
|
|
|