Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
MapReduce >> mail # user >> What's the basic idea of pseudo-distributed Hadoop ?

Jason Yang 2012-09-14, 06:03
Copy link to this message
Re: What's the basic idea of pseudo-distributed Hadoop ?

Am 14.09.2012 um 08:03 schrieb Jason Yang <[EMAIL PROTECTED]>:

> I have a question about how does the pseudo-distributed Hadoop cluster work:
> As many map tasks are submitted to the pseudo-distributed Hadoop cluster, does the hadoop run each mapper in sequence ? or does it run these mappers in different threads or something could be parallel?

pseudo-distributed mode is a one node cluster. You have a namenode, a jobtracker, and a single datanode and tasktracker running. You can verify with "jps" command.

The default setting is that a tasktracker can run up to two map and reduce tasks in parallel (mapred.tasktracker.map.tasks.maximum and mapred.tasktracker.reduce.tasks.maximum), so you will actually see some concurrency on your one machine.


Kai Voigt
Jason Yang 2012-09-14, 06:28
Harsh J 2012-09-14, 07:24
Bertrand Dechoux 2012-09-14, 07:31
Jason Yang 2012-09-14, 07:34
Hemanth Yamijala 2012-09-14, 08:21