Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> What's the basic idea of pseudo-distributed Hadoop ?


Copy link to this message
-
Re: What's the basic idea of pseudo-distributed Hadoop ?
Hello.

Am 14.09.2012 um 08:03 schrieb Jason Yang <[EMAIL PROTECTED]>:

> I have a question about how does the pseudo-distributed Hadoop cluster work:
>
> As many map tasks are submitted to the pseudo-distributed Hadoop cluster, does the hadoop run each mapper in sequence ? or does it run these mappers in different threads or something could be parallel?

pseudo-distributed mode is a one node cluster. You have a namenode, a jobtracker, and a single datanode and tasktracker running. You can verify with "jps" command.

The default setting is that a tasktracker can run up to two map and reduce tasks in parallel (mapred.tasktracker.map.tasks.maximum and mapred.tasktracker.reduce.tasks.maximum), so you will actually see some concurrency on your one machine.

Kai

--
Kai Voigt
[EMAIL PROTECTED]
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB