Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HBase >> mail # user >> Configuring Hadoop, HBase and Hive Cluster


Copy link to this message
-
Re: Configuring Hadoop, HBase and Hive Cluster
I do advise you to use Cloudera Manager its a very simple and opensource cluster configuration software..

A good design is to run zookeeper on node1, node2, another node alone

Sent from my iPhone

On 2012-11-13, at 2:04 AM, "Hakan Bogay" <[EMAIL PROTECTED]> wrote:

> Hi,
>
> I am a newbie to Hadoop, HBase and Hive. I installed Hadoop, HBase and Hive
> in pseudodistributed mode and everything works fine. Now I am planning to
> set up an simple Hadoop Cluster (5 nodes) with Hive, HBase and ZooKeeper.
> I´ve read several documentations and instructions before but i could not
> find a good explanation for my question. I´m not sure, where to run all the
> daemons. This is my consideration:
>
> *Node_1* (Master)
>
>   - NameNode
>   - JobTrakcer
>   - HBase Master
>   -
>
>   ZooKeeper (Standalone node; managed by HBase)
>
>
>
> *Node_2* (Backup_Master)
>
>   -
>
>   SecondaryNameNode
>
>
>
> *Node_3* (Slave1)
>
>   - DataNode1
>   - TaskTracker1
>   -
>
>   RegionServer1
>
>
>
> *Node_4* (Slave2)
>
>   - DataNode2
>   - TaskTracker2
>   -
>
>   RegionServer2
>
>
>
> *Node_5* (Slave3)
>
>   - DataNode3
>   - TaskTracker3
>   - RegionServer3
>
>
> I know, in production it is recommended to run ZooKeeper ensemble at an odd
> number of nodes (seperate Cluster). But for a simple cluster, is it OK to
> set up a standalone ZooKeeper node which runs on the master node?
> Another question is regarding Hive: I know that Hive is a Hadoop client.
> Should I also install Hive on the master node? Does it make sense?
>
> Thanks for all tips and comments!
>
> Hakan
>
> Note: I have just 5 machines to simulate a cluster.