Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HDFS, mail # user - Re: Who splits the file into blocks


Copy link to this message
-
Re: Who splits the file into blocks
Jens Scheidtmann 2013-03-31, 16:30
Dear Sai Sai,

"Hadoop, the definitive guide" says regarding default replica placement:

- first replica is placed on the same node as the client (lowest bandwidth
penalty).
- second replica is placed off-rack, at a random node of the other rack
(avoiding busy racks).
- third replicate is placed on random node on rack where second replica is
stored.
- other replicas are placed on random nodes of the cluster (avoiding busy
racks).

If client is not on the cluster, first replica is placed on a random node
(avoiding busy racks).

Best regards,
Jens