Hadoop performs best when each node has exclusive use of its disk.  Therefore, if you have a choice, try to provision exclusive use of individual spindles on your SAN and map each one to a separate mount on your Hadoop nodes.  Anything other than that will tend to produce poor performance due to disk contention.


From: Natarajan, Prabakaran 1. (NSN - IN/Bangalore) [mailto:[EMAIL PROTECTED]]
Sent: Thursday, June 12, 2014 12:00 AM
Subject: Hadoop SAN Storage reuse

I know SAN storage is not recommended for Hadoop.    But we don't want waste - already existing SAN Storage.

How can we make use of SAN Storage for Hadoop - what are the best methods, can a Ethernet upgrade helps,...?

NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB