I had super ugly experience with Amazon EC2 virtual nodes and I even found bug reports related to Ubuntu... Problems with unpredictable "wall time" when everything stops and ZooKeeper sessions expire...
I don't have any problems with dedicated servers and CentoOS.
It is super risky to do any kind of business without (sorry for cliche) corporate standards (knowledge base of corporate problems and corporate workarounds lol) - it is very specific... I suggest "stick with standards" it will lower TCO;)
Sent from my iPad
On 2011-11-11, at 4:42 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote:
> I was wondering if anyone has done an experiment with HBase or HDFS/MR where machines in the cluster have heterogeneous underlying file systems?
> * 10 nodes with xfs
> * 10 nodes with ext3
> * 10 nodes with ext4
> The goal being comparing performance of MapReduce jobs reading from and writing to HBase (or just HDFS).
> And does anyone have any reason to believe doing the above would be super risky and cause data loss?
> Sematext :: http://sematext.com/ :: Solr - Lucene - Nutch
> Lucene ecosystem search :: http://search-lucene.com/