I am curious to know what is the biggest problem of extremely large hadoop
cluster. What I can imagine now is the memory cost of meta data of hdfs in
name node. One solution I can think about now is to use other storage
implementation such as database to store the metadata, although it has
performance cost. Is there any other solutions or any problems of extremely
large hadoop cluster ?