I was looking at the web interface and found that some of my nodes have
enormous amount of "Non DFS Used".
There is even a node with 800GB of "Non DFS Used" which is just ridiculous.
I tried to remove them by doing:
"hadoop namenode -format"
and I also tried deleting "hadoop.tmp.dir" (in my case, which is
But when I start my cluster again, there it is again with thousands of giga
bytes of "Non DFS Used".
Can anyone tell me what "Non DFS Used" is and how to remove them forever?
Thanks in advance.