Simple question: When I issue a "hadoop fs -du" command and/or when I view the namenode web UI to see HDFS disk utilization (which the namenode reports both as bytes and percentage), should I expect to see disk use reported as "true data size" or as replicated size (e.g. with 3X replication, should I expect reported values to be three times higher than the actual underlying data itself)?
Keith Wiley [EMAIL PROTECTED] keithwiley.com music.keithwiley.com
"I used to be with it, but then they changed what it was. Now, what I'm with
isn't it, and what's it seems weird and scary to me."
-- Abe (Grandpa) Simpson