Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Is disk use reported with replication?

Copy link to this message
Re: Is disk use reported with replication?
Hi Keith,

The "fs -du" computes length of files, and would not report replicated
on-disk size. HDFS disk utilization OTOH, is the current, simple
report of used/free disk space, which would certainly include
replicated data.

On Mon, Apr 22, 2013 at 10:49 PM, Keith Wiley <[EMAIL PROTECTED]> wrote:
> Simple question: When I issue a "hadoop fs -du" command and/or when I view the namenode web UI to see HDFS disk utilization (which the namenode reports both as bytes and percentage), should I expect to see disk use reported as "true data size" or as replicated size (e.g. with 3X replication, should I expect reported values to be three times higher than the actual underlying data itself)?
> Thanks.
> ________________________________________________________________________________
> Keith Wiley     [EMAIL PROTECTED]     keithwiley.com    music.keithwiley.com
> "I used to be with it, but then they changed what it was.  Now, what I'm with
> isn't it, and what's it seems weird and scary to me."
>                                            --  Abe (Grandpa) Simpson
> ________________________________________________________________________________

Harsh J