Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Hadoop, mail # general - one or more file system


+
Xiang Hua 2012-10-08, 15:30
Copy link to this message
-
Re: one or more file system
Andy Isaacson 2012-10-08, 17:45
On Mon, Oct 8, 2012 at 8:30 AM, Xiang Hua <[EMAIL PROTECTED]> wrote:
> Hi,
>    we have 4T disk from a diskarray.
>    i want to split 2T*1 to 1T*2, then add to HDFS, which leads to more
> local storage directories.
>    this time we have 12 local directories(1T), is ti harmful to hdfs
> performance?

Assuming you're running a modern Hadoop on a recent Linux (2.6.38 or
later, or RHEL6):

For best performance you should configure your disk array as JBOD
rather than RAID, then put one ext4 filesystem on each spindle. Do not
put multiple storage directories on a single spindle, that results in
very bad performance and no benefit over a single storage directory
per spindle. And do not put multiple spindles under a single storage
directory, that results in poor utilization and bad performance with
no significant benefit.

12 local storage directories will perform just fine assuming you have
enough CPU power to use them.

-andy
+
Xiang Hua 2012-10-09, 09:13
+
Andy Isaacson 2012-10-16, 23:45
+
Arun C Murthy 2012-10-16, 23:55