Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
HDFS, mail # user - Managing space in Master Node


Copy link to this message
-
RE: Managing space in Master Node
Charles Baker 2013-02-15, 02:16
Hey Arko. Glad to hear it worked out for you. Just make sure that your NN's
image and edit files aren't also (perhaps inadvertently) stored in that
directory structure. The NameNode image/edit dir contains the following:

drwxr-xr-x 2 hdfs hdfs 4096 Feb 14 17:48 current
drwxr-xr-x 2 hdfs hdfs 4096 Jul 17  2012 image
-rw-r--r-- 1 hdfs hdfs    0 Sep 27 12:28 in_use.lock
drwxr-xr-x 2 hdfs hdfs 4096 Sep 17 23:25 previous.checkpoint

whereas, the DN data dir should contain:

drwxr-xr-x  2 hdfs hdfs 57344 Feb 14 14:04 blocksBeingWritten
drwxr-xr-x 66 hdfs hdfs  8192 Jan 22 02:44 current
drwxr-xr-x  2 hdfs hdfs     6 Aug 10  2012 detach
-rw-r--r--  1 hdfs hdfs     0 Sep 27 11:19 in_use.lock
-rw-r--r--  1 hdfs hdfs   157 Aug 10  2012 storage
drwxr-xr-x  2 hdfs hdfs     6 Feb  4 17:34 tmp
It's a good practice to locate the JobTracker and SecondaryNameNode services
on a different machine than the NN. If you have an opportunity to do so, I
recommend getting a separate machine for those two services as well.  

-Chuck
-----Original Message-----
From: Arko Provo Mukherjee [mailto:[EMAIL PROTECTED]]
Sent: Thursday, February 14, 2013 4:21 PM
To: [EMAIL PROTECTED]
Subject: Re: Managing space in Master Node

Hello Everyone - I have decommissioned my NN so that only the JobTracker,
NameNode and SecondaryNameNodes are running on it now.
I would like to thank everyone who helped me to perform this task!

I have just one more issue: The data files created in the Master Node seem to
be still there. in /hdfs/data/current Now that this server is not being used
as a DataNode, can I safely delete the files under this folder to clear up
space?

Thanks & regards
Arko

On Thu, Feb 14, 2013 at 11:48 AM, Arko Provo Mukherjee
<[EMAIL PROTECTED]> wrote:
> Thanks! I have already requested for downtime. Will do the changes soon!
> Warm regards
> Arko
>
> On Thu, Feb 14, 2013 at 3:31 AM,  <[EMAIL PROTECTED]> wrote:
>> Hi Arko.
>>
>>   Only thing you need to do is not running the TaskTracker and DataNode
demons on your master machine. Ensure you do not have this on your slaves
file con hadoop's config directory when you start the system. I'm supposing
you are on Open Source release. For other distributions look at its
documentation about how to remove those demons from your master node machine.
>>
>>
>> -----Original Message-----
>> From: Arko Provo Mukherjee [mailto:[EMAIL PROTECTED]]
>> Sent: miércoles, 13 de febrero de 2013 20:32
>> To: [EMAIL PROTECTED]
>> Subject: Managing space in Master Node
>>
>> Hello Gurus,
>>
>> I am managing a Hadoop Cluster to run some experiments.
>>
>> The issue I am continuously facing is that the Master Node runs out
>> of disk space due to logs and data files.
>>
>> I can monitor and delete log files. However, I cannot delete the HDFS
data.
>>
>> Thus, is there a way to force Hadoop not to save any HDFS data in the
>> Master Node?
>>
>> Then I can use my master to handle the metadata only and store the logs.
>>
>> Thanks & regards
>> Arko
>>
>>
>> This message is for the designated recipient only and may contain
privileged, proprietary, or otherwise private information. If you have
received it in error, please notify the sender immediately and delete the
original. Any other use of the e-mail by you is prohibited.
>>
>> Where allowed by local law, electronic communications with Accenture and
its affiliates, including e-mail and instant messaging (including content),
may be scanned by our systems for the purposes of information security and
assessment of internal compliance with Accenture policy.
>>
>> _____________________________________________________________________
>> _________________
>>
>> www.accenture.com
>>
SDL Enterprise Technologies, Inc. - all rights reserved.  The information contained in this email may be confidential and/or legally privileged. It has been sent for the sole use of the intended recipient(s). If you are not the intended recipient of this mail, you are hereby notified that any unauthorized review, use, disclosure, dissemination, distribution, or copying of this communication, or any of its contents, is strictly prohibited. If you have received this communication in error, please reply to the sender and destroy all copies of the message.
Registered address: 201 Edgewater Drive, Suite 225, Wakefield, MA 01880, USA