Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Hadoop, mail # user - YARN NM containers were killed


+
YouPeng Yang 2013-01-30, 07:11
Copy link to this message
-
Re: YARN NM containers were killed
YouPeng Yang 2013-02-02, 14:43
Hi All
      I am sorry to bother you guys, but i have to  put up the problem
againt .
I do want to get clear why the some  containers  were killed.

the details about this situation are descriped in my  mail I've posted few
days ago.
My questions:
               1. Why  were 2 containers created in Hadoop02,however
Hadoop04 got nothing.is it normal ?
2. What is the principle that guides containers to be created.
3. Why were the two containers (the container_*_000003 and the
container_*_000002)  killed, while the container_*_000001 succeeded.
   is it normal?

   Any suggestion will be appreciated.
regards
YouPeng Yang
2013/1/31 YouPeng Yang <[EMAIL PROTECTED]>

> Hi
>
>    I have posted my question for a day,please can somebody help me to
> figure  out
> what the problem is.
>    Thank you.
> regards
> YouPeng Yang
>
>
> ---------- Forwarded message ----------
> From: YouPeng Yang <[EMAIL PROTECTED]>
> Date: 2013/1/30
> Subject: YARN NM containers were killed
> To: [EMAIL PROTECTED]
>
>
> i've tested the hadoop-mapreduce-examples-2.0.0-cdh4.1.2.jar on my hadoop
> environment
> (   1 RM - Hadoop01 and 3 NM --Hadoop02,Hadoop03,Hadoop04
>   OS:CDH4.1.2 rhel5.5):
> ./bin/hadoop jar
> share/hadoop/mapreduce/hadoop-mapreduce-examples-2.0.0-cdh4.1.2.jar
>  wordcount 1/input output
>
> when i checked the log .i was confused by the plz:
> my hadoop creates 2 containers in Hadoop02,1 container in Hadoop03
> ,however 0 container Hadoop04.
>
> the result of the containers processing:
>
> Hadoop02:
> * container_1359422495723_0001_01_000001
> (its state changes as follows:NEW --> LOCALIZING --> LOCALIZED --> RUNNING
> --> KILLING --> EXITED_WITH_SUCCESS)
>
>       the log indates that:
> NodeStatusUpdaterImpl: Sending out status for container: container_id {,
> app_attempt_id {, application_id {, id: 1, cluster_timestamp:
> 1359422495723, }, attemptId: 1, }, id: 1, }, state: C_RUNNING, diagnostics:
> "", exit_status: -1000,
>  ContainerLaunch: Container container_1359422495723_0001_01_000001
> succeeded
> Container: Container container_1359422495723_0001_01_000001 transitioned
> from RUNNING to EXITED_WITH_SUCCESS
>  ContainerLaunch: Cleaning up container
> container_1359422495723_0001_01_000001
> NMAuditLogger: USER=hadoop OPERATION=Container Finished - Succeeded
> TARGET=ContainerImpl RESULT=SUCCESSAPPID=application_1359422495723_0001
> CONTAINERID=container_1359422495723_0001_01_000001
>  * container_1359422495723_0001_01_000003
> (its state changes as follows:NEW --> LOCALIZING --> LOCALIZED --> RUNNING
> --> KILLING --> CONTAINER_CLEANEDUP_AFTER_KILL--> DONE)
>  the log indates that:
> NodeStatusUpdaterImpl: Sending out status for container: container_id {,
> app_attempt_id {, application_id {, id: 1, cluster_timestamp:
> 1359422495723, }, attemptId: 1, }, id: 3, }, state: C_RUNNING, diagnostics:
> "Container killed by the ApplicationMaster.\n", exit_status: -1000,
>  DefaultContainerExecutor: Exit code from task is : 137
> NMAuditLogger: USER=hadoop OPERATION=Container Finished - Killed
> TARGET=ContainerImpl RESULT=SUCCESS APPID=application_1359422495723_0001
> CONTAINERID=container_1359422495723_0001_01_000003
>
> Hadoop03:
>         * container_1359422495723_0001_01_000002
> (its state changes as follows:NEW --> LOCALIZING --> LOCALIZED --> RUNNING
> --> KILLING --> CONTAINER_CLEANEDUP_AFTER_KILL--> DONE)
>  NodeStatusUpdaterImpl: Sending out status for container: container_id {,
> app_attempt_id {, application_id {, id: 1, cluster_timestamp:
> 1359422495723, }, attemptId: 1, }, id: 2, }, state: C_RUNNING, diagnostics:
> "Container killed by the ApplicationMaster.\n", exit_status: -1000,
>         DefaultContainerExecutor: Exit code from task is : 143
> NMAuditLogger: USER=hadoop OPERATION=Container Finished - Killed
> TARGET=ContainerImpl RESULT=SUCCESS APPID=application_1359422495723_0001
> CONTAINERID=container_1359422495723_0001_01_000002
>
> My questions:
>         1. Why  were 2 containers created in Hadoop02,however Hadoop04 got