Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Re: Task Tracker going down on hive cluster


Copy link to this message
-
Re: Task Tracker going down on hive cluster
Is /mnt/app/ an NFS?

On Wed, Jun 12, 2013 at 6:20 PM, Shahab Yunus <[EMAIL PROTECTED]> wrote:
> Broken Pipe is a network related issue usually. Have you verified no change
> in network connectivity?
>
> Regards,
> Shahab
>
>
> On Wed, Jun 12, 2013 at 3:17 AM, Ravi Shetye <[EMAIL PROTECTED]> wrote:
>>
>> In last 4-5 of day the task tracker on one of my slave machines has gone
>> down couple of time. It has been working fine from the past 4-5 months
>>
>> The cluster configuration is
>> 4 machine cluster on AWS
>> 1 m2.xlarge master
>> 3 m2.xlarge slaves
>>
>> The cluster is dedicated to run hive queries, with the data residing on
>> s3.
>>
>> the slave on which the task tracker went down had the following log
>>
>> *******************************************************************
>> 2013-06-11 00:26:30,968 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.190.***.***:60659, bytes: 38, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005693_0, duration: 279198
>> 2013-06-11 00:26:30,971 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.191.**.***:37605, bytes: 38, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005700_0, duration: 193135
>> 2013-06-11 00:26:30,971 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.190.***.***:60630, bytes: 6, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005700_0, duration: 192011
>> 2013-06-11 00:26:30,972 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.190.***.***:60656, bytes: 6, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005693_0, duration: 178209
>> 2013-06-11 00:26:30,973 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.8.***.**:45321, bytes: 6, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005694_0, duration: 186452
>> 2013-06-11 00:26:30,973 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.190.***.***:60659, bytes: 6, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005694_0, duration: 157360
>> 2013-06-11 00:26:30,974 INFO
>> org.apache.hadoop.mapred.TaskTracker.clienttrace: src: 10.191.**.***:50060,
>> dest: 10.8.***.**:45321, bytes: 38, op: MAPRED_SHUFFLE, cliID:
>> attempt_201306071409_0151_m_005700_0, duration: 157555
>> 2013-06-11 00:26:30,991 INFO org.apache.hadoop.mapred.JvmManager: JVM Not
>> killed jvm_201306071409_0151_m_-435659475 but just removed
>> 2013-06-11 00:26:30,991 INFO org.apache.hadoop.mapred.JvmManager: JVM :
>> jvm_201306071409_0151_m_-435659475 exited with exit code 0. Number of tasks
>> it ran: 0
>> 2013-06-11 00:26:30,991 ERROR org.apache.hadoop.mapred.JvmManager: Caught
>> Throwable in JVMRunner. Aborting TaskTracker.
>> org.apache.hadoop.fs.FSError: java.io.IOException: Broken pipe
>> at
>> org.apache.hadoop.fs.RawLocalFileSystem$LocalFSFileOutputStream.write(RawLocalFileSystem.java:200)
>> at java.io.BufferedOutputStream.write(BufferedOutputStream.java:122)
>> at
>> org.apache.hadoop.fs.FSDataOutputStream$PositionCache.write(FSDataOutputStream.java:49)
>> at java.io.DataOutputStream.write(DataOutputStream.java:107)
>> at sun.nio.cs.StreamEncoder.writeBytes(StreamEncoder.java:220)
>> at sun.nio.cs.StreamEncoder.implClose(StreamEncoder.java:315)
>> at sun.nio.cs.StreamEncoder.close(StreamEncoder.java:148)
>> at java.io.OutputStreamWriter.close(OutputStreamWriter.java:233)
>> at java.io.BufferedWriter.close(BufferedWriter.java:265)
>> at java.io.PrintWriter.close(PrintWriter.java:312)
>> at
>> org.apache.hadoop.mapred.TaskController.writeCommand(TaskController.java:231)
>> at
>> org.apache.hadoop.mapred.DefaultTaskController.launchTask(DefaultTaskController.java:126)
>> at
>> org.apache.hadoop.mapred.JvmManager$JvmManagerForType$JvmRunner.runChild(JvmManager.java:497)
>> at
>> org.apache.hadoop.mapred.JvmManager$JvmManagerForType$JvmRunner.run(JvmManager.java:471)

Harsh J