Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
HDFS >> mail # user >> Skipping entire task


Copy link to this message
-
RE: Skipping entire task
Thinking here... if you submitted the task programmatically you should be
able to capture the failure of the task and gracefully move past it to your
next tasks.

To say it in a long-winded way:  Let's say you submit a job to Hadoop, a
java jar, and your main class implements Tool. That code has the
responsibility to submit a series of jobs to hadoop, something like this:

try{
  Job myJob = new MyJob(getConf());
  myJob.submitAndWait();
}catch(Exception uhhohh){
  //Deal with the issue and move on
}
Job myNextJob = new MyNextJob(getConf());
myNextJob.submit();

Just pseudo code there to demonstrate my thought.

David

-----Original Message-----
From: Håvard Wahl Kongsgård [mailto:[EMAIL PROTECTED]]
Sent: Saturday, January 05, 2013 4:54 PM
To: user
Subject: Skipping entire task

Hi, hadoop can skip bad records
http://devblog.factual.com/practical-hadoop-streaming-dealing-with-brittle-c
ode.
But it is also possible to skip entire tasks?

-Håvard

--
Håvard Wahl Kongsgård
Faculty of Medicine &
Department of Mathematical Sciences
NTNU

http://havard.security-review.net/
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB