Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Preferred way to submit a job?


Copy link to this message
-
Preferred way to submit a job?
What's the preferred way to submit a job these days?
org.apache.hadoop.mapreduce.Job.submit() ?  Or
org.apache.hadoop.mapred.JobClient.runJob()?  Or does it even matter?
(i.e., is there any difference between them?)

I've been trying to run a job using
org.apache.hadoop.mapreduce.Job.submit() (since I assumed that the
org.apache.hadoop.mapred.* classes were deprecated).  However, I'm
seeing some weirdness (the "mapred.job.tracker" setting that I set on my
job's Configuration is getting ignored, and making the job get run
locally) and I was wondering if the way I was submitting my job might
have something to do with it.
On a related note, if there's actually no difference between the 2
methods, would anybody have any idea what could make the
"mapred.job.tracker" setting on a job Configuration get ignored?  (I
currently have it set to "hdfs://<hadoop_job_tracker_host_name>:9001".)

TIA,

DR
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB