|
imen Megdiche
2012-12-12, 10:41
Mohammad Tariq
2012-12-12, 11:04
imen Megdiche
2012-12-12, 11:41
Mohammad Tariq
2012-12-12, 12:06
imen Megdiche
2012-12-12, 12:11
imen Megdiche
2012-12-12, 12:12
Mohammad Tariq
2012-12-12, 12:19
imen Megdiche
2012-12-12, 12:23
Mohammad Tariq
2012-12-12, 12:25
imen Megdiche
2012-12-12, 12:30
Mohammad Tariq
2012-12-12, 12:36
imen Megdiche
2012-12-12, 12:44
Mohammad Tariq
2012-12-12, 12:53
imen Megdiche
2012-12-12, 13:01
Mohammad Tariq
2012-12-12, 13:07
imen Megdiche
2012-12-12, 13:16
Mohammad Tariq
2012-12-12, 13:22
imen Megdiche
2012-12-12, 13:38
Mohammad Tariq
2012-12-12, 13:48
imen Megdiche
2012-12-12, 14:01
Mohammad Tariq
2012-12-12, 14:07
imen Megdiche
2012-12-12, 14:15
Mohammad Tariq
2012-12-12, 14:48
imen Megdiche
2012-12-12, 15:25
Mohammad Tariq
2012-12-12, 15:37
|
-
Modify the number of map tasksimen Megdiche 2012-12-12, 10:41
Hi,
I try to force the number of map for the mapreduce job with the command : public static void main(String[] args) throws Exception { JobConf conf = new JobConf(WordCount.class); conf.set("mapred.job.tracker", "local"); conf.set("fs.default.name", "local"); conf.setJobName("wordcount"); conf.setOutputKeyClass(Text.class); conf.setOutputValueClass(IntWritable.class); conf.setNumMapTask(6); conf.setMapperClass(Map.class); conf.setCombinerClass(Reduce.class); conf.setReducerClass(Reduce.class); ... } But it doesn t work. What can i do to modify the number of map and reduce tasks. Thank you +
imen Megdiche 2012-12-12, 10:41
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 11:04
Hi Imen,
You can add "mapred.map.tasks" property in your mapred-site.xml file. But, it is just a hint for the InputFormat. Actually no. of maps is actually determined by the no of InputSplits created by the InputFormat. HTH Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > Hi, > > I try to force the number of map for the mapreduce job with the command : > public static void main(String[] args) throws Exception { > > JobConf conf = new JobConf(WordCount.class); > conf.set("mapred.job.tracker", "local"); > conf.set("fs.default.name", "local"); > conf.setJobName("wordcount"); > > conf.setOutputKeyClass(Text.class); > conf.setOutputValueClass(IntWritable.class); > > conf.setNumMapTask(6); > conf.setMapperClass(Map.class); > conf.setCombinerClass(Reduce.class); > conf.setReducerClass(Reduce.class); > ... > } > > But it doesn t work. > What can i do to modify the number of map and reduce tasks. > > Thank you > +
Mohammad Tariq 2012-12-12, 11:04
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 11:41
Thank you Mohammad but the number of map tasks still the same in the
execution. Do you know how to capture the time spent on execution. 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Hi Imen, > > You can add "mapred.map.tasks" property in your mapred-site.xml file. > > But, it is just a hint for the InputFormat. Actually no. of maps is > actually determined by the no of InputSplits created by the InputFormat. > > HTH > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> Hi, >> >> I try to force the number of map for the mapreduce job with the command : >> public static void main(String[] args) throws Exception { >> >> JobConf conf = new JobConf(WordCount.class); >> conf.set("mapred.job.tracker", "local"); >> conf.set("fs.default.name", "local"); >> conf.setJobName("wordcount"); >> >> conf.setOutputKeyClass(Text.class); >> conf.setOutputValueClass(IntWritable.class); >> >> conf.setNumMapTask(6); >> conf.setMapperClass(Map.class); >> conf.setCombinerClass(Reduce.class); >> conf.setReducerClass(Reduce.class); >> ... >> } >> >> But it doesn t work. >> What can i do to modify the number of map and reduce tasks. >> >> Thank you >> > > +
imen Megdiche 2012-12-12, 11:41
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 12:06
Hi Imen,
You can visit the MR web UI at "JobTrackerHost:50030" and see all the useful information like no. of mappers, no of reducers, time taken for the execution etc. One quick question for you, what is the size of your data and what is the no of maps which you are getting right now? Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > Thank you Mohammad but the number of map tasks still the same in the > execution. Do you know how to capture the time spent on execution. > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Hi Imen, >> >> You can add "mapred.map.tasks" property in your mapred-site.xml file. >> >> But, it is just a hint for the InputFormat. Actually no. of maps is >> actually determined by the no of InputSplits created by the InputFormat. >> >> HTH >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> Hi, >>> >>> I try to force the number of map for the mapreduce job with the command >>> : >>> public static void main(String[] args) throws Exception { >>> >>> JobConf conf = new JobConf(WordCount.class); >>> conf.set("mapred.job.tracker", "local"); >>> conf.set("fs.default.name", "local"); >>> conf.setJobName("wordcount"); >>> >>> conf.setOutputKeyClass(Text.class); >>> conf.setOutputValueClass(IntWritable.class); >>> >>> conf.setNumMapTask(6); >>> conf.setMapperClass(Map.class); >>> conf.setCombinerClass(Reduce.class); >>> conf.setReducerClass(Reduce.class); >>> ... >>> } >>> >>> But it doesn t work. >>> What can i do to modify the number of map and reduce tasks. >>> >>> Thank you >>> >> >> > +
Mohammad Tariq 2012-12-12, 12:06
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 12:11
the size of data 49 MB and n of map 4
the web UI JobTrackerHost:50030 does not wok, what should i do to make this appear , i work on ubuntu 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Hi Imen, > > You can visit the MR web UI at "JobTrackerHost:50030" and see all the > useful information like no. of mappers, no of reducers, time taken for the > execution etc. > > One quick question for you, what is the size of your data and what is the > no of maps which you are getting right now? > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> Thank you Mohammad but the number of map tasks still the same in the >> execution. Do you know how to capture the time spent on execution. >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> Hi Imen, >>> >>> You can add "mapred.map.tasks" property in your mapred-site.xml >>> file. >>> >>> But, it is just a hint for the InputFormat. Actually no. of maps is >>> actually determined by the no of InputSplits created by the InputFormat. >>> >>> HTH >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> Hi, >>>> >>>> I try to force the number of map for the mapreduce job with the command >>>> : >>>> public static void main(String[] args) throws Exception { >>>> >>>> JobConf conf = new JobConf(WordCount.class); >>>> conf.set("mapred.job.tracker", "local"); >>>> conf.set("fs.default.name", "local"); >>>> conf.setJobName("wordcount"); >>>> >>>> conf.setOutputKeyClass(Text.class); >>>> conf.setOutputValueClass(IntWritable.class); >>>> >>>> conf.setNumMapTask(6); >>>> conf.setMapperClass(Map.class); >>>> conf.setCombinerClass(Reduce.class); >>>> conf.setReducerClass(Reduce.class); >>>> ... >>>> } >>>> >>>> But it doesn t work. >>>> What can i do to modify the number of map and reduce tasks. >>>> >>>> Thank you >>>> >>> >>> >> > +
imen Megdiche 2012-12-12, 12:11
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 12:12
excuse me the data size is 98 MB
2012/12/12 imen Megdiche <[EMAIL PROTECTED]> > the size of data 49 MB and n of map 4 > the web UI JobTrackerHost:50030 does not wok, what should i do to make > this appear , i work on ubuntu > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Hi Imen, >> >> You can visit the MR web UI at "JobTrackerHost:50030" and see all >> the useful information like no. of mappers, no of reducers, time taken for >> the execution etc. >> >> One quick question for you, what is the size of your data and what is the >> no of maps which you are getting right now? >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> Thank you Mohammad but the number of map tasks still the same in the >>> execution. Do you know how to capture the time spent on execution. >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> Hi Imen, >>>> >>>> You can add "mapred.map.tasks" property in your mapred-site.xml >>>> file. >>>> >>>> But, it is just a hint for the InputFormat. Actually no. of maps is >>>> actually determined by the no of InputSplits created by the InputFormat. >>>> >>>> HTH >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> Hi, >>>>> >>>>> I try to force the number of map for the mapreduce job with the >>>>> command : >>>>> public static void main(String[] args) throws Exception { >>>>> >>>>> JobConf conf = new JobConf(WordCount.class); >>>>> conf.set("mapred.job.tracker", "local"); >>>>> conf.set("fs.default.name", "local"); >>>>> conf.setJobName("wordcount"); >>>>> >>>>> conf.setOutputKeyClass(Text.class); >>>>> conf.setOutputValueClass(IntWritable.class); >>>>> >>>>> conf.setNumMapTask(6); >>>>> conf.setMapperClass(Map.class); >>>>> conf.setCombinerClass(Reduce.class); >>>>> conf.setReducerClass(Reduce.class); >>>>> ... >>>>> } >>>>> >>>>> But it doesn t work. >>>>> What can i do to modify the number of map and reduce tasks. >>>>> >>>>> Thank you >>>>> >>>> >>>> >>> >> > +
imen Megdiche 2012-12-12, 12:12
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 12:19
You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with the
actual name of the machine where JobTracker is running. For example, If you are working on a local cluster, you have to use "localhost:50030". Are you running your job through the command line or some IDE? Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > excuse me the data size is 98 MB > > > 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> > >> the size of data 49 MB and n of map 4 >> the web UI JobTrackerHost:50030 does not wok, what should i do to make >> this appear , i work on ubuntu >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> Hi Imen, >>> >>> You can visit the MR web UI at "JobTrackerHost:50030" and see all >>> the useful information like no. of mappers, no of reducers, time taken for >>> the execution etc. >>> >>> One quick question for you, what is the size of your data and what is >>> the no of maps which you are getting right now? >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> Thank you Mohammad but the number of map tasks still the same in the >>>> execution. Do you know how to capture the time spent on execution. >>>> >>>> >>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>> >>>>> Hi Imen, >>>>> >>>>> You can add "mapred.map.tasks" property in your mapred-site.xml >>>>> file. >>>>> >>>>> But, it is just a hint for the InputFormat. Actually no. of maps is >>>>> actually determined by the no of InputSplits created by the InputFormat. >>>>> >>>>> HTH >>>>> >>>>> Regards, >>>>> Mohammad Tariq >>>>> >>>>> >>>>> >>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>> [EMAIL PROTECTED]> wrote: >>>>> >>>>>> Hi, >>>>>> >>>>>> I try to force the number of map for the mapreduce job with the >>>>>> command : >>>>>> public static void main(String[] args) throws Exception { >>>>>> >>>>>> JobConf conf = new JobConf(WordCount.class); >>>>>> conf.set("mapred.job.tracker", "local"); >>>>>> conf.set("fs.default.name", "local"); >>>>>> conf.setJobName("wordcount"); >>>>>> >>>>>> conf.setOutputKeyClass(Text.class); >>>>>> conf.setOutputValueClass(IntWritable.class); >>>>>> >>>>>> conf.setNumMapTask(6); >>>>>> conf.setMapperClass(Map.class); >>>>>> conf.setCombinerClass(Reduce.class); >>>>>> conf.setReducerClass(Reduce.class); >>>>>> ... >>>>>> } >>>>>> >>>>>> But it doesn t work. >>>>>> What can i do to modify the number of map and reduce tasks. >>>>>> >>>>>> Thank you >>>>>> >>>>> >>>>> >>>> >>> >> > +
Mohammad Tariq 2012-12-12, 12:19
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 12:23
i run the job through the command line
2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with the > actual name of the machine where JobTracker is running. For example, If > you are working on a local cluster, you have to use "localhost:50030". > > Are you running your job through the command line or some IDE? > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> excuse me the data size is 98 MB >> >> >> 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> >> >>> the size of data 49 MB and n of map 4 >>> the web UI JobTrackerHost:50030 does not wok, what should i do to make >>> this appear , i work on ubuntu >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> Hi Imen, >>>> >>>> You can visit the MR web UI at "JobTrackerHost:50030" and see all >>>> the useful information like no. of mappers, no of reducers, time taken for >>>> the execution etc. >>>> >>>> One quick question for you, what is the size of your data and what is >>>> the no of maps which you are getting right now? >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> Thank you Mohammad but the number of map tasks still the same in the >>>>> execution. Do you know how to capture the time spent on execution. >>>>> >>>>> >>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>> >>>>>> Hi Imen, >>>>>> >>>>>> You can add "mapred.map.tasks" property in your mapred-site.xml >>>>>> file. >>>>>> >>>>>> But, it is just a hint for the InputFormat. Actually no. of maps is >>>>>> actually determined by the no of InputSplits created by the InputFormat. >>>>>> >>>>>> HTH >>>>>> >>>>>> Regards, >>>>>> Mohammad Tariq >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>>> [EMAIL PROTECTED]> wrote: >>>>>> >>>>>>> Hi, >>>>>>> >>>>>>> I try to force the number of map for the mapreduce job with the >>>>>>> command : >>>>>>> public static void main(String[] args) throws Exception { >>>>>>> >>>>>>> JobConf conf = new JobConf(WordCount.class); >>>>>>> conf.set("mapred.job.tracker", "local"); >>>>>>> conf.set("fs.default.name", "local"); >>>>>>> conf.setJobName("wordcount"); >>>>>>> >>>>>>> conf.setOutputKeyClass(Text.class); >>>>>>> conf.setOutputValueClass(IntWritable.class); >>>>>>> >>>>>>> conf.setNumMapTask(6); >>>>>>> conf.setMapperClass(Map.class); >>>>>>> conf.setCombinerClass(Reduce.class); >>>>>>> conf.setReducerClass(Reduce.class); >>>>>>> ... >>>>>>> } >>>>>>> >>>>>>> But it doesn t work. >>>>>>> What can i do to modify the number of map and reduce tasks. >>>>>>> >>>>>>> Thank you >>>>>>> >>>>>> >>>>>> >>>>> >>>> >>> >> > +
imen Megdiche 2012-12-12, 12:23
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 12:25
Any luck with "localhost:50030"??
Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > i run the job through the command line > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with the >> actual name of the machine where JobTracker is running. For example, If >> you are working on a local cluster, you have to use "localhost:50030". >> >> Are you running your job through the command line or some IDE? >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> excuse me the data size is 98 MB >>> >>> >>> 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> >>> >>>> the size of data 49 MB and n of map 4 >>>> the web UI JobTrackerHost:50030 does not wok, what should i do to make >>>> this appear , i work on ubuntu >>>> >>>> >>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>> >>>>> Hi Imen, >>>>> >>>>> You can visit the MR web UI at "JobTrackerHost:50030" and see all >>>>> the useful information like no. of mappers, no of reducers, time taken for >>>>> the execution etc. >>>>> >>>>> One quick question for you, what is the size of your data and what is >>>>> the no of maps which you are getting right now? >>>>> >>>>> Regards, >>>>> Mohammad Tariq >>>>> >>>>> >>>>> >>>>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche < >>>>> [EMAIL PROTECTED]> wrote: >>>>> >>>>>> Thank you Mohammad but the number of map tasks still the same in the >>>>>> execution. Do you know how to capture the time spent on execution. >>>>>> >>>>>> >>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>> >>>>>>> Hi Imen, >>>>>>> >>>>>>> You can add "mapred.map.tasks" property in your mapred-site.xml >>>>>>> file. >>>>>>> >>>>>>> But, it is just a hint for the InputFormat. Actually no. of maps is >>>>>>> actually determined by the no of InputSplits created by the InputFormat. >>>>>>> >>>>>>> HTH >>>>>>> >>>>>>> Regards, >>>>>>> Mohammad Tariq >>>>>>> >>>>>>> >>>>>>> >>>>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>> >>>>>>>> Hi, >>>>>>>> >>>>>>>> I try to force the number of map for the mapreduce job with the >>>>>>>> command : >>>>>>>> public static void main(String[] args) throws Exception { >>>>>>>> >>>>>>>> JobConf conf = new JobConf(WordCount.class); >>>>>>>> conf.set("mapred.job.tracker", "local"); >>>>>>>> conf.set("fs.default.name", "local"); >>>>>>>> conf.setJobName("wordcount"); >>>>>>>> >>>>>>>> conf.setOutputKeyClass(Text.class); >>>>>>>> conf.setOutputValueClass(IntWritable.class); >>>>>>>> >>>>>>>> conf.setNumMapTask(6); >>>>>>>> conf.setMapperClass(Map.class); >>>>>>>> conf.setCombinerClass(Reduce.class); >>>>>>>> conf.setReducerClass(Reduce.class); >>>>>>>> ... >>>>>>>> } >>>>>>>> >>>>>>>> But it doesn t work. >>>>>>>> What can i do to modify the number of map and reduce tasks. >>>>>>>> >>>>>>>> Thank you >>>>>>>> >>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > +
Mohammad Tariq 2012-12-12, 12:25
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 12:30
no
2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Any luck with "localhost:50030"?? > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> i run the job through the command line >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with the >>> actual name of the machine where JobTracker is running. For example, If >>> you are working on a local cluster, you have to use "localhost:50030". >>> >>> Are you running your job through the command line or some IDE? >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> excuse me the data size is 98 MB >>>> >>>> >>>> 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> >>>> >>>>> the size of data 49 MB and n of map 4 >>>>> the web UI JobTrackerHost:50030 does not wok, what should i do to make >>>>> this appear , i work on ubuntu >>>>> >>>>> >>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>> >>>>>> Hi Imen, >>>>>> >>>>>> You can visit the MR web UI at "JobTrackerHost:50030" and see >>>>>> all the useful information like no. of mappers, no of reducers, time taken >>>>>> for the execution etc. >>>>>> >>>>>> One quick question for you, what is the size of your data and what is >>>>>> the no of maps which you are getting right now? >>>>>> >>>>>> Regards, >>>>>> Mohammad Tariq >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche < >>>>>> [EMAIL PROTECTED]> wrote: >>>>>> >>>>>>> Thank you Mohammad but the number of map tasks still the same in the >>>>>>> execution. Do you know how to capture the time spent on execution. >>>>>>> >>>>>>> >>>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>>> >>>>>>>> Hi Imen, >>>>>>>> >>>>>>>> You can add "mapred.map.tasks" property in your mapred-site.xml >>>>>>>> file. >>>>>>>> >>>>>>>> But, it is just a hint for the InputFormat. Actually no. of maps is >>>>>>>> actually determined by the no of InputSplits created by the InputFormat. >>>>>>>> >>>>>>>> HTH >>>>>>>> >>>>>>>> Regards, >>>>>>>> Mohammad Tariq >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>>> >>>>>>>>> Hi, >>>>>>>>> >>>>>>>>> I try to force the number of map for the mapreduce job with the >>>>>>>>> command : >>>>>>>>> public static void main(String[] args) throws Exception { >>>>>>>>> >>>>>>>>> JobConf conf = new JobConf(WordCount.class); >>>>>>>>> conf.set("mapred.job.tracker", "local"); >>>>>>>>> conf.set("fs.default.name", "local"); >>>>>>>>> conf.setJobName("wordcount"); >>>>>>>>> >>>>>>>>> conf.setOutputKeyClass(Text.class); >>>>>>>>> conf.setOutputValueClass(IntWritable.class); >>>>>>>>> >>>>>>>>> conf.setNumMapTask(6); >>>>>>>>> conf.setMapperClass(Map.class); >>>>>>>>> conf.setCombinerClass(Reduce.class); >>>>>>>>> conf.setReducerClass(Reduce.class); >>>>>>>>> ... >>>>>>>>> } >>>>>>>>> >>>>>>>>> But it doesn t work. >>>>>>>>> What can i do to modify the number of map and reduce tasks. >>>>>>>>> >>>>>>>>> Thank you >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > +
imen Megdiche 2012-12-12, 12:30
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 12:36
Are you working locally?What exactly is the issue?
Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 6:00 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > no > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Any luck with "localhost:50030"?? >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> i run the job through the command line >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with >>>> the actual name of the machine where JobTracker is running. For >>>> example, If you are working on a local cluster, you have to use >>>> "localhost:50030". >>>> >>>> Are you running your job through the command line or some IDE? >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> excuse me the data size is 98 MB >>>>> >>>>> >>>>> 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> >>>>> >>>>>> the size of data 49 MB and n of map 4 >>>>>> the web UI JobTrackerHost:50030 does not wok, what should i do to >>>>>> make this appear , i work on ubuntu >>>>>> >>>>>> >>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>> >>>>>>> Hi Imen, >>>>>>> >>>>>>> You can visit the MR web UI at "JobTrackerHost:50030" and see >>>>>>> all the useful information like no. of mappers, no of reducers, time taken >>>>>>> for the execution etc. >>>>>>> >>>>>>> One quick question for you, what is the size of your data and what >>>>>>> is the no of maps which you are getting right now? >>>>>>> >>>>>>> Regards, >>>>>>> Mohammad Tariq >>>>>>> >>>>>>> >>>>>>> >>>>>>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche < >>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>> >>>>>>>> Thank you Mohammad but the number of map tasks still the same in >>>>>>>> the execution. Do you know how to capture the time spent on execution. >>>>>>>> >>>>>>>> >>>>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>>>> >>>>>>>>> Hi Imen, >>>>>>>>> >>>>>>>>> You can add "mapred.map.tasks" property in your >>>>>>>>> mapred-site.xml file. >>>>>>>>> >>>>>>>>> But, it is just a hint for the InputFormat. Actually no. of maps >>>>>>>>> is actually determined by the no of InputSplits created by the InputFormat. >>>>>>>>> >>>>>>>>> HTH >>>>>>>>> >>>>>>>>> Regards, >>>>>>>>> Mohammad Tariq >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>>>> >>>>>>>>>> Hi, >>>>>>>>>> >>>>>>>>>> I try to force the number of map for the mapreduce job with the >>>>>>>>>> command : >>>>>>>>>> public static void main(String[] args) throws Exception { >>>>>>>>>> >>>>>>>>>> JobConf conf = new JobConf(WordCount.class); >>>>>>>>>> conf.set("mapred.job.tracker", "local"); >>>>>>>>>> conf.set("fs.default.name", "local"); >>>>>>>>>> conf.setJobName("wordcount"); >>>>>>>>>> >>>>>>>>>> conf.setOutputKeyClass(Text.class); >>>>>>>>>> conf.setOutputValueClass(IntWritable.class); >>>>>>>>>> >>>>>>>>>> conf.setNumMapTask(6); >>>>>>>>>> conf.setMapperClass(Map.class); >>>>>>>>>> conf.setCombinerClass(Reduce.class); >>>>>>>>>> conf.setReducerClass(Reduce.class); >>>>>>>>>> ... >>>>>>>>>> } >>>>>>>>>> >>>>>>>>>> But it doesn t work. >>>>>>>>>> What can i do to modify the number of map and reduce tasks. >>>>>>>>>> >>>>>>>>>> Thank you >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >>> >> > +
Mohammad Tariq 2012-12-12, 12:36
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 12:44
My goal is to analyze the response time of MapReduce depending on the size
of the input files. I need to change the number of map and / or Reduce tasks and recover the execution time. S it turns out that nothing works locally on my pc : neither hadoop job-status command job_local_0001 (which return no job found ) nor localhost: 50030 I will be very grateful if you can help m better understand these problem 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Are you working locally?What exactly is the issue? > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 6:00 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> no >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> Any luck with "localhost:50030"?? >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> i run the job through the command line >>>> >>>> >>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>> >>>>> You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with >>>>> the actual name of the machine where JobTracker is running. For >>>>> example, If you are working on a local cluster, you have to use >>>>> "localhost:50030". >>>>> >>>>> Are you running your job through the command line or some IDE? >>>>> >>>>> Regards, >>>>> Mohammad Tariq >>>>> >>>>> >>>>> >>>>> On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche < >>>>> [EMAIL PROTECTED]> wrote: >>>>> >>>>>> excuse me the data size is 98 MB >>>>>> >>>>>> >>>>>> 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> >>>>>> >>>>>>> the size of data 49 MB and n of map 4 >>>>>>> the web UI JobTrackerHost:50030 does not wok, what should i do to >>>>>>> make this appear , i work on ubuntu >>>>>>> >>>>>>> >>>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>>> >>>>>>>> Hi Imen, >>>>>>>> >>>>>>>> You can visit the MR web UI at "JobTrackerHost:50030" and see >>>>>>>> all the useful information like no. of mappers, no of reducers, time taken >>>>>>>> for the execution etc. >>>>>>>> >>>>>>>> One quick question for you, what is the size of your data and what >>>>>>>> is the no of maps which you are getting right now? >>>>>>>> >>>>>>>> Regards, >>>>>>>> Mohammad Tariq >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche < >>>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>>> >>>>>>>>> Thank you Mohammad but the number of map tasks still the same in >>>>>>>>> the execution. Do you know how to capture the time spent on execution. >>>>>>>>> >>>>>>>>> >>>>>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>>>>> >>>>>>>>>> Hi Imen, >>>>>>>>>> >>>>>>>>>> You can add "mapred.map.tasks" property in your >>>>>>>>>> mapred-site.xml file. >>>>>>>>>> >>>>>>>>>> But, it is just a hint for the InputFormat. Actually no. of maps >>>>>>>>>> is actually determined by the no of InputSplits created by the InputFormat. >>>>>>>>>> >>>>>>>>>> HTH >>>>>>>>>> >>>>>>>>>> Regards, >>>>>>>>>> Mohammad Tariq >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>>>>> >>>>>>>>>>> Hi, >>>>>>>>>>> >>>>>>>>>>> I try to force the number of map for the mapreduce job with the >>>>>>>>>>> command : >>>>>>>>>>> public static void main(String[] args) throws Exception { >>>>>>>>>>> >>>>>>>>>>> JobConf conf = new JobConf(WordCount.class); >>>>>>>>>>> conf.set("mapred.job.tracker", "local"); >>>>>>>>>>> conf.set("fs.default.name", "local"); >>>>>>>>>>> conf.setJobName("wordcount"); >>>>>>>>>>> >>>>>>>>>>> conf.setOutputKeyClass(Text.class); >>>>>>>>>>> conf.setOutputValueClass(IntWritable.class); >>>>>>>>>>> >>>>>>>>>>> conf.setNumMapTask(6); >>>>>>>>>>> conf.setMapperClass(Map.class); >>>>>>>>>>> conf.setCombinerClass(Reduce.class); >>>>>>>>>>> conf.setReducerClass(Reduce.class); >>>>>>>>>>> ... >>>> +
imen Megdiche 2012-12-12, 12:44
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 12:53
I would check if all the daemons are running properly or not, before
anything else. If some problem is found, next place to track is the log of each daemon. The correct command to check the status of a job from command line is : hadoop job -status jobID. (Mind the 'space' after job and remove 'command' from the statement) HTH Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 6:14 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > My goal is to analyze the response time of MapReduce depending on the size > of the input files. I need to change the number of map and / or Reduce > tasks and recover the execution time. S it turns out that nothing works locally > on my pc : > neither hadoop job-status command job_local_0001 (which return no job > found ) > nor localhost: 50030 > I will be very grateful if you can help m better understand these problem > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Are you working locally?What exactly is the issue? >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 6:00 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> no >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> Any luck with "localhost:50030"?? >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> i run the job through the command line >>>>> >>>>> >>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>> >>>>>> You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with >>>>>> the actual name of the machine where JobTracker is running. For >>>>>> example, If you are working on a local cluster, you have to use >>>>>> "localhost:50030". >>>>>> >>>>>> Are you running your job through the command line or some IDE? >>>>>> >>>>>> Regards, >>>>>> Mohammad Tariq >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Dec 12, 2012 at 5:42 PM, imen Megdiche < >>>>>> [EMAIL PROTECTED]> wrote: >>>>>> >>>>>>> excuse me the data size is 98 MB >>>>>>> >>>>>>> >>>>>>> 2012/12/12 imen Megdiche <[EMAIL PROTECTED]> >>>>>>> >>>>>>>> the size of data 49 MB and n of map 4 >>>>>>>> the web UI JobTrackerHost:50030 does not wok, what should i do to >>>>>>>> make this appear , i work on ubuntu >>>>>>>> >>>>>>>> >>>>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>>>> >>>>>>>>> Hi Imen, >>>>>>>>> >>>>>>>>> You can visit the MR web UI at "JobTrackerHost:50030" and see >>>>>>>>> all the useful information like no. of mappers, no of reducers, time taken >>>>>>>>> for the execution etc. >>>>>>>>> >>>>>>>>> One quick question for you, what is the size of your data and what >>>>>>>>> is the no of maps which you are getting right now? >>>>>>>>> >>>>>>>>> Regards, >>>>>>>>> Mohammad Tariq >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> On Wed, Dec 12, 2012 at 5:11 PM, imen Megdiche < >>>>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>>>> >>>>>>>>>> Thank you Mohammad but the number of map tasks still the same in >>>>>>>>>> the execution. Do you know how to capture the time spent on execution. >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>>>>>> >>>>>>>>>>> Hi Imen, >>>>>>>>>>> >>>>>>>>>>> You can add "mapred.map.tasks" property in your >>>>>>>>>>> mapred-site.xml file. >>>>>>>>>>> >>>>>>>>>>> But, it is just a hint for the InputFormat. Actually no. of maps >>>>>>>>>>> is actually determined by the no of InputSplits created by the InputFormat. >>>>>>>>>>> >>>>>>>>>>> HTH >>>>>>>>>>> >>>>>>>>>>> Regards, >>>>>>>>>>> Mohammad Tariq >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> On Wed, Dec 12, 2012 at 4:11 PM, imen Megdiche < >>>>>>>>>>> [EMAIL PROTECTED]> wrote: >>>>>>>>>>> >>>>>>>>>>>> Hi, >>>>>>>>>>>> >>>>>>>>>>>> I try to force the number of map for the mapreduce job with the >>>>>>>>>>>> command : >>>>>>>>>>>> public static void main(String[] args) throws Exception { >>>>>>>>>>>> >>>>>>>>>>>> JobConf conf = new JobConf(WordCount.class); +
Mohammad Tariq 2012-12-12, 12:53
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 13:01
i run the start-all.sh and all daemons starts without problems. But i the
log of the tasktracker look like this : 2012-12-12 13:53:45,495 INFO org.apache.hadoop.mapred.TaskTracker: STARTUP_MSG: /************************************************************ STARTUP_MSG: Starting TaskTracker STARTUP_MSG: host = megdiche-OptiPlex-GX280/127.0.1.1 STARTUP_MSG: args = [] STARTUP_MSG: version = 1.0.4 STARTUP_MSG: build https://svn.apache.org/repos/asf/hadoop/common/branches/branch-1.0 -r 1393290; compiled by 'hortonfo' on Wed Oct 3 05:13:58 UTC 2012 ************************************************************/ 2012-12-12 13:53:47,009 INFO org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properties 2012-12-12 13:53:47,331 INFO org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source MetricsSystem,sub=Stats registered. 2012-12-12 13:53:47,336 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot period at 10 second(s). 2012-12-12 13:53:47,336 INFO org.apache.hadoop.metrics2.impl.MetricsSystemImpl: TaskTracker metrics system started 2012-12-12 13:53:48,165 INFO org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source ugi registered. 2012-12-12 13:53:48,192 WARN org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi already exists! 2012-12-12 13:53:48,513 ERROR org.apache.hadoop.mapred.TaskTracker: Can not start task tracker because java.lang.IllegalArgumentException: Does not contain a valid host:port authority: local at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:162) at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:128) at org.apache.hadoop.mapred.JobTracker.getAddress(JobTracker.java:2560) at org.apache.hadoop.mapred.TaskTracker.<init>(TaskTracker.java:1426) at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3742) 2012-12-12 13:53:48,519 INFO org.apache.hadoop.mapred.TaskTracker: SHUTDOWN_MSG: /************************************************************ SHUTDOWN_MSG: Shutting down TaskTracker at megdiche-OptiPlex-GX280/127.0.1.1 ************************************************************/ 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > I would check if all the daemons are running properly or not, before > anything else. If some problem is found, next place to track is the log of > each daemon. > > The correct command to check the status of a job from command line is : > hadoop job -status jobID. > (Mind the 'space' after job and remove 'command' from the statement) > > HTH > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 6:14 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> My goal is to analyze the response time of MapReduce depending on the size >> of the input files. I need to change the number of map and / or Reduce >> tasks and recover the execution time. S it turns out that nothing works locally >> on my pc : >> neither hadoop job-status command job_local_0001 (which return no job >> found ) >> nor localhost: 50030 >> I will be very grateful if you can help m better understand these problem >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> Are you working locally?What exactly is the issue? >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 6:00 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> no >>>> >>>> >>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>> >>>>> Any luck with "localhost:50030"?? >>>>> >>>>> Regards, >>>>> Mohammad Tariq >>>>> >>>>> >>>>> >>>>> On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche < >>>>> [EMAIL PROTECTED]> wrote: >>>>> >>>>>> i run the job through the command line >>>>>> >>>>>> >>>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>>> >>>>>>> You have to replace "JobTrackerHost" in "JobTrackerHost:50030" with >>>>>>> the actual name of the machine where JobTracker is running. For >>>>>>> example, If you are working on a local cluster, you have to use +
imen Megdiche 2012-12-12, 13:01
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 13:07
Can I have a look at your config files?
Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 6:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > i run the start-all.sh and all daemons starts without problems. But i the > log of the tasktracker look like this : > > > 2012-12-12 13:53:45,495 INFO org.apache.hadoop.mapred.TaskTracker: > STARTUP_MSG: > /************************************************************ > STARTUP_MSG: Starting TaskTracker > STARTUP_MSG: host = megdiche-OptiPlex-GX280/127.0.1.1 > STARTUP_MSG: args = [] > STARTUP_MSG: version = 1.0.4 > STARTUP_MSG: build > https://svn.apache.org/repos/asf/hadoop/common/branches/branch-1.0 -r > 1393290; compiled by 'hortonfo' on Wed Oct 3 05:13:58 UTC 2012 > ************************************************************/ > 2012-12-12 13:53:47,009 INFO > org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from > hadoop-metrics2.properties > 2012-12-12 13:53:47,331 INFO > org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source > MetricsSystem,sub=Stats registered. > 2012-12-12 13:53:47,336 INFO > org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot > period at 10 second(s). > 2012-12-12 13:53:47,336 INFO > org.apache.hadoop.metrics2.impl.MetricsSystemImpl: TaskTracker metrics > system started > 2012-12-12 13:53:48,165 INFO > org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source ugi > registered. > 2012-12-12 13:53:48,192 WARN > org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi already > exists! > 2012-12-12 13:53:48,513 ERROR org.apache.hadoop.mapred.TaskTracker: Can > not start task tracker because java.lang.IllegalArgumentException: Does not > contain a valid host:port authority: local > at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:162) > at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:128) > at org.apache.hadoop.mapred.JobTracker.getAddress(JobTracker.java:2560) > at org.apache.hadoop.mapred.TaskTracker.<init>(TaskTracker.java:1426) > at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3742) > > 2012-12-12 13:53:48,519 INFO org.apache.hadoop.mapred.TaskTracker: > SHUTDOWN_MSG: > /************************************************************ > SHUTDOWN_MSG: Shutting down TaskTracker at megdiche-OptiPlex-GX280/ > 127.0.1.1 > ************************************************************/ > > > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> I would check if all the daemons are running properly or not, before >> anything else. If some problem is found, next place to track is the log of >> each daemon. >> >> The correct command to check the status of a job from command line is : >> hadoop job -status jobID. >> (Mind the 'space' after job and remove 'command' from the statement) >> >> HTH >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 6:14 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> My goal is to analyze the response time of MapReduce depending on the size >>> of the input files. I need to change the number of map and / or Reduce >>> tasks and recover the execution time. S it turns out that nothing works locally >>> on my pc : >>> neither hadoop job-status command job_local_0001 (which return no job >>> found ) >>> nor localhost: 50030 >>> I will be very grateful if you can help m better understand these >>> problem >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> Are you working locally?What exactly is the issue? >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 6:00 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> no >>>>> >>>>> >>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>> >>>>>> Any luck with "localhost:50030"?? >>>>>> >>>>>> Regards, >>>>>> Mohammad Tariq >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Dec 12, 2012 at 5:53 PM, imen Megdiche < >>>>>> [EMAIL PROTECTED]> wrote: >>>>>> >>>> +
Mohammad Tariq 2012-12-12, 13:07
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 13:16
For mapred-site.xml :
<configuration> <property> <name>mapred.map.tasks</name> <value>6</value> </property> </configuration> for core-site.xml : <configuration> <!-- <property> <name>fs.default.name</name> <value>hdfs://localhost:9100</value> </property> --> </configuration> on hdfs-site.xml nothing 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Can I have a look at your config files? > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 6:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> i run the start-all.sh and all daemons starts without problems. But i the >> log of the tasktracker look like this : >> >> >> 2012-12-12 13:53:45,495 INFO org.apache.hadoop.mapred.TaskTracker: >> STARTUP_MSG: >> /************************************************************ >> STARTUP_MSG: Starting TaskTracker >> STARTUP_MSG: host = megdiche-OptiPlex-GX280/127.0.1.1 >> STARTUP_MSG: args = [] >> STARTUP_MSG: version = 1.0.4 >> STARTUP_MSG: build >> https://svn.apache.org/repos/asf/hadoop/common/branches/branch-1.0 -r >> 1393290; compiled by 'hortonfo' on Wed Oct 3 05:13:58 UTC 2012 >> ************************************************************/ >> 2012-12-12 13:53:47,009 INFO >> org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from >> hadoop-metrics2.properties >> 2012-12-12 13:53:47,331 INFO >> org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source >> MetricsSystem,sub=Stats registered. >> 2012-12-12 13:53:47,336 INFO >> org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot >> period at 10 second(s). >> 2012-12-12 13:53:47,336 INFO >> org.apache.hadoop.metrics2.impl.MetricsSystemImpl: TaskTracker metrics >> system started >> 2012-12-12 13:53:48,165 INFO >> org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source ugi >> registered. >> 2012-12-12 13:53:48,192 WARN >> org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi already >> exists! >> 2012-12-12 13:53:48,513 ERROR org.apache.hadoop.mapred.TaskTracker: Can >> not start task tracker because java.lang.IllegalArgumentException: Does not >> contain a valid host:port authority: local >> at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:162) >> at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:128) >> at >> org.apache.hadoop.mapred.JobTracker.getAddress(JobTracker.java:2560) >> at org.apache.hadoop.mapred.TaskTracker.<init>(TaskTracker.java:1426) >> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3742) >> >> 2012-12-12 13:53:48,519 INFO org.apache.hadoop.mapred.TaskTracker: >> SHUTDOWN_MSG: >> /************************************************************ >> SHUTDOWN_MSG: Shutting down TaskTracker at megdiche-OptiPlex-GX280/ >> 127.0.1.1 >> ************************************************************/ >> >> >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> I would check if all the daemons are running properly or not, before >>> anything else. If some problem is found, next place to track is the log of >>> each daemon. >>> >>> The correct command to check the status of a job from command line is : >>> hadoop job -status jobID. >>> (Mind the 'space' after job and remove 'command' from the statement) >>> >>> HTH >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 6:14 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> My goal is to analyze the response time of MapReduce depending on the size >>>> of the input files. I need to change the number of map and / or Reduce >>>> tasks and recover the execution time. S it turns out that nothing works locally >>>> on my pc : >>>> neither hadoop job-status command job_local_0001 (which return no job >>>> found ) >>>> nor localhost: 50030 >>>> I will be very grateful if you can help m better understand these >>>> problem >>>> >>>> >>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>> >>>>> Are you working locally?What exactly is the issue? +
imen Megdiche 2012-12-12, 13:16
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 13:22
I wonder how you are able to run the job without a JT. You must have this
on your mapred-site.xml file : <property> <name>mapred.job.tracker</name> <value>localhost:9001</value> </property> Also add "hadoop.tmp.dir" in core-site.xml, and "dfs.name.dir" & "dfs.data.dir" in hdfs-site.xml. Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 6:46 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > For mapred-site.xml : > > <configuration> > > <property> > <name>mapred.map.tasks</name> > <value>6</value> > </property> > > </configuration> > > for core-site.xml : > <configuration> > > <!-- <property> > <name>fs.default.name</name> > <value>hdfs://localhost:9100</value> > </property> --> > > </configuration> > > on hdfs-site.xml nothing > > > > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Can I have a look at your config files? >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 6:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> i run the start-all.sh and all daemons starts without problems. But i >>> the log of the tasktracker look like this : >>> >>> >>> 2012-12-12 13:53:45,495 INFO org.apache.hadoop.mapred.TaskTracker: >>> STARTUP_MSG: >>> /************************************************************ >>> STARTUP_MSG: Starting TaskTracker >>> STARTUP_MSG: host = megdiche-OptiPlex-GX280/127.0.1.1 >>> STARTUP_MSG: args = [] >>> STARTUP_MSG: version = 1.0.4 >>> STARTUP_MSG: build >>> https://svn.apache.org/repos/asf/hadoop/common/branches/branch-1.0 -r >>> 1393290; compiled by 'hortonfo' on Wed Oct 3 05:13:58 UTC 2012 >>> ************************************************************/ >>> 2012-12-12 13:53:47,009 INFO >>> org.apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from >>> hadoop-metrics2.properties >>> 2012-12-12 13:53:47,331 INFO >>> org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source >>> MetricsSystem,sub=Stats registered. >>> 2012-12-12 13:53:47,336 INFO >>> org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot >>> period at 10 second(s). >>> 2012-12-12 13:53:47,336 INFO >>> org.apache.hadoop.metrics2.impl.MetricsSystemImpl: TaskTracker metrics >>> system started >>> 2012-12-12 13:53:48,165 INFO >>> org.apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source ugi >>> registered. >>> 2012-12-12 13:53:48,192 WARN >>> org.apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi already >>> exists! >>> 2012-12-12 13:53:48,513 ERROR org.apache.hadoop.mapred.TaskTracker: Can >>> not start task tracker because java.lang.IllegalArgumentException: Does not >>> contain a valid host:port authority: local >>> at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:162) >>> at org.apache.hadoop.net.NetUtils.createSocketAddr(NetUtils.java:128) >>> at >>> org.apache.hadoop.mapred.JobTracker.getAddress(JobTracker.java:2560) >>> at org.apache.hadoop.mapred.TaskTracker.<init>(TaskTracker.java:1426) >>> at org.apache.hadoop.mapred.TaskTracker.main(TaskTracker.java:3742) >>> >>> 2012-12-12 13:53:48,519 INFO org.apache.hadoop.mapred.TaskTracker: >>> SHUTDOWN_MSG: >>> /************************************************************ >>> SHUTDOWN_MSG: Shutting down TaskTracker at megdiche-OptiPlex-GX280/ >>> 127.0.1.1 >>> ************************************************************/ >>> >>> >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> I would check if all the daemons are running properly or not, before >>>> anything else. If some problem is found, next place to track is the log of >>>> each daemon. >>>> >>>> The correct command to check the status of a job from command line is : >>>> hadoop job -status jobID. >>>> (Mind the 'space' after job and remove 'command' from the statement) >>>> >>>> HTH >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 6:14 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> My goal is to analyze the response time of MapReduce depending on the size +
Mohammad Tariq 2012-12-12, 13:22
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 13:38
I changed the files
now when i run i have this response : 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 0 time(s). 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 1 time(s). 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 2 time(s). 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 3 time(s). 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 4 time(s). 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 5 time(s). 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 6 time(s). 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 7 time(s). 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 8 time(s). 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: localhost/ 127.0.0.1:9001. Already tried 9 time(s). Exception in thread "main" java.net.ConnectException: Call to localhost/ 127.0.0.1:9001 failed on connection exception: java.net.ConnectException: Connexion refusée at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) at org.apache.hadoop.ipc.Client.call(Client.java:1075) at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown Source) at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) at org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) at org.myorg.WordCount.run(WordCount.java:115) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.myorg.WordCount.main(WordCount.java:120) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) at java.lang.reflect.Method.invoke(Unknown Source) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) Caused by: java.net.ConnectException: Connexion refusée at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) at sun.nio.ch.SocketChannelImpl.finishConnect(Unknown Source) at org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489) at org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:434) at org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:560) at org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:184) at org.apache.hadoop.ipc.Client.getConnection(Client.java:1206) at org.apache.hadoop.ipc.Client.call(Client.java:1050) ... 16 more 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > dfs.name.dir +
imen Megdiche 2012-12-12, 13:38
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 13:48
Uncomment the property in core-site.xml. That is a must. After doing this
you have to restart the daemons? Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > I changed the files > now when i run i have this response : > > 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 0 time(s). > 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 1 time(s). > 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 2 time(s). > 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 3 time(s). > 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 4 time(s). > 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 5 time(s). > 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 6 time(s). > 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 7 time(s). > 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 8 time(s). > 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: localhost/ > 127.0.0.1:9001. Already tried 9 time(s). > Exception in thread "main" java.net.ConnectException: Call to localhost/ > 127.0.0.1:9001 failed on connection exception: java.net.ConnectException: > Connexion refusée > at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) > at org.apache.hadoop.ipc.Client.call(Client.java:1075) > at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) > at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown Source) > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) > at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) > at > org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) > at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) > at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) > at org.myorg.WordCount.run(WordCount.java:115) > at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) > at org.myorg.WordCount.main(WordCount.java:120) > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) > at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) > at java.lang.reflect.Method.invoke(Unknown Source) > at org.apache.hadoop.util.RunJar.main(RunJar.java:156) > Caused by: java.net.ConnectException: Connexion refusée > at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) > at sun.nio.ch.SocketChannelImpl.finishConnect(Unknown Source) > at > org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) > at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489) > at > org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:434) > at > org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:560) > at org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:184) > at org.apache.hadoop.ipc.Client.getConnection(Client.java:1206) > at org.apache.hadoop.ipc.Client.call(Client.java:1050) > ... 16 more > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> dfs.name.dir > > > +
Mohammad Tariq 2012-12-12, 13:48
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 14:01
thank you very much you re awsome.
Fixed 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Uncomment the property in core-site.xml. That is a must. After doing this > you have to restart the daemons? > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> I changed the files >> now when i run i have this response : >> >> 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 0 time(s). >> 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 1 time(s). >> 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 2 time(s). >> 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 3 time(s). >> 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 4 time(s). >> 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 5 time(s). >> 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 6 time(s). >> 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 7 time(s). >> 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 8 time(s). >> 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: localhost/ >> 127.0.0.1:9001. Already tried 9 time(s). >> Exception in thread "main" java.net.ConnectException: Call to localhost/ >> 127.0.0.1:9001 failed on connection exception: >> java.net.ConnectException: Connexion refusée >> at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) >> at org.apache.hadoop.ipc.Client.call(Client.java:1075) >> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) >> at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown Source) >> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) >> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) >> at >> org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) >> at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) >> at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) >> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) >> at org.myorg.WordCount.run(WordCount.java:115) >> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) >> at org.myorg.WordCount.main(WordCount.java:120) >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >> at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) >> at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) >> at java.lang.reflect.Method.invoke(Unknown Source) >> at org.apache.hadoop.util.RunJar.main(RunJar.java:156) >> Caused by: java.net.ConnectException: Connexion refusée >> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) >> at sun.nio.ch.SocketChannelImpl.finishConnect(Unknown Source) >> at >> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) >> at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489) >> at >> org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:434) >> at >> org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:560) >> at >> org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:184) >> at org.apache.hadoop.ipc.Client.getConnection(Client.java:1206) >> at org.apache.hadoop.ipc.Client.call(Client.java:1050) >> ... 16 more >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> dfs.name.dir >> >> >> > +
imen Megdiche 2012-12-12, 14:01
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 14:07
You are always welcome. If you still need any help, you can go here :
http://cloudfront.blogspot.in/2012/07/how-to-configure-hadoop.html I have outlined the entire process here along with few small(but necessary) explanations. Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 7:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > thank you very much you re awsome. > > Fixed > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Uncomment the property in core-site.xml. That is a must. After doing this >> you have to restart the daemons? >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> I changed the files >>> now when i run i have this response : >>> >>> 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 0 time(s). >>> 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 1 time(s). >>> 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 2 time(s). >>> 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 3 time(s). >>> 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 4 time(s). >>> 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 5 time(s). >>> 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 6 time(s). >>> 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 7 time(s). >>> 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 8 time(s). >>> 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: localhost/ >>> 127.0.0.1:9001. Already tried 9 time(s). >>> Exception in thread "main" java.net.ConnectException: Call to localhost/ >>> 127.0.0.1:9001 failed on connection exception: >>> java.net.ConnectException: Connexion refusée >>> at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) >>> at org.apache.hadoop.ipc.Client.call(Client.java:1075) >>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) >>> at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown >>> Source) >>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) >>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) >>> at >>> org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) >>> at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) >>> at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) >>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) >>> at org.myorg.WordCount.run(WordCount.java:115) >>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) >>> at org.myorg.WordCount.main(WordCount.java:120) >>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>> at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) >>> at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) >>> at java.lang.reflect.Method.invoke(Unknown Source) >>> at org.apache.hadoop.util.RunJar.main(RunJar.java:156) >>> Caused by: java.net.ConnectException: Connexion refusée >>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) >>> at sun.nio.ch.SocketChannelImpl.finishConnect(Unknown Source) >>> at >>> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) >>> at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489) >>> at >>> org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:434) >>> at >>> org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:560) >>> at >>> org.apache.hadoop.ipc.Client$Connection.access$2000(Client.java:184) +
Mohammad Tariq 2012-12-12, 14:07
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 14:15
have you please commented the configuration of hadoop on cluster
thanks 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > You are always welcome. If you still need any help, you can go here : > http://cloudfront.blogspot.in/2012/07/how-to-configure-hadoop.html > I have outlined the entire process here along with few small(but > necessary) explanations. > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 7:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> thank you very much you re awsome. >> >> Fixed >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> Uncomment the property in core-site.xml. That is a must. After doing >>> this you have to restart the daemons? >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> I changed the files >>>> now when i run i have this response : >>>> >>>> 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 0 time(s). >>>> 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 1 time(s). >>>> 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 2 time(s). >>>> 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 3 time(s). >>>> 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 4 time(s). >>>> 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 5 time(s). >>>> 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 6 time(s). >>>> 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 7 time(s). >>>> 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 8 time(s). >>>> 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: >>>> localhost/127.0.0.1:9001. Already tried 9 time(s). >>>> Exception in thread "main" java.net.ConnectException: Call to localhost/ >>>> 127.0.0.1:9001 failed on connection exception: >>>> java.net.ConnectException: Connexion refusée >>>> at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) >>>> at org.apache.hadoop.ipc.Client.call(Client.java:1075) >>>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) >>>> at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown >>>> Source) >>>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) >>>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) >>>> at >>>> org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) >>>> at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) >>>> at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) >>>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) >>>> at org.myorg.WordCount.run(WordCount.java:115) >>>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) >>>> at org.myorg.WordCount.main(WordCount.java:120) >>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>>> at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) >>>> at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) >>>> at java.lang.reflect.Method.invoke(Unknown Source) >>>> at org.apache.hadoop.util.RunJar.main(RunJar.java:156) >>>> Caused by: java.net.ConnectException: Connexion refusée >>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) >>>> at sun.nio.ch.SocketChannelImpl.finishConnect(Unknown Source) >>>> at >>>> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206) >>>> at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:489) >>>> at >>>> org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:434) +
imen Megdiche 2012-12-12, 14:15
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 14:48
Hi Imen,
I am sorry, I didn't get the question. Are you asking about creating a distributed cluster? Yeah, I have done that. Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 7:45 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > have you please commented the configuration of hadoop on cluster > > thanks > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> You are always welcome. If you still need any help, you can go here : >> http://cloudfront.blogspot.in/2012/07/how-to-configure-hadoop.html >> I have outlined the entire process here along with few small(but >> necessary) explanations. >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 7:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> thank you very much you re awsome. >>> >>> Fixed >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> Uncomment the property in core-site.xml. That is a must. After doing >>>> this you have to restart the daemons? >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> I changed the files >>>>> now when i run i have this response : >>>>> >>>>> 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 0 time(s). >>>>> 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 1 time(s). >>>>> 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 2 time(s). >>>>> 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 3 time(s). >>>>> 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 4 time(s). >>>>> 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 5 time(s). >>>>> 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 6 time(s). >>>>> 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 7 time(s). >>>>> 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 8 time(s). >>>>> 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: >>>>> localhost/127.0.0.1:9001. Already tried 9 time(s). >>>>> Exception in thread "main" java.net.ConnectException: Call to >>>>> localhost/127.0.0.1:9001 failed on connection exception: >>>>> java.net.ConnectException: Connexion refusée >>>>> at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) >>>>> at org.apache.hadoop.ipc.Client.call(Client.java:1075) >>>>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) >>>>> at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown >>>>> Source) >>>>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) >>>>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) >>>>> at >>>>> org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) >>>>> at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) >>>>> at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) >>>>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) >>>>> at org.myorg.WordCount.run(WordCount.java:115) >>>>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) >>>>> at org.myorg.WordCount.main(WordCount.java:120) >>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>>>> at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) >>>>> at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) >>>>> at java.lang.reflect.Method.invoke(Unknown Source) >>>>> at org.apache.hadoop.util.RunJar.main(RunJar.java:156) >>>>> Caused by: java.net.ConnectException: Connexion refusée >>>>> at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) +
Mohammad Tariq 2012-12-12, 14:48
-
Re: Modify the number of map tasksimen Megdiche 2012-12-12, 15:25
have you a page in which you explain the steps.
2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > Hi Imen, > > I am sorry, I didn't get the question. Are you asking about > creating a distributed cluster? Yeah, I have done that. > > Regards, > Mohammad Tariq > > > > On Wed, Dec 12, 2012 at 7:45 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > >> have you please commented the configuration of hadoop on cluster >> >> thanks >> >> >> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >> >>> You are always welcome. If you still need any help, you can go here : >>> http://cloudfront.blogspot.in/2012/07/how-to-configure-hadoop.html >>> I have outlined the entire process here along with few small(but >>> necessary) explanations. >>> >>> Regards, >>> Mohammad Tariq >>> >>> >>> >>> On Wed, Dec 12, 2012 at 7:31 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >>> >>>> thank you very much you re awsome. >>>> >>>> Fixed >>>> >>>> >>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>> >>>>> Uncomment the property in core-site.xml. That is a must. After doing >>>>> this you have to restart the daemons? >>>>> >>>>> Regards, >>>>> Mohammad Tariq >>>>> >>>>> >>>>> >>>>> On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche < >>>>> [EMAIL PROTECTED]> wrote: >>>>> >>>>>> I changed the files >>>>>> now when i run i have this response : >>>>>> >>>>>> 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 0 time(s). >>>>>> 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 1 time(s). >>>>>> 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 2 time(s). >>>>>> 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 3 time(s). >>>>>> 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 4 time(s). >>>>>> 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 5 time(s). >>>>>> 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 6 time(s). >>>>>> 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 7 time(s). >>>>>> 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 8 time(s). >>>>>> 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: >>>>>> localhost/127.0.0.1:9001. Already tried 9 time(s). >>>>>> Exception in thread "main" java.net.ConnectException: Call to >>>>>> localhost/127.0.0.1:9001 failed on connection exception: >>>>>> java.net.ConnectException: Connexion refusée >>>>>> at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) >>>>>> at org.apache.hadoop.ipc.Client.call(Client.java:1075) >>>>>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) >>>>>> at org.apache.hadoop.mapred.$Proxy1.getProtocolVersion(Unknown >>>>>> Source) >>>>>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:396) >>>>>> at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:379) >>>>>> at >>>>>> org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:480) >>>>>> at org.apache.hadoop.mapred.JobClient.init(JobClient.java:474) >>>>>> at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457) >>>>>> at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1260) >>>>>> at org.myorg.WordCount.run(WordCount.java:115) >>>>>> at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) >>>>>> at org.myorg.WordCount.main(WordCount.java:120) >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >>>>>> at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) >>>>>> at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) >>>>>> at java.lang.reflect.Method.invoke(Unknown Source) +
imen Megdiche 2012-12-12, 15:25
-
Re: Modify the number of map tasksMohammad Tariq 2012-12-12, 15:37
No. But the process is pretty simple. You just have to copy the set of conf
file across all the machines after a few minor changes. The process goes like this : Add the IPs and hostnames of each machines into the /etc/hosts file of rest of the files. Make ssh(passwordless) work across all the machines. Modify the core-site.xml, hdfs-site.xml and mapred-site.xml(as specified above). Add the name of all slave machines to the /conf/slaves file and the machine name, where you want to run the Secondary NameNode, in the /etc/master file. Copy these files across all the machines in your cluster and you are good to go. For a detailed info you can go here : http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/ Regards, Mohammad Tariq On Wed, Dec 12, 2012 at 8:55 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: > have you a page in which you explain the steps. > > > > 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> > >> Hi Imen, >> >> I am sorry, I didn't get the question. Are you asking about >> creating a distributed cluster? Yeah, I have done that. >> >> Regards, >> Mohammad Tariq >> >> >> >> On Wed, Dec 12, 2012 at 7:45 PM, imen Megdiche <[EMAIL PROTECTED]>wrote: >> >>> have you please commented the configuration of hadoop on cluster >>> >>> thanks >>> >>> >>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>> >>>> You are always welcome. If you still need any help, you can go here : >>>> http://cloudfront.blogspot.in/2012/07/how-to-configure-hadoop.html >>>> I have outlined the entire process here along with few small(but >>>> necessary) explanations. >>>> >>>> Regards, >>>> Mohammad Tariq >>>> >>>> >>>> >>>> On Wed, Dec 12, 2012 at 7:31 PM, imen Megdiche <[EMAIL PROTECTED] >>>> > wrote: >>>> >>>>> thank you very much you re awsome. >>>>> >>>>> Fixed >>>>> >>>>> >>>>> 2012/12/12 Mohammad Tariq <[EMAIL PROTECTED]> >>>>> >>>>>> Uncomment the property in core-site.xml. That is a must. After doing >>>>>> this you have to restart the daemons? >>>>>> >>>>>> Regards, >>>>>> Mohammad Tariq >>>>>> >>>>>> >>>>>> >>>>>> On Wed, Dec 12, 2012 at 7:08 PM, imen Megdiche < >>>>>> [EMAIL PROTECTED]> wrote: >>>>>> >>>>>>> I changed the files >>>>>>> now when i run i have this response : >>>>>>> >>>>>>> 12/12/12 14:37:33 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 0 time(s). >>>>>>> 12/12/12 14:37:34 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 1 time(s). >>>>>>> 12/12/12 14:37:35 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 2 time(s). >>>>>>> 12/12/12 14:37:36 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 3 time(s). >>>>>>> 12/12/12 14:37:37 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 4 time(s). >>>>>>> 12/12/12 14:37:38 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 5 time(s). >>>>>>> 12/12/12 14:37:39 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 6 time(s). >>>>>>> 12/12/12 14:37:40 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 7 time(s). >>>>>>> 12/12/12 14:37:41 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 8 time(s). >>>>>>> 12/12/12 14:37:42 INFO ipc.Client: Retrying connect to server: >>>>>>> localhost/127.0.0.1:9001. Already tried 9 time(s). >>>>>>> Exception in thread "main" java.net.ConnectException: Call to >>>>>>> localhost/127.0.0.1:9001 failed on connection exception: >>>>>>> java.net.ConnectException: Connexion refusée >>>>>>> at org.apache.hadoop.ipc.Client.wrapException(Client.java:1099) >>>>>>> at org.apache.hadoop.ipc.Client.call(Client.java:1075) >>>>>>> at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:225) >> +
Mohammad Tariq 2012-12-12, 15:37
|