|
|
+
Hemanth Yamijala 2013-01-14, 05:07
-
Re: Compile error using contrib.utils.join package with new mapreduce APIHemanth Yamijala 2013-01-14, 14:15
Hi,
No. I didn't find any reference to a working sample. I also didn't find any JIRA that asks for a migration of this package to the new API. Not sure why. I have asked on the dev list. Thanks hemanth On Mon, Jan 14, 2013 at 6:25 PM, Michael Forage < [EMAIL PROTECTED]> wrote: > Thanks Hemanth**** > > ** ** > > I appreciate your response**** > > Did you find any working example of it in use? It looks to me like I’d > still be tied to the old API**** > > Thanks**** > > Mike**** > > ** ** > > *From:* Hemanth Yamijala [mailto:[EMAIL PROTECTED]] > *Sent:* 14 January 2013 05:08 > *To:* [EMAIL PROTECTED] > *Subject:* Re: Compile error using contrib.utils.join package with new > mapreduce API**** > > ** ** > > Hi,**** > > ** ** > > The datajoin package has a class called DataJoinJob ( > http://hadoop.apache.org/docs/stable/api/org/apache/hadoop/contrib/utils/join/DataJoinJob.html > )**** > > ** ** > > I think using this will help you get around the issue you are facing.**** > > ** ** > > From the source, this is the command line usage of the class:**** > > ** ** > > usage: DataJoinJob inputdirs outputdir map_input_file_format numofParts > mapper_class reducer_class map_output_value_class output_value_class > [maxNumOfValuesPerGroup [descriptionOfJob]]]**** > > ** ** > > Internally the class uses the old API to set the mapper and reducer passed > as arguments above.**** > > ** ** > > Thanks**** > > hemanth**** > > ** ** > > ** ** > > ** ** > > On Fri, Jan 11, 2013 at 9:00 PM, Michael Forage < > [EMAIL PROTECTED]> wrote:**** > > Hi**** > > **** > > I’m using Hadoop 1.0.4 and using the hadoop.mapreduce API having problems > compiling a simple class to implement a reduce-side data join of 2 files.* > *** > > I’m trying to do this using contrib.utils.join and in Eclipse it all > compiles fine other than:**** > > **** > > job.*setMapperClass*(MapClass.*class*);**** > > job.*setReducerClass*(Reduce.*class*);**** > > **** > > …which both complain that the referenced class no longer extends either > Mapper<> or Reducer<>**** > > It’s my understanding that for what they should instead extend DataJoinMapperBase > and DataJoinReducerBase in order **** > > **** > > Have searched for a solution everywhere but unfortunately, all the > examples I can find are based on the deprecated mapred API.**** > > Assuming this package actually works with the new API, can anyone offer > any advice?**** > > **** > > Complete compile errors:**** > > **** > > The method setMapperClass(Class<? extends Mapper>) in the type Job is not > applicable for the arguments (Class<DataJoin.MapClass>)**** > > The method setReducerClass(Class<? extends Reducer>) in the type Job is > not applicable for the arguments (Class<DataJoin.Reduce>)**** > > **** > > …and the code…**** > > **** > > *package* JoinTest;**** > > **** > > *import* java.io.DataInput;**** > > *import* java.io.DataOutput;**** > > *import* java.io.IOException;**** > > *import* java.util.Iterator;**** > > **** > > *import* org.apache.hadoop.conf.Configuration;**** > > *import* org.apache.hadoop.conf.Configured;**** > > *import* org.apache.hadoop.fs.Path;**** > > *import* org.apache.hadoop.io.LongWritable;**** > > *import* org.apache.hadoop.io.Text;**** > > *import* org.apache.hadoop.io.Writable;**** > > *import* org.apache.hadoop.mapreduce.Job;**** > > *import* org.apache.hadoop.mapreduce.Mapper;**** > > *import* org.apache.hadoop.mapreduce.Reducer;**** > > *import* org.apache.hadoop.mapreduce.Mapper.Context;**** > > *import* org.apache.hadoop.mapreduce.lib.input.FileInputFormat;**** > > *import* org.apache.hadoop.mapreduce.lib.input.TextInputFormat;**** > > *import* org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;**** > > *import* org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;**** > > *import* org.apache.hadoop.util.Tool;**** > > *import* org.apache.hadoop.util.ToolRunner;**** > > **** > > *import* org.apache.hadoop.contrib.utils.join.DataJoinMapperBase;**** |