-Re: Hadoop's datajoin
Hemanth Yamijala 2010-07-12, 08:21
> I am trying to use the hadoop's datajoin for joining two relation. According to
> the Readme file of datajoin, it gives the following syntax:
> $HADOOP_HOME/bin/hadoop jar hadoop-datajoin-examples.jar
> org.apache.hadoop.contrib.utils.join.DataJoinJob datajoin/input datajoin/output
> Text 1 org.apache.hadoop.contrib.utils.join.SampleDataJoinMapper
> org.apache.hadoop.contrib.utils.join.SampleTaggedMapOutput Text
> But I do not find hadoop-datajoin-examples.jar anywhere in my Hadoop_home. Can
> anyone tell me how to produce it or where to find it?
Datajoin is a contrib module. So, you will typically find it under
contrib/datajoin/. The name could something slightly different - it
could have a version number and other things.
> Thanks in advance.