Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Hadoop's datajoin

Copy link to this message
Re: Hadoop's datajoin

> I am trying to use the hadoop's datajoin for joining two relation. According to
> the Readme file of datajoin, it gives the following syntax:
> $HADOOP_HOME/bin/hadoop jar hadoop-datajoin-examples.jar
> org.apache.hadoop.contrib.utils.join.DataJoinJob datajoin/input  datajoin/output
> Text 1  org.apache.hadoop.contrib.utils.join.SampleDataJoinMapper
> org.apache.hadoop.contrib.utils.join.SampleDataJoinReducer
> org.apache.hadoop.contrib.utils.join.SampleTaggedMapOutput Text
> But I do not find hadoop-datajoin-examples.jar anywhere in my Hadoop_home. Can
> anyone tell me how to produce it or where to find it?

Datajoin is a contrib module. So, you will typically find it under
contrib/datajoin/. The name could something slightly different - it
could have a version number and other things.

> Thanks in advance.