Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> CompositeInputFormat


Copy link to this message
-
RE: CompositeInputFormat
Sorry I should've specified that I need an example of CompositeInputFormat that uses the new API.
The example linked below uses old API objects like JobConf.

Any known examples of CompositeInputFormat using the new API?

Thanks in advance,

Andrew

From: Jay Vyas [mailto:[EMAIL PROTECTED]]
Sent: Thursday, July 11, 2013 5:10 PM
To: [EMAIL PROTECTED]
Subject: Re: CompositeInputFormat

Map Side joins will use the CompositeInputFormat.  They will only really be worth doing if one data set is small, and the other is large.
This is a good example : http://www.congiu.com/joins-in-hadoop-using-compositeinputformat/
the trick is to google for CompositeInputFormat.compose() .... :)

On Thu, Jul 11, 2013 at 5:02 PM, Botelho, Andrew <[EMAIL PROTECTED]<mailto:[EMAIL PROTECTED]>> wrote:
Hi,

I want to perform a JOIN on two sets of data with Hadoop.  I read that the class CompositeInputFormat can be used to perform joins on data, but I can't find any examples of how to do it.
Could someone help me out? It would be much appreciated. :)

Thanks in advance,

Andrew

--
Jay Vyas
http://jayunit100.blogspot.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB