Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop, mail # user - Different input streams


Copy link to this message
-
Re: Different input streams
Harsh J 2012-05-01, 08:57
Ajay,

Take a look at MultipleInputs: See Page 214 | Chapter 7: MapReduce
Types and Formats  of Hadoop: The Definitive Guide (2nd edition) by
Tom White (O'Reilly) and also
http://hadoop.apache.org/common/docs/current/api/org/apache/hadoop/mapreduce/lib/input/MultipleInputs.html

This class will solve your need, just use a common mapper with them.

On Tue, May 1, 2012 at 1:32 PM, Ajay Srivastava
<[EMAIL PROTECTED]> wrote:
> Hi,
>
> If there are two inputs to a hadoop job one is text and another is binary (Sequence file), is there a way to set InputFormatClass to these two different streams ?
> job.setInputFormatClass will set to one type of input. Does that mean a hadoop job can not take input in two different formats?
>
>
>
> Thanks.
> Ajay Srivastava

--
Harsh J