Serge Blazhievsky 2012-02-08, 20:24
-Re: Multiple Input for Avro jobs
Scott Carey 2012-02-08, 21:26
If you are after only multiple paths, path globs work.
For example to read both /logs/2012_01 and /logs/2012_02 use the glob path:
And to read the four paths /logs/2011_01, /logs/2011_02/, logs/2012_01, and
'*' is a wildcard as well, e.g. /logs/2011_*/
If you need a mapper instance per directory or different split assignment
there would be more work involved.
On 2/8/12 12:24 PM, "Serge Blazhievsky" <[EMAIL PROTECTED]> wrote:
> Hi all,
> I am trying to assign different mapper to different folders.
> Is there an equivalent of Multiinputs for avro
> MultipleInputs.addInputPath(job, new Path(input),
> AvroInputFormat<GenericRecord>.class, MapImpl.class);
Serge Blazhievsky 2012-02-08, 22:45
Scott Carey 2012-02-08, 22:53
Serge Blazhievsky 2012-02-08, 23:01