Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop, mail # user - CombineFileInputFormat in 0.20.2 version

Copy link to this message
Re: CombineFileInputFormat in 0.20.2 version
Aaron Kimball 2010-03-16, 17:34
The most obvious workaround is to use the old API (continue to use Mapper,
Reducer, etc. from org.apache.hadoop.mapred, not .mapreduce).

If you really want to use the new API, though, I unfortunately don't see a
super-easy path. You could try to apply the patch from MAPREDUCE-364 to your
version of Hadoop and recompile, but that might be tricky since the
filenames will most likely not line up (due to the project split).

- Aaron

On Tue, Mar 16, 2010 at 8:11 AM, Aleksandar Stupar <

> Hi all,
> I want to use CombineFileInputFormat in 0.20.2 version but it can't be used
> with Job class.
> Description:
> org.apache.hadoop.mapred.lib.CombineFileInputFormat can not be used with
> org.apache.hadoop.mapreduce.Job
> because Job.setInputFormat requires subclass of
>  org.apache.hadoop.mapreduce.InputFormat and CombineFileInputFormat
> extends org.apache.hadoop.mapred.FileInputFormat.
> Also CombineFileInputFormat uses deprecated classes.
> Are there any workarounds?
> Thanks,
> Aleksandar Stupar.