Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> CombineFileInputFormat in 0.20.2 version

Copy link to this message
Re: CombineFileInputFormat in 0.20.2 version
The most obvious workaround is to use the old API (continue to use Mapper,
Reducer, etc. from org.apache.hadoop.mapred, not .mapreduce).

If you really want to use the new API, though, I unfortunately don't see a
super-easy path. You could try to apply the patch from MAPREDUCE-364 to your
version of Hadoop and recompile, but that might be tricky since the
filenames will most likely not line up (due to the project split).

- Aaron

On Tue, Mar 16, 2010 at 8:11 AM, Aleksandar Stupar <

> Hi all,
> I want to use CombineFileInputFormat in 0.20.2 version but it can't be used
> with Job class.
> Description:
> org.apache.hadoop.mapred.lib.CombineFileInputFormat can not be used with
> org.apache.hadoop.mapreduce.Job
> because Job.setInputFormat requires subclass of
>  org.apache.hadoop.mapreduce.InputFormat and CombineFileInputFormat
> extends org.apache.hadoop.mapred.FileInputFormat.
> Also CombineFileInputFormat uses deprecated classes.
> Are there any workarounds?
> Thanks,
> Aleksandar Stupar.