Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Inputs of Mapreduce

Copy link to this message
Re: Inputs of Mapreduce

Hadoop mapreduce innately takes in file line by line.
XML files are not comprised of single lines.
So you will have to pack a single xml document into a single line.
Or you can make your own input format, which you need to refer to a guide

2010/7/13 Khaled BEN BAHRI <[EMAIL PROTECTED]>

> Hello to all
> I'm novice in working with mapreduce and i'm developping a mapreduce
> function that take xml documents as inputs.
> How can i make input files and precise it to the map function
> Thanks for help
> Best regards
> Khaled