Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> The name of the current input file during a map

Copy link to this message
Re: The name of the current input file during a map
Thank you.

On Thu, Nov 26, 2009 at 2:10 AM, Amogh Vasekar <[EMAIL PROTECTED]> wrote:
> Conf.get(map.input.file) is what you need.
> Amogh
> On 11/26/09 12:35 PM, "Saptarshi Guha" <[EMAIL PROTECTED]> wrote:
> Hello,
> I have a set of input files part-r-* which I will pass through another
> map(no reduce).  the part-r-* files consist of key, values, keys being
> small, values fairly large(MB's)
> I would like to index these, i.e run a map, whose output is key and
> /filename/ i.e to which part-r-* file the particular key belongs, so
> that if i need them again I can just access that file.
> Q: In the map stage,how do I retrieve the name of the file being
> processed?  I'd rather not use the MapFileOutputFormat.
> Hadoop 0.21
> Regards
> Saptarshi