Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # user - Way of determining the source of data


Copy link to this message
-
Way of determining the source of data
Ranjan Bagchi 2012-02-03, 01:11
Hi,

I've a bunch of [for example] apache logfiles that I'm searching through.  I can process them with:

logs = load 's3://bucket/directory/*' USING LogLoader as (remoteAddr, remoteLogname, user, time :chararray, method, uri :chararray, proto, status, bytes, referer, userAgent);

Is there any way of getting the name of the file from which logs was pulled added to the relation?

Thanks,

Ranjan