Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> calling python udfs with varargs

Copy link to this message
calling python udfs with varargs

I have a simple python udf which takes a variable number of (string)
arguments and returns the first non-empty one.
I can see that the udf is invoked from pig but no arguments are being passed.

Here is the script:

from org.apache.pig.scripting import *

def firstNonempty(*args):
    print args
    for v in args:
     if len(v) != 0:
           return v
    return ''

if __name__ == "__main__":
   data = load 'input.txt' AS (string1:chararray, string2:chararray);
   data = foreach data generate firstNonempty(string1, string2) as id,
string1, string2;
   dump data;


Julien Le Dem 2011-10-17, 18:26
Stan Rosenberg 2011-10-17, 19:38
Julien Le Dem 2011-10-17, 20:01