|
|
-
Deserialization error when using Jython UDF in Pig 0.10 scriptChris Diehl 2012-06-08, 19:35
Hi All,
I recently downloaded and installed Pig 0.10 on our Hadoop cluster. After configuring things as I've done before to use Jython UDFs, I'm seeing deserialization errors. I've verified that my test code runs when I switch back to Pig 0.8. I'm successfully using Pig 0.10 with Jython on my MacBook Pro in local mode so I'm rather flummoxed as to what is up. What I've done to set up Pig 0.10 for using Jython UDFs: 1) Installed Jython 2.5.0 2) set PIG_CLASSPATH="<jython path>/jython.jar" 3) set JYTHON_HOME="<jython path>" 4) added <jython path>/bin to the path 5) put jython-2.5.0.jar into <pig path>/lib for good measure I'm not sure how to get around this issue. Anyone have any suggestions? In case it's illuminating, I've returned output from a very simple Pig job that loads data and attempts to pass it through a Jython UDF. Chris 2012-06-08 19:25:16,031 [main] INFO org.apache.pig.Main - Apache Pig version 0.10.0 (r1328203) compiled Apr 19 2012, 22:54:12 2012-06-08 19:25:16,032 [main] INFO org.apache.pig.Main - Logging error messages to: /opt/shared_storage/log_analysis_pig_python_scripts/pig_1339183516027.log 2012-06-08 19:25:18,506 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: hdfs://X 2012-06-08 19:25:19,222 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to map-reduce job tracker at: X 2012-06-08 19:25:21,153 [main] INFO org.apache.pig.scripting.jython.JythonScriptEngine - created tmp python.cachedir=/tmp/pig_jython_491028285262723042 *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/lib/tools.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/hadoop-core-0.20.2-cdh3u1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/ant-contrib-1.0b3.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/aspectjrt-1.6.5.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/aspectjtools-1.6.5.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-cli-1.2.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-codec-1.4.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-daemon-1.0.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-el-1.0.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-httpclient-3.0.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-logging-1.0.4.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-logging-api-1.0.4.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/commons-net-1.4.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/core-3.1.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/hadoop-fairscheduler-0.20.2-cdh3u1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/hsqldb-1.8.0.10.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jackson-core-asl-1.5.2.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jackson-mapper-asl-1.5.2.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jasper-compiler-5.5.12.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jasper-runtime-5.5.12.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jets3t-0.6.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jetty-6.1.26.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jetty-servlet-tester-6.1.26.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jetty-util-6.1.26.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jsch-0.1.42.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/junit-4.5.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/kfs-0.2.2.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/log4j-1.2.15.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/mockito-all-1.8.2.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/oro-2.0.8.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/servlet-api-2.5-20081211.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/servlet-api-2.5-6.1.14.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/slf4j-api-1.4.3.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/slf4j-log4j12-1.4.3.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/xmlenc-0.52.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jsp-2.1/jsp-2.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/lib/jsp-2.1/jsp-api-2.1.jar' *sys-package-mgr*: processing new jar, '/usr/lib/hadoop-0.20/contrib/fairscheduler/hadoop-fairscheduler-0.20.2-cdh3u1.jar' *sys-package-mgr*: processing new jar, '/opt/shared_storage/jython_2.5.0/jython.jar' *sys-package-mgr*: processing new jar, '/opt/shared_storage/pig-0.10.0/lib/automaton.jar' *sys-package-mgr*: processing new jar, '/opt/shared_storage/pig-0.10.0/lib/jython-2.5.0.jar' *sys-package-mgr*: processing new jar, '/opt/shared_storage/pig-0.10.0/pig-0.10.0-withouthadoop.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/resources.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/rt.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/jsse.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/jce.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/charsets.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/ext/sunjce_provider.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/ext/localedata.jar' *sys-package-mgr*: processing new jar, '/usr/java/jdk1.6.0_21/jre/lib/e |