Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> RE: Streaming Error in PIG


Copy link to this message
-
RE: Streaming Error in PIG
hey all,

I am trying to get the following command work in Pig but getting the ERROR grunt.Grunt: ERROR 2083: Error while trying to get next result in POStream.

My script work fine and I am able to print the databag fine but after that i need to parse out the columns to my requirement where it fails. Ive tested the AWK statement and it work fine.

databag = foreach data generate id, FLATTEN(py.bag_of_tuples(data))
;databag2 = STREAM databag THROUGH `awk -F'- ' '{print $1,$2,$3,substr($4,1,1),substr($4,2,1),$5}'`

;Can you help? Thanks.

(Sorry about previous mail no idea why it looked the way it did)
+
ingvay7@...) 2013-03-07, 03:56
+
jojo.mathis@...) 2013-03-07, 03:50
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB