Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop, mail # dev - python streaming error


Copy link to this message
-
python streaming error
springring 2013-01-12, 08:30
Hi,

     When I run code below as a streaming, the job error N/A and killed.  I run step by step, find it error when
" file_obj = open(file) " .  When I run same code outside of hadoop, everything is ok.

  1 #!/bin/env python
  2
  3 import sys
  4
  5 for line in sys.stdin:
  6     offset,filename = line.split("\t")
  7     file = "hdfs://user/hdfs/catalog3/" + filename
  8     print line
  9     print filename
 10     print file
 11     file_obj = open(file)
..................................