Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # user - Pig storage chararray unicode support


Copy link to this message
-
Pig storage chararray unicode support
Saket Joshi 2013-03-25, 21:17
Hi All,

I have a question regards to char encoding in pig. I am parsing some url
data for european sites for example i have a string "veste à capuche" but
after loading in pig using TextLoader/PigStorage the data get mangled as
"veste ? capuche" Certainly looks like the encoding is not being maintained
, the documentation for pig suggests that chararray supports utf-8. Has
anyone faced such a issue ? any pointers on how to solve this issue
Thanks,
Saket