Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> Pig storage chararray unicode support

Copy link to this message
Pig storage chararray unicode support
Hi All,

I have a question regards to char encoding in pig. I am parsing some url
data for european sites for example i have a string "veste à capuche" but
after loading in pig using TextLoader/PigStorage the data get mangled as
"veste ? capuche" Certainly looks like the encoding is not being maintained
, the documentation for pig suggests that chararray supports utf-8. Has
anyone faced such a issue ? any pointers on how to solve this issue