Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # user - Pig 0.10 XmlLoader can't handle XML shorthand

Copy link to this message
Pig 0.10 XmlLoader can't handle XML shorthand
Zhu Wayne 2013-05-06, 17:04
Greetings! Did someone encounter the same issue?

Well-formated XML for <Sellers></Sellers> is fine:

grunt> register /usr/lib/pig/piggybank.jar;

grunt> a = load 'sample.xml' using
org.apache.pig.piggybank.storage.XMLLoader('Sellers') as (doc:chararray);

grunt> dump a;


            <Seller SellerName="Leebay-Brothers" SellerRating="3.9"
SellerPrice="3,499.99" ContactInfo="" ContactPhoneInfo=""/>


 Short-hand XML for <Seller/> is NOT good:

grunt> a = load 'sample.xml' using
org.apache.pig.piggybank.storage.XMLLoader('Seller') as (doc:chararray);

grunt> dump a;

I got nothing here.