Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> JsonLoader schema field order shouldn't matter

Copy link to this message
JsonLoader schema field order shouldn't matter
When using JsonLoader with Pig 0.10.0

if I have an input.json file that looks like this:

{"date": "2007-08-25", "id": 16}
{"date": "2007-09-08", "id": 17}
{"date": "2007-09-15", "id": 18}

And I use

a = LOAD 'input.json' USING JsonLoader('id:int,date:chararray');

I get errors when it tries to force the date fields into an integer.

Shouldn't this work independent of the ordering of the schema fields?
Json writers generally don't make guarantees about the ordering.

One alternative (though annoying) would to be use elephant bird
instead, but I can't get that to compile against hadoop 2.0.0 and Pig