Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Avro, mail # user - Avro and Hadoop streaming


Copy link to this message
-
Avro and Hadoop streaming
Miki Tebeka 2011-06-02, 21:30
Greetings,

I'd like to use hadoop streaming with Avro files.
My plan is to write an inputformat class that emits json records, one
per line. This way the streaming application can read one record per
line.
(http://hadoop.apache.org/common/docs/r0.15.2/streaming.html#Specifying+Other+Plugins+for+Jobs)

I couldn't find any documentation/help about writing inputformat
classes. Can someone point me to the right direction?

Thanks,
--
Miki