Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Avro >> mail # user >> Avro and Hadoop streaming


Copy link to this message
-
Avro and Hadoop streaming
Greetings,

I'd like to use hadoop streaming with Avro files.
My plan is to write an inputformat class that emits json records, one
per line. This way the streaming application can read one record per
line.
(http://hadoop.apache.org/common/docs/r0.15.2/streaming.html#Specifying+Other+Plugins+for+Jobs)

I couldn't find any documentation/help about writing inputformat
classes. Can someone point me to the right direction?

Thanks,
--
Miki
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB