Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Custom File reader


Copy link to this message
-
Re: Custom File reader
Steven,

> I have a number of files which can be read and converted into a series of
> lines of lext - however the means of reading the
> file is not known to the standard Hadoop splitters. I understand that I can
> Override FileInputFormat to set isSplitable to false -
> I am a little unclear on how to get the Job to Use my version of
> that FileInputFormat  and nowhere do I see a place to
> override the code for reading the file and converting it to lines of text.
> Anyone know how to do this??

Could you look at JobConf.setInputFormat() API to set your input format ?

Thanks
Hemanth
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB