|
|
-
Custom File readerSteve Lewis 2010-06-24, 19:44
I have a number of files which can be read and converted into a series of
lines of lext - however the means of reading the file is not known to the standard Hadoop splitters. I understand that I can Override FileInputFormat to set isSplitable to false - I am a little unclear on how to get the Job to Use my version of that FileInputFormat and nowhere do I see a place to override the code for reading the file and converting it to lines of text. Anyone know how to do this?? -- Steven M. Lewis PhD Institute for Systems Biology Seattle WA |