|
|
-
Is there a loader that loads a file as a line?
Jonathan Coveney 2012-06-21, 21:34
It can even be a bytearray. Basically I have a bunch of files, and I want one file -> one row. Is there an easy way to do this? Or will I need to provide a special fileinputformat etc?
-
Re: Is there a loader that loads a file as a line?
Prashant Kommireddi 2012-06-21, 22:09
I think you will need to implement a RecordReader/InputFormat of your own for this and use it with a LoadFunc. Not sure if Hadoop has a Reader that you could re-use for this.
How do you handle the case when a file exceeds block size?
On Thu, Jun 21, 2012 at 2:34 PM, Jonathan Coveney <[EMAIL PROTECTED]>wrote:
> It can even be a bytearray. Basically I have a bunch of files, and I want > one file -> one row. Is there an easy way to do this? Or will I need to > provide a special fileinputformat etc? >
-
Re: Is there a loader that loads a file as a line?
Mohammad Tariq 2012-06-21, 22:34
Hello Jonathan, Have a look at Hadoop's WholeFileInputFormat..Might fit into your requirements. Regards, Mohammad Tariq On Fri, Jun 22, 2012 at 3:39 AM, Prashant Kommireddi <[EMAIL PROTECTED]> wrote: > I think you will need to implement a RecordReader/InputFormat of your own > for this and use it with a LoadFunc. Not sure if Hadoop has a Reader that > you could re-use for this. > > How do you handle the case when a file exceeds block size? > > On Thu, Jun 21, 2012 at 2:34 PM, Jonathan Coveney <[EMAIL PROTECTED]>wrote: > >> It can even be a bytearray. Basically I have a bunch of files, and I want >> one file -> one row. Is there an easy way to do this? Or will I need to >> provide a special fileinputformat etc? >>
|
|
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by
Sematext