-Re: Problem in Input Format Class
Sonal Goyal 2011-09-30, 07:02
Are you trying to insert this data into HBase or are you trying to just
process this log file using Hadoop? I am not sure how your question is
related to HBase, so if it is unrelated, you can seek help on the mapreduce
For a MR job, you can use TextInputFormat and specify the custom separator.
Crux: Reporting for HBase <https://github.com/sonalgoyal/crux>
Nube Technologies <http://www.nubetech.co>
On Fri, Sep 30, 2011 at 12:26 PM, Arsalan Bilal <[EMAIL PROTECTED]>wrote:
> No , i did not try Guava's Splitter
> I am asking about such input format class that takes also separator.
> As example, job.setInputFormatClass(<Class Format>, <Seperator>);
> What will be inputFormatClass here that support separator?
> On Thu, Sep 29, 2011 at 8:39 PM, Buttler, David <[EMAIL PROTECTED]> wrote:
> > Have you considered just taking the line of text as is and using Guava's
> > Splitter?
> > Not sure how this is related to HBase
> > -----Original Message-----
> > From: Arsalan Bilal [mailto:[EMAIL PROTECTED]]
> > Sent: Thursday, September 29, 2011 7:24 AM
> > To: [EMAIL PROTECTED]
> > Subject: Problem in Input Format Class
> > i want to read text file (sample given below) separated with semicolon(;)
> > using mapper. Each one record is separated with semicolon(;).
> > Should i write my own custom input format class? OR
> > Is there exist any input format class that ask about separator?
> > input File look
> > 1;00000003;310:012:8001:01;-05:00;04:04;2010;45;56164773;3;1;0;1;
> Best Regards,
> Arsalan Bilal