pgaurav 2012-09-05, 08:42
Bejoy Ks 2012-09-05, 08:57
there are few sql database like products built on hadoop for large scale data
you may want to take a look at hbase or hive or combined both on how
they suit your needs
you want to do more on data analytics and machine learning then you
can take a look at mahout as well
On Wed, Sep 5, 2012 at 2:12 PM, pgaurav <[EMAIL PROTECTED]> wrote:
> Hi Guys,
> I’m 5 days old in hadoop world and trying to analyse this as a long term
> solution to our client.
> I could do some r&d on Amazon EC2 / EMR:
> Load the data, text / csv, to S3
> Write your mapper / reducer / Jobclient and upload the jar to s3
> Start a job flow
> I tried 2 sample code, word count and csv data process.
> My question is that to further analyse the data / reporting / search, what
> should be done? Do I need to implement in Mapper class itself? Do I need to
> dump the data to the database and then write some custom application? What
> is the standard way to analysing the data?
> View this message in context: http://old.nabble.com/Using-hadoop-for-analytics-tp34391246p34391246.html
> Sent from the Hadoop core-user mailing list archive at Nabble.com.
Bertrand Dechoux 2012-09-05, 08:57