|
|
+
Richard Pickens 2013-02-05, 18:12
-
Application of Cloudera Hadoop for Dataset analysisSharath Chandra Guntuku 2013-02-05, 10:43
Hi,
I am Sharath Chandra, an undergraduate student at BITS-Pilani, India. I would like to get the following clarifications regarding cloudera hadoop distribution. I am using a CDH4 Demo VM for now. 1. After I upload the files into the file browser, if I have to link two-three datasets using a key in those files, what should I do? Do I have to run a query over them? 2. My objective is that I have some data collected over a few years and now, I would like to link all of them, as in a database using keys and then run queries over them to find out particular patterns. Later I would like to implement some Machine learning algorithms on them for predictive analysis. Will this be possible on the demo VM? I am totally new to this. Can I get some help on this? I would be very grateful for the same. ------------------------------------------------------------------------------ Thanks and Regards, *Sharath Chandra Guntuku* Undergraduate Student (Final Year) *Computer Science Department* *Email*: [EMAIL PROTECTED] *BITS-Pilani*, Hyderabad Campus Jawahar Nagar, Shameerpet, RR Dist, Hyderabad - 500078, Andhra Pradesh +
Suresh Srinivas 2013-02-05, 16:00
|