Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
MapReduce >> mail # user >> Application of Cloudera Hadoop for Dataset analysis


Copy link to this message
-
Re: Application of Cloudera Hadoop for Dataset analysis
Please take this thread to CDH mailing list.
On Tue, Feb 5, 2013 at 2:43 AM, Sharath Chandra Guntuku <
[EMAIL PROTECTED]> wrote:

> Hi,
>
> I am Sharath Chandra, an undergraduate student at BITS-Pilani, India. I
> would like to get the following clarifications regarding cloudera hadoop
> distribution. I am using a CDH4 Demo VM for now.
>
> 1. After I upload the files into the file browser, if I have to link
> two-three datasets using a key in those files, what should I do? Do I have
> to run a query over them?
>
> 2. My objective is that I have some data collected over a few years and
> now, I would like to link all of them, as in a database using keys and then
> run queries over them to find out particular patterns. Later I would like
> to implement some Machine learning algorithms on them for predictive
> analysis. Will this be possible on the demo VM?
>
> I am totally new to this. Can I get some help on this? I would be very
> grateful for the same.
>
>
> ------------------------------------------------------------------------------
> Thanks and Regards,
> *Sharath Chandra Guntuku*
> Undergraduate Student (Final Year)
> *Computer Science Department*
> *Email*: [EMAIL PROTECTED]
>
> *BITS-Pilani*, Hyderabad Campus
> Jawahar Nagar, Shameerpet, RR Dist,
> Hyderabad - 500078, Andhra Pradesh
>

--
http://hortonworks.com/download/