Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Right way to implement MR ?


Copy link to this message
-
Re: Right way to implement MR ?
You might want to start with http://hadoop.apache.org/common/docs/stable/mapred_tutorial.html.

Arun

On May 23, 2012, at 12:47 PM, samir das mohapatra wrote:

> Hi All,
>     How to compare to input file In M/R Job.
>     let A Log file around 30GB
>    and B Log file size is around 60 GB
>
>  I wanted to know how  i will  define <K,V> inside the mapper.
>
> Thanks
>  samir.

--
Arun C. Murthy
Hortonworks Inc.
http://hortonworks.com/
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB