Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Re: Some issues!


Hey ,

I have one more doubt , Suppose I have some cascading mapred jobs and
suppose some data which was collected in
MRjob1 is to be used in MRjob2 m is there any way?

Thanks

On Fri, Sep 4, 2009 at 1:54 PM, Amandeep Khurana <[EMAIL PROTECTED]> wrote:

> Or you can output the data in the keys and NullWritable as the value.
> That ways you'll get only unique data...
>
> On 9/4/09, zhang jianfeng <[EMAIL PROTECTED]> wrote:
> > Hi Sugandha ,
> >
> > If you only want to the value, you need to set the key as NullWritable in
> > reduce.
> >
> > e.g.
> > output.collect(NullWritable.get(), value);
> >
> >
> >
> > On Fri, Sep 4, 2009 at 12:46 AM, Sugandha Naolekar
> > <[EMAIL PROTECTED]>wrote:
> >
> >> Hello!
> >>
> >>        Running a simple MR job, and setting a replication factor of 2.
> >> Now,
> >> after its execution, the output is split in files named as part-00000
> and
> >> so
> >> on. I want to ask is, can't we avoid these keys or key values to get
> >> printed
> >> in output files? I mean, I am getting the output in the files in
> key-value
> >> pair. I want just the data and not the keys(integers) in it.
> >>
> >>
> >>
> >>
> >> --
> >> Regards!
> >> Sugandha
> >>
> >
>
>
> --
>
>
> Amandeep Khurana
> Computer Science Graduate Student
> University of California, Santa Cruz
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB