Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> generate multiple output files?


Copy link to this message
-
Re: generate multiple output files?
very nice, thanks!
On Fri, Jan 11, 2013 at 2:52 PM, Dmitriy Ryaboy <[EMAIL PROTECTED]> wrote:

> Yang,
> Try MultiStorage:
>
> https://pig.apache.org/docs/r0.8.1/api/org/apache/pig/piggybank/storage/MultiStorage.html
>
>
> On Wed, Jan 9, 2013 at 2:37 PM, Yang <[EMAIL PROTECTED]> wrote:
>
> > let's say I have an input dataset, each row has 2 fields, the first field
> > is a value among 100 possible values. I want to just split the input
> > dataset into 100 outputs , based on the  value of the first field.
> >
> > is there a way to do that in pig? I see MultipleOutputs Format in Java
> API,
> > but have not found anything similar in PIG
> >
> > Thanks!
> > Yang
> >
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB