Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Plain View
Pig >> mail # user >> generate multiple output files?


+
Yang 2013-01-09, 22:37
Copy link to this message
-
Re: generate multiple output files?
Yang,
Try MultiStorage:
https://pig.apache.org/docs/r0.8.1/api/org/apache/pig/piggybank/storage/MultiStorage.html
On Wed, Jan 9, 2013 at 2:37 PM, Yang <[EMAIL PROTECTED]> wrote:

> let's say I have an input dataset, each row has 2 fields, the first field
> is a value among 100 possible values. I want to just split the input
> dataset into 100 outputs , based on the  value of the first field.
>
> is there a way to do that in pig? I see MultipleOutputs Format in Java API,
> but have not found anything similar in PIG
>
> Thanks!
> Yang
>
+
Yang 2013-01-18, 00:25
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB