Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Basic questions


Copy link to this message
-
RE: Basic questions
Hi Mark,

Both PigStorage and BinStorage allow you to store complex types such as bags. So you can do something like this:

A = load 'mydata' as (x, y, z);
B = group A by x;
C = foreach B generate A;
store C;

If you don't like the formatting done by the store function, you can create a storage function that formats the data the way you like.

Olga

-----Original Message-----
From: Mark [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, September 15, 2010 5:05 PM
To: [EMAIL PROTECTED]
Subject: Basic questions

  Say I have a bunch of tuples that is a result of a GROUP, how can I
just store the values.. not the key?

As a side note, how can I output bags to be separated by tabs instead of
commas? How can I remove the annoying parentheses and brackets around my
output?

Thanks
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB