Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Basic questions


Copy link to this message
-
RE: Basic questions
Not with standard storage functions that we have.

-----Original Message-----
From: Mark [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, September 15, 2010 5:33 PM
To: [EMAIL PROTECTED]
Subject: Re: Basic questions

  So if I have a tuple that looks like:

({(163906653),(61482111),(157530223)})

There is no way to get that to output as:

163906653/t61482111/t157530223

???

On 9/15/10 5:19 PM, Olga Natkovich wrote:
> Hi Mark,
>
> Both PigStorage and BinStorage allow you to store complex types such as bags. So you can do something like this:
>
> A = load 'mydata' as (x, y, z);
> B = group A by x;
> C = foreach B generate A;
> store C;
>
> If you don't like the formatting done by the store function, you can create a storage function that formats the data the way you like.
>
> Olga
>
> -----Original Message-----
> From: Mark [mailto:[EMAIL PROTECTED]]
> Sent: Wednesday, September 15, 2010 5:05 PM
> To: [EMAIL PROTECTED]
> Subject: Basic questions
>
>    Say I have a bunch of tuples that is a result of a GROUP, how can I
> just store the values.. not the key?
>
> As a side note, how can I output bags to be separated by tabs instead of
> commas? How can I remove the annoying parentheses and brackets around my
> output?
>
> Thanks
>