-Re: Pig and Hive on the same data?
Ashutosh Chauhan 2009-09-30, 15:45
Pig doesn't mandate a Ctrl-A or any other character to be used as field
delimiter. You can tell Pig which delimiter to use. For example, you can
specify Ctrl-A as field delimiter as following:
A = load 'mydata' using PigStorage('\u0001');
If you don't specify any delimiter, e.g. A = load 'mydata'; tab is assumed
to be a delimiter.
Also, if you have more questions on Pig, please post on pig-user list to get
On Wed, Sep 30, 2009 at 10:55, dumbfounder <[EMAIL PROTECTED]> wrote:
> We would like to use the same data for Pig and Hive queries for
> has anyone done this without having 2 copies of the data? Hive seems to
> want to work with CTRL-A delimited data, and I don't see a way to specify
> CTRL-A as a delimiter for Pig. Is there another efficient regex that people
> have used for Pig, or has anyone figured out a way to use delimiters that
> aren't CTRL-A for Hive? Or are there any other outside the box ideas?
> View this message in context:
> Sent from the Hadoop core-user mailing list archive at Nabble.com.