Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop, mail # user - Pig and Hive on the same data?


Copy link to this message
-
Re: Pig and Hive on the same data?
Ashutosh Chauhan 2009-09-30, 15:45
Hi Chris,

Pig doesn't mandate a Ctrl-A or any other character to be used as field
delimiter. You can tell Pig which delimiter to use. For example, you can
specify Ctrl-A as field delimiter  as following:

A = load 'mydata' using PigStorage('\u0001');

If you don't specify any delimiter, e.g. A = load 'mydata';  tab is assumed
to be a delimiter.

Also, if you have more questions on Pig, please post on pig-user list to get
faster response.

Thanks,
Ashutosh

On Wed, Sep 30, 2009 at 10:55, dumbfounder <[EMAIL PROTECTED]> wrote:

>
> We would like to use the same data for Pig and Hive queries for
> flexibility,
> has anyone done this without having 2 copies of the data? Hive seems to
> only
> want to work with CTRL-A delimited data, and I don't see a way to specify
> CTRL-A as a delimiter for Pig. Is there another efficient regex that people
> have used for Pig, or has anyone figured out a way to use delimiters that
> aren't CTRL-A for Hive? Or are there any other outside the box ideas?
> --
> View this message in context:
> http://www.nabble.com/Pig-and-Hive-on-the-same-data--tp25682735p25682735.html
> Sent from the Hadoop core-user mailing list archive at Nabble.com.
>
>