Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # user - STRSPLIT problems (or UDF shortcoming?)


Copy link to this message
-
Re: STRSPLIT problems (or UDF shortcoming?)
Dan Young 2012-05-17, 23:06
What version of pig are you using on EMR?
On May 17, 2012 5:02 PM, "Nerius Landys" <[EMAIL PROTECTED]> wrote:

> > Did you try to escape the backslash?
>
> I just tried this:
>
>  POSA = FOREACH TEST GENERATE STRSPLIT(startpos,'\\u002F');
>
> ... and still the same result.  By the way I'm using a forward slash
> for the separator character.
> I also tried this:
>
>  POSA = FOREACH TEST GENERATE STRSPLIT(startpos,'/',-1);
>
> ... and still getting null rows.
>
> If you look at my original post you'll see that the data contained in
> POSA and POSB should be identical.  There's something that's getting
> screwy during the processing stage, where processing functions are
> "concatenated" together.  If I save the output from each step to a
> file and load it back in, things work fine.  I demonstrated this in my
> original post.
>
> Very strange, but I really need to get this resolved.
>