Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> self cross doesn't work as expected


Copy link to this message
-
Re: self cross doesn't work as expected
It's not a bug, pig works in that way avoiding the field name conflict
issue. You need to load the data twice when you do self join/group/cross.
On 29 December 2012 12:44, Danfeng Li <[EMAIL PROTECTED]> wrote:

> If I self cross a relation, I got the original relation, which is not
> expected.
>
> The input: A.txt
> 1
> 2
> 3
>
> The code:
> A = load 'A.txt' as (id:chararray);
> B = cross A, A;
> dump B;
> (1)
> (2)
> (3)
>
> C = foreach A generate id;
> D = cross A, C;
> dump D;
> (1,1)
> (1,2)
> (1,3)
> (2,1)
> (2,2)
> (2,3)
> (3,1)
> (3,2)
> (3,3)
>
> Is this a feature, or a bag? I'm using pig 0.10.0
>
> Thanks.
> Dan
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB