Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig, mail # dev - Re: Review Request 14897: PIG-3538 Implement LIMIT in Tez

Copy link to this message
Re: Review Request 14897: PIG-3538 Implement LIMIT in Tez
Alex Bain 2013-10-24, 01:42

This is an automatically generated e-mail. To reply, visit:

(Updated Oct. 23, 2013, 6:42 p.m.)
Review request for pig, Cheolsoo Park, Daniel Dai, Mark Wagner, and Rohini Palaniswamy.
Summary (updated)

PIG-3538 Implement LIMIT in Tez
Bugs: PIG-3538
Repository: pig-git

Implement LIMIT in Tez by providing an implementation of visitLimit in TezCompiler.java.

  src/org/apache/pig/backend/hadoop/executionengine/tez/TezCompiler.java 0c20214

Diff: https://reviews.apache.org/r/14897/diff/
Testing (updated)

[abain@abain-ld pig]$ cat data/1.dat

[abain@abain-ld pig]$ cat test3.pig
a = load './1.dat' using PigStorage(',') as (id:int, fruit:chararray);
b = LIMIT a 2;
STORE b INTO 'foo';

I ran with with "pig -x tez -f test3.pig" and got the following (correct results):

[abain@abain-ld pig]$ hadoop fs -ls /user/abain/foo
Found 2 items
-rw-r--r--   1 abain supergroup          0 2013-10-23 18:38 /user/abain/foo/_SUCCESS
-rw-r--r--   1 abain supergroup         17 2013-10-23 18:38 /user/abain/foo/part-r-00000

[abain@abain-ld pig]$ hadoop fs -cat /user/abain/foo/part-r-00000
1 orange
2 apple

Alex Bain