-Re: issue with partioning sdf
Aniket Mokashi 2012-02-02, 05:51
I think pig will use default partitioner for the same.
You can use following syntax--
A = load 'input_data';
B = group A by $0 PARTITION BY
org.apache.pig.test.utils.SimpleCustomPartitioner parallel 2;
Take a look-
On Wed, Feb 1, 2012 at 5:04 PM, Aleksandr Elbakyan <[EMAIL PROTECTED]>wrote:
> Hello All,
> I am trying to understand how does pig group partitioning work, I was not
> able to find any documentation regarding what happen under the hood.
> For example
> B = GROUP A BY age;
> Does pig partition data by age? Or it will partition by something else?
> Other question:
> If I want to create custom partitioner can I pass fields I want data be
> partition by or it will be the same as group by key?