Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> issue with partioning sdf

Copy link to this message
Re: issue with partioning sdf
I think pig will use default partitioner for the same.

You can use following syntax--
A = load 'input_data';
B = group A by $0 PARTITION BY
org.apache.pig.test.utils.SimpleCustomPartitioner parallel 2;

Take a look-


On Wed, Feb 1, 2012 at 5:04 PM, Aleksandr Elbakyan <[EMAIL PROTECTED]>wrote:

> Hello All,
> I am trying to understand how does pig group partitioning work, I was not
> able to find any documentation regarding what happen under the hood.
> For example
> B = GROUP A BY age;
> Does pig partition data by age? Or it will partition by something else?
> Other question:
> If I want to create custom partitioner can I pass fields I want data be
> partition by or it will be the same as group by key?
> Regards,
> Aleksandr
"...:::Aniket:::... Quetzalco@tl"