Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Improve Performance of Pig script

Copy link to this message
Re: Improve Performance of Pig script
Can you please forward the script and Job Counters? Cluster size - # of Map
Reduce slots would be good too.


On Mon, Apr 2, 2012 at 5:27 PM, sonia gehlot <[EMAIL PROTECTED]> wrote:

> Hi,
> I have a really large data set of about 10 to 15 billion rows. I wanted to
> do some aggregates like sum, count distinct, max etc but this is taking
> forever to run the script.
> What hints or properties should I set to improve performance.
> Please let me know.
> Thanks,
> Sonia