Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Pig >> mail # user >> Tracking parts of a job taking the most time


Copy link to this message
-
Re: Tracking parts of a job taking the most time
How about disable multi-query execution and use UDF CurrentTime to print
time between each script block?

Johnny
On Tue, Jun 4, 2013 at 7:11 PM, John Meek <[EMAIL PROTECTED]> wrote:

> All,
>
> I have a 400 line pig script which perfoems the calculations I need it to
> perform, however I need to figure out the amount of time that specific
> parts of the script take.
>
> For example, initial load from a Hbase table - id like to know how much
> time the load takes before moving onto the next step.
>
> Whats the easiest way to break this down?
>
>
> thanks,
> JM
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB