Home | About | Sematext search-lucene.com search-hadoop.com
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> Scheduling Hive Jobs (Oozie vs. Pentaho vs. something else)


Copy link to this message
-
Re: Scheduling Hive Jobs (Oozie vs. Pentaho vs. something else)
William,

Oozie workflow jobs support Hive actions and Oozie coordinator jobs support
time/data activation of workflow jobs.

Cheers.

Alejandro

On Tue, Nov 29, 2011 at 4:27 PM, William Kornfeld <[EMAIL PROTECTED]>wrote:

>  We are building an application that involves chains of M/R jobs, most
> likely all will be written in Hive.  We need to start a Hive job when one
> or more prerequisite data sets appear (defined in the Hive sense as a new
> partition having been populated with data) - OR- a particular time has been
> reached.
>
> We know of two scheduling packages that appear to solve this problem:
> Oozie & Pentaho (to which my company has a license).
>
> Does anyone have actual experience using either of these (or something
> else) to schedule Hive jobs?
>
> William Kornfeld
> Baynote
>
>
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB