Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hive >> mail # user >> Scheduling Hive Jobs (Oozie vs. Pentaho vs. something else)

Copy link to this message
Re: Scheduling Hive Jobs (Oozie vs. Pentaho vs. something else)

Oozie workflow jobs support Hive actions and Oozie coordinator jobs support
time/data activation of workflow jobs.



On Tue, Nov 29, 2011 at 4:27 PM, William Kornfeld <[EMAIL PROTECTED]>wrote:

>  We are building an application that involves chains of M/R jobs, most
> likely all will be written in Hive.  We need to start a Hive job when one
> or more prerequisite data sets appear (defined in the Hive sense as a new
> partition having been populated with data) - OR- a particular time has been
> reached.
> We know of two scheduling packages that appear to solve this problem:
> Oozie & Pentaho (to which my company has a license).
> Does anyone have actual experience using either of these (or something
> else) to schedule Hive jobs?
> William Kornfeld
> Baynote