Last checkin on Azkaban was 11 months ago:
But, the last checkin for Hamake was June 2010. And it's still a cool
little Hadoop/Pig scheduler.
On Sun, Aug 19, 2012 at 2:49 PM, Michael Segel
<[EMAIL PROTECTED]> wrote:
> There has been some work to replace the use of queues with HBase.
> This would be used to feed processes off the queue to help balance out the load on the cluster.
> In one specific use case, this was effective because the time spent processing each mapper.map() iteration is a couple of orders of magnitude as the time it takes to pull the data from the 'queue' and to each node for processing.
> Again, YMMV, it is an interesting hack though....
> On Aug 19, 2012, at 11:46 AM, Robert Nicholson <[EMAIL PROTECTED]> wrote:
>> We have an application or a series of applications that listen to incoming feeds they then distribute this data in XML form to a number of queues. Another set of processes listen to these queues and process the messages. Order of processing is important in so far as related messages need to be processed in sequence hence today all related messages go to the same queue and are processed by the same queue consumer.
>> The idea would be replace the use of MQ with some kind of reliable distributed dispatch. Does Hadoop provide that?