-Re: Best way to collect Hadoop logs across cluster
Mark Kerzner 2013-04-26, 00:08
Thank you for all the advice, it was indeed very useful.
On Thu, Apr 18, 2013 at 11:44 PM, Roman Shaposhnik <[EMAIL PROTECTED]> wrote:
> On Thu, Apr 18, 2013 at 9:23 PM, Mark Kerzner <[EMAIL PROTECTED]>
> > Hi,
> > my clusters are on EC2, and they disappear after the cluster's instances
> > destroyed. What is the best practice to collect the logs for later
> > EC2 does exactly that with their EMR, how do they do it?
> Apache Flume could be extremely useful for this purpose. You
> can even configure it to deposit log data in realtime into