-Re: Best way to collect Hadoop logs across cluster
Roman Shaposhnik 2013-04-19, 04:44
On Thu, Apr 18, 2013 at 9:23 PM, Mark Kerzner <[EMAIL PROTECTED]> wrote:
> my clusters are on EC2, and they disappear after the cluster's instances are
> destroyed. What is the best practice to collect the logs for later storage?
> EC2 does exactly that with their EMR, how do they do it?
Apache Flume could be extremely useful for this purpose. You
can even configure it to deposit log data in realtime into