Yes data flow visualizations definitely sound like something that would be
good for Ambari. If you are interested in debugging Hadoop jobs there is
also the Hadoop Development Tools project
It is taking the Eclipse plugin for Hadoop and really improving it. I
know that there has been some work to try and get a debugger working over
there where you could walk through parts of your MR job line by line.
On 6/14/13 12:40 PM, "Chris Nauroth" <[EMAIL PROTECTED]> wrote:
>You might want to investigate contributing on Apache Ambari, which has
>features for visualization of jobs and end-to-end flows consisting of
>multiple dependent jobs.
>On Fri, Jun 14, 2013 at 8:20 AM, Saikat Kanjilal
>> Hi Folks,
>> I was wondering if anyone is currently working on or thinking about
>> debugging tools for mapreduce jobs, I was thinking about starting an
>> to build an end to end visual tool that shows all the steps in the
>> mapreduce workflow and data flows, variable content changing to speed up
>> debugging of jobs. Please ignore if something like this already
>> and if not I'd love to collaborate with folks to build something.