You are on the right track. TestDFSIO and TeraGen/Sort provide good characterization of IO and shuffle /sort performance. You would likely want to run/save dstat (/vmstat/iostat/..) info on the individual nodes as well.
HiBench does provide additional useful characterizations such as mixed workloads using typical hadoop ecosystem tools. 2013/9/2 Ravi Kiran <[EMAIL PROTECTED]>
> You can also look at > a ) https://github.com/intel-hadoop/HiBench > > > Regards > Ravi Magham > > > On Mon, Sep 2, 2013 at 12:26 PM, ch huang <[EMAIL PROTECTED]> wrote: > >> hi ,all: >> i want to evaluate my hadoop cluster performance ,what tool can i >> use? (TestDFSIO,nnbench?) >> > >
All projects made searchable here are trademarks of the Apache Software Foundation.
Service operated by Sematext