Home | About | Sematext search-lucene.com search-hadoop.com
 Search Hadoop and all its subprojects:

Switch to Threaded View
Hadoop >> mail # user >> Matrix multiplication in Hadoop

Copy link to this message
Re: Matrix multiplication in Hadoop
Sounds like a job for next gen map reduce native libraries and gpu's. A
modern day Dr frankenstein for sure.

On Saturday, November 19, 2011, Tim Broberg <[EMAIL PROTECTED]> wrote:
> Perhaps this is a good candidate for a native library, then?
> ________________________________________
> From: Mike Davis [[EMAIL PROTECTED]]
> Sent: Friday, November 18, 2011 7:39 PM
> Subject: Re: Matrix multiplication in Hadoop
> On Friday, November 18, 2011, Mike Spreitzer <[EMAIL PROTECTED]> wrote:
>>  Why is matrix multiplication ill-suited for Hadoop?
> IMHO, a huge issue here is the JVM's inability to fully support cpu vendor
> specific SIMD instructions and, by extension, optimized BLAS routines.
> Running a large MM task using intel's MKL rather than relying on generic
> compiler optimization is orders of magnitude faster on a single multicore
> processor. I see almost no way that Hadoop could win such a CPU intensive
> task against an mpi cluster with even a tenth of the nodes running with a
> decently tuned BLAS library. Racing even against a single CPU might be
> difficult, given the i/o overhead.
> Still, it's a reasonably common problem and we shouldn't murder the good
> favor of the best. I'm certain a MM/LinAlg Hadoop library with even
> mediocre performance, wrt C, would get used.
> --
> Mike Davis
> The information and any attached documents contained in this message
> may be confidential and/or legally privileged.  The message is
> intended solely for the addressee(s).  If you are not the intended
> recipient, you are hereby notified that any use, dissemination, or
> reproduction is strictly prohibited and may be unlawful.  If you are
> not the intended recipient, please contact the sender immediately by
> return e-mail and destroy all copies of the original message.