Fast recursive matrix multiplication for multi-core architectures | Zendy

Gudula Rünger | Zendy; Michael Schwind | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Fast recursive matrix multiplication for multi-core architectures

Author(s) -

Gudula Rünger,

Michael Schwind

Publication year - 2010

Publication title -

procedia computer science

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.334

H-Index - 76

ISSN - 1877-0509

DOI - 10.1016/j.procs.2010.04.009

Subject(s) - computer science , matrix multiplication , multiplication (music) , parallel computing , core (optical fiber) , matrix (chemical analysis) , theoretical computer science , telecommunications , mathematics , physics , combinatorics , quantum mechanics , quantum , materials science , composite material

In this article, we present a fast algorithm for matrix multiplication optimized for recent multicore architectures. The implementation exploits different methodologies from parallel programming, like recursive decomposition, efficient low-level implementations of basic blocks, software prefetching, and task scheduling resulting in a multilevel algorithm with adaptive features. Measurements on different systems and comparisons with GotoBLAS, Intel Math Kernel Library (IMKL), and AMD Core Math Library (AMCL) show that the matrix implementation presented has a very high efficiency

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research