z-logo
open-access-imgOpen Access
Fast recursive matrix multiplication for multi-core architectures
Author(s) -
Gudula Rünger,
Michael Schwind
Publication year - 2010
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2010.04.009
Subject(s) - computer science , matrix multiplication , multiplication (music) , parallel computing , core (optical fiber) , matrix (chemical analysis) , theoretical computer science , telecommunications , mathematics , physics , combinatorics , quantum mechanics , quantum , materials science , composite material
In this article, we present a fast algorithm for matrix multiplication optimized for recent multicore architectures. The implementation exploits different methodologies from parallel programming, like recursive decomposition, efficient low-level implementations of basic blocks, software prefetching, and task scheduling resulting in a multilevel algorithm with adaptive features. Measurements on different systems and comparisons with GotoBLAS, Intel Math Kernel Library (IMKL), and AMD Core Math Library (AMCL) show that the matrix implementation presented has a very high efficiency

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom