OpenMP Issues Arising in the Development of Parallel BLAS and LAPACK Libraries
Author(s) -
C. A. Addison,
Y. Ren,
Matthijs van Waveren
Publication year - 2003
Publication title -
scientific programming
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.269
H-Index - 36
eISSN - 1875-919X
pISSN - 1058-9244
DOI - 10.1155/2003/278167
Subject(s) - computer science , parallel computing , parallelism (grammar) , implementation , shared memory , exploit , computation , distributed memory , partition (number theory) , context (archaeology) , linear algebra , focus (optics) , data parallelism , speedup , theoretical computer science , algorithm , programming language , mathematics , paleontology , physics , geometry , computer security , optics , combinatorics , biology
Dense linear algebra libraries need to cope efficiently with a range of input problem sizes and shapes. Inherently this means that parallel implementations have to exploit parallelism wherever it is present. While OpenMP allows relatively fine grain parallelism to be exploited in a shared memory environment it currently lacks features to make it easy to partition computation over multiple array indices or to overlap sequential and parallel computations. The inherent flexible nature of shared memory paradigms such as OpenMP poses other difficulties when it becomes necessary to optimise performance across successive parallel library calls. Notions borrowed from distributed memory paradigms, such as explicit data distributions help address some of these problems, but the focus on data rather than work distribution appears misplaced in an SMP context
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom