Highly Parallel Sparse Cholesky Factorization
Author(s) -
John R. Gilbert,
Robert Schreiber
Publication year - 1992
Publication title -
siam journal on scientific and statistical computing
Language(s) - English
Resource type - Journals
eISSN - 2168-3417
pISSN - 0196-5204
DOI - 10.1137/0913067
Subject(s) - computer science , subroutine , cholesky decomposition , parallel computing , incomplete cholesky factorization , sparse matrix , factorization , vectorization (mathematics) , simd , algorithm , incomplete lu factorization , data structure , parallel algorithm , minimum degree algorithm , parallel programming model , matrix decomposition , programming paradigm , programming language , operating system , eigenvalues and eigenvectors , physics , quantum mechanics , gaussian
This paper develops and compares several fine-grained parallel algorithms to compute the Cholesky factorization of a sparse matrix. The experimental implementations are on the Connection Machine, a distributed-memory SIMD machine whose programming model conceptually supplies one processor per data element. In contrast to special-purpose algorithms in which the matrix structure conforms to the connection structure of the machine, this paper focuses on matrices with arbitrary sparsity structure.The most promising alternative is a supernodal, multifrontal algorithm whose inner loop performs several dense factorizations simultaneously on a two-dimensional grid of processors. The key subroutine is a fine-grained parallel, dense-factorization algorithm. The sparse code attains execution rates comparable to those of the dense subroutine. Although at present architectural limitations prevent the dense factorization from realizing its potential efficiency, it is concluded that a regular data parallel architecture ...
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom