z-logo
open-access-imgOpen Access
Mixed-precision iterative refinement using tensor cores on GPUs to accelerate solution of linear systems
Author(s) -
Azzam Haidar,
Harun Bayraktar,
Stanimire Tomov,
Jack Dongarra,
Nicholas J. Higham
Publication year - 2020
Publication title -
proceedings of the royal society a mathematical physical and engineering sciences
Language(s) - English
Resource type - Journals
eISSN - 1471-2946
pISSN - 1364-5021
DOI - 10.1098/rspa.2020.0110
Subject(s) - computer science , algorithm , rounding , leverage (statistics) , floating point , double precision floating point format , computational science , stability (learning theory) , machine learning , operating system
Double-precision floating-point arithmetic (FP64) has been the de facto standard for engineering and scientific simulations for several decades. Problem complexity and the sheer volume of data coming from various instruments and sensors motivate researchers to mix and match various approaches to optimize compute resources, including different levels of floating-point precision. In recent years, machine learning has motivated hardware support for half-precision floating-point arithmetic. A primary challenge in high-performance computing is to leverage reduced-precision and mixed-precision hardware. We show how the FP16/FP32 Tensor Cores on NVIDIA GPUs can be exploited to accelerate the solution of linear systems of equations Ax  =  b without sacrificing numerical stability. The techniques we employ include multiprecision LU factorization, the preconditioned generalized minimal residual algorithm (GMRES), and scaling and auto-adaptive rounding to avoid overflow. We also show how to efficiently handle systems with multiple right-hand sides. On the NVIDIA Quadro GV100 (Volta) GPU, we achieve a 4 × − 5 × performance increase and 5× better energy efficiency versus the standard FP64 implementation while maintaining an FP64 level of numerical stability.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom