Premium
The nonequispaced FFT on graphics processing units
Author(s) -
Kunis Susanne,
Kunis Stefan
Publication year - 2012
Publication title -
pamm
Language(s) - English
Resource type - Journals
ISSN - 1617-7061
DOI - 10.1002/pamm.201210003
Subject(s) - fast fourier transform , cuda , computer science , parallel computing , graphics , scheme (mathematics) , memory bandwidth , computational science , general purpose computing on graphics processing units , bandwidth (computing) , graphics processing unit , computer graphics (images) , algorithm , mathematics , telecommunications , mathematical analysis
Without doubt, the fast Fourier transform (FFT) belongs to the algorithms with large impact on science and engineering. By appropriate approximations, this scheme has been generalized for arbitrary spatial sampling points. This so called nonequispaced FFT is the core of the sequential NFFT3 library and we discuss its computational costs in detail. On the other hand, programmable graphics processing units have evolved into highly parallel, multithreaded, manycore processors with enormous computational capacity and very high memory bandwidth. By means of the so called Compute Unified Device Architecture (CUDA), we parallelized the nonequispaced FFT using the CUDA FFT library and a dedicated parallelization of the approximation scheme. (© 2012 Wiley‐VCH Verlag GmbH & Co. KGaA, Weinheim)