Optimizing the computation of a parallel 3D finite difference algorithm for graphics processing units | Zendy

PorterSobieraj J. | Zendy; Cygert S. | Zendy; Kikoła D. | Zendy; Sikorski J. | Zendy; Słodkowski M. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Optimizing the computation of a parallel 3D finite difference algorithm for graphics processing units

Author(s) -

PorterSobieraj J.,

Cygert S.,

Kikoła D.,

Sikorski J.,

Słodkowski M.

Publication year - 2014

Publication title -

concurrency and computation: practice and experience

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.309

H-Index - 67

eISSN - 1532-0634

pISSN - 1532-0626

DOI - 10.1002/cpe.3351

Subject(s) - stencil , computation , cuda , parallel computing , graphics processing unit , computer science , graphics , locality , computational science , series (stratigraphy) , algorithm , shared memory , computer graphics (images) , biology , paleontology , linguistics , philosophy

Summary This paper explores the possibilities of using a graphics processing unit for complex 3D finite difference computation via MUSTA‐FORCE and WENO algorithms. We propose a novel algorithm based on the new properties of CUDA surface memory optimized for 2D spatial locality and compare it with 3D stencil computations carried out via shared memory, which is currently considered to be the best approach. A case study was performed for the extensive generation of a time series of 3D grids of arbitrary size used in the computation of collisions between heavy nuclei in terms of relativistic hydrodynamics. It proved that implementation based on surface memory is as much as 23% faster than an equivalent implementation using shared memory. Copyright © 2014 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research