Heterogeneous computing with OpenMP and Hydra | Zendy

Diener Matthias | Zendy; Kale Laxmikant V. | Zendy; Bodony Daniel J. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Heterogeneous computing with OpenMP and Hydra

Author(s) -

Diener Matthias,

Kale Laxmikant V.,

Bodony Daniel J.

Publication year - 2020

Publication title -

concurrency and computation: practice and experience

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.309

H-Index - 67

eISSN - 1532-0634

pISSN - 1532-0626

DOI - 10.1002/cpe.5728

Subject(s) - computer science , cuda , host (biology) , parallel computing , lernaean hydra , code (set theory) , supercomputer , general purpose computing on graphics processing units , computer architecture , operating system , programming language , graphics , biology , history , ecology , set (abstract data type) , classics

Summary High‐performance computing relies on accelerators (such as GPGPUs) to achieve fast execution of scientific applications. Traditionally, these accelerators have been programmed with specialized languages, such as CUDA or OpenCL. In recent years, OpenMP emerged as a promising alternative for supporting accelerators, providing advantages such as maintaining a single code base for the host and different accelerator types and providing a simple way to extend support for accelerators to existing application codes. Efficiently using this support requires solving several challenges, related to performance, work partitioning, and concurrent execution on multiple device types. In this article, we discuss our experiences with using OpenMP for accelerators and present performance guidelines. We also introduce a library, Hydra, that addresses several of the challenges of using OpenMP for such devices. We apply Hydra to a scientific application, PlasCom2, that has not previously been able to use accelerators. Experiments on three architectures show that Hydra results in performance gains of up to 10× compared with CPU‐only execution. Concurrent execution on the host and GPU resulted in additional gains of up to 20% compared to running on the GPU only.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research