z-logo
open-access-imgOpen Access
Performance Portability of the Aeras Atmosphere Model to Next Generation Architectures using Kokkos
Author(s) -
Jerry Watkins,
Irina Tezaur
Publication year - 2016
Publication title -
osti oai (u.s. department of energy office of scientific and technical information)
Language(s) - English
Resource type - Reports
DOI - 10.2172/1563064
Subject(s) - software portability , speedup , computer science , parallel computing , supercomputer , gpu cluster , cuda , software , titan (rocket family) , code (set theory) , shared memory , code refactoring , assembly language , programming paradigm , computer architecture , operating system , programming language , aerospace engineering , set (abstract data type) , engineering
The subject of this report is the performance portability of the Aeras global atmosphere dynamical core (implemented within the Albany multi-physics code) to new and emerging architecture machines using the Kokkos library and programming model. We describe the process of refactoring the finite element assembly process for the 3D hydrostatic model in Aeras and highlight common issues associated with development on GPU architectures. After giving detailed build and execute instructions for Aeras with MPI, OpenMP and CUDA on the Shannon cluster at Sandia National Laboratories and the Titan supercomputer at Oak Ridge National Laboratory, we evaluate the performance of the code on a canonical test case known as the baroclinic instability problem. We show a speedup of up to 4 times on 8 OpenMP threads, but we were unable to achieve a speedup on the GPU due to memory constraints. We conclude by providing methods for improving the performance of the code for future optimization.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom