z-logo
Premium
N ‐body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation
Author(s) -
Goli Mehdi,
González–Vélez Horacio
Publication year - 2013
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.3076
Subject(s) - computer science , computation , graphics processing unit , cuda , graphics , implementation , multi core processor , central processing unit , parallel computing , architecture , general purpose computing on graphics processing units , computer architecture , computer engineering , computer hardware , programming language , operating system , art , visual arts
SUMMARY With the emergence of general‐purpose computation on graphics processing units, high‐level approaches that hide the conceptual complexity of the low‐level Compute Unified Device Architecture and Open Computing Language platforms are the subject of active research. However, these approaches may require a trade‐off in terms of achieved performance and utilisation on graphics processing units hardware and may impose algorithmic limitations. In this paper, we present and systematically evaluate the parallel performance of three implementations of the brute force, all‐pairs N ‐body algorithm with skeletal deployments based on the FastFlow, SkePU and Thrust frameworks. Our results indicate that the skeletal framework implementation achieves up to two orders of magnitude speed‐up over serial version with a Tesla M2050 with lower implementation complexity than low‐level Compute Unified Device Architecture programming. Copyright © 2013 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here