Forward modeling of gravitational fields on hybrid multi-threaded cluster | Zendy

Carlos Couder-Castañeda | Zendy; C. OrtizAlemán | Zendy; Mauricio Gabriel Orozco-del-Castillo | Zendy; Mauricio Nava-Flores | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Forward modeling of gravitational fields on hybrid multi-threaded cluster

Author(s) -

Carlos Couder-Castañeda,

C. OrtizAlemán,

Mauricio Gabriel Orozco-del-Castillo,

Mauricio Nava-Flores

Publication year - 2015

Publication title -

geofísica internacional

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.159

H-Index - 26

ISSN - 0016-7169

DOI - 10.1016/j.gi.2015.04.002

Subject(s) - bottleneck , computer science , cluster (spacecraft) , tensor (intrinsic definition) , gravitation , parallel computing , computational science , node (physics) , grid , gravimetric analysis , gravitational field , distributed memory , speedup , field (mathematics) , gpu cluster , shared memory , mathematics , physics , geometry , cuda , chemistry , organic chemistry , quantum mechanics , astronomy , pure mathematics , embedded system , classical mechanics , programming language

The analytic solution of the gravimetric tensor components, making use of the gravitational potential equation for a three-dimensional volumetric assembly composed of unit prisms of constant density, demands a high computational cost. This is due to the gravitational potential of each one of these prisms must be calculated for all of the points of a previously defined observation grid, which turns out in a large scale computational cost. In this work we introduce a hybrid design and its parallel implementation, based on OpenMP and MPI, for the calculation of the vectorial components of the gravimetric field and the components of the gravimetric tensor. Since the computing time is drastically reduced, the obtained performance leads close to optimal speed-up ratios. The applied parallelization technique consists of decomposing the problem into groups of prisms and using different memory allocations per processing core to avoid bottleneck issues when accessing the main memory in one cluster node, which are generally produced when using too many execution threads over the same region in OpenMP. Due OpenMP can be only used on shared memory systems is necessary to use MPI for the calculation distribution among cluster nodes, giving as a result a hybrid code (OpenMP+MPI) highly efficient and with a nearly perfect speed-up. Additionally the numerical results were validated with respect to its sequential counterpart

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research