z-logo
open-access-imgOpen Access
Locality‐Conscious Nested‐Loops Parallelization
Author(s) -
Parsa Saeed,
Hamze Mohammad
Publication year - 2014
Publication title -
etri journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.295
H-Index - 46
eISSN - 2233-7326
pISSN - 1225-6463
DOI - 10.4218/etrij.14.0113.0266
Subject(s) - locality , nested loop join , parallel computing , computer science , loop tiling , loop fission , transformation (genetics) , automatic parallelization , computation , reuse , loop fusion , locality of reference , data transformation , algorithm , cache , programming language , database , compiler , ecology , philosophy , linguistics , biochemistry , chemistry , biology , data warehouse , gene
To speed up data‐intensive programs, two complementary techniques, namely nested loops parallelization and data locality optimization, should be considered. Effective parallelization techniques distribute the computation and necessary data across different processors, whereas data locality places data on the same processor. Therefore, locality and parallelization may demand different loop transformations. As such, an integrated approach that combines these two can generate much better results than each individual approach. This paper proposes a unified approach that integrates these two techniques to obtain an appropriate loop transformation. Applying this transformation results in coarse grain parallelism through exploiting the largest possible groups of outer permutable loops in addition to data locality through dependence satisfaction at inner loops. These groups can be further tiled to improve data locality through exploiting data reuse in multiple dimensions.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here