Factors affecting the performance of parallel mining of minimal unique itemsets on diverse architectures | Zendy

Haglin D. J. | Zendy; Mayes K. R. | Zendy; Manning A. M. | Zendy; Feo J. | Zendy; Gurd J. R. | Zendy; Elliot M. | Zendy; Keane J. A. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Factors affecting the performance of parallel mining of minimal unique itemsets on diverse architectures

Author(s) -

Haglin D. J.,

Mayes K. R.,

Manning A. M.,

Feo J.,

Gurd J. R.,

Elliot M.,

Keane J. A.

Publication year - 2009

Publication title -

concurrency and computation: practice and experience

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.309

H-Index - 67

eISSN - 1532-0634

pISSN - 1532-0626

DOI - 10.1002/cpe.1379

Subject(s) - computer science , parallel computing , workstation , message passing interface , implementation , interface (matter) , cluster (spacecraft) , code (set theory) , identification (biology) , computation , message passing , parallel algorithm , computer cluster , set (abstract data type) , distributed computing , operating system , algorithm , programming language , botany , bubble , maximum bubble pressure method , biology

Three parallel implementations of a divide‐and‐conquer search algorithm (called SUDA2) for finding minimal unique itemsets (MUIs) are compared in this paper. The identification of MUIs is used by national statistics agencies for statistical disclosure assessment. The first parallel implementation adapts SUDA2 to a symmetric multi‐processor cluster using the message passing interface (MPI), which we call an MPI cluster; the second optimizes the code for the Cray MTA2 (a shared‐memory, multi‐threaded architecture) and the third uses a heterogeneous ‘group’ of workstations connected by LAN. Each implementation considers the parallel structure of SUDA2, and how the subsearch computation times and sequence of subsearches affect load balancing. All three approaches scale with the number of processors, enabling SUDA2 to handle larger problems than before. For example, the MPI implementation is able to achieve nearly two orders of magnitude improvement with 132 processors. Performance results are given for a number of data sets. Copyright © 2009 John Wiley & Sons, Ltd.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research