z-logo
Premium
An improved approach for mining association rules in parallel using Spark Streaming
Author(s) -
Liu Longtao,
Wen Jiabao,
Zheng Zexun,
Su Hansong
Publication year - 2021
Publication title -
international journal of circuit theory and applications
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.364
H-Index - 52
eISSN - 1097-007X
pISSN - 0098-9886
DOI - 10.1002/cta.2935
Subject(s) - association rule learning , spark (programming language) , computer science , data mining , focus (optics) , streaming data , apriori algorithm , argo , oceanography , physics , optics , programming language , geology
Summary Parallel computing is an effective method to solve computationally large and data‐intensive problems. The traditional data mining algorithm cannot mining association rules for large amounts of streaming data in a timely and effectively. In order to improve the speed and accuracy of association rules mining, distributed and parallel algorithms have become a research focus. This paper proposes a parallel FP‐growth approach using Spark Streaming, called SSPFP, which can parallel mining frequent itemsets and association rules in real‐time streaming data. In this paper, the proposed SSPFP algorithm is applied to mining the association rules between temperature and salinity in marine Argo data. The experimental results indicate that SSPFP algorithm is efficient for association rules mining.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here