Premium
A survey on graphic processing unit computing for large‐scale data mining
Author(s) -
Cano Alberto
Publication year - 2017
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.1232
Subject(s) - computer science , scalability , big data , data science , context (archaeology) , data intensive computing , volume (thermodynamics) , massively parallel , computation , data processing , scale (ratio) , data mining , parallelizable manifold , distributed computing , database , parallel computing , algorithm , biology , grid computing , mathematics , paleontology , physics , geometry , quantum mechanics , grid
General purpose computation using Graphic Processing Units (GPUs) is a well‐established research area focusing on high‐performance computing solutions for massively parallelizable and time‐consuming problems. Classical methodologies in machine learning and data mining cannot handle processing of massive and high‐speed volumes of information in the context of the big data era. GPUs have successfully improved the scalability of data mining algorithms to address significantly larger dataset sizes in many application areas. The popularization of distributed computing frameworks for big data mining opens up new opportunities for transformative solutions combining GPUs and distributed frameworks. This survey analyzes current trends in the use of GPU computing for large‐scale data mining, discusses GPU architecture advantages for handling volume and velocity of data, identifies limitation factors hampering the scalability of the problems, and discusses open issues and future directions. WIREs Data Mining Knowl Discov 2018, 8:e1232. doi: 10.1002/widm.1232 This article is categorized under: Technologies > Computer Architectures for Data Mining Technologies > Machine Learning Technologies > Computational Intelligence