Premium
Clustering of nonstationary data streams: A survey of fuzzy partitional methods
Author(s) -
Abdullatif Amr,
Masulli Francesco,
Rovetta Stefano
Publication year - 2018
Publication title -
wiley interdisciplinary reviews: data mining and knowledge discovery
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.506
H-Index - 47
eISSN - 1942-4795
pISSN - 1942-4787
DOI - 10.1002/widm.1258
Subject(s) - data stream mining , cluster analysis , computer science , concept drift , data mining , outlier , data stream clustering , data stream , fuzzy clustering , consensus clustering , fuzzy logic , workflow , artificial intelligence , machine learning , cure data clustering algorithm , database , telecommunications
Data streams have arisen as a relevant research topic during the past decade. They are real‐time, incremental in nature, temporally ordered, massive, contain outliers, and the objects in a data stream may evolve over time (concept drift). Clustering is often one of the earliest and most important steps in the streaming data analysis workflow. A comprehensive literature is available about stream data clustering; however, less attention is devoted to the fuzzy clustering approach, even though the nonstationary nature of many data streams makes it especially appealing. This survey discusses relevant data stream clustering algorithms focusing mainly on fuzzy methods, including their treatment of outliers and concept drift and shift. This article is categorized under Technologies > Machine Learning Technologies > Computational Intelligence Fundamental Concepts of Data and Knowledge > Data Concepts