Outlier Detection in Sensor Data using Ensemble Learning
Author(s) -
Nadeem Iftikhar,
Thorkil Baattrup-Andersen,
Finn Ebertsen Nordbjerg,
Karsten Jeppesen
Publication year - 2020
Publication title -
procedia computer science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.334
H-Index - 76
ISSN - 1877-0509
DOI - 10.1016/j.procs.2020.09.112
Subject(s) - computer science , outlier , anomaly detection , sliding window protocol , cluster analysis , data mining , artificial intelligence , pattern recognition (psychology) , construct (python library) , feature (linguistics) , series (stratigraphy) , class (philosophy) , ensemble learning , time series , sequence (biology) , machine learning , window (computing) , paleontology , linguistics , philosophy , genetics , biology , programming language , operating system
Analyzing sensor data from a production environment is quite challenging because of the high-dimensional nature of the data. In addition, the generated data is in the form of time-series, where the sequence of registrations may be of utmost significance. One of the main goals of the paper is to determine if the given time-series of feature combinations is normal or rare. This goal could successfully be achieved by combining multiple machine learning models. In this paper, a sliding window based ensemble method is proposed to detect outliers in a streaming fashion. The proposed method uses a combination of clustering algorithms to construct subgroups (clusters) representing different data structures. These structures are later used in a one-class classification algorithm to identfy the outliers. Thus, if a pattern does not belong to any of the common structures or clusters, it is an outlier. Further, based on the rare pattern classification, machine failures could be predicted in advance.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom