z-logo
open-access-imgOpen Access
Condition monitoring of wind turbines using machine learning based anomaly detection and statistical techniques for the extraction of 'healthy data'
Author(s) -
Xavier Chesterman,
Timothy Verstraeten,
Pieter-Jan Daems,
Jan Helsen
Publication year - 2021
Publication title -
proceedings of the annual conference of the prognostics and health management society
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.18
H-Index - 11
ISSN - 2325-0178
DOI - 10.36001/phmconf.2021.v13i1.2980
Subject(s) - anomaly detection , leverage (statistics) , computer science , turbine , wind power , data mining , support vector machine , anomaly (physics) , naive bayes classifier , artificial intelligence , machine learning , engineering , mechanical engineering , physics , electrical engineering , condensed matter physics
Premature failures caused by excessive wear are responsible for a large fraction of the maintenance costs of wind turbines. Therefore, it is crucial to be able to identify the propagation of these failures as early as possible. To this end, a novel condition monitoring method is proposed that uses statistical data analysis techniques and machine learning to construct a multivariate anomaly detection framework, based on high-frequency temperature SCADA data from wind turbines. This framework contains several steps. First, there is a preprocessing step in which relevant wind turbine states are extracted from the data. These states are the operating conditions and whether or not the turbine exhibits transient behavior. The second step entails anomaly detection on the temperature time series data. Fleet information is used to filter out exogenous (environmental) factors. Furthermore, multiple models are combined to get more stable and robust anomaly detections. By combining them the weaknesses of the individual models are alleviated resulting in a better overall performance. A limitation of machine learning-based anomaly detection on temperature data is the requirement that at least one year of “healthy” (meaning without anomalies) training data is available to account for seasonal effects. The lack of verified “healthy” data spread out evenly over the seasons generally means that the anomaly detection accuracy is severely compromised for the unrepresented seasons. This research uses smart retraining to reduce this limitation. Statistical techniques that leverage the information of the fleet are used to extract “healthy” data from at least one year, but preferably multiple years, of unverified data. This can then be used as training data for the machine learning-based models. To validate the pipeline, temperature and failure data of a real operational wind farm is used. Although the methodology is general in its scope, the validation case focusses specifically on generator bearing failures.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here