z-logo
open-access-imgOpen Access
Pre-processing of imbalanced samples and the effective contribution in fault diagnosis in wastewater treatment plants
Author(s) -
Yuhong Xu,
Wu Deng,
Bing Song,
Xinyang Deng,
Feijun Luo
Publication year - 2017
Publication title -
journal of hydroinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.654
H-Index - 50
eISSN - 1465-1734
pISSN - 1464-7141
DOI - 10.2166/hydro.2017.206
Subject(s) - support vector machine , artificial intelligence , relevance vector machine , relevance (law) , machine learning , fault (geology) , class (philosophy) , sample (material) , artificial neural network , computer science , key (lock) , data processing , data mining , engineering , pattern recognition (psychology) , chemistry , computer security , chromatography , seismology , political science , law , geology , operating system
Fault diagnosis by machine learning techniques is of great importance in wastewater treatment plants (WWTPs). A key factor influencing the accuracy of fault diagnosis lies in the imbalance between the sample data in minority classes (i.e. faulty situations) and that in majority classes (i.e. normal situations), which may cause misjudgments of faults and lead to failure in practical use. This study proposes a novel pre-processing method with a fast relevance vector machine (Fast RVM) reducing the data of majority class samples and the synthetic minority over-sampling technique expanding the minority class samples. A case study indicates that this pre-processing method could be a promising solution for imbalanced data classification in WWTPs and the pre-processed data can be well diagnosed by back-propagation neural networks, support vector machine, RVM and Fast RVM models.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom