z-logo
open-access-imgOpen Access
Multivariate data mining for estimating the rate of discolouration material accumulation in drinking water distribution systems
Author(s) -
S. R. Mounce,
Mirjam Blokker,
Stewart Husband,
W. R. Furnass,
Peter Schaap,
Joby Boxall
Publication year - 2015
Publication title -
journal of hydroinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.654
H-Index - 50
eISSN - 1465-1734
pISSN - 1464-7141
DOI - 10.2166/hydro.2015.140
Subject(s) - multivariate statistics , usable , data mining , self organizing map , computer science , compositional data , regression analysis , machine learning , artificial neural network , world wide web
Particulate material accumulates over time as cohesive layers on internal pipeline surfaces in water distribution systems (WDS). When mobilised, this material can cause discolouration. This paper explores factors expected to be involved in this accumulation process. Two complementary machine learning methodologies are applied to significant amounts of real world field data from both a qualitative and a quantitative perspective. First, Kohonen self-organising maps were used for integrative and interpretative multivariate data mining of potential factors affecting accumulation. Second, evolutionary polynomial regression (EPR), a hybrid data-driven technique, was applied that combines genetic algorithms with numerical regression for developing easily interpretable mathematical model expressions. EPR was used to explore producing novel simple expressions to highlight important accumulation factors. Three case studies are presented: UK national and two Dutch local studies. The results highlight bulk water iron concentration, pipe material and looped network areas as key descriptive parameters for the UK study. At the local level, a significantly increased third data set allowed K-fold cross validation. The mean cross validation coefficient of determination was 0.945 for training data and 0.930 for testing data for an equation utilising amount of material mobilised and soil temperature for estimating daily regeneration rate. The approach shows promise for developing transferable expressions usable for pro-active WDS management.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom