Premium
Reconstructing input for artificial neural networks based on embedding theory and mutual information to simulate soil pore water salinity in tidal floodplain
Author(s) -
Zheng Fawen,
Wan Yongshan,
Song Keunyea,
Sun Detong,
Hedgepeth Marion
Publication year - 2016
Publication title -
water resources research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.863
H-Index - 217
eISSN - 1944-7973
pISSN - 0043-1397
DOI - 10.1002/2014wr016875
Subject(s) - mutual information , floodplain , artificial neural network , computer science , mean squared error , hydrology (agriculture) , soil science , environmental science , algorithm , geology , mathematics , artificial intelligence , geotechnical engineering , statistics , geography , cartography
Soil pore water salinity plays an important role in the distribution of vegetation and biogeochemical processes in coastal floodplain ecosystems. In this study, artificial neural networks (ANNs) were applied to simulate the pore water salinity of a tidal floodplain in Florida. We present an approach based on embedding theory with mutual information to reconstruct ANN model input time series from one system state variable. Mutual information between system output and input was computed and the local minimum mutual information points were used to determine a time lag vector for time series embedding and reconstruction, with which the mutual information weighted average method was developed to compute the components of reconstructed time series. The optimal embedding dimension was obtained by optimizing model performance. The method was applied to simulate soil pore water salinity dynamics at 12 probe locations in the tidal floodplain influenced by saltwater intrusion using 4 years (2005–2008) data, in which adjacent river water salinity was used to reconstruct model input. The simulated electrical conductivity of the pore water showed close agreement with field observations ( RMSE ≤ 0.031 S / m andR 2 ≥ 0.897 ), suggesting the reconstructed input by the proposed approach provided adequate input information for ANN modeling. Multiple linear regression model, partial mutual information algorithm for input variable selection, k ‐NN algorithm, and simple time delay embedding were also used to further verify the merit of the proposed approach.