z-logo
open-access-imgOpen Access
Statistically reinforced machine learning for nonlinear patterns and variable interactions
Author(s) -
Ryo Masahiro,
Rillig Matthias C.
Publication year - 2017
Publication title -
ecosphere
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.255
H-Index - 57
ISSN - 2150-8925
DOI - 10.1002/ecs2.1976
Subject(s) - machine learning , a priori and a posteriori , artificial intelligence , permutation (music) , computer science , variable (mathematics) , nonlinear system , inference , tree (set theory) , parametric statistics , random forest , context (archaeology) , set (abstract data type) , mathematics , statistics , mathematical analysis , paleontology , philosophy , physics , epistemology , quantum mechanics , acoustics , biology , programming language
Most statistical models assume linearity and few variable interactions, even though real‐world ecological patterns often result from nonlinear and highly interactive processes. We here introduce a set of novel empirical modeling techniques which can address this mismatch: statistically reinforced machine learning. We demonstrate the behaviors of three techniques (conditional inference tree, model‐based tree, and permutation‐based random forest) by analyzing an artificially generated example dataset that contains patterns based on nonlinearity and variable interactions. The results show the potential of statistically reinforced machine learning algorithms to detect nonlinear relationships and higher‐order interactions. Estimation reliability for any technique, however, depended on sample size. The applications of statistically reinforced machine learning approaches would be particularly beneficial for investigating (1) novel patterns for which shapes cannot be assumed a priori, (2) higher‐order interactions which are often overlooked in parametric statistics, (3) context dependency where patterns change depending on other conditions, (4) significance and effect sizes of variables while taking nonlinearity and variable interactions into account, and (5) a hypothesis using parametric statistics after identifying patterns using statistically reinforced machine learning techniques.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here