z-logo
open-access-imgOpen Access
Identifying indicator species in ecological habitats using Deep Optimal Feature Learning
Author(s) -
Yi-Cheng Tsai,
Susan A. Baldwin,
R. Bhushan Gopaluni
Publication year - 2021
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0256782
Subject(s) - machine learning , artificial intelligence , computer science , identification (biology) , support vector machine , feature (linguistics) , habitat , extractor , domain (mathematical analysis) , deep learning , abundance (ecology) , ecology , artificial neural network , feature extraction , biology , engineering , mathematics , process engineering , mathematical analysis , linguistics , philosophy
Much of the current research on supervised modelling is focused on maximizing outcome prediction accuracy. However, in engineering disciplines, an arguably more important goal is that of feature extraction, the identification of relevant features associated with the various outcomes. For instance, in microbial communities, the identification of keystone species can often lead to improved prediction of future behavioral shifts. This paper proposes a novel feature extractor based on Deep Learning, which is largely agnostic to underlying assumptions regarding the training data. Starting from a collection of microbial species abundance counts, the Deep Learning model first trains itself to classify the selected distinct habitats. It then identifies indicator species associated with the habitats. The results are then compared and contrasted with those obtained by traditional statistical techniques. The indicator species are similar when compared at top taxonomic levels such as Domain and Phylum , despite visible differences in lower levels such as Class and Order . More importantly, when our estimated indicators are used to predict final habitat labels using simpler models (such as Support Vector Machines and traditional Artificial Neural Networks), the prediction accuracy is improved. Overall, this study serves as a preliminary step that bridges modern, black-box Machine Learning models with traditional, domain expertise-rich techniques.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here