Distribution models using semi-structured community science data outperform unstructured-data models for a data-poor species, the Plain Tyrannulet
Author(s) -
Fabricio C. Gorleri,
Wesley M. Hochachka,
Juan Ignacio Areta
Publication year - 2021
Publication title -
ornithological applications
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.874
H-Index - 78
eISSN - 1938-5129
pISSN - 0010-5422
DOI - 10.1093/ornithapp/duab038
Subject(s) - sampling (signal processing) , arboreal locomotion , missing data , species distribution , data quality , amazon rainforest , data collection , computer science , data mining , habitat , statistics , ecology , machine learning , mathematics , biology , metric (unit) , operations management , filter (signal processing) , economics , computer vision
Modeling the distribution of a data-poor species is challenging due to a reliance on unstructured data that often lacks relevant information on sampling and produces coarse-resolution outputs of varying accuracy. Data on sampling effort associated with higher-quality, semi-structured data derived from some community science programs can be used to produce more precise models of distribution, albeit at a cost of using fewer data. Here, we used semi-structured data to model the seasonal ranges of the Plain Tyrannulet (Inezia inornata), a poorly known Austral–Neotropical migrant, and compared predictive performance to models built with the full unstructured dataset of the species. By comparing these models, we examined the relatively unexplored tradeoff between data quality and data quantity for modeling of a data-sparse species. We found that models using semi-structured data outperformed unstructured-data models in the predictive accuracy metrics (mean squared error, area under the curve, kappa, sensitivity, and specificity), despite using only 30% of the available detection records. Moreover, semi-structured models were more biologically accurate, indicating that the tyrannulet favors arboreal habitats in dry and hot lowlands during the breeding season (Chaco region) and is associated with proximity to rivers in tropical and wet areas during the nonbreeding season (Pantanal, Beni, and southwest Amazonia). We demonstrate that more detailed insights into distributional patterns can be gained from even small quantities of data when the data are analyzed appropriately. The use of semi-structured data promises to be of wide applicability even for data-poor bird species, helping refine information on distribution and habitat use, needed for effective assessments of conservation status.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom