Open Access
Automated detection of Hainan gibbon calls for passive acoustic monitoring
Author(s) -
Dufourq Emmanuel,
Durbach Ian,
Hansford James P.,
Hoepfner Amanda,
Ma Heidi,
Bryant Jessica V.,
Stender Christina S.,
Li Wenyong,
Liu Zhiwei,
Chen Qing,
Zhou Zhaoli,
Turvey Samuel T.
Publication year - 2021
Publication title -
remote sensing in ecology and conservation
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.191
H-Index - 21
ISSN - 2056-3485
DOI - 10.1002/rse2.201
Subject(s) - preprocessor , bioacoustics , computer science , spectrogram , convolutional neural network , classifier (uml) , artificial intelligence , artificial neural network , pattern recognition (psychology) , telecommunications
Abstract Extracting species calls from passive acoustic recordings is a common preliminary step to ecological analysis. For many species, particularly those occupying noisy, acoustically variable habitats, the call extraction process continues to be largely manual, a time‐consuming and increasingly unsustainable process. Deep neural networks have been shown to offer excellent performance across a range of acoustic classification applications, but are relatively underused in ecology. We describe the steps involved in developing an automated classifier for a passive acoustic monitoring project, using the identification of calls of the Hainan gibbon Nomascus hainanus , one of the world's rarest mammal species, as a case study. This includes preprocessing—selecting a temporal resolution, windowing and annotation; data augmentation; processing—choosing and fitting appropriate neural network models; and post‐processing—linking model predictions to replace, or more likely facilitate, manual labelling. Our best model converted acoustic recordings into spectrogram images on the mel frequency scale, using these to train a convolutional neural network. Model predictions were highly accurate, with per‐second false positive and false negative rates of 1.5% and 22.3%. Nearly all false negatives were at the fringes of calls, adjacent to segments where the call was correctly identified, so that very few calls were missed altogether. A post‐processing step identifying intervals of repeated calling reduced an 8‐h recording to, on average, 22 min for manual processing, and did not miss any calling bouts over 72 h of test recordings. Gibbon calling bouts were detected regularly in multi‐month recordings from all selected survey points within Bawangling National Nature Reserve, Hainan. We demonstrate that passive acoustic monitoring incorporating an automated classifier represents an effective tool for remote detection of one of the world's rarest and most threatened species. Our study highlights the viability of using neural networks to automate or greatly assist the manual labelling of data collected by passive acoustic monitoring projects. We emphasize that model development and implementation be informed and guided by ecological objectives, and increase accessibility of these tools with a series of notebooks that allow users to build and deploy their own acoustic classifiers.