z-logo
open-access-imgOpen Access
Acoustic detection of unknown bird species and individuals
Author(s) -
Ntalampiras Stavros,
Potamitis Ilyas
Publication year - 2021
Publication title -
caai transactions on intelligence technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.613
H-Index - 15
ISSN - 2468-2322
DOI - 10.1049/cit2.12007
Subject(s) - spectrogram , autoencoder , abstraction , encoding (memory) , computer science , thresholding , encoder , range (aeronautics) , human echolocation , artificial intelligence , decoding methods , convolutional neural network , representation (politics) , pattern recognition (psychology) , machine learning , deep learning , biology , engineering , algorithm , philosophy , epistemology , neuroscience , politics , law , political science , image (mathematics) , aerospace engineering , operating system
Computational bioacoustics is a relatively young research area, yet it has increasingly received attention over the last decade because it can be used in a wide range of applications in a cost‐effective manner. This work focuses on the problem of detecting the novel bird calls and songs associated with various species and individual birds. To this end, variational autoencoders, consisting of deep encoding–decoding networks, are employed. The encoder encompasses a series of convolutional layers leading to a smooth high‐level abstraction of log‐Mel spectrograms that characterise bird vocalisations. The decoder operates on this latent representation to generate each respective original observation. Novel species/individual detection is carried out by monitoring and thresholding the expected reconstruction probability. We thoroughly evaluate the proposed method on two different data sets, including the vocalisations of 11 North American bird species and 16 Athene noctua individuals.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here