Literature DB >> 34344970

Bioacoustic classification of avian calls from raw sound waveforms with an open-source deep learning architecture.

Francisco J Bravo Sanchez1, Md Rahat Hossain1, Nathan B English2, Steven T Moore3.   

Abstract

The use of autonomous recordings of animal sounds to detect species is a popular conservation tool, constantly improving in fidelity as audio hardware and software evolves. Current classification algorithms utilise sound features extracted from the recording rather than the sound itself, with varying degrees of success. Neural networks that learn directly from the raw sound waveforms have been implemented in human speech recognition but the requirements of detailed labelled data have limited their use in bioacoustics. Here we test SincNet, an efficient neural network architecture that learns from the raw waveform using sinc-based filters. Results using an off-the-shelf implementation of SincNet on a publicly available bird sound dataset (NIPS4Bplus) show that the neural network rapidly converged reaching accuracies of over 65% with limited data. Their performance is comparable with traditional methods after hyperparameter tuning but they are more efficient. Learning directly from the raw waveform allows the algorithm to select automatically those elements of the sound that are best suited for the task, bypassing the onerous task of selecting feature extraction techniques and reducing possible biases. We use publicly released code and datasets to encourage others to replicate our results and to apply SincNet to their own datasets; and we review possible enhancements in the hope that algorithms that learn from the raw waveform will become useful bioacoustic tools.
© 2021. The Author(s).

Entities:  

Year:  2021        PMID: 34344970     DOI: 10.1038/s41598-021-95076-6

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


  3 in total

1.  A ResNet attention model for classifying mosquitoes from wing-beating sounds.

Authors:  Xutong Wei; Md Zakir Hossain; Khandaker Asif Ahmed
Journal:  Sci Rep       Date:  2022-06-20       Impact factor: 4.996

2.  Fast environmental sound classification based on resource adaptive convolutional neural network.

Authors:  Zheng Fang; Bo Yin; Zehua Du; Xianqing Huang
Journal:  Sci Rep       Date:  2022-04-22       Impact factor: 4.996

3.  Computational bioacoustics with deep learning: a review and roadmap.

Authors:  Dan Stowell
Journal:  PeerJ       Date:  2022-03-21       Impact factor: 2.984

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.