Literature DB >> 31370640

Deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss.

Anshul Thakur1, Daksh Thapar1, Padmanabhan Rajan1, Aditya Nigam1.   

Abstract

Bioacoustic classification often suffers from the lack of labeled data. This hinders the effective utilization of state-of-the-art deep learning models in bioacoustics. To overcome this problem, the authors propose a deep metric learning-based framework that provides effective classification, even when only a small number of per-class training examples are available. The proposed framework utilizes a multiscale convolutional neural network and the proposed dynamic variant of the triplet loss to learn a transformation space where intra-class separation is minimized and inter-class separation is maximized by a dynamically increasing margin. The process of learning this transformation is known as deep metric learning. The triplet loss analyzes three examples (referred to as a triplet) at a time to perform deep metric learning. The number of possible triplets increases cubically with the dataset size, making triplet loss more suitable than the cross-entropy loss in data-scarce conditions. Experiments on three different publicly available datasets show that the proposed framework performs better than existing bioacoustic classification methods. Experimental results also demonstrate the superiority of dynamic triplet loss over cross-entropy loss in data-scarce conditions. Furthermore, unlike existing bioacoustic classification methods, the proposed framework has been extended to provide open-set classification.

Entities:  

Year:  2019        PMID: 31370640     DOI: 10.1121/1.5118245

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  3 in total

1.  Computational bioacoustics with deep learning: a review and roadmap.

Authors:  Dan Stowell
Journal:  PeerJ       Date:  2022-03-21       Impact factor: 2.984

2.  Convolutional Neural Networks for the Identification of African Lions from Individual Vocalizations.

Authors:  Martino Trapanotto; Loris Nanni; Sheryl Brahnam; Xiang Guo
Journal:  J Imaging       Date:  2022-04-01

3.  Exploiting deep neural network and long short-term memory method-ologies in bioacoustic classification of LPC-based features.

Authors:  Cihun-Siyong Alex Gong; Chih-Hui Simon Su; Kuo-Wei Chao; Yi-Chu Chao; Chin-Kai Su; Wei-Hang Chiu
Journal:  PLoS One       Date:  2021-12-23       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.