| Literature DB >> 35725886 |
Xutong Wei1, Md Zakir Hossain2,3,4, Khandaker Asif Ahmed5.
Abstract
Mosquitoes are vectors of numerous deadly diseases, and mosquito classification task is vital for their control programs. To ease manual labor and time-consuming classification tasks, numerous image-based machine-learning (ML) models have been developed to classify different mosquito species. Mosquito wing-beating sounds can serve as a unique classifier for mosquito classification tasks, which can be adopted easily in field applications. The current study aims to develop a deep neural network model to identify six mosquito species of three different genera, based on their wing-beating sounds. While existing models focused on raw audios, we developed a comprehensive pre-processing step to convert raw audios into more informative Mel-spectrograms, resulting in more robust and noise-free extracted features. Our model, namely 'Wing-beating Network' or 'WbNet', combines the state-of-art residual neural network (ResNet) model as a baseline, with self-attention mechanism and data-augmentation technique, and outperformed other existing models. The WbNet achieved the highest performance of 89.9% and 98.9% for WINGBEATS and ABUZZ data respectively. For species of Aedes and Culex genera, our model achieved 100% precision, recall and F1-scores, whereas, for Anopheles, the WbNet reached above 95%. We also compared two existing wing-beating datasets, namely WINGBEATS and ABUZZ, and found our model does not need sophisticated audio devices, hence performed better on ABUZZ audios, captured on usual mobile devices. Overall, our model has potential to serve in mosquito monitoring and prevalence studies in mosquito eradication programs, along with potential implementation in classification tasks of insect pests or other sound-based classifications.Entities:
Mesh:
Year: 2022 PMID: 35725886 PMCID: PMC9209486 DOI: 10.1038/s41598-022-14372-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Models comparison with input: (a) raw audio waveforms, (b) pre-processed spectrograms.
Figure 4The architecture of our WbNet model.
WbNet evaluation metrics for different mosquito species.
| Species | WINGBEATS dataset | ABUZZ dataset | ||||
|---|---|---|---|---|---|---|
| Precision | Recall | F1-score | Precision | Recall | F1-score | |
| 91 | 90 | 91 | 100 | 100 | 100 | |
| 91 | 99 | 95 | 100 | 100 | 100 | |
| 67 | 69 | 68 | 100 | 95 | 97 | |
| 90 | 89 | 89 | 97 | 100 | 98 | |
| 96 | 93 | 94 | 100 | 100 | 100 | |
| 87 | 92 | 89 | 100 | 100 | 100 | |
Figure 2Confusion matrix of WbNet on (a) WINGBEATS, (b) ABUZZ.
Figure 3Data augmentation on ResNet-18 and WbNet model, percentage increased shown in red.