| Literature DB >> 35668486 |
María I González-Pérez1, Bastian Faulhaber2, Núria Busquets1, Sandra Talavera3, Mark Williams2, Josep Brosa1, Carles Aranda1,4, Nuria Pujol1, Marta Verdún1, Pancraç Villalonga2, Joao Encarnação2.
Abstract
BACKGROUND: Every year, more than 700,000 people die from vector-borne diseases, mainly transmitted by mosquitoes. Vector surveillance plays a major role in the control of these diseases and requires accurate and rapid taxonomical identification. New approaches to mosquito surveillance include the use of acoustic and optical sensors in combination with machine learning techniques to provide an automatic classification of mosquitoes based on their flight characteristics, including wingbeat frequency. The development and application of these methods could enable the remote monitoring of mosquito populations in the field, which could lead to significant improvements in vector surveillance.Entities:
Keywords: Aedes; Automatic classification; Culex; Deep learning; Genus and sex classification; Machine learning; Mosquito surveillance; Mosquito trap; Optical sensor
Mesh:
Year: 2022 PMID: 35668486 PMCID: PMC9169302 DOI: 10.1186/s13071-022-05324-5
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 4.047
Fig. 1a Prototype sensor (top) fitted to a BG-Mosquitaire trap (bottom). b Side view diagram of sensor and trap to illustrate operation. The exterior of the sensor unit (1) is formed by an inlet tube with a diameter of approximately 100 mm (2), sensor housing (3) and outlet tube (4). The housing contains an optical emitter (5), which projects collimated beams of light through the transparent flight tube (6) and onto an optical receiver (7) to create a sensing zone (8) within the flight tube. The trap (9) contains a suction fan (10), a removable catch bag (11) made of textile mesh and a perforated lid (12). The fan produces a flow of air downward through the inlet tube, flight tube and catch bag and upward through the perforated lid as indicated by the blue arrows. An insect (13) which flies close to the entrance of the inlet tube may then be sucked downwards through the sensing zone where it will be recorded and then trapped in the catch bag. As the mosquito passes through the sensing zone it casts a shadow upon the optical receiver according to the so-called optical extinction mode of operation. As the insect flaps its wings within the sensing zone, the light falling on the optical receiver is modulated, giving rise to changes in the amplitude in the recorded waveform
Fig. 2a Example of a recorded mosquito flight with ADC sample number (0 to 1023) on the x-axis and amplitude on the y-axis, scaled to a range of [− 1, 1], which equates to the full-scale range of the ADC. A high pass filter in the optical receiver attenuates frequencies < 300 Hz to remove electronic offsets and low-frequency noise, which also attenuates the signal due to the body of the insect. Baseline correction has been applied by subtracting the average value of the recording from each data point in the recording. b Power spectral density (PSD) plot of a typical mosquito flight. The wingbeat fundamental peak is labelled as f1. The fundamental frequency is indicated by the vertical arrow and the fundamental peak power by the horizontal arrow. The various peaks to the right of f1 are harmonics of f1, i.e. at frequencies of 2*f1, 3*f1, etc. The power density has units of (units2/Hz) on a logarithmic (dB) scale. A level of 0 dB/Hz corresponds to a white noise signal time domain signal with a power density of 1.0 unit2/Hz. The fundamental peak power density levels in this study are typically < − 40 dB/Hz, i.e. < 1 × 10–4 units2/Hz. The noise floor of the system, i.e. with sensor active but with no insect in the sensing zone, is < − 85 dB/Hz from 0 to 300 Hz and < − 90 dB/Hz from 300 Hz
Fig. 3a Scatterplot of wingbeat fundamental frequency and peak power for the full dataset showing Aedes genus in red and Culex in blue. b Scatter plot of wingbeat fundamental frequency and peak power for Aedes genus showing females in red and males in blue. c Scatter plot of wingbeat fundamental frequency and peak power for Culex genus showing females in red and males in blue
Accuracy results for genus classification with best results per feature indicated by a superscript letter
| Feature | Algorithm | ||||
|---|---|---|---|---|---|
| LR (%) | GB (%) | RF (%) | SVM (%) | DNN (%) | |
| Fundamental frequency | 55.2 | 67.3a | 65.9 | 65.5 | 66.1 |
| Fundamental peak power | 68.9 | 70.1a | 69.6 | 69.8 | 70.0 |
| Fundamental frequency and peak power | 70.1 | 77.7 | 77.2 | 77.2 | 77.8a |
| PSD | 84.8 | 92.3a | 89.0 | 90.5 | 90.3 |
| Spectrogram | 90.5 | 93.2 | 91.2 | 93.4 | 94.2a |
| MFCC | 89.3 | 93.2a | 90.2 | 93.0 | 93.2a |
Accuracy results for sex classification of Aedes with best results per feature indicated by a superscript letter
| Feature | Algorithm | ||||
|---|---|---|---|---|---|
| LR (%) | GB (%) | RF (%) | SVM (%) | DNN (%) | |
| Fundamental frequency | 95.5 | 95.5 | 95.5 | 95.5 | 95.5 |
| Fundamental peak power | 86.9 | 89.5a | 89.5 | 89.2 | 89.3 |
| Fundamental frequency and peak power | 98.2 | 96.7 | 97.0 | 98.5 | 97.9a |
| PSD | 97.0 | 98.8a | 97.9 | 98.8a | 98.2 |
| Spectrogram | 99.4a | 98.8 | 98.8 | 99.1 | 98.8 |
| MFCC | 99.4a | 99.4a | 98.8 | 98.8 | 98.8 |
Accuracy results for sex classification of Culex with best results per feature indicated by a superscript letter
| Feature | Algorithm | ||||
|---|---|---|---|---|---|
| LR (%) | GB (%) | RF (%) | SVM (%) | DNN (%) | |
| Fundamental frequency | 98.0 | 98.0 | 98.0 | 98.0 | 98.0 |
| Fundamental peak power | 83.4 | 81.3 | 81.5 | 83.1 | 83.6a |
| Fundamental frequency and peak power | 98.7a | 98.7a | 98.5 | 98.7a | 98.7a |
| PSD | 99.7 | 99.2 | 99.2 | 100a | 99.7 |
| Spectrogram | 100a | 99.7 | 99.7 | 100a | 100a |
| MFCC | 100a | 100a | 100a | 100a | 100a |
Summary of machine learning classification results
| Classification task | Using the test set | Using the training dataset | Error analysis indication | ||||
|---|---|---|---|---|---|---|---|
| Best test accuracy (%) | Best feature | Best algorithm | No. of samples | Training accuracy (%) | Validation accuracy (%) | ||
| Genus | 94.2 | Spectrogram | DNN | 2016 | 100 | 95 | Slight overfitting: more training samples |
| Sex | 99.4 | Spectrogram | LR | 1008 | 99.5 | 99.5 | No overfitting |
| MFCC | LR, GB | ||||||
| Sex | 100 | PSD | SVM | 1170 | 100 | 100 | No error |
| Spectrogram | LR, SVM, DNN | ||||||
| MFCC | All algorithms | ||||||