| Literature DB >> 30640961 |
Daniel Motta1, Alex Álisson Bandeira Santos1, Ingrid Winkler1, Bruna Aparecida Souza Machado1,2, Daniel André Dias Imperial Pereira1, Alexandre Morais Cavalcanti1, Eduardo Oyama Lins Fonseca2, Frank Kirchner3, Roberto Badaró1,2.
Abstract
Dengue, chikungunya and Zika are arboviruses transmitted by mosquitos of the genus Aedes and have caused several outbreaks in world over the past ten years. Morphological identification of mosquitos is currently restricted due to the small number of adequately trained professionals. We implemented a computational model based on a convolutional neural network (CNN) to extract features from mosquito images to identify adult mosquitoes from the species Aedes aegypti, Aedes albopictus and Culex quinquefasciatus. To train the CNN to perform automatic morphological classification of mosquitoes, we used a dataset that included 4,056 mosquito images. Three neural networks, including LeNet, AlexNet and GoogleNet, were used. During the validation phase, the accuracy of the mosquito classification was 57.5% using LeNet, 74.7% using AlexNet and 83.9% using GoogleNet. During the testing phase, the best result (76.2%) was obtained using GoogleNet; results of 52.4% and 51.2% were obtained using LeNet and AlexNet, respectively. Significantly, accuracies of 100% and 90% were achieved for the classification of Aedes and Culex, respectively. A classification accuracy of 82% was achieved for Aedes females. Our results provide information that is fundamental for the automatic morphological classification of adult mosquito species in field. The use of CNN's is an important method for autonomous identification and is a valuable and accessible resource for health workers and taxonomists for the identification of some insects that can transmit infectious agents to humans.Entities:
Mesh:
Year: 2019 PMID: 30640961 PMCID: PMC6331110 DOI: 10.1371/journal.pone.0210829
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Representation of the training (phase 1), validating (phase 2) and testing (phase 3) processes of the mosquito classification model.
Fig 2Images used for the validation phase during the development of the model used in this study.
(A and C) Ae. aegypti females photographed using a digital camera; (B and D) Ae. aegypti males photographed using a digital camera; (E) an Ae. aegypti female photographed using a mobile phone; (F) an Ae. aegypti male photographed using a mobile phone.
Parameters used as inputs for the full factorial experiments in this study.
| Parameters | Levels | Level | |||
|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | ||
| Number of epochs | 1 | 200 | |||
| Seed | 1 | 10 | |||
| % application images used at training | 4 | 30 | 40 | 50 | 60 |
| Solver algorithm | 4 | SGD | NAG | Adam | AdaGrad |
| Learning rate | 3 | 0.001 | 0.01 | 0.1 | |
| LR decay function | 4 | Step Down | Exponential | Sigmoid | Polynomial |
Dataset structure and sample sizes of the mosquito images used in this study.
| Class name | Total number of images | Non-application images (used in training) | Application images | Application images used for training and validation | Application images used for testing |
|---|---|---|---|---|---|
| 947 | 723 | 224 | 210 | 14 | |
| 282 | 201 | 81 | 67 | 14 | |
| 1050 | 821 | 229 | 215 | 14 | |
| 436 | 327 | 109 | 95 | 14 | |
| 965 | 895 | 70 | 56 | 14 | |
| 376 | 266 | 110 | 96 | 14 |
Fig 3Effects of the first level parameters on the accuracy of the testing phase (average accuracy percentage).
Fig 4Effects of the second level parameters on the accuracy of the testing phase (average accuracy percentage).
Fig 5Histograms representing the outcome data: (A) percent accuracy–validation, (B) percent accuracy–testing, (C) loss function.
Fig 6Representation of the training process of each CNN: (A) LeNet, (B) AlexNet and (C) GoogLeNet (orange line: accuracy validation; green line: loss validation; blue line: loss training). A–LeNet (after 200 epochs): percent accuracy (validation) 57.50%, loss (validation) 2.07, loss (training) 0.03; B–AlexNet (after 200 epochs): percent accuracy (validation) 74.69%, loss (validation) 0.83, loss (training) 0.11; C–GoogLeNet (after 200 epochs): percent accuracy (validation) 83.88%, loss (validation) 1.03, loss (training) 0.00.
Confusion matrix showing the results of the testing phase for the LeNet neural network.
| Class | Accuracy (%) | ||||||
|---|---|---|---|---|---|---|---|
| 7 | 1 | 2 | 4 | 0 | 0 | 50.00 | |
| 5 | 6 | 3 | 0 | 0 | 0 | 42.86 | |
| 1 | 2 | 10 | 0 | 1 | 0 | 71.43 | |
| 2 | 0 | 0 | 7 | 3 | 2 | 50.00 | |
| 1 | 0 | 1 | 1 | 9 | 2 | 64.29 | |
| 2 | 3 | 2 | 0 | 2 | 5 | 35.71 |
Confusion matrix showing the results of the testing phase for the AlexNet neural network.
| Class | Accuracy (%) | ||||||
|---|---|---|---|---|---|---|---|
| 5 | 1 | 6 | 0 | 1 | 1 | 35.71 | |
| 2 | 7 | 2 | 1 | 0 | 2 | 50.00 | |
| 1 | 1 | 11 | 1 | 0 | 0 | 78.57 | |
| 0 | 2 | 3 | 9 | 0 | 0 | 64.29 | |
| 2 | 4 | 0 | 0 | 2 | 6 | 14.29 | |
| 2 | 1 | 0 | 0 | 2 | 9 | 64.29 |
Confusion matrix showing the results of the testing phase for the GoogLeNet neural network.
| Class | Accuracy (%) | ||||||
|---|---|---|---|---|---|---|---|
| 9 | 3 | 2 | 0 | 0 | 0 | 64.29 | |
| 1 | 12 | 1 | 0 | 0 | 0 | 85.71 | |
| 0 | 1 | 12 | 1 | 0 | 0 | 85.71 | |
| 0 | 1 | 0 | 13 | 0 | 0 | 92.86 | |
| 1 | 1 | 0 | 0 | 8 | 4 | 57.14 | |
| 0 | 0 | 0 | 1 | 3 | 10 | 71.43 |