| Literature DB >> 31412028 |
Masabho P Milali1,2, Maggy T Sikulu-Lord3, Samson S Kiware1,2, Floyd E Dowell4, George F Corliss5, Richard J Povinelli5.
Abstract
BACKGROUND: Near infrared spectroscopy (NIRS) is currently complementing techniques to age-grade mosquitoes. NIRS classifies lab-reared and semi-field raised mosquitoes into < or ≥ 7 days old with an average accuracy of 80%, achieved by training a regression model using partial least squares (PLS) and interpreted as a binary classifier. METHODS ANDEntities:
Mesh:
Year: 2019 PMID: 31412028 PMCID: PMC6693756 DOI: 10.1371/journal.pone.0209451
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Box plots when PLS (A and C) and ANN (B and D) were applied to estimate the actual age of out of the sample An. gambiae (A and B) and An. arabiensis (C and D), respectively.
Fig 2Number of An. gambiae s.s (A and B) and An. arabiensis (C and D) in two age classes (less than or greater/equal seven days) when PLS (A and C) and ANN (B and D) regression models, respectively, interpreted as binary classifiers.
Performance analysis of PLS and ANN regression models on estimating the age of An. gambiae and An. arabiensis.
Results from ten-fold Monte Carlo cross-validation.
| Species | Model estimation | Metric | Model architecture | P-value | P-value | |
|---|---|---|---|---|---|---|
| PLS | ANN | |||||
| Actual age | RMSE | 3.7 ± 0.2 | 1.6 ± 0.2 | < 0.001 | < 0.001 | |
| Age class | Accuracy (%) | 83.9 ± 2.3 | 93.7± 1.0 | < 0.001 | < 0.001 | |
| Sensitivity (%) | 89.0 ± 2.1 | 92.5 ± 1.6 | 0.005 | 0.047 | ||
| Specificity (%) | 75.8 ± 5.2 | 95.6 ± 1.8 | < 0.001 | < 0.001 | ||
| Actual age | RMSE | 4.5 ± 0.1 | 2.8 ± 0.2 | < 0.001 | < 0.001 | |
| Age class | Accuracy (%) | 80.3 ± 2.1 | 90.2 ± 1.7 | < 0.001 | < 0.001 | |
| Sensitivity (%) | 90.5 ± 1.9 | 91.7 ± 3.3 | 0.58 | 0.60 | ||
| Specificity (%) | 60.3 ± 4.2 | 88.4 ± 3.9 | < 0.001 | < 0.001 | ||
Mean actual age estimation of mosquitoes in out of the sample test sets by ANN and PLS regression models.
Column “N” represents the number of mosquitoes in each age group.
| Actual age | Model Prediction | |||||
|---|---|---|---|---|---|---|
| PLS | N | ANN | PLS | N | ANN | |
| 1 | 1.9 ± 3.2 | 43 | 1.3 ± 2.5 | 2.4 ± 2.8 | 29 | 1.0 ± 1.4 |
| 3 | 5.8 ± 3.9 | 40 | 3.7 ± 3.5 | 5.0 ± 2.2 | 45 | 2.4 ± 1.3 |
| 5 | 9.3 ± 3.3 | 39 | 6.1 ± 2.1 | 6.5 ± 2.1 | 35 | 5.0 ± 0.9 |
| 7 | 8.7 ± 2.9 | 47 | 8.1 ± 2.4 | 10.5 ± 3.3 | 41 | 6.9 ± 1.7 |
| 9 | 9.9 ± 3.7 | 35 | 10.2 ± 1.7 | 9.2 ± 2.5 | 35 | 8.5 ± 1.2 |
| 11 | 12.2 ± 3.4 | 45 | 11.5 ± 1.8 | 8.7 ± 3.9 | 29 | 10.8 ± 1.3 |
| 15 | 13.6 ± 4.3 | 37 | 14.9 ± 1.9 | 13.6 ± 3.3 | 36 | 14.3 ± 2.2 |
| 20 | 17.3 ± 3.4 | 38 | 18.2 ± 2.4 | 15.8 ± 3.6 | 28 | 18.6 ± 2.3 |
| 25 | 19.9 ± 6.7 | 38 | 23.2 ± 6.4 | |||
Fig 3Box plot of directly trained PLS (A and C) and ANN (B and D) binary classifiers for estimating age classes of An.gambiae (A and B) and An. arabiensis (C and D) in out of sample testing sets.
Fig 4The number of correct and false predictions in each estimated age-class when directly trained PLS (A and C) and ANN (B and D) binary classifiers were applied to classify age of An. gambiae (A and B) and An. arabiensis (C and D) in testing sets. Results from ten replicates.
Comparison of the accuracy of ANN and PLS classification models on ten replicates.
| Species | Metric | Model architecture | P-value | P-value | |
|---|---|---|---|---|---|
| PLS | ANN | ||||
| Accuracy (%) | 93.6 ± 1.2 | 99.4 ± 1.0 | < 0.001 | < 0.001 | |
| Sensitivity (%) | 94.4 ± 1.6 | 99.3 ± 1.4 | < 0.001 | < 0.001 | |
| Specificity (%) | 92.4 ± 1.9 | 99.5 ± 0.7 | < 0.001 | < 0.001 | |
| Accuracy (%) | 88.7 ± 1.1 | 99.0 ± 0.6 | < 0.001 | < 0.001 | |
| Sensitivity (%) | 95.4 ± 1.4 | 99.5 ± 0.5 | < 0.001 | < 0.001 | |
| Specificity (%) | 75.2 ± 3.4 | 98.3 ± 1.3 | < 0.001 | < 0.001 | |
Fig 5Error distribution per actual age of An. gambiae and An. arabiensis when ANN and PLS regressors applied to estimate the actual ages of mosquitoes in training and test data sets, showing a uniform distribution of errors (un-biased estimating) across actual ages of mosquitoes for the ANN regressor and an un-uniform distribution of errors (biased estimating) for the PLS regressor.