| Literature DB >> 32228670 |
Oselyne T W Ong1, Elise A Kho2, Pedro M Esperança3, Chris Freebairn4, Floyd E Dowell5, Gregor J Devine6, Thomas S Churcher3.
Abstract
BACKGROUND: Practical, field-ready age-grading tools for mosquito vectors of disease are urgently needed because of the impact that daily survival has on vectorial capacity. Previous studies have shown that near-infrared spectroscopy (NIRS), in combination with chemometrics and predictive modeling, can forecast the age of laboratory-reared mosquitoes with moderate to high accuracy. It remains unclear whether the technique has utility for identifying shifts in the age structure of wild-caught mosquitoes. Here we investigate whether models derived from the laboratory strain of mosquitoes can be used to predict the age of mosquitoes grown from pupae collected in the field.Entities:
Keywords: Age; Asian tiger mosquito; Chemometrics; Near-infrared; Spectroscopy
Mesh:
Year: 2020 PMID: 32228670 PMCID: PMC7106667 DOI: 10.1186/s13071-020-04031-3
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Predictive power of models derived from different Ae. albopictus populations
| Actual age (days) | No. scanned | Standard PLS | Resampling PLS | ||
|---|---|---|---|---|---|
| Age in days | Classed as old (%) | Age in days | Classed as old (%) | ||
| Using laboratory-derived models to predict the age of laboratory-reared mosquitoes | |||||
| Number of components | 8 | 10 | |||
| 2 | 41 | 3.58 (0.28) | 0 | 2.13 (0.22) | 4 |
| 5 | 42 | 7.62 (0.30) | 36 | 7.27 (0.23) | 9 |
| 8 | 42 | 8.35 (0.27) | 60 | 8.07 (0.21) | 57 |
| 12 | 42 | 8.47 (0.33) | 57 | 9.84 (0.22) | 58 |
| 15 | 44 | 14.0 (0.33) | 100 | 14.7 (0.26) | 68 |
| Overall accuracy | RMSD = 2.38 | – | RMSD = 2.89 | AUC = 0.88 | |
| Using field-derived models to predict the age of field-derived reared mosquitoes | |||||
| Number of components | 8 | 10 | |||
| 1 | 50 | 4.58 (0.32) | 6 | 3.31 (0.28) | 0 |
| 7 | 50 | 7.71 (0.44) | 41 | 7.28 (0.36) | 0 |
| 14 | 100 | 11.7 (0.34) | 90 | 12.7 (0.26) | 100 |
| Overall accuracy | RMSD = 3.41 | – | RMSD = 2.82 | AUC = 0.97 | |
| Using laboratory-derived models to predict the age of field-derived reared mosquitoes | |||||
| Number of components | 8 | 10 | |||
| 1 | 50 | 6.50 (0.20) | 12 | 8.23 (0.18) | 36 |
| 7 | 50 | 6.92 (0.20) | 18 | 9.40 (0.27) | 74 |
| 14 | 100 | 7.00 (0.15) | 24 | 9.18 (0.22) | 75 |
| Overall accuracy | RMSD = 5.84 | – | RMSD = 5.42 | AUC = 0.60 | |
Notes: The true age of mosquito groups is shown on the left while the mean predicted age (and variability, given as standard error of the mean, SEM) is shown on the right using standard Partial Least Squares (PLS) regression or a resampling PLS framework. Mosquitoes are classified as young (< 8 days) or old (≥ 8 days). First line of each section of the table shows the number of components used in the different models. Accuracy of age estimates is shown by the average difference between the true and predicted age measured in days (root-mean-square deviation, RMSD), with lowest values indicating a more accurate model. The ability to classifying mosquitoes as young or old is given by the area under the curve (AUC), with higher values indicating greater accuracy
Fig. 1The ability of NIRS to predict the age of Ae. albopictus mosquitoes in days. a The best-fit regression coefficient function for the resampling PLS model trained on laboratory-reared mosquitoes showing the most informative regions of the spectrum. Grey lines show best-fit model for each of the 100 dataset randomisations whilst black line indicates the average. b Ability of the model to predict age of laboratory-reared mosquitoes. Boxplot thick horizontal black line shows the median/50th-percentile whilst the box edges, inner and outer whiskers show 25th/75th, 15th/85th and 5th/95th percentiles, respectively. Grey dashed line shows model with 100% accuracy. c Ability of the model trained on laboratory mosquitoes to predict the age of field-derived mosquitoes. Results can be compared to the simple PLS method presented in Additional file 1: Figure S1
Fig. 2The ability of NIRS to classify Ae. albopictus mosquitoes as being young or old. a The best-fit regression coefficient function for a model trained on laboratory-reared mosquitoes showing the most informative regions of the spectrum. Grey lines show best-fit model for each of the 100 dataset randomisations whilst black line indicates average. b Ability of the model to predict age classification of laboratory-reared mosquitoes. Histogram of the estimated linear predictor for the test observations colour-coded by the true class (green, true young mosquitoes; blue, true old mosquitoes). Vertical black line indicates optimum threshold for classifying mosquitoes as old or young (“left” predicted to be young, “right” predicted to be old). The shaded area where two distributions overlap corresponds to misclassified test observations, false negatives to the left and false positives to the right of the optimal classification threshold. c The corresponding confusion matrix for the best model trained and predicting laboratory-reared mosquitoes showing the different error rates: tnr, true negative rate; fnr, false negative rate (specificity); fpr, false positive rate; and tpr, true positive rate (sensitivity). d The receiver operating characteristic (ROC) curve for the best-fit model predicting laboratory-reared mosquitoes showing the false positive and true positive rates achievable for different classification probability thresholds (shifting the black vertical line (b) left or right) whilst the overall performance is given by the area under the ROC curve (AUC). The pink dashed line denotes a model with no predictive ability (a random chance of correct prediction) whilst a perfect model with 100% sensitivity and specificity would be in the top left corner (coordinates 0, 1). The solid line shows the average ROC curve; boxplots show the variability for 100 randomisations of the training, validation and testing datasets (box edges, inner and outer whiskers show 25th/75th, 15th/85th and 5th/95th percentiles, respectively; black line inside the box showing the median/50th-percentile). e The ROC curve showing the ability of the model trained on laboratory rerared mosquitoes to predict the age classification of mosquitoes reared in the field-derived environment. f The corresponding confusion matrix of the best model
Fig. 3Differences between laboratory and field-derived spectra. a Principal components analysis showing the difference between scores calculated for PC-1 (which explains 88% of the variation) and PC-2 (8% of the variation) for laboratory (blue; square) and field-derived (red; circle) mosquitoes. b Loading plot from PCA showing that water bands at 1450 nm and 1930 nm accounted for 88% of the total variance observed. R denotes reflectance c difference between undried (dashed line) and dried (solid line) mosquito spectra. In b and c blue horizontal lines indicate peaks associated with water