| Literature DB >> 29954424 |
Pedro M Esperança1, Andrew M Blagborough2, Dari F Da3, Floyd E Dowell4, Thomas S Churcher5.
Abstract
BACKGROUND: The proportion of mosquitoes infected with malaria is an important entomological metric used to assess the intensity of transmission and the impact of vector control interventions. Currently, the prevalence of mosquitoes with salivary gland sporozoites is estimated by dissecting mosquitoes under a microscope or using molecular methods. These techniques are laborious, subjective, and require either expensive equipment or training. This study evaluates the potential of near-infrared spectroscopy (NIRS) to identify laboratory reared mosquitoes infected with rodent malaria.Entities:
Keywords: Anopheles stephensi; Machine learning; Near-infrared spectroscopy; Partial least squares; Plasmodium berghei; Predictive modelling; Vector borne diseases; Vector control monitoring
Mesh:
Year: 2018 PMID: 29954424 PMCID: PMC6027764 DOI: 10.1186/s13071-018-2960-z
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Sample sizes and replicate scans. Summary of the number of spectra collected by experiment (sporozoites or oocysts), the number of parasites in the mosquitoes (intensity of infection) and number of times the same mosquito was scanned in different positions (replications). Values in parentheses indicate the percentage of the total samples with that intensity of infection or sample repetition
| Intensity of infection | Replications | |||
|---|---|---|---|---|
| No. of sporosoites/oocysts | No. of samples (%) | No. of replications | No. of samples (%) | |
| Sporozoites | 0 | 162 (54) | 1 | 99 (33) |
| 1–10 | 24 (8) | 2 | 82 (27) | |
| 11–100 | 35(12) | 3 | 39 (13) | |
| 101–1000 | 35(12) | 4 | 80 (27) | |
| > 1000 | 44 (14) | total: 300 | ||
| total: 300 | ||||
| Oocysts | 0 | 29 (37) | 1 | 1 (1) |
| > 0 | 50 (63) | 2 | 1 (1) | |
| total: 79 | 3 | 77 (98) | ||
| total: 79 | ||||
Fig. 1An illustration of near-infrared mosquito spectra. The colours of the 30 spectra denote the salivary gland sporozoite infection intensity on a log scale: 0 (orange); 1–10 (purple); 11–100 (green); 101–1000 (blue); >1000 (red)
Fig. 2The ability of NIRS to determine sporozoite prevalence. Results are for a binomial GLM with 8 PLS components. a The receiver operating characteristic (ROC) curve for the best-fit model showing the false positive and true positive rates achievable for different classification probability thresholds whilst the overall performance is given by the area under the ROC curve (AUC). The dashed line denotes a model with no predictive ability (a random chance of correctly predicting sporozoite presence) whilst a perfect model with 100% sensitivity and specificity would be in the top left corner (coordinates 0, 1). The solid line shows the average ROC curve whilst the boxplots show the variability for 100 randomisations of the training, validation and testing datasets (with box edges, inner and outer whiskers showing 25th/75th, 15th/85th and 5th/95th percentiles, respectively; and the black line inside the box showing the median/50th-percentile). b The best fit coefficient functions for each of the 100 dataset randomisations (grey lines) and the corresponding average (black line). c The histogram of the estimated linear predictor for the test observations, colour-coded by the true class, shows the model’s ability to separate the two infection groups. The vertical black line indicates the optimum threshold for classifying mosquitoes as infectious or not. The shaded area where the two distributions overlap corresponds to misclassified test observations - false negatives to the left and false positives to the right of the optimal classification threshold. The confusion matrix (inset) shows the different error rates: tnr, true negative rate; fnr, false negative rate (specificity); fpr, false positive rate; and tpr, true positive rate (sensitivity)
Fig. 3The inability of NIRS to determine sporozoite intensity. Results are for a multinomial GLM with 8 PLS components. a Average receiver operating characteristic (ROC) curves for one-versus-all classification models and corresponding area under the ROC curve. b The average predicted class probabilities of test observations show that uninfectious mosquitoes are easily distinguished from infectious (as there is a large difference between the largest and second-largest average predicted class probabilities). Conversely, the model has difficulty in distinguishing between the different infection groups accurately (small differences between average predicted class probabilities). c The breakdown of predicted classes of test observations gives the matrix of misclassification rates. Panel b shows the average predicted probabilities for test observations (the probability that they belong to each of the 5 classes, as given by the model). To perform classification in multinomial GLMs we choose the class for which the model gives the highest predicted probability and subsequently compute the misclassification rates (which is shown in c). The two panels give complementary information: whereas b hints at the difficulty in separating the different classes and can be used to pinpoint which classes are most confounded, c shows the actual proportions of misclassified test observation
Improvements to the accuracy of NIRS in detecting presence of sporozoites and oocysts according to the number of times the mosquito was scanned. Mosquitoes are repositioned after each scan. Area under ROC curve (AUC) and misclassification rate (MR) when averaging one (no averaging), two and all available scans; see Table 1 for the number of replicates available. Improvements in performance, from one to all scans, are given in level for AUC and in percentage points for misclassification rate. Values in brackets indicate the percentage improvement in accuracy. Results are for a binomial GLM with 8 PLS components in both sporozoites and oocysts models
| No. of scans averaged | |||||
|---|---|---|---|---|---|
| One | Two | All | Change | ||
| Sporosoites | AUC | 0.72 | 0.75 | 0.81 | +0.09 (↑12.5%) |
| MR | 41% | 33% | 28% | -13 p.p. | |
| Oocysts | AUC | 0.70 | 0.68 | 0.69 | -0.01 (↓1.43%) |
| MR | 36% | 34% | 38% | +2 p.p. | |