| Literature DB >> 28125584 |
Hsiao-Han Chang1, Colin J Worby1, Adoke Yeka2,3, Joaniter Nankabirwa3,4, Moses R Kamya3,4, Sarah G Staedke5, Grant Dorsey6, Maxwell Murphy6, Daniel E Neafsey7, Anna E Jeffreys8, Christina Hubbart8, Kirk A Rockett8,9, Roberto Amato9, Dominic P Kwiatkowski8,9, Caroline O Buckee1, Bryan Greenhouse6.
Abstract
As many malaria-endemic countries move towards elimination of Plasmodium falciparum, the most virulent human malaria parasite, effective tools for monitoring malaria epidemiology are urgent priorities. P. falciparum population genetic approaches offer promising tools for understanding transmission and spread of the disease, but a high prevalence of multi-clone or polygenomic infections can render estimation of even the most basic parameters, such as allele frequencies, challenging. A previous method, COIL, was developed to estimate complexity of infection (COI) from single nucleotide polymorphism (SNP) data, but relies on monogenomic infections to estimate allele frequencies or requires external allele frequency data which may not available. Estimates limited to monogenomic infections may not be representative, however, and when the average COI is high, they can be difficult or impossible to obtain. Therefore, we developed THE REAL McCOIL, Turning HEterozygous SNP data into Robust Estimates of ALelle frequency, via Markov chain Monte Carlo, and Complexity Of Infection using Likelihood, to incorporate polygenomic samples and simultaneously estimate allele frequency and COI. This approach was tested via simulations then applied to SNP data from cross-sectional surveys performed in three Ugandan sites with varying malaria transmission. We show that THE REAL McCOIL consistently outperforms COIL on simulated data, particularly when most infections are polygenomic. Using field data we show that, unlike with COIL, we can distinguish epidemiologically relevant differences in COI between and within these sites. Surprisingly, for example, we estimated high average COI in a peri-urban subregion with lower transmission intensity, suggesting that many of these cases were imported from surrounding regions with higher transmission intensity. THE REAL McCOIL therefore provides a robust tool for understanding the molecular epidemiology of malaria across transmission settings.Entities:
Mesh:
Year: 2017 PMID: 28125584 PMCID: PMC5300274 DOI: 10.1371/journal.pcbi.1005348
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
The observational model for categorical method.
| 1− | 0 | |||
| 1− | ||||
| 0 | 1− | |||
Fig 1True vs. estimated values of COI (A) and allele frequencies (B) using COIL and THE REAL McCOIL.
Each blue dot represents a sample. The black bar and the grey box show the median and 25% to 75% quantile. THE REAL McCOIL estimated allele frequencies and COI better than COIL, especially when the average COI was high and the majority of infections were polygenomic.
Fig 2Estimates of COI in Nagongera, Walukuba, and Kihihi.
(A) Estimates of COI by COIL, THE REAL McCOIL, and msp2. For THE REAL McCOIL, the point estimates of COI shown are medians from the posterior distributions. The COI estimated by THE REAL McCOIL in Nagongera and Walukuba were similar, and much higher than that in Kihihi (median COI = 5 [Nagongera], 4.5 [Walukuba], and 1 [Kihihi]; permutation test, p-values = 0.158 [Nagongera vs. Walukuba], 0.002 [Nagongera vs. Kihihi], 0.0006 [Walukuba vs. Kihihi]). Allele counts > 5 in msp2 typing were grouped into a single category due to difficulties in accurately distinguishing artifacts from true alleles at high complexities of infection. The dashed red lines represent the medians of COI in three regions. (B) The spatial distribution of estimated COI by THE REAL McCOIL in three regions. Small random noise was added to the location of samples in the map. COI of samples collected from the West of Walukuba was higher than those from the East of Walukuba (medians = 5 [West] and 3 [East], p-value = 0.027).
Fig 3F.
(A) Estimated COI by THE REAL McCOIL was negatively associated with F. (B) F in Kihihi was higher than Nagongera and Walukuba. The F values shown were calculated using population allele frequencies estimated from categorical method of THE REAL McCOIL.