| Literature DB >> 29685152 |
Daibin Zhong1, Cristian Koepfli2, Liwang Cui3, Guiyun Yan4.
Abstract
Multiplicity of infection (MOI), also termed complexity of infection (COI), is defined as the number of genetically distinct parasite strains co-infecting a single host, which is an important indicator of malaria epidemiology. PCR-based genotyping often underestimates MOI. Next generation sequencing technologies provide much more accurate and genome-wide characterization of polyclonal infections. However, complete haplotype characterization of multiclonal infections remains a challenge due to PCR artifacts and sequencing errors, and requires efficient computational tools. In this review, the advantages and limitations of current molecular approaches to determine multiplicity of malaria parasite infection are discussed.Entities:
Keywords: Amplicon ultra-deep sequencing; Malaria parasite; Molecular marker; Multiplicity of infection; Next generation sequencing; Single nucleotide polymorphism
Mesh:
Year: 2018 PMID: 29685152 PMCID: PMC5914063 DOI: 10.1186/s12936-018-2322-5
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
The advantages and limitations of bioinformatics tools current available for analysis of polyclonal infections and estimation of MOI
| Software/approach | Implementation | Data source | Advantage | Limitation | References |
|---|---|---|---|---|---|
|
| Fws statistics | SNPs | Provide a summary statistic of relative inbreeding and correlation with MOI | Does not provide a direct estimate of COI | Manske et al. [ |
|
| Perl scripts | TDS | estimate MOI, SNP combinations and proportion in a specific genomic region | Partial genomic region, sufficient density of SNPs in an amplicon is required | Assefa et al. [ |
|
| Perl scripts web interface | SNPs | Use genome-wide SNPs to infer the number of strains and their proportions | Does not infer haplotypes; sequence read counts are assumed to be unbiased and the SNPs are unlinked | Galinsky et al. [ |
|
| Perl scripts | TDS | No reference sequence required, direct estimation of amplicon haplotype cluster and frequency | Partial genomic region, threshold for haplotype cluster calls needs to be determined by empirical methods in each study | Hathaway et al. [ |
|
| R package | TDS | Direct estimation of haplotype cluster and frequency based quality filtering of sequencing reads | Partial genomic region, enough high polymorphism in the amplicon is required and unable to deal with indel of amplicon | Lerch et al. [ |
|
| R package | SNPs | Infer inbreeding coefficients using WGS to assess within-sample parasite infections | Does not infer haplotypes, sequence read counts are assumed to be unbiased and SNPs need to be unlinked | O’Brien et al. [ |
|
| Perl scripts web interface | SNPs | Infer the number of strains and their proportions | Does not infer haplotypes, sequence read counts are assumed to be unbiased and SNPs need to be unlinked | Chang et al. [ |
|
| R package | SNPs | Estimate the number of strains, their relative proportions, and the haplotypes | Reference panel required, more reference strains are better. Multiallelic variant not considered | Zhu et al. [ |
SNPs single nucleotide polymorphism, TDS targeted deep sequencing, WGS whole genome sequencing, MOI multiplicity of infection