| Literature DB >> 22721170 |
A Taylor Bright1, Ryan Tewhey, Shira Abeles, Raul Chuquiyauri, Alejandro Llanos-Cuentas, Marcelo U Ferreira, Nicholas J Schork, Joseph M Vinetz, Elizabeth A Winzeler.
Abstract
BACKGROUND: Malaria caused by Plasmodium vivax is an experimentally neglected severe disease with a substantial burden on human health. Because of technical limitations, little is known about the biology of this important human pathogen. Whole genome analysis methods on patient-derived material are thus likely to have a substantial impact on our understanding of P. vivax pathogenesis and epidemiology. For example, it will allow study of the evolution and population biology of the parasite, allow parasite transmission patterns to be characterized, and may facilitate the identification of new drug resistance genes. Because parasitemias are typically low and the parasite cannot be readily cultured, on-site leukocyte depletion of blood samples is typically needed to remove human DNA that may be 1000X more abundant than parasite DNA. These features have precluded the analysis of archived blood samples and require the presence of laboratories in close proximity to the collection of field samples for optimal pre-cryopreservation sample preparation.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22721170 PMCID: PMC3410760 DOI: 10.1186/1471-2164-13-262
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Enrichment of Sal1 DNA using whole genome baits. Mock P. vivax infections were created by combining Sal1 reference DNA and human DNA. These samples were then whole genome captured using the Sal1 derived baits. Pre and Post bars refer to the percent P. vivax DNA in the mock sample before and after whole genome capture.
genomic DNA yield from whole blood samples using whole-genome capture
| Isolate | Parasitemia | % | ||||
|---|---|---|---|---|---|---|
| SA-96 | 2,300 | 0.04% | 3.2 | 0.53% | 52.89% | 99.8 |
| SA-97 | 2800 | 0.05% | 7 | 1.34% | 60.52% | 45.2 |
| SA-98 | 8700 | 0.16% | 7.6 | 3.95% | 78.40% | 19.8 |
| SA-94 | 6100 | 0.11% | 2.4 | na | 80.13% | na |
| SA-95 | 1800 | 0.03% | 1.7 | na | 20.25% | na |
| IQ07a | na | na | na | 41.87% | na | na |
| Acre3b | na | na | na | 1.42% | na | na |
Abbreviations: RBCs, red blood cells; WGC, whole-genome capture; na, not available.
apositive control - leukocyte filtered, bnegative control.
Sequencing statistics for whole-genome capture samples
| | Number of bases sequenced (billion) | | % PCR duplicates | | % genome covered by 5 or more reads |
|---|---|---|---|---|---|
| Isolate | % | Coverage (X) | |||
| IQ07a | 2.3 | 41.87% | 1.86% | 35.16 | 95.33% |
| Acre3b | 1.4 | 1.42% | 4.38% | 0.77 | 0.80% |
| SA-94c | 6.0 | 80.13% | 13.60% | 150.31 | 97.21% |
| SA-95c | 5.3 | 20.25% | 46.17% | 21.89 | 84.73% |
| SA-96c | 4.7 | 52.89% | 32.29% | 63.11 | 93.35% |
| SA-97c | 5.2 | 60.52% | 26.37% | 86.75 | 94.37% |
| SA-98c | 6.3 | 78.40% | 12.14% | 160.44 | 96.51% |
apositive control - leukocyte filtered, bnegative control, cwhole-genome capture.
Figure 2genomic coverage after whole genome capture. (A) Percent of the P. vivax genome covered by various coverage thresholds for the five whole genome capture samples, the IQ07 positive control (shaded region), and the Acre3 negative control (orange). (B) The percent of each chromosome that was able to be confidently genotyped, ie callable, for the whole genome captured samples compared to the IQ07 positive control (upper scale). The Acre3 negative control is presented on the lower scale.
Figure 3Confidently genotyped loci at various sequencing depths. SA-94 (black) and IQ07 (gray) were randomly downsampled and the mean percent of the genome that is callable of 5 randomly produced data sets for each coverage threshold is presented.
SNVs identifiedby whole genome sequencing
| Isolate | Isolate | MSP7 gene family | All MSP gene families | SERA gene family | Metabolic genes | ||||
|---|---|---|---|---|---|---|---|---|---|
| | | ||||||||
| IQ07a | 16861 | 1.82 | 1.64 | 2.7 | 4.1 | 7.07 | 5.54 | 0.45 | 0.6 |
| Acre3b | 0 | na | na | na | na | na | na | na | na |
| SA-94 | 17871 | 1.93 | 1.57 | 3.2 | 4.41 | 6.94 | 5.6 | 0.37 | 0.59 |
| SA-95 | 18916 | 2.03 | 1.77 | 2.48 | 4.1 | 7.3 | 5.03 | 0.35 | 0.64 |
| SA-96 | 17946 | 1.79 | 1.81 | 3.23 | 4.21 | 6.64 | 5.61 | 0.54 | 0.7 |
| SA-97 | 18321 | 1.86 | 1.34 | 2.81 | 4.19 | 7.23 | 5.91 | 0.44 | 0.68 |
| SA-98 | 16596 | 1.55 | 1.28 | 2.89 | 3.98 | 6.94 | 5.04 | 0.49 | 0.68 |
Abbreviations: SNV, single nucleotide variant; Syn, synonymous; Non-syn, non-synonymous; na, not available.
apositive control - leukocyte filtered, bnegative control.