| Literature DB >> 25159520 |
Christopher G Jacob, John C Tan, Becky A Miller, Asako Tan, Shannon Takala-Harrison, Michael T Ferdig, Christopher V Plowe1.
Abstract
BACKGROUND: Analysis of single nucleotide polymorphisms (SNPs) derived from whole-genome studies allows for rapid evaluation of genome-wide diversity, and genomic epidemiology studies of Plasmodium falciparum provide insights into parasite population structure, gene flow, drug resistance and vaccine development. In areas with adequate cold chain facilities, large volumes of leukocyte-depleted patient blood can be frozen for use in parasite genomic analyses. In more remote endemic areas smaller volumes of infected blood are taken by finger prick, and dried and stored on filter paper. These dried blood spots do not generally yield enough concentrated parasite DNA for whole-genome sequencing.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25159520 PMCID: PMC4153902 DOI: 10.1186/1471-2164-15-719
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
SNP call accuracy and SNP call rate of 3D7 cultured parasites
| SNP call accuracy | SNP call rate | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Heuristic algorithm | D-score method | Heuristic algorithm | D-score method | |||||||||||||
| WGA | + | - | + | - | + | - | + | - | ||||||||
| LD | + | - | + | - | + | - | + | - | + | - | + | - | + | - | + | - |
| 1,000 p/μl | 99.0 | 96.2 | 99.6 | 94.1 | 93.5 | 76.4 | 97.4 | 69.0 | 75.4 | 36.8 | 68.7 | 15.6 | 56.0 | 40.7 | 63.2 | 32.3 |
| 10,000 p/μl | 99.4 | 99.4 | 99.7 | 99.5 | 94.4 | 95.8 | 98.6 | 94.1 | 76.5 | 87.5 | 87.6 | 62.9 | 51.7 | 59.1 | 70.8 | 50.4 |
| 100,000 p/μl | - | 99.6 | - | 99.7 | - | 98.7 | - | 98.8 | - | 94.4 | - | 92.0 | - | 76.0 | - | 78.9 |
| 500,000 p/μl | - | 99.7 | - | 99.8 | - | 98.4 | - | 98.9 | - | 94.5 | - | 95.3 | - | 64.4 | - | 72.8 |
To test the accuracy and call rate of the new heuristic algorithm cultured 3D7 strain parasites were genotyped under differing leukocyte depletion, whole-genome amplification, and parasite concentrations. We then compared the previously describe D-score method to the heuristic algorithm. (WGA = whole-genome amplification, LD = leukocyte depletion, p/μl = parasites per microliter).
Figure 1Sample intensities and intensity distributions of varying center bases. A) Distribution of raw probe intensities. Dotted lines indicate global mean intensity values for T (black) and G (green) center position alleles. B) Average raw intensities of a single SNP for 24 field samples. Dotted lines indicate area of low contrast where points between the lines would not be distinguishable. Black points were called a T allele by genome sequencing and green diamonds were called a G allele. C) Average intensities after global mean adjustment, all sequenced calls match microarray calls after running through algorithm.
SNP call accuracy and SNP call rate of NF54 purified DNA
| ID | Parasite DNA (ng) | Call accuracy | Call rate |
|---|---|---|---|
| NF54-1 | 219 | 99.4% | 86.6% |
| NF54-2 | 212 | 99.5% | 81.5% |
| NF54-3 | 210 | 99.4% | 78.6% |
| NF54-4 | 203 | 99.4% | 73.8% |
| NF54-5 | 170 | 99.4% | 85.7% |
| NF54-6 | 168 | 99.4% | 77.3% |
| NF54-7 | 156 | 99.2% | 76.9% |
| NF54-8 | 152 | 99.3% | 69.0% |
| NF54 (WGA) | 222 | 99.6% | 91.6% |
| NF54 (WGA) | 138 | 99.5% | 85.1% |
NF54 reference strain DNA was typed at varying concentrations and call accuracy and call rate was calculated. Two samples underwent whole genome amplification (WGA) prior to genotyping and the concentration reflects post-WGA quantities.
Figure 2SNP call accuracy and SNP call rate of low quantity reference DNA. Black and grey bars show SNP accuracy of purified 3D7 and Dd2 strain parasites at low DNA quantities (1 – 40 ng). Lines show SNP call rate of each typed sample. DNA quantity was significantly associated with SNP calling accuracy (p = 0.025), and DNA quantity with SNP call rate (p = 1.57e-05).
Figure 3SNP call accuracy and SNP call rate of field samples. A) SNP calling accuracy (grey bars) and SNP call rates (black line) for 24 field isolate from Southeast Asia. B) Correlation of SNP call accuracy and DNA quantity (p = 0.79). C) Correlation of SNP call rate with DNA quantity (p = 9.8e-08).
Figure 4Microarray probe statistics. A) Histogram of Inter-SNP distances of all SNPs. B) Boxplot showing distribution of inter-SNP distances where allele variability has been shown. C) Histogram of G + C content of probes. D) Histogram showing distribution of probe lengths.