| Literature DB >> 19440377 |
Kevin K A Tetteh1, Lindsay B Stewart, Lynette Isabella Ochola, Alfred Amambua-Ngwa, Alan W Thomas, Kevin Marsh, Gareth D Weedall, David J Conway.
Abstract
BACKGROUND: Endemic human pathogens are subject to strong immune selection, and interrogation of pathogen genome variation for signatures of balancing selection can identify important target antigens. Several major antigen genes in the malaria parasite Plasmodium falciparum have shown such signatures in polymorphism-versus-divergence indices (comparing with the chimpanzee parasite P. reichenowi), and in allele frequency based indices. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2009 PMID: 19440377 PMCID: PMC2679211 DOI: 10.1371/journal.pone.0005568
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Scheme of the 26 genes studied showing positions of individual nucleotide polymorphisms among 14 P. falciparum isolates as vertical lines above each gene.
Fixed differences between P. falciparum and P. reichenowi are shown as vertical lines below each gene. Positions of insertions, deletions, and repetitive sequence are also indicated on each gene, as well as verified and predicted protein domains. Stop codons within genes are shown for MRSP3 (in P reichenowi only) and for PF10_0348 (in P. falciparum R033 isolate). Horizontal bars above some genes indicate portions not analysed.
HKAr indices summarizing polymorphism (π, among 14 P. falciparum laboratory isolates) and divergence (K, from P. reichenowi) of 26 merozoite stage genes together with MK tests on synonymous (syn) and nonsynonymous (non-syn) polymorphic and fixed differences.
| Gene | Locus | Nt |
| K (10−3) | HKAr ( | Syn | Non-syn | MK test | ||
| Fixed | Poly | Fixed | Poly | P value | ||||||
|
| PFF0615c | 990 | 1.0 | 35.9 | 0.028 | 12 | 0 | 23 | 3 | 0.54 |
|
| Pf13_0338 | 2319◊ | 1.2 | 41.6 | 0.029 | 39 | 0 | 53 | 9 |
|
|
| PFE0395c | 900 | 3.4 | 21.3 | 0.160 | 6 | 0 | 11 | 11 | 0.06 |
|
| PF14_0201 | 2619◊ | 0.5 | 30.6 | 0.016 | 31 | 1 | 48 | 6 | 0.25 |
|
| PFF0995c | 1329◊ | 1.7 | 45.6 | 0.037 | 15 | 0 | 44 | 8 | 0.18 |
|
| PF10_0346 | 867◊ | 17.5 | 59.3 | 0.295 | 5 | 3 | 27 | 35 | 0.46 |
|
| PF10_0347 | 1230 | 1.1 | 37.2 | 0.030 | 14 | 1 | 30 | 3 | 1.00 |
|
| PF10_0348 | 1935ץ | 56.8 | 82.8 | 0.686 | 16 | 72 | 58 | 167 | 0.18 |
|
| PF10_0352 | 1107◊ | 0.9 | 54.5 | 0.017 | 14 | 2 | 44 | 4 | 0.64 |
|
| PF13_0197 | 1044 | 5.6 | 59.8 | 0.094 | 13 | 1 | 44 | 17 | 0.17 |
|
| PF13_0196 | 1143 | 0.4 | 43.8 | 0.009 | 8 | 0 | 41 | 3 | 1.00 |
|
| MAL13P1.174 | 819 | 0.5 | 36.9 | 0.014 | 10 | 1 | 19 | 1 | 1.00 |
|
| PF13_0193 | 897ץ | 0.2 | 53.9 | 0.028 | 6 | 0 | 32 | 0 | - |
|
| MAL13P1.173 | 897◊ | 1.2 | 26.9 | 0.045 | 11 | 0 | 12 | 4 | 0.12 |
|
| Pf13_0191 | 870◊ | 0.3 | 19.7 | 0.015 | 7 | 0 | 10 | 1 | 1.00 |
|
| PF13_0192 | 1446◊ | 1.1 | 29.6 | 0.037 | 8 | 0 | 33 | 5 | 0.57 |
|
| PF13_0194 | 567 | 0.0 | 5.3 | 0.000 | 1 | 0 | 2 | 0 | - |
|
| PFL1385c | 1761◊ | 1.4 | 48.2 | 0.029 | 29 | 0 | 54 | 6 | 0.17 |
|
| PFB0340c | 609◊ | 16.0 | 98.2 | 0.163 | 7 | 4 | 32 | 23 | 1.0 |
|
| MAL7P1.208 | 1326◊ | 1.0 | 26.8 | 0.036 | 10 | 1 | 23 | 5 | 0.66 |
|
| PF13_0348 | 3132◊ | 0.7 | 45.1 | 0.016 | 64 | 1 | 76 | 9 |
|
|
| PF10_0144 | 894 | 0.8 | 9.0 | 0.089 | 5 | 2 | 2 | 2 | 0.58 |
|
| PF14_0102 | 2271◊ | 2.0 | 29.1 | 0.069 | 18 | 1 | 45 | 11 | 0.17 |
|
| PFE0080c | 1053 | 1.5 | 29.5 | 0.051 | 8 | 0 | 22 | 3 | 0.56 |
|
| PFE0075c | 1119 | 0.0 | 26.8 | 0.000 | 17 | 0 | 13 | 0 | - |
|
| PFD0955w | 894 | 0.5 | 28.5 | 0.018 | 13 | 0 | 12 | 3 | 0.23 |
Nt, number of aligned nucleotides.
◊, repeats removed from gene sequences.
•, internal stop codon in P. reichenowi mrsp3 gene, and in an allele of P. falciparum PF10_0348 (codons removed from analysis).
complex codons in rap1 and rap2 not analysed.
less sequence aligned when P.reichenowi added (Pf10_0347 N = 1200, Pf10_0348 N = 1866, Pf10_0352 N = 1104, MRSP1 N = 1140, MRSP2 N = 816, MRSP3 N = 705, MSP9 N = 1758, SERA5 N = 561, Pf34 N = 882); Sequences submitted to Genbank (Accession numbers are listed in Supplementary Table S1).
Figure 2Scatterplot of McDonald-Kreitman (MK) skew (log2 transformation of the neutrality index with 0 representing no skew) and Hudson-Kreitman-Aguade ratio (HKAr) of polymorphism (in P. falciparum) and divergence (between P. falciparum and P. reichenowi) for each of the genes studied in Table 1.
Two of the 26 genes had no polymorphism and are not plotted. There was no significant correlation between the two indices (Spearman's ρ = −0.26). Six genes at the outer fringe of the distributions, and one in the middle, are labelled and were selected for population based analysis.
Figure 3Sample size optimisation for Tajima's D test assessed using a large sample of ama1 gene sequences from a Gambian P. falciparum population.
100 random samples of each size (n) were taken from a set of 114 ama1 sequences covering the region encoding the ectodomain of the protein (nucleotides 442–1743). The median values and 95% confidence limits (3rd and 97th centile values of 100 estimates) of π, θ, and Tajima's D indices were calculated for each sample size.
Analysis of polymorphism in 9 merozoite stage expressed P. falciparum genes in a Gambian population sample.
| Gene | Locus | No. of isolates | Nt | π (×103) | K (×103) | HKAr ( | McDonald-Kreitman | Tajima's D | Fu & Li F | ||||
| SYN | NONSYN | MK | |||||||||||
| Fixed | Poly | Fixed | Poly | p-value | |||||||||
|
| PF10_0348 | 66 | 1896ץ | 38.7 | 72.3 | 0.535 | 17 | 68 | 57 | 150 | 0.18 |
|
|
|
| PF13_0197 | 85 | 1050 | 6.4 | 62.0 | 0.103 | 13 | 3 | 45 | 28 | 0.16 | 0.28 | 0.19 |
|
| PF13_0348 | 81 | 1200◊ | 0.3 | 47.7 | 0.006 | 30 | 2 | 27 | 8 | 0.09 |
|
|
|
| PFB0340c | 56 | 624◊ | 11.2 | 93.6 | 0.119 | 11 | 2 | 36 | 24 | 0.12 | 0.76 | 0.75 |
|
| PFE0395c | 88 | 903 | 3.6 | 21.3 | 0.169 | 6 | 2 | 11 | 12 | 0.24 | 0.50 | 0.55 |
|
| PF13_0338 | 87 | 2319◊ | 1.3 | 41.9 | 0.031 | 37 | 6 | 56 | 26 | 0.06 | −1.73 |
|
|
| MAL7P1.208 | 68 | 1323◊ | 0.7 | 26.4 | 0.027 | 11 | 2 | 22 | 5 | 1.00 | −0.87 | −1.98 |
|
| PF10_0323 | 78 | 990 | 1.4 | 50.0 | 0.028 | 14 | 2 | 33 | 11 | 0.48 | −1.37 | −1.15 |
|
| PF11_0344 | 114 | 1302 | 16.3 | 40.0 | 0.420 | 11 | 3 | 21 | 61 | 0.0002 |
|
|
Nt, number of aligned nucleotide positions analysed.
•, stop codon in 7 alleles of PF10_0348 (codon removed from analysis).
◊, repeats removed from gene sequences for analysis.
only one region of msp7 gene (of most dense polymorphism) was studied in the population.
5 complex codons in ama1 not analysed.
less sequence aligned when P. reichenowi added (Pf10_0348 N = 1824, msp7 N = 1041, sera5 N = 600).
p<0.05.
p<0.01.
p<0.001.
Values significantly different to 0 are shown in italics; for ama1 Tajima's D is significant for Domain II (p<0.05) and Domain III (p<0.001), and Fu and Li's F is significant for Domain III (p<0.02); Sequences submitted to Genbank (Accession numbers for population datasets of allele sequences are listed in Supplementary Table S2).
Figure 4Correlations between the allele frequency-based Tajima's D index and two different polymorphism-versus-divergence indices for 9 merozoite stage genes (N>55 alleles of each gene sampled from a Gambian P. falciparum population, and interspecies comparison with P. reichenowi).
A. Scatterplot showing strong positive correlation (Spearman's ρ = 0.92, P<0.001) between the HKA ratio (HKAr) index and the allele frequency-based Tajima's D index. B. Scatterplot indicating no correlation (Spearman's ρ = −0.07, N.S.) between the McDonald Kreitman (MK) skew and Tajima's D index.