| Literature DB >> 23264727 |
Phonepadith Xangsayarath1, Morakot Kaewthamasorn, Kazuhide Yahata, Shusuke Nakazawa, Jetsumon Sattabongkot, Rachanee Udomsangpetch, Osamu Kaneko.
Abstract
Plasmodium falciparum SURFIN(4.1) is a type I transmembrane protein thought to locate on the merozoite surface and to be responsible for a reversible adherence to the erythrocyte before invasion. In this study, we evaluated surf(4.1) gene segment encoding extracellular region for polymorphism, the signature of positive selection, the degree of linkage disequilibrium, and temporal change in allele frequency distribution in P. falciparum isolates from Thailand in 1988-89, 2003, and 2005. We found that SURFIN(4.1) is highly polymorphic, particularly at the C-terminal side of the variable region located just before a predicted transmembrane region. A signature of positive diversifying selection on the variable region was detected by multiple tests and, to a lesser extent, on conserved N-terminally located cysteine-rich domain by Tajima's D test. Linkage disequilibrium between sites over a long distance (> 1.5 kb) was detected, and multiple SURFIN(4.1) haplotype sequences detected in 1988/89 still circulated in 2003. Few of the single amino acid polymorphism allele frequency distributions were significantly different between the 1988/89 and 2003 groups, suggesting that the frequency distribution of SURFIN(4.1) extracellular region remained stable over 14 years.Entities:
Keywords: Plasmodium falciparum; allele frequency distribution; positive diversifying selection
Year: 2012 PMID: 23264727 PMCID: PMC3521052 DOI: 10.2149/tmh.2012-12
Source DB: PubMed Journal: Trop Med Health ISSN: 1348-8945
Fig. 1.Schematic of the SURFIN4.1 and SURFIN4.2 domain structure. Extracellular region is divided into 4 parts: N-terminal (Nter), CRD, and variable regions 1 and 2 (Var1 and Var2). WR domain in the intracellular region is indicated. Positions are after 3D7 line sequences based on the previous report [3, 5].
Oligonucleotide primers used for sequencing of surf4.1
| Name | Sequence |
|---|---|
| F6_7G8 | GATGAAAACGATGCATTTGTTC |
| F6_842 | GTGGATAACTATGCATTTGTTC |
| F6 | GATGGACTTAATGGATTTGATC |
| F7 | TATTGTTCTGGAGATGAATGTG |
| F8 | GAGAGAATGTAGTTTCTACTGCTGG |
| R5 | TCCCTTCATATCTTTTGGATTTA |
| R6 | ACGCAATATCATTATGATGAGG |
| R6_D10 | AAGACAATTTCATTATGATCAGG |
| R6_7G8 | CAACGCAATTTCATTATAATCTGG |
| R6_842 | ACCGCAATTTCATTGTTTTCTGA |
| R8 | CATTTGTTATATAAGAATGAGTACCTTCA |
Fig. 2.Sliding window plot of nucleotide diversity and amino acid polymorphism of SURFIN4.1 extracellular region in Plasmodium falciparum Asian isolates. Window length of 90 bp and step size of 3 bp is used for the sliding window plot (top). The number of the amino acid type at each amino acid position (middle) and a scheme of SURFIN4.1 extracellular region (bottom) is shown in scale to visualize the location of the polymorphic sites. SURFIN4.1 extracellular region was divided into 4 parts: N-terminal (Nter), CRD, and variable regions 1 and 2 (Var1 and Var2). A total of 62 sequences from Thai isolates and 1 sequence from Myanmar isolate were used. Nucleotide and amino acid positions are 3D7 line sequences.
Fig. 3.Sliding window plots of Tajima’s D, Fu and Li’s D* and F* for Plasmodium falciparum surf4.1 sequence encoding extracellular region in Thai isolates. Thirty-seven sequences from the 1988/89 group are used. Sites above the solid line (percentage point α = 0.975) and broken line (α = 0.995 for Tajima’s D and α = 0.99 for Fu and Li’s D* and F*) depart significantly from neutrality (two-tailed), suggesting diversifying selection. Nucleotide numbers are after 3D7 line sequence. Window length is 90 bp and step size is 3 bp. Asterisks indicate positions showing a negative peak value with 0.05 ≤ p < 0.1.
Nucleotide diversity of Plasmodium falciparum surf from Asian isolates (n = 63)
| region (position) | Number of sites (base) | indels | k (SE) | π (SE) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Extracellular | 2304 | 9 | 149.87 | 125.77 | 1811.79 | 24.10 | 492.21 | 0.069 | 0.074 | 0.051 | 1.45 | 0.002 |
| (4–2310) | (6.86) | (7.47) | (8.59) | (2.91) | (8.59) | (0.003) | (0.005) | (0.006) | ||||
| Nter | 147 | 0 | 0.69 | 0.69 | 117.86 | 0.00 | 29.14 | 0.005 | 0.006 | 0.000 | ∞ | ns |
| (4–150) | (0.45) | (0.49) | (2.02) | (0.00) | (2.06) | (0.003) | (0.004) | (0.000) | ||||
| CRD | 435 | 0 | 3.13 | 2.40 | 354.58 | 0.73 | 80.42 | 0.007 | 0.007 | 0.009 | 0.78 | ns |
| (151–585) | (1.13) | (0.91) | (3.53) | (0.41) | (3.48) | (0.003) | (0.003) | (0.006) | 0.00006 | |||
| Variable region | 1722 | 9 | 146.06 | 122.68 | 1339.35 | 23.37 | 382.65 | 0.092 | 0.100 | 0.065 | 1.54 | |
| (586–2310) | (6.80) | (6.44) | (6.80) | (2.87) | (7.05) | (0.005) | (0.006) | (0.008) | ||||
| Var1 | 918 | 3 | 35.56 | 30.25 | 722.26 | 5.31 | 195.74 | 0.040 | 0.044 | 0.028 | 1.57 | ns |
| (586–1506) | (3.40) | (3.40) | (5.18) | (1.34) | (5.40) | (0.004) | (0.005) | (0.007) | ||||
| Var2 | 804 | 6 | 110.50 | 92.44 | 617.09 | 18.06 | 186.92 | 0.157 | 0.174 | 0.106 | 1.64 | 0.0003 |
| (1507–2310) | (5.49) | (5.10) | (4.18) | (2.38) | (4.33) | (0.009) | (0.011) | (0.016) |
Extracellular, extracellular region; Nter, N-terminal segment; CRD, cysteine-rich domain; Var1, variable region 1; Var2, variable region 2; sites, sites nucleotide analyzed; indels, insertion/deletion polymorphism; k, the average number of nucleotide differences; N and S, average numbers of non-synonymous and synonymous sites; π, pairwise nucleotide diversity; dN, number of non-synonymous substitutions over number of non-synonymous sites; dS, number of synonymous substitutions over number of synonymous sites; SE, standard error computed using the Nei-Gojobori method with Jukes-Cantor correction. SE was estimated using the bootstrap method with 500 replication. The number of synonymous (Sd) and non-synonymous (Nd) differences was calculated using the Nei-Gojobori method. p value indicates the statistical difference between dN and dS, tested using one-tail Z-test with 500 bootstrap pseudosamples implemented in MEGA. ns indicates not significant by two-tailed Fisher’s exact tests. Number is after 3D7 line sequence.
Test of neutrality for Plasmodium falciparum surf from Thai 1988/89 isolates (n = 37)
| region | nucleotide position | number of sites (base) | η | two variants | more than two variants | π | θ | Tajima’s | Fu and Li’s | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (singleton) | (not singleton) | |||||||||||
| Extracellular | 4-2310 | 2304 | 425 | 396 | 15 | 352 | 29 | 0.06 | 0.04 | 2.13* | 1.76** | 2.25** |
| Nter | 4-150 | 147 | 2 | 2 | 0 | 2 | 0 | 0.004 | 0.003 | 0.68 | 0.78 | 0.87 |
| CRD | 151-585 | 435 | 9 | 8 | 0 | 7 | 1 | 0.007 | 0.004 | 1.58 | 1.31 | 1.64 |
| Variable region | 586-2310 | 1722 | 414 | 386 | 15 | 343 | 28 | 0.08 | 0.05 | 2.14* | 1.75** | 2.25** |
| Var1 | 586-1506 | 918 | 105 | 102 | 11 | 88 | 3 | 0.038 | 0.027 | 1.61 | 1.12 | 1.53 |
| Var2 | 1507-2310 | 804 | 309 | 284 | 4 | 255 | 25 | 0.137 | 0.08 | 2.31* | 1.95** | 2.47** |
Extracellular , extracellular region; Nter, N-terminal segment; CRD, cysteine-rich domain; Var1, variable region 1; Var2, variable region 2; sites, nucleotide sites analyzed; η, the total number of mutations; S, number of segregating sites; π, observed nucleotide diversity; θ, the expected nucleotide diversity under neutrality derived from S. * indicates p < 0.05 and ** indicates p < 0.02. Sequence number is after 3D7 line sequence.
Fig. 4.Polymorphic amino acid sites of SURFIN4.1 extracellular region among 62 Thai isolates. Only polymorphic sites were selected and aligned, then haplotype numbers were given to each sequence as described in the text. Amino acid residues are shown with green, pink or yellow for dimorphic sites, and green, pink and yellow for trimorphic sites. Fourth or fifth substitutions are shown with cyan or black colors. Singleton amino acid substitutions are shown with red color. The arrow and asterisk on the bottom of the scheme indicate amino acid position for which significant change was detected in the frequency distribution and the position where Fu and Li’s D* and F* showed negative peak value, respectively. Thin lines under the scheme for the 1988/89 group connect the sites for which recombination was detected.
Fig. 5.Linkage disequilibrium of Plasmodium falciparum surf4.1 sequence encoding extracellular region in Thai isolates. The number of polymorphic sites analyzed was 283. Red filled square, p < 0.001 after Bonferroni correction (more conservative); asterisks, p < 0.001; black filled triangle, 0.001 ≤ p < 0.01; dark gray filled triangle, 0.01 ≤ p < 0.05; light gray open triangle, not significant. Correlation coefficients for │D'│ and r2 were –0.189 and –0.273, respectively.