| Literature DB >> 30314440 |
Hương Giang Lê1,2, Jung-Mi Kang1,2, Mya Moe3, Hojong Jun4, Thị Lam Thái1,2, Jinyoung Lee4, Moe Kyaw Myint3, Khin Lin3, Woon-Mok Sohn1, Ho-Joon Shin5, Tong-Soo Kim4, Byoung-Kuk Na6,7.
Abstract
BACKGROUND: Plasmodium falciparum circumsporozoite protein (PfCSP) is one of the most extensively studied malaria vaccine candidates, but the genetic polymorphism of PfCSP within and among the global P. falciparum population raises concerns regarding the efficacy of a PfCSP-based vaccine efficacy. In this study, genetic diversity and natural selection of PfCSP in Myanmar as well as global P. falciparum were comprehensively analysed.Entities:
Keywords: Circumsporozoite protein; Genetic polymorphism; Myanmar; Natural selection; Plasmodium falciparum
Mesh:
Substances:
Year: 2018 PMID: 30314440 PMCID: PMC6186114 DOI: 10.1186/s12936-018-2513-0
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
Fig. 1Genetic polymorphisms of the N-terminal non-repeat region in Myanmar and global PfCSP. a Polymorphic pattern of the N-terminal non-repeat region in Myanmar PfCSP. Only a haplotype of the N-terminal non-repeat region was identified in 51 Myanmar PfCSP sequences. All Myanmar PfCSP had 19 amino acids length insertion (NNGDNGREGKDEDKRDGNN) in the middle of the region. The dots represent residues identical to the reference sequence of 3D7 (XM_001351086). The dashes represent gaps to maximize the alignment. The predicted T-cell epitope region was shaded with pink. The conserved motif (KLKQP) involved in the sporozoite invasion of mosquito salivary gland and in binding to hepatocytes prior to invasion was underlined. b Polymorphic patterns of the N-terminal non-repeat region in global PfCSP. The N-terminal non-repeat region of PfCSP was largely conserved in global PfCSP. Amino acid changes at seven positions and insertion of 19 amino acids length (NNGDNGREGKDEDKRDGNN) were the major variations observed in global PfCSP. The frequencies of the amino acid polymorphisms and insertion observed in global PfCSP differed by geographical origins. PNG Papua New Guinea
Fig. 2Genetic polymorphisms in the central repeat region of Myanmar PfCSP. a Polymorphic characters in the central repeat region of Myanmar PfCSP. Sequence alignment revealed that the central repeat region of Myanmar PfCSP showed polymorphic characters with 14 distinct haplotypes. Residues identical to those of the 3D7 PfCSP reference sequence (XM_001351086) are shown by dots. The dashes represent gaps introduced to maximize the alignment. The NANP repeat and NVDP repeat are indicated as one and two, respectively. The two new tetrapeptides, NTNP and NANS, are shaded with yellow. The number of haplotypes indicated the number of Myanmar PfCSP belonging to each haplotype. b Frequency of number of repeats in the central region among Myanmar PfCSP
Fig. 3Frequency comparison of NANP repeats in the central repeat region of global PfCSP from different geographic regions. The numbers of NANP repeats differed in the global PfCSP population. In general, Asian PfCSP had a higher number of NANP repeats than African, Oceanian, and South American PfCSP
Fig. 4Polymorphic patterns of the C-terminal non-repeat region in Myanmar and global PfCSP. a Amino acid polymorphisms in the C-terminal non-repeat region of Myanmar PfCSP. The three different haplotypes (H1–H3) of the C-terminal non-repeat region were identified in Myanmar PfCSP. Th2R and Th3R regions are shaded with sky blue and yellow, respectively. Amino acid changes at particular amino acid positions are indicated as red. The number of haplotypes is the number of PfCSP sequences belonging to each haplotype. The dots represent residues identical to the reference sequence of 3D7 (XM_001351086). b Comparative analysis of polymorphic patterns of the C-terminal non-repeat region in global PfCSP. The pattern of amino acid changes differed by country or geographic continent, but most amino acid changes were concentrated in the Th2R and Th3R regions in global PfCSP. A logo plot was constructed for each PfCSP population using the WebLogo program. PNG Papua New Guinea
Genetic polymorphism and tests of neutrality in C-terminal region of global PfCSP
| Country |
|
| S | Total No. of mutations | H | Hd ± SD | π ± SD | dN–dS | Tajima’s D | Fu and Li’s D | Fu and Li’s F |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||||
| Myanmar | 51 | 0.92 | 3 | 3 | 3 | 0.329 ± 0.072 | 0.004 ± 0.003 | 0.005 ± 0.003 | 0.764a | 0.889a | 0.992a |
| Vietnam | 23 | 4.39 | 14 | 16 | 23 | 1.000 ± 0.013 | 0.019 ± 0.001 | 0.024 ± 0.008 | 0.050a | 0.862a | 0.722a |
| Thailand | 223 | 1.74 | 12 | 13 | 8 | 0.485 ± 0.039 | 0.009 ± 0.001 | 0.011 ± 0.004 | − 0.483a | 1.475d | 0.886a |
| Philippines | 42 | 5.79 | 18 | 19 | 9 | 0.821 ± 0.041 | 0.028 ± 0.001 | 0.033 ± 0.009 | 1.019a | 1.649b | 1.701c |
| Iran | 21 | 0.93 | 2 | 2 | 2 | 0.467 ± 0.075 | 0.004 ± 0.001 | 0.006 ± 0.004 | 1.564a | 0.858a | 1.205a |
| India | 15 | 3.71 | 18 | 19 | 13 | 0.981 ± 0.031 | 0.019 ± 0.004 | 0.023 ± 0.007 | − 1.486a | − 1.173a | − 1.448a |
| Total | 375 | 2.73 | 25 | 29 | 38 | 0.653 ± 0.027 | 0.013 ± 0.001 | 0.016 ± 0.005 | − 1.032a | − 0.235a | − 0.698a |
|
| |||||||||||
| PNG | 94 | 1.17 | 5 | 5 | 4 | 0.532 ± 0.034 | 0.006 ± 0.001 | 0.007 ± 0.004 | 0.419a | − 0.012a | 0.151a |
| Vanuatu | 116 | 0.55 | 2 | 2 | 2 | 0.276 ± 0.046 | 0.003 ± 0.001 | 0.003 ± 0.002 | 0.706a | 0.674a | 0.801a |
| Solomon Islands | 51 | 0.83 | 4 | 4 | 3 | 0.397 ± 0.066 | 0.004 ± 0.0007 | 0.005 ± 0.003 | − 0.1426a | − 1.209a | − 1.029a |
| Total | 261 | 1.09 | 5 | 5 | 4 | 0.522 ± 0.012 | 0.005 ± 0.0002 | 0.007 ± 0.004 | 0.626a | − 0.214a | 0.087a |
|
| |||||||||||
| Kenya | 18 | 5.83 | 17 | 20 | 12 | 0.922 ± 0.051 | 0.028 ± 0.003 | 0.035 ± 0.011 | 0.010a | 0.295a | 0.247a |
| Cameroon | 9 | 4.56 | 12 | 15 | 7 | 0.944 ± 0.07 | 0.022 ± 0.005 | 0.028 ± 0.009 | − 0.842a | − 0.689a | − 0.813a |
| Gambia | 44 | 5.83 | 18 | 21 | 20 | 0.944 ± 0.019 | 0.028 ± 0.002 | 0.036 ± 0.009 | 0.677a | 1.006a | 1.068a |
| Ghana | 33 | 5.02 | 18 | 22 | 19 | 0.932 ± 0.028 | 0.024 ± 0.003 | 0.031 ± 0.009 | − 0.257a | 0.474a | 0.278a |
| Tanzania | 60 | 6.45 | 22 | 26 | 30 | 0.969 ± 0.008 | 0.031 ± 0.001 | 0.039 ± 0.011 | 0.499a | 0.852a | 0.863a |
| Senegal | 11 | 4.54 | 10 | 12 | 9 | 0.964 ± 0.051 | 0.019 ± 0.002 | 0.024 ± 0.007 | 0.475a | 1.149a | 1.106a |
| Total | 175 | 21.76 | 150 | 169 | 58 | 0.968 ± 0.004 | 0.105 ± 0.021 | 0.169 ± 0.055 | − 0.831a | 2.648b | 1.205a |
|
| |||||||||||
| Venezuela | 10 | 6.44 | 14 | 16 | 7 | 0.911 ± 0.077 | 0.031 ± 0.003 | 0.034 ± 0.014 | 0.645a | 0.799a | 0.857a |
| Brazil | 42 | 2.33 | 8 | 8 | 3 | 0.459 ± 0.08 | 0.011 ± 0.002 | 0.014 ± 0.005 | 0.721a | 1.309d | 1.317a |
| Total | 52 | 3.74 | 16 | 18 | 9 | 0.630 ± 0.065 | 0.018 ± 0.002 | 0.022 ± 0.007 | − 0.188a | 0.431a | 0.257a |
PNG Papua New Guinea, n number of sequences analysed, S number of Segregating sites, K average number of nucleotide differences, H number of Haplotypes, Hd haplotype diversity, π observed average pairwise nucleotide diversity, dN rate of non-synonymous mutations, dS rate of synonymous mutations, SD standard deviation
aP > 0.1; b P < 0.02; c P < 0.05; d 0.05 < P < 0.1
Fig. 5Nucleotide diversity of C-terminal non-repeat region among global PfCSP. Sliding window analysis presented nucleotide diversity (π) values in global PfCSP sequences. The values of nucleotide diversity per site (π) were calculated using the DnaSP with a window length of 10 bp and step size of 5 bp. PNG Papua New Guinea
Recombination events in global PfCSP
| Country | Ra | Rb | Rm |
|---|---|---|---|
| Myanmar | 0.000 | 0.001 | 0 |
| Vietnam | 0.504 | 116 | 3 |
| Thailand | 0.000 | 0.001 | 2 |
| Philippines | 0.082 | 16.9 | 1 |
| India | 0.000 | 0.001 | 0 |
| Iran | 0.043 | 8.2 | 2 |
| PNG | 0.024 | 4.9 | 1 |
| Solomon Islands | 0.000 | 0.001 | 0 |
| Vanuatu | 0.000 | 0.001 | 0 |
| Ghana | 0.071 | 19.5 | 6 |
| Tanzania | 0.171 | 35.1 | 6 |
| Cameroon | 0.077 | 15.9 | 2 |
| Senegal | 0.172 | 41.2 | 2 |
| Kenya | 0.110 | 22.7 | 6 |
| Gambia | 0.116 | 23.9 | 5 |
| Brazil | 0.000 | 0.001 | 0 |
| Venezuela | 0.1403 | 28.9 | 2 |
The R and Rm were estimated excluding the sites containing alignment gaps or those segregating for three nucleotides
Ra recombination parameter between adjacent sites, Rb recombination parameter for entire gene, Rm minimum number of recombination events between adjacent sites, PNG Papua New Guinea
Fig. 6Haplotype network analysis of global PfCSP. Haplotype network of the PfCSP was constructed using the NETWORK program version 5.0.0.3 with the Median Joining algorithm. The network showed 103 haplotypes found in 817 global PfCSP sequences. Branch lengths were the proportion of divergence. The size of each node indicated the proportion of the total haplotype frequencies. The color of each node corresponded to a different geographic region