| Literature DB >> 32883283 |
Tuấn Cường Võ1,2, Hương Giang Lê1,2, Jung-Mi Kang1,2, Mya Moe3, Haung Naw1,2, Moe Kyaw Myint3, Jinyoung Lee4, Woon-Mok Sohn1, Tong-Soo Kim4, Byoung-Kuk Na5,6.
Abstract
BACKGROUND: Circumsporozoite surface protein (CSP) of malaria parasites has been recognized as one of the leading vaccine candidates. Clinical trials of vaccines for vivax malaria incorporating Plasmodium vivax CSP (PvCSP) have demonstrated their effectiveness in preventing malaria, at least in part. However, genetic diversity of pvcsp in the natural population remains a major concern.Entities:
Keywords: Circumsporozoite protein; Genetic polymorphism; Myanmar; Natural selection; Plasmodium vivax
Mesh:
Substances:
Year: 2020 PMID: 32883283 PMCID: PMC7650223 DOI: 10.1186/s12936-020-03366-7
Source DB: PubMed Journal: Malar J ISSN: 1475-2875 Impact factor: 2.979
Fig. 1Map of the study sites. Blood samples were collected from P. vivax infected patients, who resided in villages or townships in Naung Cho, Pyin Oo Lwin and Tha Beik Kyin, Myanmar and used in this study
Fig. 2Polymorphic patterns of the N-terminal non-repeat region of Myanmar pvcsp sequences. a VK210 variants. Seven distinct haplotypes of the N-terminal non-repeat region was identified in 143 Myanmar VK210 sequences. The dots represent residues identical to the reference sequence of Sal I (GU339059). Amino acid changes at particular amino acid positions are indicated as red. The dashes represent gaps to maximize the alignment. b VK247 variants. Seven distinct haplotypes of the N-terminal non-repeat region was identified in 28 Myanmar VK247 sequences. The dots represent residues identical to the reference sequence of PNG (M69059). Amino acid changes at particular amino acid positions are indicated as red. The dashes represent gaps to maximize the alignment. RI means KLKQP motif that is involved in the sporozoite invasion of mosquito salivary gland and in binding to hepatocytes prior to invasion. The total number of sequences for each hapoltype is listed in the right panel
Fig. 3Comparative analysis of polymorphic patterns of N-terminal non-repeat region in global pvcsp. a Global VK210 variants. b Global VK247 variants. The pattern of amino acid changes differed by country or geographic continent. A logo plot was constructed for each pvcsp population using the WebLogo program. Sal I, Salvador I (GU339059); PNG, Papua New Guinea (M69059). RI region was marked as an underline
Fig. 4Genetic polymorphism in the central repeat region (CRR) of Myanmar VK210 variants. Sequence alignment revealed that the CRR of Myanmar VK210 variants showed polymorphic characters with 118 distinct haplotypes. A total of 47 different types of peptide repeat motifs (PRMs) were identified in CRR of Myanmar VK210 variants. Differences in numbers, types, and combinations of PRMs produced the high levels of genetic diversity of the CRR in Myanmar VK210 variants
Fig. 5Frequency comparison of peptide repeat motifs (PRMs) in the central repeat region (CRR) of global VK210 variants. The numbers of PRMs differed in the global VK210 varaints
Fig. 6Genetic polymorphism in the central repeat rsegion (CRR) of Myanmar VK247 varaints. Sequence alignment revealed that the CRR of Myanmar VK247 variants showed polymorphic characters with 23 distinct haplotypes. A total of 8 different types of peptide repeat motifs (PRMs) were identified in CRR of Myanmar VK247 variants. Differences in numbers, types, and combinations of PRMs produced the high levels of genetic diversity of the CRR in Myanmar VK247 variants
Fig. 7Frequency comparison of peptide repeat motifs (PRMs) in the central repeat region of global VK247 variants. The numbers of PRMs differed in the global VK210 varaints
Fig. 8Genetic polymorphism of the C-terminal non-repeat region of Myanmar pvcsp sequences. a VK210 variants. A total of 27 distinct haplotypes of the C-terminal non-repeat region was identified in 143 Myanmar VK210 sequences. The dots represent residues identical to the reference sequence of Sal I (GU339059). Amino acid changes at particular amino acid positions are indicated as red. The dashes represent gaps to maximize the alignment. b VK247 variants. Ten distinct haplotypes of the C-terminal non-repeat region was identified in 28 Myanmar VK247 sequences. The dots represent residues identical to the reference sequence of PNG (M69059). Amino acid changes at particular amino acid positions are indicated as red. The dashes represent gaps to maximize the alignment. The total number of sequences for each hapoltype is listed in the right panel
Fig. 9Frequency of octa-peptide insertion and GGNA motif in the C-terminal non-repeat region of global pvcsp. a Frequency of ANKKAEDA octa-peptide insertion in the C-terminal non-repeat region of global VK210 variants. b Frequency of GGNA repeat motifs in the C-terminal non-repeat region of global VK210 variants. c Frequency of ANKKAGDA insertion in the C- terminal region of global VK247 variants. d) Frequency of GGNA repeat motifs in the C- terminal region of global VK247 variants
Genetic polymorphism and tests of neutrality in the N-terminal and C-terminal regions of Myanmar pvcsp
| Variant | Region | n | K | S | Eta | H | Hd ± SD | π ± SD | dN-dS | Tajima’s D | Fu and Li’s D | Fu and Li’s F |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| VK210 | N-terminal | 143 | 0.098 | 5 | 6 | 7 | 0.096 ± 0.034 | 0.0024 ± 0.0009 | 0.0008 | − 1.9359c | − 3.9086b | − 3.8409b |
| C-terminal | 143 | 0.209 | 10 | 11 | 11 | 0.186 ± 0.044 | 0.0035 ± 0.0009 | −0.0035 | − 2.2251d | − 3.5089b | − 3.6329b | |
| VK247 | N-terminal | 28 | 0.405 | 1 | 2 | 3 | 0.405 ± 0.094 | 0.0090 ± 0.0021 | 0.0117 | − 0.4445a | − 0.7114a | − 0.7369a |
| C-terminal | 28 | 0.495 | 6 | 6 | 6 | 0.331 ± 0.114 | 0.0067 ± 0.0027 | − 0.0150 | − 1.9719c | − 2.5946c | − 2.8039c |
n number of analysed sequences, K average number of nucleotide differences, S number of segregating sites, Eta total number of mutations, H number of haplotypes, Hd haplotype diversity, π observed average pairwise nucleotide diversity, SD standard deviation, dN rate of non-synonymous mutations, dS rate of synonymous mutations
aP > 0.1
bP < 0.02
cP < 0.05
dP < 0.01
Genetic polymorphism and tests of neutrality in the N-terminal and C-terminal non-repeat regions of global VK210 variants
| Region | Country | n | S | Eta | H | Hd ± SD | π ± SD | dN-dS | Tajima’s D | Fu and Li’s D | Fu and Li’s F | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N-terminal | Myanmar | 143 | 0.098 | 5 | 6 | 7 | 0.096 ± 0.034 | 0.0024 ± 0.0009 | 0.0008 | − 1.9359c | − 3.9086b | − 3.8409b |
| Cambodia | 31 | 0.125 | 1 | 1 | 2 | 0.125 ± 0.077 | 0.0030 ± 0.0018 | 0.0039 | − 0.7737a | 0.5907a | 0.2450a | |
| India | 79 | 1.972 | 23 | 28 | 28 | 0.661 ± 0.063 | 0.0470 ± 0.0074 | − 0.0909 | − 2.0261c | − 1.3952a | − 1.9514e | |
| Iran | 39 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| South Korea | 39 | 0.051 | 1 | 1 | 2 | 0.051 ± 0.048 | 0.0012 ± 0.0011 | 0.0016 | − 1.1264a | − 1.7662a | − 1.8293a | |
| Brazil | 41 | 0.049 | 1 | 1 | 2 | 0.049 ± 0.046 | 0.0012 ± 0.0011 | − 0.0059 | − 1.1219a | − 1.7816a | − 1.8406a | |
| Mexico | 11 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Sudan | 30 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Vanuatu | 21 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| C-terminal | Myanmar | 143 | 0.209 | 10 | 11 | 11 | 0.186 ± 0.044 | 0.0035 ± 0.0009 | − 0.0035 | − 2.2251d | − 3.5089b | − 3.6329b |
| Cambodia | 31 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| India | 79 | 0.276 | 8 | 9 | 6 | 0.123 ± 0.051 | 0.0043 ± 0.0020 | − 0.0149 | − 2.1953d | − 4.4419b | − 4.3524b | |
| Iran | 39 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| South Korea | 39 | 0.103 | 2 | 2 | 3 | 0.101 ± 0.065 | 0.0008 ± 0.0005 | − 0.0037 | − 1.4889a | − 2.4148e | − 2.4864e | |
| Brazil | 41 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Mexico | 11 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Sudan | 30 | 0.067 | 1 | 1 | 2 | 0.067 ± 0.061 | 0.0009 ± 0.0008 | − 0.0041 | − 1.1470a | − 1.6821a | − 1.7655a | |
| Vanuatu | 21 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
n number of analysed sequences, K avarage number of nucleotide differences, S number of segregating sites, Eta total number of mutations, H number of haplotypes, Hd haplotype diversity, π observed average pairwise nucleotide diversity, SD standard deviation, dN rate of non-synonymous mutations, dS rate of synonymous mutations
aP > 0.1
bP < 0.02
cP < 0.05
dP < 0.01
e0.05 < P < 0.1
Genetic polymorphism and tests of neutrality in N-terminal and C-terminal regions of global pvcsp VK247 variants
| Region | Country | n | S | Eta | H | Hd ± SD | π ± SD | dN-dS | Tajima’s D | Fu andLi’s D | Fu and Li’s F | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| N-terminal | Myanmar | 28 | 0.405 | 1 | 2 | 3 | 0.405 ± 0.094 | 0.0090 ± 0.0021 | 0.0117 | − 0.4445a | − 0.7114a | − 0.7369a |
| Cambodia | 10 | 0.2 | 1 | 1 | 2 | 0.200 ± 0.154 | 0.0029 ± 0.0022 | 0.0038 | − 1.1117a | − 1.2434a | − 1.3467a | |
| Iran | 11 | 1.309 | 3 | 3 | 2 | 0.436 ± 0.133 | 0.0190 ± 0.0058 | − 0.0543 | 0.9518a | 1.1271a | 1.2185a | |
| Mexico | 8 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Colombia | 25 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| C-terminal | Myanmar | 28 | 0.495 | 6 | 6 | 6 | 0.331 ± 0.114 | 0.0067 ± 0.0027 | − 0.0150 | − 1.9719c | − 2.5946c | − 2.8039c |
| Cambodia | 10 | 0.400 | 2 | 2 | 2 | 0.200 ± 0.154 | 0.0039 ± 0.0030 | − 0.0172 | − 1.4009a | − 1.5866a | − 1.7190a | |
| Iran | 11 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Mexico | 8 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Colombia | 25 | 0.500 | 2 | 2 | 3 | 0.44 ± 0.095 | 0.0039 ± 0.0010 | 0.0017 | − 0.1215a | − 0.6754a | − 0.6012a |
n number of analysed sequences, K average number of nucleotide differences, S number of segregating sites, Eta total number of mutations, H number of haplotypes, Hd haplotype diversity, π observed average pairwise nucleotide diversity, SD standard deviation, dN rate of non-synonymous mutations, dS rate of synonymous mutations
aP > 0.1
bP < 0.02
cP < 0.05
dP < 0.01
e0.05 < P < 0.1