| Literature DB >> 32620822 |
Napaporn Kuamsab1,2, Chaturong Putaporntip3, Urassaya Pattanawong1, Somchai Jongwutiwes1.
Abstract
Plasmodium vivax merozoite surface protein 3 (PvMSP3) is encoded by a multi-gene family. Of these, PvMSP3α, PvMSP3β and PvMSP3γ, are considered to be vaccine targets. Despite comprehensive analyses of PvMSP3α and PvMSP3β, little is known about structural and sequence diversity in PvMSP3γ. Analysis of 118 complete pvmsp3γ sequences from diverse endemic areas of Thailand and 9 reported sequences has shown 86 distinct haplotypes. Based on variation in insert domains, pvmsp3γ can be classified into 3 types, i.e. Belem, Salvador I and NR520. Imperfect nucleotide repeats were found in six regions of the gene; none encoded tandem amino acid repeats. Predicted coiled-coil heptad repeats were abundant in the protein and displayed variation in length and location. Interspersed phase shifts occurred in the heptad arrays that may have an impact on protein structure. Polymorphism in pvmsp3γ seems to be generated by intragenic recombination and driven by natural selection. Most P. vivax isolates in Thailand exhibit population structure, suggesting limited gene flow across endemic areas. Phylogenetic analysis has suggested that insert domains could have been subsequently acquired during the evolution of pvmsp3γ. Sequence and structural diversity of PvMSP3γ may complicate vaccine design due to alteration in predicted immunogenic epitopes among variants.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32620822 PMCID: PMC7335089 DOI: 10.1038/s41598-020-67222-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
The number of haplotypes, haplotype diversity and nucleotide diversity in pvmsp3γ by endemic area.
| Province | n | No. nucleotide | M | S | H | π ± S.E. | |
|---|---|---|---|---|---|---|---|
| Tak | 31 | 1,755-2,925 | 225 | 198 | 24 | 0.981 ± 0.014 | 0.147 ± 0.005 |
| Ubon Ratchathani | 32 | 1,773-2,922 | 181 | 166 | 30 | 0.996 ± 0.009 | 0.138 ± 0.004# |
| Chanthaburi | 31 | 1,791-2,925 | 155 | 142 | 21 | 0.929 ± 0.036 | 0.124 ± 0.004# |
| Yala & Narathiwat | 24 | 1,842-2,895 | 131 | 127 | 5 | 0.587 ± 0.102 | 0.079 ± 0.003## |
| Total | 118 | 1,755-2,925 | 252 | 217 | 77 | 0.968 ± 0.010 | 0.151 ± 0.005 |
n = number of isolates; M = the number of mutations; S = the number of segregating sites; h = haplotype diversity; π = nucleotide diversity. M and S are computed from all nonrepeat regions.
Tests of the hypothesis that π in Tak population equals the corresponding values in other populations: # p < 0.05; ## p < 0.001 (Z-tests).
Figure 1Domain organization of pvmsp3γ.
Nucleotide substitutions, the number of haplotypes (H), nucleotide diversity (π) and the rates of nucleotide substitutions at synonymous sites (dS) and nonsynonymous sites (dN) in pvmsp3γ.
| Domain | No. nucleotide | n# | M§ | S§ | H | ||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Repeats | ||||||||
| Conserved I | 216 | — | 127 | 103 | 82 | 53 | 0.097 ± 0.012 | 0.067 ± 0.013 | |
| Conserved II | 129 | — | 127 | 52 | 44 | 17 | 0.098 ± 0.015 | 0.120 ± 0.025 | 0.080 ± 0.032 |
| Conserved III | 216 | — | 127 | 56 | 50 | 31 | 0.070 ± 0.010 | 0.055 ± 0.011 | 0.083 ± 0.027 |
| Conserved IV | 204 | — | 127 | 46 | 45 | 30 | 0.065 ± 0.009 | 0.060 ± 0.010 | 0.068 ± 0.025 |
| Variable I | 726-834 | 426-513 | 127 | 221 | 207 | 64 | 0.220 ± 0.010 | 0.283 ± 0.021 | 0.242 ± 0.046 |
| Variable II | 99-108 | — | 127 | 75 | 52 | 21 | 0.178 ± 0.026 | 0.100 ± 0.033 | |
| Variable III | 90-120 | 57-75 | 127 | 16 | 14 | 23 | 0.136 ± 0.023 | 0.151 ± 0.041 | 0.181 ± 0.108 |
| Insert A | 291-378 | 219-306 | 99 | 33 | 28 | 27 | 0.185 ± 0.015 | 0.103 ± 0.026 | 0.112 ± 0.042 |
| Insert B | 318-357 | 318-357 | 92 | — | — | 17 | 0.137 ± 0.012 | — | — |
| Insert C | 462 | 258 | 99 | 79 | 65 | 31 | 0.114 ± 0.009 | 0.103 ± 0.018 | 0.100 ± 0.022 |
#Thai isolates (n = 118) and worldwide strains (n = 9) including Belem (GenBank accession no. AF099663), Brazil I (AFMK01001074), North Korea I (AFNJ01000104), Salvador I (XM001613144), Indonesia I (KC907565), Chesson (KC907566), India VII (KC907567), Panama I (KC907568) and Vietnam II (KC907569). The numbers of isolates, haplotypes, mutations and segregating sites are represented by n, H, M and S, respectively. §Repeats are excluded. Tests of the hypothesis that mean dS equals that for dN: *p < 0.05; **p < 0.0005 (Z test).
Figure 2Distribution of major allelic types of pvmsp3γ in Thailand.
Distribution of coiled-coil repeat motifs and heptad phase shift in PvMSP3γ.
| Type | n* | % Heptad repeats in protein | % Phase shift in heptad repeats | Types of phase shift (%) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Range | Mean | +1 | +2 | +3 | +4 | +5 | +6 | |||
| Belem | 72 | 38.4 - 57.8 | 50.7 | 12.3 | 1.1 | 0.7 | 4.1 | 2.1 | 2.1 | 2.2 |
| Salvador I | 6 | 41.0 - 44.7 | 42.5 | 10.0 | 0 | 1.9 | 3.9 | 2.3 | 1.9 | 0 |
| NR520 | 8 | 35.6 - 44.3 | 40.1 | 12.0 | 2.2 | 4.4 | 2.9 | 1.5 | 0 | 1.1 |
| Total | 86 | 35.6 - 57.8 | 49.1 | 12.2 | 1.1 | 0.9 | 4.0 | 2.1 | 2.0 | 2.1 |
*Number of haplotypes. Types of phase shift are based on the number of amino acids (n < 7) that interrupt heptad repeat arrays. Phase shifts +1, +3 and +4 are skip, stammer and stutter, respectively. Sequences are from 118 Thai isolates and 9 sequences in the GenBank database as listed in the footnote of Table 2.
Distribution of recombination breakpoints in pvmsp3γ among Thai isolates.
| Recombination | No. recombination breakpoints# | Distance (bp) | |
|---|---|---|---|
| Range | Median | ||
| Within or between conserved domains | 6 | 126-2782 | 1484 |
| Within or between variable domains | 16 | 49-2533 | 343 |
| Within or between insert domains | 9 | 106-437 | 246 |
| Between conserved and variable domains | 19 | 148-2758 | 808 |
| Between conserved and insert domains | 9 | 555-2279 | 1374 |
| Between variable and insert domains | 11 | 390-1350 | 791 |
| Total | 70 | 49-2782 | 630 |
*Consensus from 3 of 7 tests (RDP, GENCONV, Bootscan, MaxChi, Chimera, SiScan and TOPAL).
#Number counted from recombination events.
Recombination involved conserved domains = (34/765) x 100 = 0.44 site per 100 nucleotides.
Recombination involved variable domains = (46/939) x 100 to (46/1035) x 100 = 0.44-0.49 site per 100 nucleotides.
Recombination involved insert domains = (29/1110) x 100 = 0.26 site per 100 nucleotides.
Genetic differentiation of P. vivax populations in Thailand based on the pvmsp3γ locus.
| Tak | Ubon Ratchathani | Chanthaburi | Yala & Narathiwat | |
|---|---|---|---|---|
| Tak | 0.0631 | <10−5 | <10−5 | |
| Ubon Ratchathani | 0.0087 | <10−5 | <10−5 | |
| Chanthaburi | 0.0390 | 0.0353 | <10−5 | |
| Yala & Narathiwat | 0.2093 | 0.2007 | 0.2333 |
F indices and their respective p values are in lower and upper diagonals, respectively.
Figure 3Bayesian tree of 77 representative distinct pvmsp3γ sequences inferred from concatenated conserved domains containing702 nucleotide sites in comparison with pcymsp3 (GenBank accession nos. for PCY Cambodian, PCY PT1, PCY R0, PCY Ceylonensis, PCY Mulligan, PCY B, PCY Berok and PCY Gombak are KC907553-KC907559 and KC907561, respectively) and pimsp3 (GenBank accession no. KC907508, Leaf Monkey II). Values on the branches indicate percentage of Bayesian posterior probability; only values greater than 0.50 are shown. Scale bar is the number of nucleotide substitutions per site. The pvmsp3γ sequence types are shown on the right with their orthologues.