| Literature DB >> 34828264 |
Lana Bitencourt Chaves1, Glaucia de Oliveira Guimarães1, Daiana de Souza Perce-da-Silva2, Dalma Maria Banic3, Paulo Renato Rivas Totino4, Ricardo Luiz Dantas Machado5, Rodrigo Nunes Rodrigues-da-Silva6, Lilian Rose Pratt-Riccio4, Cláudio Tadeu Daniel-Ribeiro4, Josué da Costa Lima-Junior1.
Abstract
The Plasmodium vivax Cysteine-Rich Protective Antigen (PvCyRPA) has an important role in erythrocyte invasion and has been considered a target for vivax malaria vaccine development. Nonetheless, its genetic diversity remains uncharted in Brazilian malaria-endemic areas. Therefore, we investigated the pvcyrpa genetic polymorphism in 98 field isolates from the Brazilian Amazon and its impact on the antigenicity of predicted B-cell epitopes. Genetic diversity parameters, population genetic analysis, neutrality test and the median-joining network were analyzed, and the potential amino acid polymorphism participation in B-cell epitopes was investigated. One synonymous and 26 non-synonymous substitutions defined fifty haplotypes. The nucleotide diversity and Tajima's D values varied across the coding gene. The exon-1 sequence had greater diversity than those of exon-2. Concerning the prediction analysis, seven sequences were predicted as linear B cell epitopes, the majority contained in conformational epitopes. Moreover, important amino acid polymorphism was detected in regions predicted to contain residues participating in B-cell epitopes. Our data suggest that the pvcyrpa gene presents a moderate polymorphism in the studied isolates and such polymorphisms alter amino acid sequences contained in potential B cell epitopes, an important observation considering the antigen potentiality as a vaccine candidate to cover distinct P. vivax endemic areas worldwide.Entities:
Keywords: Brazilian Amazon; Plasmodium vivax; PvCyRPA; genetic diversity; malaria; prediction; vaccine candidate
Mesh:
Substances:
Year: 2021 PMID: 34828264 PMCID: PMC8623135 DOI: 10.3390/genes12111657
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Polymerase chain reaction (PCR) primers used for the amplification of the pvcyrpa gene.
| PCR Primers | Direction | Sequence (5′–3′) | Gene Length (bp) | |
|---|---|---|---|---|
| Exon-1 | PvCyRPA_F1 | Forward primer | TGCAATTTTCCTCTTCTTCTTCC | 600 |
| PvCyRPA_R1 | Reverse primer | CCCCATGTTCTTCCCTTGTCTT | ||
| Exon-2 | PvCyRPA_F2 | Forward primer | CCTGATAAACTACGAAGAGCTCCA | 466 |
| PvCyRPA_R2 | Reverse primer | CCTCGTATAGTAAAGCGTGTGT |
Specific primers used for conventional PCR and sequencing of the pvcyrpa gene.
Figure 1Molecular detection of the pvcyrpa gene by PCR. (A) Schematic representation of pvcyrpa gene, showing the fragments (exons 1 and 2) that were successfully amplified by two different primers sets (PvCyRPA F1/R1 and PvCyRPA F2/R2). (B) PCR amplification of the pvcyrpa gene. The figure shows agarose gel of pvcyrpa fragments amplified using PvCyRPA_F1/R1 primer and PvCyRPA_F2/R2 primer sets, respectively: Lane 1 and 9:100 bp molecular marker; Lane 2: negative control (water); Lane 3: P. falciparum in vitro culture (amplification with pvcyrpa primers); Lane 4: P. falciparum in vitro culture (amplification with p126 primers); Lane 5 and 7: PCR positive control (P. vivax infected sample); Lanes 6 and 8: P. vivax samples.
Polymorphisms in the PvCyRPA of P. vivax isolates from the Brazilian Amazon.
| Sal-1 a | Substitutions b | Isolates | |||||
|---|---|---|---|---|---|---|---|
| CZS (31) | ML (17) | GJ (4) | MAO (37) | OIA (9) | Total (98) | ||
| G69 | G69 | 12 (39%) | 7 (41%) | 2 (50%) | 19 (51%) | 3 (33%) | 43 (44%) |
| E86 | E86 | 4 (13%) | 1 (6%) | 0 | 0 | 2 (22%) | 7 (7%) |
| T90 | T90 | 14 (45%) | 10 (59%) | 2 (50%) | 19 (51%) | 2 (22%) | 47 (48%) |
| K93 | K93 | 14 (45%) | 9 (53%) | 2 (50%) | 19 (51%) | 3 (33%) | 47 (48%) |
| H95 | H95 | 14 (45%) | 9 (53%) | 2 (50%) | 19 (51%) | 3 (33%) | 47 (48%) |
| R122 | R122 | 24 (77%) | 15 (88%) | 4 (100%) | 30 (81%) | 7 (78%) | 80 (82%) |
| E126 | E126 | 14 (45%) | 7 (41%) | 2 (50%) | 17 (46%) | 3 (33%) | 43 (44%) |
| G127 | G127 | 14 (45%) | 7 (41%) | 2 (50%) | 17 (46%) | 2 (22%) | 42 (43%) |
| I129 | I129 | 9 (29%) | 6 (35%) | 2 (50%) | 11 (30%) | 3 (33%) | 31 (32%) |
| K131 | K131 | 23 (74%) | 13 (76%) | 4 (100%) | 30 (81%) | 7 (78%) | 77 (79%) |
| Q142 | Q142 | 17 (55%) | 8 (47%) | 2 (50%) | 15 (41%) | 6 (67%) | 48 (49%) |
| Q142 | 1 (3%) | 2 (12%) | 0 | 0 | 0 | 3 (3%) | |
| D145 | D145 | 9 (29%) | 6 (35%) | 2 (50%) | 11 (30%) | 3 (33%) | 31 (32%) |
| D145 | 8 (26%) | 2 (12%) | 0 | 6 (16%) | 3 (33%) | 19 (19%) | |
| D149 | D149 | 23 (74%) | 13 (76%) | 4 (100%) | 16 (43%) | 6 (67%) | 62 (63%) |
| K150 | K150 | 6 (19%) | 5 (29%) | 2 (50%) | 0 | 0 | 13 (13%) |
| A154 | A154 | 20 (65%) | 11 (65%) | 4 (100%) | 19 (51%) | 6 (67%) | 60 (61%) |
| E159 | E159 | 23 (74%) | 14 (82%) | 4 (100%) | 19 (51%) | 6 (67%) | 66 (67%) |
| N170 | N170 | 12 (39%) | 8 (47%) | 2 (50%) | 7 (19%) | 0 | 29 (30%) |
| K185 | K185 | 10 (32%) | 6 (35%) | 2 (50%) | 0 (0%) | 1 (11%) | 19 (19%) |
| A187 | A187 | 2 (6%) | 2 (12%) | 0 | 2 (5%) | 0 | 6 (6%) |
| V220 | V220 | 2 (6%) | 2 (12%) | 0 | 13 (35%) | 2 (22%) | 19 (19%) |
| D232 | D232 | 9 (29%) | 3 (18%) | 3 (75%) | 7 (19%) | 3 (33%) | 25 (26%) |
| D259 | D259 | 12 (39%) | 8 (47%) | 1 (25%) | 18 (49%) | 2 (22%) | 41 (42%) |
| V260 | V260 | 12 (39%) | 8 (47%) | 1 (25%) | 18 (49%) | 2 (22%) | 41 (42%) |
| T261 | T261 | 12 (39%) | 8 (47%) | 1 (25%) | 18 (49%) | 2 (22%) | 41 (42%) |
| G264 | G264 | 14 (45%) | 8 (47%) | 1 (25%) | 18 (49%) | 2 (22%) | 43 (44%) |
| E267 | *** | 14 (45%) | 8 (47%) | 1 (25%) | 18 (49%) | 2 (22%) | 43 (44%) |
| V287 | V287 | 4 (13%) | 2 (12%) | 0 | 1 (3%) | 0 | 7 (7%) |
a Reference Sequence Salvador-1; b Substitutions: The first letter represents the amino acid in the Sal-1 reference sequence and the last in bold the replacing amino acid; Study areas: CZS, Cruzeiro do Sul; ML, Mâncio Lima; GJ, Guajará; MAO, Manaus; OIA, Oiapoque. The number of samples per locality is indicates in parentheses; ***: Deletion of amino acid in position E267; N (%): Frequency of substitution in each locality.
Figure 2Frequency of substitutions in the PvCyRPA of P. vivax isolates from Brazilian Amazon. The two colors (black and gray) represent the two alleles within a population: reference allele and variant allele, respectively. The identification code represents the amino acid in the Sal-1 reference sequence (first character), followed by the position of this residue (number), and the replacing amino acid observed at the same position (last character).
Diversity parameters and natural selection for the pvcyrpa gene in P. vivax isolates from the Brazilian Amazon.
| Diversity | Entire Coding | Exon-1 | Exon-2 |
|---|---|---|---|
|
| |||
| No. of segregating sites (S) | 32 | 22 | 10 |
| No. of haplotypes (h) | 20 | 11 | 11 |
| Haplotype diversity (Hd) | 0.955 ± 0.022 | 0.875 ± 0.031 | 0.867 ± 0.038 |
| Nucleotide diversity (π) | 0.01248 ± 0.00037 | 0.01489 ± 0.00050 | 0.00921 ± 0.00048 |
| Tajima’s test (D) | 2.24953 * | 2.17555 * | 1.96672 ns1 |
| Tajima’s test (Z) | 0.774 | 1.330 | 0.884 |
|
| |||
| No. of segregating sites (S) | 32 | 22 | 10 |
| No. of haplotypes (h) | 11 | 8 | 5 |
| Haplotype diversity (Hd) | 0.926 ± 0.045 | 0.846 ± 0.062 | 0.750 ± 0.088 |
| Nucleotide diversity (π) | 0.01272 ± 0.00064 | 0.01517 ± 0.00095 | 0.00938 ± 0.00077 |
| Tajima’s test (D) | 1.63472 ns | 1.59064 ns | 1.47204 ns |
| Tajima’s test (Z) | 1.014 | 1.391 | 0.743 |
|
| |||
| No. of segregating sites (S) | 20 | 13 | 7 |
| No. of haplotypes (h) | 3 | 2 | 2 |
| Haplotype diversity (Hd) | 0.833 ± 0.222 | 0.667 ± 0.204 | 0.500 ± 0.265 |
| Nucleotide diversity (π) | 0.01168 ± 0.00338 | 0.01444 ± 0.00442 | 0.00792 ± 0.00420 |
| Tajima’s test (D) | 1.18178 ns | 2.24818 * | −0.81734 ns |
| Tajima’s test (Z) | 0.502 | 0.686 | 0.440 |
|
| |||
| No. of segregating sites (S) | 27 | 18 | 9 |
| No. of haplotypes (h) | 19 | 11 | 6 |
| Haplotype diversity (Hd) | 0.943 ± 0.021 | 0.886 ± 0.031 | 0.818 ± 0.024 |
| Nucleotide diversity (π) | 0.01116 ± 0.00030 | 0.01285 ± 0.00050 | 0.00887 ± 0.00034 |
| Tajima’s test (D) | 2.75595 ** | 2.60865 ** | 2.42821 * |
| Tajima’s test (Z) | 0.908 | 1.258 | 0.685 |
|
| |||
| No. of segregating sites (S) | 28 | 18 | 10 |
| No. of haplotypes (h) | 9 | 7 | 6 |
| Haplotype diversity (Hd) | 1.000 ± 0.052 | 0.917 ± 0.092 | 0.889 ± 0.091 |
| Nucleotide diversity (π) | 0.01136 ± 0.00113 | 0.01361 ± 0.00171 | 0.00830 ± 0.00233 |
| Tajima’s test (D) | 0.74327 ns | 1.13901 ns | −0.01607 ns |
| Tajima’s test (Z) | 0.932 | 1.156 | 0.402 |
InDels were not included in the analysis; ns not significant (p > 0.10); ns1 not significant (0.10 > p > 0.05); * p < 0.05; ** p < 0.01.
Figure 3Median-joining network of pvcyrpa haplotypes. Each circle represents a unique haplotype and the color of the circles represents the geographic origins of each haplotype, while the size of the circle represents the frequency of each haplotype. Lines separating haplotypes represent mutational steps.
Comparison of amino acid variations of the PvCyRPA protein between the Brazilian Amazon and genome sequences available worldwide.
| Isolates | Exon-1 | Exon-2 | ||||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Codon Number/Amino Acid Residue | ||||||||||||||||||||||||||||||||
| 63 | 69 | 86 | 90 | 93 | 95 | 122 | 125 | 126 | 127 | 129 | 131 | 142 | 145 | 147 | 149 | 150 | 154 | 159 | 170 | 180 | 185 | 187 | 220 | 232 | 260 | 261 | 264 | 287 | 361 | |||
| Sal-1 | I | G | E | T | K | H | R | R | E | G | I | K | Q | D | Q | D | K | A | E | N | L | K | A | V | D | D | V | T | G | V | Y | |
| Brazilian Amazon | Cruzeiro do Sul | • | G/A | E/Q | T/K | K/E | H/N | R/K | • | E/K | G/E | I/S | K/E | Q/R/K | D/G/N | • | D/G | K/R | A/D | E/D | N/D | • | K/E | A/V | V/I | D/N | D/E | V/E | T/P | G/S | V/I | • |
| Mâncio Lima | • | G/A | E/Q | T/K | K/E | H/N | R/K | • | E/K | G/E | I/S | K/E | Q/R/K | D/G/N | • | D/G | K/R | A/D | E/D | N/D | • | K/E | A/V | V/I | D/N | D/E | V/E | T/P | G/S | V/I | • | |
| Guajará | • | G/A | • | T/K | K/E | H/N | R/K | • | E/K | G/E | I/S | K/E | Q/K | D/G | • | D/G | K/R | A/D | E/D | N/D | • | K/E | • | • | D/N | D/E | V/E | T/P | G/S | • | • | |
| Manaus | • | G/A | • | T/K | K/E | H/N | R/K | • | E/K | G/E | I/S | K/E | Q/K | D/G/N | • | D/G | • | A/D | E/D | N/D | • | • | A/V | I | D/N | D/E | V/E | T/P | G/S | V/I | • | |
| Oiapoque | • | G/A | E/Q | T/K | K/E | H/N | R/K | • | E/K | G/E | I/S | K/E | Q/K | D/G/N | • | D/G | • | A/D | E/D | • | • | K/E | • | V/I | D/N | D/E | V/E | T/P | G/S | • | • | |
| GenBank | gb|KMZ81773.1 IndiaVII |
|
|
|
|
|
| K |
|
|
|
|
|
|
|
| G |
|
|
|
| H |
|
|
| N |
|
|
|
|
|
|
| gb|KNA00954.1 North Korean | T |
|
| K |
|
| K |
| K | E |
| E | R |
|
| G |
| D | D |
|
|
| V | I |
|
|
|
|
|
| H | |
| gb|KMZ94334.1 Mauritania I |
|
|
|
|
|
| K |
|
| S | E | K | G |
| G |
|
|
|
|
|
|
|
| N | E | E | P | S |
|
| ||
| gb|KMZ87926.1 Brazil I |
| A |
| K | E | N | K |
| K | E |
| E | K | N |
| G |
| D | D |
|
|
| V |
| N |
|
|
|
|
|
| |
| Sanger Institute | SCO 66052.1 |
| A |
| N |
| N | K | T | K | E |
| E | T |
| K | G |
| D | D | D |
|
| V | I |
|
|
|
|
| I |
|
| SCO 71483.1 |
|
| K |
|
| K |
|
|
|
| E | R |
|
| G |
|
| D |
|
|
| V |
| N |
|
|
|
|
|
| ||
| SGX 76259.1 |
| A | Q | K | E | N | K |
| K | E |
| E | K |
|
|
|
|
| D |
|
| E | P |
|
| E | E | P |
|
|
| |
| Mexico | Southern Mexican |
| A |
|
| E | N |
|
| K | E |
|
|
| N |
|
|
| D | D | D |
|
|
|
|
| E | E | P | S |
|
|
The amino acid variants of the PvCyRPA protein were compared to the reference Sal-1 reference sequence (PVX_090240). • Indicates identical amino acid residues compared to the Sal-1 strain. Codons from 69 to 187 and from 220 to 363 correspond to exon-1 and exon-2, respectively.
Figure 4Homology analysis of PvCyRPA protein across isolates. (A) Circular alignment of amino acid sequences of PvCyRPA protein with reference sequence Sal-1 and P01 strain. The outer circle shows the amino acid scale. Green and gray bars on the second circle show the percent matching among all sequences used in the analysis. Inner circles show the sequence alignment in which each amino acid was represented by a different color. The legend presents the ordination to outer to inner rings: Scale; Percent Matching; Consensus OIA; Consensus MAO; Consensus GJ; Consensus ML; Consensus CZS; P01 strain; Sal-1 strain. (B) The pairwise distance among all sequences studied.
B-cell epitope mapping of PvCyRPA (Salvador-1 Strain).
| Sequence | Start | End | Lenght | Conformational Epitope | Overlapped Prediction | ||||
|---|---|---|---|---|---|---|---|---|---|
| Ellipro | BCPred | ABCpred | BepiPred | Emini | |||||
| PvCyRPA-B1 | 81 | 91 | 11 | Yes | X | X | X | X | - |
| PvCyRPA-B2 | 119 | 129 | 11 | No | X | - | X | X | X |
| PvCyRPA-B3 | 134 | 151 | 18 | No | X | X | X | X | X |
| PvCyRPA-B4 | 181 | 192 | 12 | Yes | X | - | - | X | X |
| PvCyRPA-B5 | 241 | 249 | 9 | Yes | X | X | - | X | - |
| PvCyRPA-B6 | 257 | 272 | 16 | Yes | X | X | - | - | X |
| PvCyRPA-B7 | 312 | 330 | 19 | Yes | X | X | X | - | - |
Full-length protein was analyzed for B-cell linear epitopes by the Ellipro algorithm, following confirmation by the overlap between the predictions of the algorithms BCPred, ABCpred, BepiPred, and Emini. (X): confirmed; (-): no confirmed.
Predicted B cell epitopes of PvCyRPA and related polymorphisms.
| Epitope | Sequence | Epitope |
|---|---|---|
| PvCyRPA(I81–L91) | Sal-1 | INSTWETQTTL |
| E86Q | INSTW | |
| T90K | INSTWETQT | |
| T90N | INSTWETQT | |
| PvCyRPA(Y119–I129) | Sal-1 | YKQRSKREGTI |
| R122K | YKQ | |
| R125T | YKQRSK | |
| E126K | YKQRSKR | |
| G127E | YKQRSKRE | |
| I129S | YKQRSKREGT | |
| PvCyRPA(N134–151) | Sal-1 | NSVTGTIYQKEDVQIDKE |
| Q142R | NSVTGTIY | |
| Q142K | NSVTGTIY | |
| Q142T | NSVTGTIY | |
| D145G | NSVTGTIYQKE | |
| D145N | NSVTGTIYQKE | |
| Q147K | NSVTGTIYQKEDV | |
| D149G | NSVTGTIYQKEDVQI | |
| K150R | NSVTGTIYQKEDVQID | |
| PvCyRPA(S181–F192) | Sal-1 | SYEYKTANKDNF |
| K185E | SYEY | |
| A187V | SYEYKT | |
| A187P | SYEYKT | |
| PvCyRPA(R241–R249) | Sal-1 | RISTNNTAR |
| PvCyRPA(T257–C272) | Sal-1 | TLDVTNEGKKEYKFKC |
| D259E | TL | |
| V260E | TLD | |
| T261P | TLDV | |
| G264S | TLDVTNE | |
| PvCyRPA(T312–G330) | Sal-1 | TEQNAIVVKPKVQNDDLNG |
Sequences containing polymorphism inserted on predicted epitopes and marked in red letters.