| Literature DB >> 30735814 |
Lynette Isabella Ochola-Oyier1, Kevin Wamae2, Irene Omedo2, Christabel Ogola2, Abneel Matharu2, Jean Pierre Musabyimana2, Francis K Njogu3, Kevin Marsh3.
Abstract
Erythrocyte surface proteins have been identified as receptors of Plasmodium falciparum merozoite proteins. The ligand-receptor interactions enable the parasite to invade human erythrocytes, initiating the clinical symptoms of malaria. These interactions are likely to have had an evolutionary impact on the genes that encode the ligand and receptor proteins. We used sequence data from Kilifi, Kenya to detect departures from neutrality in a paired analysis of P. falciparum merozoite ligands and their erythrocyte receptor genes from the same population. We genotyped parasite and human DNA obtained from 93 individuals with severe malaria. We examined six merozoite ligands EBA175, EBL1, EBA140, MSP1, Rh4 and Rh5, and their corresponding erythrocyte receptors, glycophorin (Gyp) A, GypB, GypC, band 3, complement receptor (CR) 1 and basigin, focusing on the regions involved in the ligand-receptor interactions. Positive Tajima's D values (>1) were observed only in the MSP1 C-terminal region and EBA175 region II, while negative values (<-1) were observed in EBL-1 region II, Rh4, basigin exons 3 and 5, CR1 exon 5, Gyp B exons 2, 3 and 4 and Gyp C exon 2. Additionally, ebl-1 region II and basigin exon 3 showed extreme negative values in all three tests, Tajima's D, Fu & Li D* and F*, ≤ - 2. A large majority of the erythrocyte receptor and merozoite genes have a negative Tajima's D even when compared with previously published whole genome data. Thus, highlighting EBA175 region II and MSP1-33, as outlier genes with a positive Tajima's D (>1). Both these genes contain multiple polymorphisms, which in the case of EBA175 may counteract receptor polymorphisms and/or evade host immune responses and in MSP1 the polymorphisms may primarily evade host immune responses.Entities:
Keywords: Erythrocyte; Ligand; Malaria; Receptor; Selection
Mesh:
Substances:
Year: 2019 PMID: 30735814 PMCID: PMC6403450 DOI: 10.1016/j.meegid.2019.02.004
Source DB: PubMed Journal: Infect Genet Evol ISSN: 1567-1348 Impact factor: 3.342
Fig. 1Scheme of the six erythrocyte receptor and merozoite ligand genes. The curly brackets indicate the regions of the genes identified as being involved in the receptor-ligand interaction that were sequenced. The scheme indicates the positions of the exons and the lengths are relative to differentiate the bigger from the smaller genes. These regions were amplified and sequenced to determine whether they were under selection.
Summary of DNA polymorphisms and population genetic statistics of the merozoite ligands and erythrocyte receptor genes.
p < .05; p < .02; S- segregating sites; Sn- singletons; π - nucleotide diversity; Gyp A exon 3 was sequenced but the data was a mix of Gyp A and Gyp B sequences and was unresolvable, ebl-1 region I contained 2 stop codons and ebl—1 region II contained 1 stop codon, Bold text represent the merozoite ligands, grey shade highlights the exon+ intron analyses.
The frequency of the merozoite ligand and erythrocyte receptor haplotypes.
| Gene | Region sequenced | Haplotypes | Frequency, n (%) | Gene | Exons sequenced | Haplotypes | Frequency, n (%) |
|---|---|---|---|---|---|---|---|
| EBA140 | REGION II ( | INGKK | 20 (47.6) | Gyp C | exon 2 | PP | 53 (98.2) |
| codons 185,239,257, | VSGKK | 8 (19.1) | codon 20 | PR | 1 (1.9) | ||
| 261,285 | VSGTK | 6 (14.3) | |||||
| VNGRE | 3 (7.1) | ||||||
| INGRE | 2 (4.8) | ||||||
| ISGKK | 1 (2.4) | ||||||
| VNGKK | 1 (2.4) | ||||||
| VSVTK | 1 (2.4) | ||||||
| EBA175 | REGION II ( | KEEKKSISENK | 4 (13.3) | Gyp A | exon 2 ( | TT,DG,SS | 30 (56.6) |
| codons 226, 274, 279, 286, | KEKEKPISENK | 4 (13.3) | codon 23 | SS,GG,SS | 7 (13.2) | ||
| 388, 390, 401,402,403, | KEEKNSISKNK | 3 (10) | TT,GG,SS | 7 (13.2) | |||
| 404,405 | EKEKKPISENK | 2 (6.7) | TT,EE,SS | 3 (5.7) | |||
| KEEEKPISENK | 2 (6.7) | TT,DD,SS | 2 (3.8) | ||||
| KEEKKPISENK | 2 (6.7) | TT,DG,SF | 2 (3.8) | ||||
| KEEKNSKM | 2 (6.7) | TT,DD,SF | 1 (1.9) | ||||
| KKEENSKM | 2 (6.7) | SL,GG,SS | 1 (1.9) | ||||
| KKEKKPISENK | 2 (6.7) | ||||||
| KKEKNSISKNK | 2 (6.7) | ||||||
| EEKEKSISENK | 1 (3.3) | ||||||
| KEEEKPISKNK | 1 (3.3) | ||||||
| KEEKNSISENK | 1 (3.3) | ||||||
| KEKENPISENK | 1 (3.3) | ||||||
| KKEKNSNM | 1 (3.3) | ||||||
| EBL1 | Region I ( | QFF* | 15 (57.7) | Gyp B | exon 2 ( | EE,LL,TT,EE | 33 (91.7) |
| codons 39,187,188,189, | QLSK | 4 (15.4) | codons 13, 20, 23, | EE,WW,SS,GG | 1 (2.8) | ||
| 191,192,193 | *LSK | 4 (15.4) | 24 | EE,LW,SS,EG | 1 (2.8) | ||
| *FF* | 3 (11.5) | EK,LL,TT,EE | 1 (2.8) | ||||
| Region II ( | IYGKHN | 32 (84.2) | exon 3 ( | EE,TT,LL,RR,FF,TT | 36 (80) | ||
| codons 342,347,658, | ICGKHN | 2 (5.3) | codons 47, 48, 51, | EE,MM,LL,RR,FF,TT | 5 (11.1) | ||
| 664,690 | IYG*H* | 2 (5.3) | 54,55, 56 | EE,TM,LL,RR,FF,TT | 3 (6.7) | ||
| IYCKYN | 1 (2.6) | KK,GG,FF,LL,LL,SS | 1 (2.2) | ||||
| MYGKHN | 1 (2.6) | exon 4 ( | AA,II,SS | 16 (64) | |||
| codon 59, 66, 84 | AA,II,TT | 5 (20) | |||||
| AA,II,ST | 2 (8) | ||||||
| AA,MM,SS | 1 (4) | ||||||
| TT,II,SS | 1 (4) | ||||||
| MSP1 | 42 kDa fragment ( | KDD | 19 (40.4) | Band 3 | exons 17 & 18 contained 1 synonymous SNP | ||
| codons 1472,1516,1533 | KDN | 15 (31.9) | |||||
| DD | 6 (12.8) | ||||||
| HD | 4 (8.5) | ||||||
| DN | 3 (6.4) | ||||||
| 19 kDa fragment ( | EKSNGL | 22 (44.9) | |||||
| codons 1620,1667,1675, | QKSNGL | 10 (20.4) | |||||
| 1676,1677,1692 | QKSNGF | 8 (16.3) | |||||
| ETSSRL | 3 (6.1) | ||||||
| EKSSRL | 2 (4.1) | ||||||
| QKNNGL | 2 (4.1) | ||||||
| QTSNGF | 1 (2) | ||||||
| QTSNRL | 1 (2) | ||||||
| Rh4 | exon 2 (n = 45) | NKK | 25 (55.6) | CR1 | exon 5 ( | TT | 77 (96.3) |
| codons 435,438,500 | NQK | 16 (35.6) | codon 173 | TA | 3 (3.7) | ||
| KQK | 2 (4.4) | ||||||
| KKK | 1 (2.2) | ||||||
| NQI | 1 (2.2) | ||||||
| Rh5 | exon 2 ( | DYHYIK | 19 (48.7) | Basigin | exon 3 ( | AA,II,DD,AA,LL | 56 (91.8) |
| codons 133,147,148,203, | DYHYIN | 6 (15.4) | codons 28, 93,115, | AA,II,DD,AA,LV | 1 (1.6) | ||
| 410,429 | DHDYIN | 5 (12.8) | 130,135 | AA,II,DD,PP,LL | 2 (1.6) | ||
| DHDYIK | 4 (10.3) | AA,II,DY,PP,LL | 3 (1.6) | ||||
| DYHCIK | 2 (5.1) | AA,IM,DD,AA,LL | 4 (1.6) | ||||
| DYHCIN | 1 (2.6) | AE,II,DD,AA,LL | 5 (1.6) | ||||
| DYHYMK | 1 (2.6) | exon 5 ( | LL | 68 (98.5) | |||
| NYHCIN | 1 (2.6) | codon 90 | FF | 1 (1.5) | |||
| exon 6 ( | EE | 65 (98.5) | |||||
| codon 146 | DD | 1 (1.5) | |||||
CR1 exon 4 and Gyp B exon 5 did not contain any SNPs.
Fig. 2Scatter plot showing the correlations between the summary statistics indices for each erythrocyte receptor-merozoite ligand pair. The merozoite ligand Tajima's D value was plotted against its erythrocyte receptor Tajima's D value, the same was done for the Fu & Li D* and F* plots. The values used from the erythrocyte receptors with data from more than one exon were as follows: for band 3 the average of both exons 17 and 18 plotted against MSP1–19 since it is the region that primarily interacts with the receptor, BSG only exon 5 was considered since previous literature has shown altering codon 90 lowers binding (Crosnier et al., 2011) and it was plotted against Rh5, Gyp B- was an average of exons 2 and 3, similar to Gyp A for which exons 2 and 3 are considered to be involved in binding EBA175, plotted against EBL-1 regions and Gyp A exon 2 was plotted against EBA175. (A) Shows the correlation of the Tajima's D statistic for each receptor-ligand pair, (B) the correlation of the Fu & Li’s D* statistic for each receptor-ligand pair and (C) is the correlation of the Fu & Li’s F* statistic for each receptor-ligand pair. The Pearson product-moment correlation coefficient, r, is shown on the graphs.
Fig. 3The structure of the Rh5-BSG interaction. The Rh5 structure is shown in yellow and represents amino acid residues from 160 to 242 & 302–504 and the SNPs identified in this study are highlighted in red. It is only the SNP at codon C203Y of Rh5 that is involved in the interaction with basigin, while I410M and K429N fall outside the binding region with basigin. The pink structure represents basigin from amino acid residues 23–203 with the SNP (L90F) identified in this study highlighted in black and the Ok variant locus (92) is also shown. Both codons 90 and 92 appear to be involved in the interaction with Rh5. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)