| Literature DB >> 30770839 |
Katrina J Spensley1, Paul S Wikramaratna2, Bridget S Penman3, Andrew Walker4, Adrian L Smith4, Oliver G Pybus4, Létitia Jean5, Sunetra Gupta4, José Lourenço6.
Abstract
Despite a dramatic increase in our ability to catalogue variation among pathogen genomes, we have made far fewer advances in using this information to identify targets of protective immunity. Epidemiological models predict that strong immune selection can cause antigenic variants to structure into genetically discordant sets of antigenic types (e.g. serotypes). A corollary of this theory is that targets of immunity may be identified by searching for non-overlapping associations of amino acids among co-circulating antigenic variants. We propose a novel population genetics methodology that combines such predictions with phylogenetic analyses to identify genetic loci (epitopes) under strong immune selection. We apply this concept to the AMA-1 protein of the malaria parasite Plasmodium falciparum and find evidence of epitopes among certain regions of low variability which could render them ideal vaccine candidates. The proposed method can be applied to a myriad of multi-strain pathogens for which vast amounts of genetic data has been collected in recent years.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30770839 PMCID: PMC6377634 DOI: 10.1038/s41598-018-37288-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Counts of amino acid associations between pairs of sites with high non-overlap (scaled MI > 0.312).
| Association | Amino Acid Combinations | Scaled MI | |||
|---|---|---|---|---|---|
| 496 vs 503 |
|
|
|
| 0.89 |
| 602 | 3 | 14 | 561 | ||
| 503 vs 512 |
|
|
|
| 0.61 |
| 565 | 52 | 40 | 524 | ||
| 451 vs 485 |
|
|
|
| 0.57 |
| 604 | 103 | 15 | 476 | ||
| 496 vs 512 |
|
|
|
| 0.53 |
| 552 | 68 | 54 | 523 | ||
| 283 vs 285 |
|
|
|
| 0.52 |
| 242 | 71 | 1 | 883 | ||
| 439 vs 451 |
|
|
|
| 0.5 |
| 660 | 72 | 46 | 390 | ||
| 206 vs 225 |
|
|
|
| 0.5 |
| 96 | 806 | 290 | 6 | ||
| 439 vs 485 |
|
|
|
| 0.41 |
| 588 | 144 | 30 | 406 | ||
| 448 vs 451 |
|
|
|
| 0.39 |
| 705 | 196 | 2 | 295 | ||
| 404 vs 405 |
|
|
|
| 0.34 |
| 108 | 635 | 367 | 84 | ||
Figure 1Degree of non-overlap between amino acid residues at pairs of dimorphic sites. Each site on the x-axis (positions 149–534) that is dimorphic is compared with all other dimorphic sites. Those with high scores above a stringent scaled MI threshold are shown by bars spanning both sites (red and green). Scaled MI threshold is based on the top 1% of observed scaled MI values (CI 98% of the observed distribution, upper limit of 0.312, horizontal dotted grey line). Red bars mark pairs of sites of interest, and the green bar marks the pair that is not of interest once phylogenetic relationships are taken into account.
Figure 2Empirical phylogenetic allelic structure, simulated parsimony score and scaled MI. (A) Example, ML phylogenetic trees of AMA-1 with branches coloured on the basis of observed combination of amino acid variants found at tips for different pairs of sites. Pink and cyan are used for the two most common amino acid variants (which are discordant for these sites, see Table 1) and black for the least common. (B) Empirical pairs of sites described in Fig. 1 and Table 1 are shown in relation to a null distribution of site-pairs from 1000 simulations that scored most highly (rank 1) on S-score (the product of scaled MI measuring non-overlap, and geometric mean of parsimony score indicating the minimum number of genetic changes required to explain the observed ancestral relationships, GMPS). Color key (scale) on the right is the two-dimensional density of GMPS and scaled MI of the simulated site-pairs. (A,B) Pairs of sites identified in red and green in the two panels match the pairs in Fig. 1.
Figure 3Surface representation of the crystal structure of AMA-1 with specific sites highlighted. The highlighted amino acids are the same in both panels and the ones relevant and described in Table 1 and Figs 1 and 2. The right panel is a 90 degree rotation of the left panel, in which Lys 512 and Met 496 are revealed. Amino acids are the consensus of all sequences.
Country-specific counts of amino acid associations between positions 439, 451 and 485 within unique sequences obtained from The Gambia, Kenya and Mali.
| Association | Country and Amino Acid Combinations | Scaled MI | ||||
|---|---|---|---|---|---|---|
| 439 vs 451 |
|
|
|
| ||
|
| 46 | 3 | 0 | 31 | 0.701 | |
|
| 46 | 3 | 9 | 27 | 0.402 | |
|
| 395 | 25 | 11 | 211 | 0.637 | |
|
|
|
|
| |||
| 439 vs 485 |
| 39 | 7 | 1 | 30 | 0.549 |
|
| 45 | 4 | 1 | 35 | 0.682 | |
|
| 359 | 61 | 17 | 205 | 0.453 | |
| 451 vs 485 |
|
|
|
| ||
|
| 39 | 4 | 1 | 35 | 0.673 | |
|
| 46 | 9 | 0 | 31 | 0.515 | |
|
| 366 | 40 | 10 | 244 | 0.608 | |
Counts of amino acid associations between positions 439, 451 and 485 in a population study (duplicate sequences not excluded) in Kilifi, Kenya.
| Association | Amino Acid Combinations | Scaled MI | |||
|---|---|---|---|---|---|
| 439 vs 451 |
|
|
|
| 0.564 |
| 61 | 3 | 9 | 55 | ||
| 439 vs 485 |
|
|
|
| 0.772 |
| 60 | 4 | 1 | 63 | ||
| 451 vs 485 |
|
|
|
| 0.644 |
| 61 | 9 | 0 | 59 | ||