| Literature DB >> 25102049 |
Jaclyn K Mann1, John P Barton2, Andrew L Ferguson3, Saleha Omarjee1, Bruce D Walker4, Arup Chakraborty5, Thumbi Ndung'u6.
Abstract
Viral immune evasion by sequence variation is a major hindrance to HIV-1 vaccine design. To address this challenge, our group has developed a computational model, rooted in physics, that aims to predict the fitness landscape of HIV-1 proteins in order to design vaccine immunogens that lead to impaired viral fitness, thus blocking viable escape routes. Here, we advance the computational models to address previous limitations, and directly test model predictions against in vitro fitness measurements of HIV-1 strains containing multiple Gag mutations. We incorporated regularization into the model fitting procedure to address finite sampling. Further, we developed a model that accounts for the specific identity of mutant amino acids (Potts model), generalizing our previous approach (Ising model) that is unable to distinguish between different mutant amino acids. Gag mutation combinations (17 pairs, 1 triple and 25 single mutations within these) predicted to be either harmful to HIV-1 viability or fitness-neutral were introduced into HIV-1 NL4-3 by site-directed mutagenesis and replication capacities of these mutants were assayed in vitro. The predicted and measured fitness of the corresponding mutants for the original Ising model (r = -0.74, p = 3.6×10-6) are strongly correlated, and this was further strengthened in the regularized Ising model (r = -0.83, p = 3.7×10-12). Performance of the Potts model (r = -0.73, p = 9.7×10-9) was similar to that of the Ising model, indicating that the binary approximation is sufficient for capturing fitness effects of common mutants at sites of low amino acid diversity. However, we show that the Potts model is expected to improve predictive power for more variable proteins. Overall, our results support the ability of the computational models to robustly predict the relative fitness of mutant viral strains, and indicate the potential value of this approach for understanding viral immune evasion, and harnessing this knowledge for immunogen design.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25102049 PMCID: PMC4125067 DOI: 10.1371/journal.pcbi.1003776
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
HIV-1 NL4-3 Gag mutants selected for testing of predicted energy costs (E) by an in vitro HIV-1 replication capacity assay.
| Mutant | Gag subunit | Category of pairs/triple | E |
| 186I | p24 | 78.74 | |
| 269E | p24 | 43.43 | |
| 186I269E | p24 | Sector 3 | Infinity |
| 295E | p24 | 22.81 | |
| 186I295E | p24 | Sector 3, high E | Infinity |
| 181R | p24 | 44.62 | |
| 310T | p24 | 6.26 | |
| 181R310T | p24 | Sector 3, high E | Infinity |
| 182S | p24 | 25.13 | |
| 198V | p24 | Infinity | |
| 182S198V | p24 | Sector 3, high E | Infinity |
| 179G | p24 | 56.09 | |
| 229K | p24 | 44.63 | |
| 179G229K | p24 | Sector 3, high E | 97.01 |
| 174G | p24 | Infinity | |
| 243P | p24 | 66.65 | |
| 174G243P | p24 | Sector 3, high E | Infinity |
| 168I | p24 | 38.58 | |
| 315G | p24 | 19.11 | |
| 168I315G | p24 | HLA-associated, high E | Infinity |
| 331R | p24 | 11.77 | |
| 186I331R | p24 | HLA-associated, high E | Infinity |
| 302R | p24 | 11.10 | |
| 302R315G | p24 | HLA-associated, high E | Infinity |
| 315G331R | p24 | HLA-associated, high E | Infinity |
| 190I | p24 | 41.52 | |
| 190I302R | p24 | HLA-associated, high E | Infinity |
| 219Q | p24 | 6.73 | |
| 242N | p24 | 8.68 | |
| 219Q242N | p24 | p24, low E, compensatory | 10.80 |
| 146P | p24 | 7.22 | |
| 147L | p24 | 3.42 | |
| 146P147L | p24 | p24, low E, compensatory | 6.58 |
| 326S | p24 | 4.59 | |
| 310T326S | p24 | p24, low E, sector 3 | 10.53 |
| 173T | p24 | 5.92 | |
| 173T286K | p24 | 4.89 | |
| 173T286K147L | p24 | p24, low E, triple | 4.12 |
| 12K | p17 | 3.74 | |
| 12K54A | p17 | p17, low E | 4.84 |
| 86F | p17 | 8.00 | |
| 92M | p17 | 8.74 | |
| 86F92M | p17 | p17, high E | Infinity |
Energy cost predicted by original Ising model [11] taking into account differences in NL4-3 and the multiple sequence alignment used in model generation. E is 2.98 for wild-type NL4-3 p24 and 3.43 for wild-type NL4-3 p17.
Mutation pairs within an immunologically vulnerable group of co-evolving residues, termed sector 3, that we previously identified qualitatively [12].
E>90 or E = ∞ were considered high E values and E<15 were considered low E values.
Energy costs (E) of HIV-1 NL4-3 Gag mutants predicted by computational models.
| Mutant | Gag subunit | Ising E | Regularized Ising E | Regularized Potts E |
| 186I | p24 | 78.74 | 9.98 | 11.24 |
| 269E | p24 | 43.43 | 11.46 | 12.18 |
| 186I269E | p24 | Infinity | 17.77 | 18.97 |
| 295E | p24 | 22.81 | 9.05 | 11.03 |
| 186I295E | p24 | Infinity | 15.36 | 17.79 |
| 181R | p24 | 44.62 | 13.55 | 12.12 |
| 310T | p24 | 6.26 | 5.87 | 7.20 |
| 181R310T | p24 | Infinity | 15.74 | 14.87 |
| 182S | p24 | 25.13 | 7.11 | 9.68 |
| 198V | p24 | Infinity | 12.32 | - |
| 182S198V | p24 | Infinity | 15.77 | - |
| 179G | p24 | 56.09 | 11.14 | 11.57 |
| 229K | p24 | 44.63 | 10.52 | 11.68 |
| 179G229K | p24 | 97.01 | 17.99 | 18.81 |
| 174G | p24 | Infinity | 15.47 | 11.71 |
| 243P | p24 | 66.65 | 11.1 | 11.08 |
| 174G243P | p24 | Infinity | 22.9 | 18.32 |
| 168I | p24 | 38.58 | 9.8 | 10.30 |
| 315G | p24 | 19.11 | 6.85 | 10.64 |
| 168I315G | p24 | Infinity | 14.78 | 16.39 |
| 331R | p24 | 11.77 | 7.37 | 9.17 |
| 186I331R | p24 | Infinity | 13.68 | 15.85 |
| 302R | p24 | 11.1 | 7.75 | 9.23 |
| 302R315G | p24 | Infinity | 12.4 | 15.30 |
| 315G331R | p24 | Infinity | 10.56 | 15.22 |
| 190I | p24 | 41.52 | 8.2 | 11.41 |
| 190I302R | p24 | Infinity | 12.28 | 16.12 |
| 219Q | p24 | 6.73 | 5.65 | 6.90 |
| 242N | p24 | 8.68 | 6.7 | 8.05 |
| 219Q242N | p24 | 10.8 | 8.04 | 10.07 |
| 146P | p24 | 7.22 | 5.62 | 6.26 |
| 147L | p24 | 3.42 | 4.25 | 6.54 |
| 146P147L | p24 | 6.58 | 5.77 | 4.74 |
| 326S | p24 | 4.59 | 4.78 | 5.69 |
| 310T326S | p24 | 10.53 | 7.72 | 8.81 |
| 173T | p24 | 5.92 | 5.81 | 7.02 |
| 173T286K | p24 | 4.89 | 6.56 | 7.75 |
| 173T286K147L | p24 | 4.12 | 5.93 | 6.78 |
| 12K | p17 | 3.74 | 1.91 | 4.38 |
| 12K54A | p17 | 4.84 | 3.19 | 5.63 |
| 86F | p17 | 8 | 4.53 | 6.00 |
| 92M | p17 | 8.74 | 6.01 | 9.43 |
| 86F92M | p17 | Infinity | 9.52 | 12.57 |
E is 2.98 for wild-type NL4-3 p24 and 3.43 for wild-type NL4-3 p17.
E is 3.67 for wild-type NL4-3 p24 and 1.64 for wild-type NL4-3 p17.
E is 4.43 for wild-type NL4-3 p24 and 2.81 for wild-type NL4-3 p17.
The 198V mutation was not observed within the MSA used to fit the Potts model, precluding the fitted model from assigning an energy to viral strains containing this point mutation.
Figure 1Replication capacities of NL4-3 viruses encoding mutations in HIV-1 Gag.
Graphs show replication capacities of NL4-3 viruses encoding (A) Gag p24 mutation pairs with high E values that were previously identified to be in vulnerable co-evolving groups [12] and single mutations within these pairs; (B) Gag p24 HLA-associated pairs with high E values and single mutations within these pairs; (C) Gag p24 pairs/triple with low E values as well as single mutations within these combinations; and (D) Gag p17 pairs including single mutations within the pairs. Those mutants that (i) were not viable or (ii) were not viable unless further mutations developed (indicated with an asterisk), were assigned a replication capacity of zero. Mutation pairs and triples are shown in grey while single mutations within these combinations are shown in black. Replication capacities of mutant viruses are expressed relative to the replication capacity of wild-type NL4-3 virus (RC = 1). Bars represent the mean of three independent experiments and error bars represent standard deviation from the mean.
Figure 2Relationship between predicted E values and replicative capacities of HIV-1 NL4-3 Gag mutants.
Scatter plots showing strong correlations between measured replication capacities of mutants and E values predicted by (A) original Ising (Pearson's correlation, r = −0.74 and , two-tailed test, n = 30), (B) regularized Ising (Pearson's correlation, and , two-tailed test, n = 43) and (C) regularized Potts (Pearson's correlation, and , two-tailed test, n = 41) models. In the original Ising model (panel A), mutants with E values of infinity (n = 13) are excluded from the correlation.
Figure 3Classification of HIV-1 NL4-3 Gag mutants as unfit/fit using predicted E values.
Graphs show the ability of E classifiers, predicted by regularized Ising (panel A) and Potts (panel B) models, to correctly classify HIV-1 NL4-3 Gag mutants into unfit (RC<0.5) and fit (RC>0.5) categories. The measured replication capacities of mutants classified as fit or unfit according to their predicted E values were compared with the Mann-Whitney test and p values are shown.