| Literature DB >> 29636501 |
Sophie M Andrews1, Yonghong Zhang2, Tao Dong3, Sarah L Rowland-Jones1, Sunetra Gupta4, Joakim Esbjörnsson5,6.
Abstract
The HIV-1 envelope mutates rapidly to evade recognition and killing, and is a major target of humoral immune responses and vaccine development. Identification of common epitopes for vaccine development have been complicated by genetic variation on both virus and host levels. We studied HIV-1 envelope gp120 evolution in 12 Chinese former plasma donors infected with a purportedly single founder virus, with the aim of identifying common antibody epitopes under immune selection. We found five amino acid sites under significant positive selection in ≥50% of the study participants, and 22 sites consistent with antibody-mediated selection. Despite strong selection pressure, some sites housed a limited repertoire of amino acids. Structural modelling revealed that most of the variable amino acid sites were located on the exposed distal edge of the Gp120 trimer, whilst invariant sites clustered within the centre of the protein complex. Two sites, flanking the V3 hypervariable loop, represent novel antibody sites. Analysis of HIV-1 evolution in hosts infected with a narrow-source virus may provide insight and novel understanding of common epitopes under antibody-mediated selection. If verified in functional studies, such epitopes could be suitable as targets in vaccine development.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29636501 PMCID: PMC5893620 DOI: 10.1038/s41598-018-23913-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic and clinical characteristics of the SM cohort study participants sampled.
| Patient | Sex1 | HLA Type | 2010 | 2011 | 2012 | 2014 | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A1 | A2 | B1 | B2 | C1 | C2 | VL2 | CD4 abs3 | CD4%4 | ART5 | VL2 | CD4 abs3 | CD4%4 | ART5 | VL2 | CD4 abs3 | CD4%4 | ART5 | VL2 | CD4 abs3 | CD4%4 | ART5 | ||
| SM007 | F | 1 | 2 | 40 | 57 | 6 | 8 | 7388 | 362 | 23 | N | 1445 | 300 | 24 | N | 2884 | 253 | 16 | N | 8733 | 253 | 15 | Y |
| SM021 | F | 2 | 29 | 44 | 38 | 4 | 7 | 35827 | 266 | 18 | N | N/R6 | N/R6 | N/R6 | — | 14782 | 206 | 20 | N | N/R6 | N/R6 | N/R6 | N/R6 |
| SM039 | F | 2 | 11 | 49 | 39 | 7 | 7 | 1612 | 492 | 30 | N | 791 | 346 | 28 | N | 1732 | 319 | 18 | Y | N/R6 | N/R6 | N/R6 | N/R6 |
| SM176 | F | 11 | 32 | 38 | 51 | 7 | 14 | 43569 | 429 | 30 | N | 435669 | N/R6 | N/R6 | N | 27348 | 95 | 3 | N | 50 | 495 | 28 | Y |
| SM209 | M | 24 | 31 | 51 | 54 | 1 | 14 | 44584 | 334 | 33 | N | 1195 | 354 | 32 | N | 4034 | 478 | 22 | N | 11922 | 242 | 14 | Y |
| SM335 | M | 30 | 31 | 13 | 40 | 6 | 3 | N/R6 | 294 | 36 | N | 7751 | 257 | 33 | N | 71263 | 228 | 16 | Y | 88620 | 59 | 6 | Y |
| SM358 | F | 2 | 11 | 49 | 50 | 3 | 6 | 50 | 469 | 25 | N | 1025 | 390 | 27 | N | 1716 | 387 | 25 | N | N/R6 | N/R6 | N/R6 | N/R6 |
| SM446 | M | 2 | 26 | 38 | 51 | 12 | 14 | N/R6 | 255 | 27 | N | 7226 | 374 | 31 | N | 3217 | 187 | 25 | N | N/R6 | N/R6 | N/R6 | N/R6 |
| SM505 | M | 11 | 2 | 35 | 52 | 12 | 12 | 8871 | 478 | 19 | N | 175519 | 337 | 30 | N | 210735 | 402 | 24 | N | 180 | 593 | 30 | Y |
| SM514 | F | 2 | 31 | 40 | 40 | 3 | 15 | 30403 | 389 | 11 | N | N/R5 | 1233 | 66 | N | 5166 | 216 | 7 | N | N/R6 | N/R6 | N/R6 | N/R6 |
| SM536 | F | 11 | 30 | 13 | 15 | 6 | 7 | 4812 | 567 | 31 | N | 23956 | 397 | 24 | N | 12637 | 351 | 13 | N | N/R6 | N/R6 | N/R6 | N/R6 |
| SM538 | M | 2 | 24 | 13 | 40 | 3 | 3 | 409157 | 92 | 10 | N | 50 | 73 | 12 | N | 50 | 52 | 9 | Y | N/R6 | N/R6 | N/R6 | N/R6 |
1Sex: F = Female; M = Male.
2VL = Viral load in RNA copies per ml.
3CD4 abs = CD4 count in cells per µl.
4CD4% = CD4 cells as percentage of lymphocytes.
5ART = On antiretroviral treatment: Y = Yes; N = No.
6N/R = Data not recorded or available.
Figure 1Patient-specific selection pressure within the HIV-1 envelope Gp120 protein. Ten of the 12 study participants yielded sequences from two or more time-points and were subjected to evolutionary analyses (Fig. S1A). All time-points with available sequences for the ten study participants were included in the analyses. Mean dN/dS ratios for each codon within each study participant. A dN/dS estimate greater than 1 indicates positive selection whilst an estimate of less than 1 indicates negative selection. Sites under significant selection shown in blue whilst sites that have not reached statistical significance are shown in red and are assumed neutral. Differences in the number of codons across study participants are the result of length variation in the V4 and V5 hypervariable loops.
Figure 2Summary of the selection pressure within the HIV-1 envelope Gp120 protein among the study participants. (A) The proportion (absolute) of study participants showing evidence of significant positive or negative selection across each aligned codon of the amplicon. Positive selection is shown in dark blue, whilst negative selection is shown in light blue. Sites under significant positive selection in at least 50% of study participants are shown in purple. Variable loops V3, V4 and V5 are contained within the three boxes, respectively. The dotted lines denote 50% of study participants. Antibody epitope clustering is shown in grey, whereby intensity denotes number of epitopes spanning that residue as reported in the LANL Immunology Database (http://www.hiv.lanl.gov/content/immunology). Sequences have been aligned to the HXB2 Gp120 reference sequence (accession number K03455), and position is relative to this alignment. (B) Homology-modelled structure of the SM cohort consensus Gp120 sequence in surface representation. Variable loops V3, V4 and V5 are shown in grey and sites under significant positive selection in 50% or more study participants are shown in purple. Structure has been modelled on a glycosylated HIV-1 Gp120 trimer (RCSB PDB 3J5M)[74]. For clarity, two molecules in the trimer are shown in line representation in grey.
Figure 3Mapping of variant frequencies. Unique variants identified in the SM cohort envelope Gp120 sequences. Major variants present in greater than 15% of the sequences are shown in purple whilst minor variants (<15%) are shown in grey. Position is relative to HXB2 Gp120 (accession number K03455).
Figure 4Overall presence of major variants. (A) Presence of major variants in individual study participants of the SM cohort at each time-point sampled. Those shown in dark purple are also under significant positive selection pressure within the specified individual, whilst those in light purple are either under significant negative selection or under no significant selection pressure. (B) Homology modelled structure of the SM cohort consensus Gp120 sequence, showing the variants under significant positive selection in 50% or more of the study participants in purple, and all invariant sites in orange. All variants except positions 345 and 424 are visible on the surface of the protein. Variable loops V3, V4 and V5 are shown in grey. Structure has been modelled on a glycosylated HIV-1 Gp120 trimer (RCSB PDB 3J5M)[74]. For clarity, two molecules in the trimer are shown in line representation in grey.
Figure 5Biophysical properties of amino acids found in sites housing major variants. The biophysical properties of the amino acids found in each site, coloured according to the Lesk format[81]. The table show the composition of each position as the total number of each amino acid found at that site.
Antibody epitope sequences corresponding to the sites identified on the surface of the HIV-1 Envelope. Data made available by the Los Alamos National Laboratory (LANL) Immunology Database (http://www.hiv.lanl.gov/content/immunology). Position is relative to HXB2 Gp120 (accession number K03455). Additional information about the neutralising antibody positions and associations can be found with references in the LANL HIV Genome Browser (https://www.hiv.lanl.gov/content/sequence/genome_browser/browser.html).
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| F223Y | 223 | GV4H3 | APAGFAIL | mouse | N | |
| 493-156 | EPIPIHYCAPAGFAILKCNN | mouse | N | |||
| J1 | GFAILKCNNK | mouse | N | |||
| T236S | 236 | 1006-30-D | KGSCKNVSTV |
| Y | This position has a strong co-variation with potency and structural proximity. This position is part of a common glycosylation site at position 234. |
| E268K | 268 | B12 | RPVVSTQLLLNGSLAEEEVV | mouse | Y | This is an antibody b12 signature site in which E or S has been associated with sensitivity, and K or R has been associated with resistance. Other antibodies associated with this site include VRC01. |
| 110.E | NGSLAEEEVVIRSVNFTDNA | mouse | Y | |||
| V270I | 270 | B12 | RPVVSTQLLLNGSLAEEEVV | mouse | N | |
| 110.E | NGSLAEEEVVIRSVNFTDNA | mouse | N | |||
| T278S | 278 | 110.E | NGSLAEEEVVIRSVNFTDNA | mouse | Y | Antibodies associated with this site include: 1B2530, 3BNC60, 3BNC117, 8ANC131, 8ANC195, N6, NIH45–46, VRC01, VRC03, and VRC-PG04. |
| 110.C | VIRSVNFTDN | mouse | Y | |||
| D279N | 279 | 110.E | NGSLAEEEVVIRSVNFTDNA | mouse | Y | This site has been associated with the loss of transmitted-founder sequence, suggested to represent antibody-driven selection (Hraber |
| 110.C | VIRSVNFTDN | mouse | Y | |||
| V283I | 283 | NA3 | NA3 | NA3 | Y | Antibodies associated with this site include: HJ16, N6, VRC01, VRC03, VRC13, VRC16, VRC27, and VRC-PG04. |
| E290K4 | 290 | NA3 | NA3 | NA3 | N | |
| E293G | 293 | IIIB-V3-26 | SVEINCTRPNNNTRKSI | mouse | N | |
| L333I4 | 333 | NA3 | NA3 | NA3 | N | |
| S334N | 334 | NA3 | NA3 | NA3 | Y | This site has been described as a supersite of vulnerability for antibody neutralisation[ |
| R335S/K | 335 | P1H6 | SSNWKE | mouse | Y | Antibodies associated with this site include the PCDN antibodies. |
| T336A | 336 | P1H6 | SSNWKE | mouse | N | |
| N340K | 340 | P1H6 | SSNWKE | mouse | N | |
| P3B2 | WKEM(D/N)R | mouse | Y | Site shown to be under significant selection following 3BNC117 immunotherapy. | ||
| Q344R | 344 | P3B2 | WKEM(D/N)R | mouse | ||
| 838-D | KSITK |
| ||||
| 1006-15D | KSITKG |
| ||||
| 1027-15D | KSITKGP |
| ||||
| I345V | 345 | 838-D | KSITK |
| N | |
| 1006-15D | KSITKG |
| ||||
| 1027-15D | KSITKGP |
| ||||
| P369L | 369 | 4D7/4 | IFKQSSGGDPEIVTHSFNCGG | mouse | Y | This is an antibody b12 signature site in which A or P has been associated with sensitivity, and I, L, or Q has been associated with resistance. Other antibodies associated with this site include: CH103, and IGg1b12. |
| 36.1(ARP 329) | FKQSSGGDPEIVTHSFNCGGE | mouse | ||||
| V424I | 424 | 2D3 | RIKQIINMWQEVGKAMYAPPI | mouse | N | |
| JL413 | KQIINMWQEVGKAMYA |
| ||||
| 5C2E5 | QFINMWQEVK | mouse | ||||
| G3-211 | IINMWQKVGKAMYAP | mouse | ||||
| R444S | 444 | polyclonal | KAMYAPPISGQIRCSSNITG | mouse | Y | The S444 has been associated with increased susceptibility to neutralisation by 10-996. |
| N448K | 448 | polyclonal | KAMYAPPISGQIRCSSNITG | mouse | Y | Antibodies associated with this site include: 2G12, 3BC176, 3BC315, and PGT151-PGT158. |
| D474N | 474 | polyclonal | LTRDGGNNNNESEIFRPGGGD |
| Antibodies associated with this site include: 12A21, 8ANC131, 8ANC134, HJ16, N6, NIH45-46, VRC01, VRC03, VRC16, VRC27, and VRC-PG04. | |
| 9201 | GGGDMRDNWRSE | mouse | ||||
| 1C1 | GGGDMRDNWRSELYKYKVVK | mouse | Y | |||
| H11 | GGDMRD | mouse | ||||
| W2 | GGDMRDNWRSELYKYKVVKI | mouse | ||||
| D476K | 476 | 9201 | GGGDMRDNWRSE | mouse | Y | Antibodies associated with this site include: 8ANC131, 8ANC134, A16, N6, NIH45–46, VRC01, VRC27, and VRC-PG04. |
| 1C1 | GGGDMRDNWRSELYKYKVVK | mouse | ||||
| H11 | GGDMRD | mouse | ||||
| W2 | GGDMRDNWRSELYKYKVVKI | mouse |
1Name of monoclonal antibody or “polyclonal” (if a general response is being studied) as listed in the LANL Immunology database.
2Y = Yes; N = No.
3NA = Not available.
4Novel site identified in the current study.