| Literature DB >> 31683782 |
Alexis Kafando1, Christine Martineau2, Mohamed El-Far3, Eric Fournier4, Florence Doualla-Bell5, Bouchra Serhir6, Adama Kazienga7, Mohamed Ndongo Sangaré8, Mohamed Sylla9, Annie Chamberland10, Hugues Charest11,12, Cécile L Tremblay13,14,15.
Abstract
<span class="abstract_title">BACKGROUND: <span class="Species">HIV-1 transmitted/founder viruses (TF) are selected during the acute phase of infection from a multitude of virions present during transmission. They possess the capacity to establish infection and viral dissemination in a new host. Deciphering the discrete genetic determinant of infectivity in their envelope may provide clues for vaccine design.Entities:
Keywords: HIV-1; acute/early infection; amino acids; cytoplasmic domain; genetic signatures; lentivirus lytic peptide segment 1; recent viruses, envelope; signal peptide; transmitted/founder viruses
Mesh:
Substances:
Year: 2019 PMID: 31683782 PMCID: PMC6893788 DOI: 10.3390/v11111012
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Figure 1Flow chart showing the numbers of HIV-1 positives samples analyzed and consensus envelope sequences of HIV-1-infected individuals include in the study. From a total of 243 sequences obtained, 95% (231) of subtype B HIV-1 sequences were included in the current study. The non-B HIV-1 subtype envelope sequences representing 8.7% (four for transmitted/founder (TF) and eight for recent (RC) viruses) of the LSPQ serobank HIV-1 positive samples were excluded for subsequent genetic analysis because they may have influenced the results. The HIV-1 envelope sequences of transmitted/founder (TF) viruses derived from acute infection, recent viruses (RC) for early infection and chronic viruses for chronic infection.
Figure 2Ridge plot comparing the HIV-1 envelope variable loop number of N-glycosylation sites between TF, RC and chronic (CH) viruses. The boxes represent a density plot of number of N-glycosylation sites for: Env V1 loop (a), Env V2 loop (b), Env V1V2 loop (c), Env V3 loop (d), Env V4 loop (e) and Env V5 loop (f) for CH, RC and TF viruses respectively. In box, the top (green), middle (blue) and bottom (yellow) represent respectively number of N-glycosylation sites for CH, RC and TF viruses envelope sequences respectively. The X-axis represents sequence loops number of N-glycosylation sites and the Y-axis the density of sequences number of N-glycosylation sites for each timeline category of viruses (CH, RC and TF). As shown in Figure 2d, the differences in Env V3 loop numbers of N-glycosylation sites between CH and TF, p = 0.026; RC and TF, p = 0.004 and CH and RC, p = 0.05 were statistically significant using Wald test with logistic regression model.
Figure 3Ridge plot comparing the clade B HIV-1 envelope variable loop length between CH, RC and TF viruses. The boxes represent the density plot of: Env V1 loop lengths (a), Env V2 loop lengths (b), Env V1V2 loop length (c), Env V3 loop length (d), Env V4 loop length (e) and Env V5 loop length (f) for CH, RC and TF viruses respectively. For each box, the top (green), middle (blue) and bottom (yellow) represent, respectively, TF, RC and CH viruses sequences. X-axis presents sequence loop lengths and the Y-axis the loop length density for each timeline category of viruses (CH, RC and TF). As presented in Figure 2f, the differences in the HIV-1 Env GP120 V5 loop lengths between RC and TF, p = 0.003 and CH and RC, p = 0.004 are statistically significant using Wald Test with regression logistic model.
Figure 4Ridge plot comparing the clade B HIV-1 envelope variable region gp120 loop 3 (V3) net positive between CH, RC and TF viruses.
Summary statistics of the important amino acids change among the HIV-1 envelope sequences of chronic and transmitted/founder viruses.
| HXB2 | Env | Amino Acids | CH | TF | Chi2 Test | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Position | Amino Acids | Subregion/Domain | Alignment Position | Change | Genetic Signature | YES | NO | Total | YES | NO | Total | OR | 95% CI | B-H adjusted |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 62 | D | C1 |
| E | D62E | 20 | 85 | 105 | 4 | 94 | 98 | 5.52 | 1.89–16.04 | 0.006 |
| 514 | G | ECD |
| T | G514T | 9 | 22 | 31 | 26 | 12 | 38 | 0.18 | 0.06–0.52 | 0.006 |
| 24 | M | SP |
| I | M24I | 38 | 66 | 104 | 16 | 81 | 97 | 2.91 | 1.50–5.64 | 0.006 |
| 743 | R | HIR/KE |
| R | 743R | 45 | 60 | 105 | 62 | 35 | 97 | 0.42 | 0.24–0.74 | 0.008 |
| 153 | E | V1 |
| E | 153E | 93 | 11 | 104 | 71 | 26 | 97 | 3.09 | 1.44–6.59 | 0.008 |
| 744 | R | HIR/KE |
| R | 744R | 78 | 27 | 105 | 53 | 44 | 97 | 2.39 | 1.32–4.32 | 0.008 |
| 717 | F | HIR/KE |
| F | 717F | 86 | 19 | 105 | 62 | 35 | 97 | 2.55 | 1.34–4.85 | 0.008 |
| 717 | F | HIR/KE |
| L | F717L | 19 | 86 | 105 | 35 | 62 | 97 | 0.39 | 0.20–0.74 | 0.008 |
| 154 | I | V1 |
| M | I154M | 35 | 70 | 105 | 52 | 46 | 98 | 0.44 | 0.25–0.77 | 0.008 |
| 744 | R | HIR/KE |
| T | R744T | 15 | 90 | 105 | 30 | 67 | 97 | 0.37 | 0.18–0.74 | 0.008 |
| 841 | R | LLP-1 |
| L | R841L | 81 | 24 | 105 | 56 | 39 | 95 | 2.35 | 1.27–4.31 | 0.009 |
| 621 | Q | GP 41 Loop |
| D | Q621D | 16 | 88 | 104 | 31 | 67 | 98 | 0.39 | 0.20–0.77 | 0.009 |
| 464 | L | V5 |
| N | L464N | 24 | 45 | 69 | 42 | 31 | 73 | 0.39 | 0.20–0.77 | 0.009 |
| 543 | Q | FPPR |
| Q | 543Q | 63 | 42 | 105 | 76 | 22 | 98 | 0.43 | 0.23–0.79 | 0.009 |
Table 1 presents the important HIV-1 envelope sequence polymorphisms of chronically infected individuals compared to TF viruses of individuals infected that are considered as genetic signatures. Bold typeface define the important Env genetic signatures identified and discussed in manuscript. The gray character referred to sequences alignment position of amino acid change. Abbreviations: V1—variable loop 1, CP—cytoplasmic domain/tail, EC—endocytosis domain, SP—Signal peptide, C2—constant domain 2, C3—constant domain 3, ECD-Loop—Ectodomain-loop region, CP-KE—cytoplasmic domain-Kennedy epitope, ECD-CHR—Ectodomain-C-hepta-repeat, CP-LLP-1—Cytoplasmic Tail-Lentivirus Lytic peptide 1. MSD—Membrane-spanning domain.
Figure 5Genetic signature identified under the HIV-1 envelope GP41 Lentivirus Lytic peptide 1 (LLP-1) associate to clade B HIV-1 TF viruses compared to chronic (CH) using WebLogo. The X axis represents amino acids (AA) composing the LLP-1 sequence (direction N to C). The Y axis represents the normalized AA frequency identified at each position of the LLP-1 sequence for each category of infection. The top line box represents the chronic (CH) viruses envelope GP41 LLP-1 sequence (N = 105) and the bottom line for TF viruses (N = 98). As indicated for Weblogo analysis, the overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. The isoleucine (I) amino acid signature was localized at position 959 of alignment (HXB2 position R841I) and identified by a red asterisk.
Figure 6Genetic signature identified under the HIV-1 envelope signal peptide (SP) associate to clade B HIV-1 chronic compared to TF viruses using WebLogo. The X axis represents sequences and amino acids (AA) identities composing the SP (direction N to C). The X axis represents amino acids (AA) composing the LLP-1 sequence (direction N to C). The Y axis represents the normalized AA frequency identified at each position of the SP sequence for each category of infection. The top line box represents chronic (CH) viruses envelope SP sequence (N = 105) and the bottom line for TF viruses (N = 98). As indicated for Weblogo analysis, the overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. The isoleucine (I) amino acid signature was localized at position six of alignment (HXB2 position K6I) and identified by a red asterisk.
Figure 7Genetic signature identified under the HIV-1 envelope GP120 V1 loop associate to clade B HIV-1 chronic compared to recent viruses. The X axis represents amino acids (AA) composing the V1 loop sequence (direction N to C). The Y axis represents the normalized AA frequency identified at each position of the V1 sequence for each category of infection. The top line box represents chronic virus envelope V1 loop sequences (N = 105) and the bottom for RC viruses within 136 days MDRI and TF (N = 126). As indicated for Weblogo analysis, the overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. The glutamic acid (E) amino signature was localized at position 206 of alignment (HXB2 position 153E) and was identified by a red asterisk.
Figure 8Genetic signature identified under the HIV-1 envelope signal peptide (SP) associate to clade B HIV-1 chronic compared to recent viruses. The X axis represents amino acids (AA) composing the Env SP sequence (direction N to C). The Y axis represents the normalized AA frequency identified at each position of the SP sequence for each category of infection. The top line box represents chronic virus envelope SP sequences (N = 105) and the bottom for RC viruses within 136 MDRI and TF (N = 126). As indicated for Weblogo analysis, the overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. The isoleucine (I) amino signature was localized at position 40 of alignment (HXB2 position M24I) and is identified by a red asterisk.
Figure 9Genetic signature identified under the HIV-1 envelope GP41 cytoplasmic tail (CT or CD) loop domain associate to clade B HIV-1 chronic compared to recent viruses. The X axis represents amino acids (AA) composing the GP41 CT loop sequence (direction N to C). The Y axis represents the normalized AA frequency identified at each position of the GP41 CT loop sequence for each category of infection. The top line box represents chronic virus envelope GP41 CT loop sequences represented individuals (N = 105) and the bottom for RC viruses within 136 MDRI and TF (N = 126). As indicated for Weblogo analysis, the overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. The aspartic acid (D) amino signature was localized at position 738 of alignment (HXB2 position Q621D) and identified by a red asterisk.
Figure 10Genetic signature identified under the HIV-1 envelope GP41 cytoplasmic tail (CT or CD) between NF-κB activation (NA) and Kennedy Epitope (KE) domains associate to clade B HIV-1 chronic compared to recent viruses. The X axis represents amino acids (AA) composing the GP41 CT NA and KE sequence (direction N to C). The Y axis represents the normalized AA frequency identified at each position of the GP41 CT, NA and KE sequence for each category of infection. The top line box represents chronic virus envelope GP41 CT NA and KE sequences represented individuals (N = 105) and the bottom for RC viruses within 136 MDRI and TF (N = 126). As indicated for Weblogo analysis, the overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. The valine (V) signature was localized at position 876 of alignment and position 751 referring to HXB2 numbering (D751V) and identified by a red asterisk.
Major amino acid signatures among HIV-1 envelope sequences between chronic and recent viruses.
| HXB2 | Env | Amino Acid | CH | TF + RC | Chi2 test | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Position | Amino acid | Subregion/Domain | Alignment position | Change | Genetic signature | YES | NO | TOTAL | YES | NO | TOTAL | OR | 95% CI | B-H Adjusted |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 6 | K | SP |
| I | K6I | 83 | 22 | 105 | 69 | 56 | 125 | 3.06 | 1.70, 5.48 | 0.0003 |
| 33 | K | C1 |
| Q | K33Q | 39 | 55 | 94 | 22 | 102 | 124 | 3.28 | 1.78, 6.06 | 0.0003 |
| 717 | F | EC (YSPL) and HIR/KE |
| F | F717F | 86 | 19 | 105 | 73 | 52 | 125 | 3.22 | 1.75, 5 | 0.0003 |
| 717 | F | EC (YSPL) and HIR/KE |
| L | F717L | 19 | 86 | 105 | 52 | 73 | 125 | 0.31 | 0.16, 0.56 | 0.0003 |
| 747 | R | HIR/KE NA |
| R | R747R | 45 | 60 | 105 | 85 | 40 | 125 | 0.35 | 0.20, 0.60 | 0.0003 |
| 132 | T | V2 |
| T | T132T | 86 | 19 | 105 | 75 | 51 | 126 | 3.07 | 1.67, 5.64 | 0.0004 |
| 154 | I | V1 |
| M | I154M | 35 | 70 | 105 | 73 | 53 | 126 | 0.36 | 0.21, 0.62 | 0.0004 |
| 360 | I | C3 |
| V | I360V | 52 | 52 | 104 | 33 | 93 | 126 | 2.81 | 1.62, 4.88 | 0.0004 |
| 737 | R | HIR/KE |
| T | R737T | 15 | 90 | 105 | 44 | 81 | 125 | 0.3 | 0.16, 0.58 | 0.0006 |
| 841 | R | LLP-1 |
| I | R841I | 10 | 95 | 105 | 35 | 87 | 122 | 0.26 | 0.12, 0.55 | 0.0006 |
| 744 | R | HIR/KE |
| R | R744R | 78 | 27 | 105 | 65 | 60 | 125 | 2.66 | 1.52, 4.65 | 0.0009 |
| 236 | T | Loop D |
| S | T236T | 9 | 95 | 104 | 33 | 93 | 126 | 0.26 | 0.12, 0.58 | 0.001 |
Table 2 presents the statistically significant clade B HIV-1 envelope sequence polymorphisms among chronic compared to recent viruses of individuals HIV-1 infected that are considered as genetic signatures. Bold typeface define the important Env genetic signatures identified and discussed in manuscript. The gray character referred to sequences alignment position of amino acid changes. Abbreviations: V1—variable loop 1, CP—cytoplasmic domain/tail, EC—endocytosis domain, SP—Signal peptide, C—constant domain 2, C3—constant domain 3, ECD-Loop—Ectodomain-loop region, CP-KE—cytoplasmic domain-Kennedy epitope HIR-KE— Highly Immunogenic region (HIR) also called Kennedy epitope. ECD-CHR—Ectodomain-C-hepta-repeat, CP-LLP-1—Cytoplasmic tail-lentivirus lytic peptide 1.