| Literature DB >> 34690973 |
Gisele Umviligihozo1, Erick Muok1, Emmanuel Nyirimihigo Gisa1, Rui Xu2, Dario Dilernia2, Kimberley Herard2, Heeyah Song2, Qianhong Qin2, Jean Bizimana1, Paul Farmer2, Jonathan Hare3, Jill Gilmour4, Susan Allen5, Etienne Karita1, Eric Hunter2,5, Ling Yue2.
Abstract
Most studies of HIV-1 transmission have focused on subtypes B and C. In this study, we determined the genomic sequences of the transmitted founder (TF) viruses from acutely infected individuals enrolled between 2005 and 2011 into IAVI protocol C in Rwanda and have compared these isolates to viruses from more recent (2016-2019) acute/early infections in three at risk populations - MSM, high risk women (HRW), and discordant couples (DC). For the Protocol C samples, we utilized near full-length single genome (NFLG) amplification to generate 288 HIV-1 amplicons from 26 acutely infected seroconverters (SC), while for the 21 recent seroconverter samples (13 from HRW, two from DC, and six from MSM), we PCR amplified overlapping half-genomes. Using PacBio SMRT technology combined with the MDPseq workflow, we performed multiplex sequencing to obtain high accuracy sequences for each amplicon. Phylogenetic analyses indicated that the majority of recent transmitted viruses from DC and HRW clustered within those of the earlier Protocol C cohort. However, five of six sequences from the MSM cohort branched together and were greater than 97% identical. Recombination analyses revealed a high frequency (6/26; 23%) of unique inter-subtype recombination in Protocol C with 19% AC and 4% CD recombinant viruses, which contrasted with only 6.5% of recombinants defined by sequencing of the pol gene previously. The frequency of recombinants was significantly higher (12/21; 57%) in the more recent isolates, although, the five related viruses from the MSM cohort had identical recombination break points. While major drug resistance mutations were absent from Protocol C viruses, 4/21 of recent isolates exhibited transmitted nevirapine resistance. These results demonstrate the ongoing evolution and increased prevalence of recombinant and drug resistant transmitted viruses in Rwanda and highlight the importance of defining NFLG sequences to fully understand the nature of TF viruses and in particular the prevalence of unique recombinant forms (URFs) in transmission cohorts.Entities:
Keywords: HIV subtype; HIV-1 transmission; antiretroviral drug resistance mutations; genetic bottleneck; inter-subtype recombination; near full-length genome sequencing
Year: 2021 PMID: 34690973 PMCID: PMC8529237 DOI: 10.3389/fmicb.2021.734929
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
cDNA and PCR Primers.
| Primer name | Primer sequence (5'-3') | Position in HXB2 | Application |
|---|---|---|---|
| OFM19 | 5'-GCACTCAAGGCAAGCTTTATTGAGGCTTA-3' | 9,604–9,632 | cDNA synthesis for NFLSGA |
| 1.3'3'plCb | 5'-ACTACTTAAAGCACTCAAGGCAAGCTTTATTG-3' | 9,611–9,642 | cDNA synthesis for NFLSGA |
| 1U5Cc | 5'-CCTTGAGTGCTCTAAGTAGTGTGTGCCCGTCTGT-3' | 538–571 | First round NFLSGA forward primer |
| 1.3'3'plCb | 5'-ACTACTTAAAGCACTCAAGGCAAGCTTTATTG-3' | 9,611–9,642 | First round NFLSGA reverse primer |
| 2U5Cd | 5'-AGTAGTGTGTGCCCGTCTGTTGTGTGACTC-3' | 552–581 | Second round NFLSGA forward primer |
| 2.3'3'plCb | 5'-TAGAGCACTCAAGGCAAGCTTTATTGAGGCTTA-3' | 9,604–9,636 | Second round NFLSGA reverse primer |
| 1half_R1_For | 5'-TTTGACTAGCGGAGGCTAGAA-3' | 761–781 | 5'HF first round PCR forward primer |
| 1half_R1_Rev | 5'-TTCTATGGAGACYCCATGACCC-3' | 5,304–5,283 | 5'HF cDNA synthesis and first round PCR reverse primer |
| 2half_R1_For | 5'-GGGTTTATTACAGGGACAGCAGAG-3' | 4,900–4,923 | 3'HF first round PCR forward primer |
| OFM19 | 5'-GCACTCAAGGCAAGCTTTATTGAGGCTTA-3' | 9,604–9,632 | 3'HF cDNA synthesis and first and second round PCR reverse primer |
| 1half_R2_For | 5'-TTTGACTAGCGGAGGCTAGAAGGA-3' | 761–784 | 5'HF second round PCR forward primer |
| 1half_R2_Rev | 5'-TCCCCTARTGGGATGTGTACTTCTGAAC-3' | 5,195–5,222 | 5'HF second round PCR reverse primer |
| 2half_R2_For | 5'-GCAAAACTACTCTGGAAAGGTGAAGGG-3' | 4,944–4,970 | 3'HF second round PCR forward primer |
Transmission variants and subtypes in protocol C individuals.
| PCID | Coded ID | Sample date | EDI | VL | Transmission variants | Sequence type | Subtype by NFLG | Subtype by pol |
|---|---|---|---|---|---|---|---|---|
| 175,019 | R49M | 30-Mar-05 | 10 | 3,000,000 | Single | TF | A1 | A1 |
| 175,042 | R463F | 9-Mar-07 | 10 | 152,000,000 | Single | TF | A1 | A1 |
| 175,079 | R3103M | 19-Sep-08 | 13 | 750,001 | Single | TF | A1 | A1 |
| 175,089 | R3584M | 15-Dec-09 | 13 | 223,440 | Single | TF | A1C | A1 |
| 175,005 | R53F | 16-Jun-05 | 14 | 2,090,040 | Single | TF | A1 | A1 |
| 175,062 | R1135M | 11-Feb-08 | 14 | 1,256,940 | Single | TF | A1 | A1 |
| 175,065 | R269M | 24-Mar-08 | 14 | 31,700,000 | Single | Con | A1 | A1 |
| 175,072 | R3137M | 28-May-08 | 14 | 9,061,540 | Single | Con | C | C |
| 175,053 | R254F | 8-Aug-07 | 15 | 1,876,000 | Single | TF | A1C | C |
| 175,090 | R3469F | 15-Apr-10 | 15 | 2,920,000 | >2 | TF(P1) | A1 | A1 |
| 175,093 | R2302M | 9-Jul-10 | 15 | 3,760,000 | Single | TF | A1 | A1 |
| 175,094 | R3843F | 14-Oct-10 | 16 | 4,400,000 | Single | TF | A1 | A1 |
| 175,059 | R977F | 14-Dec-07 | 17 | 7,290,000 | Single | TF | A1 | A1 |
| 175,038 | R880F | 12-Jan-07 | 21 | 730,000 | Single | TF | A1 | A1 |
| 175,097 | R3894M | 18-Mar-11 | 23 | 6,934,997 | Single | TF | CD | CD |
| 175,092 | R3671F | 8-Jun-10 | 25 | 3,940,000 | Single | NA | A1 | A1 |
| 175,074 | R873F | 24-Jul-08 | 37 | 219,920 | 2 | TF(P1) | A1 | A1 |
| 175,071 | R1077F | 12-Jun-08 | 41 | 4,702,444 | 3 | TF(P2) | A1 | A1 |
| 175,008 | R50M | 25-May-05 | 45 | 117,608 | >2 | NA | C | C |
| 175,020 | R57F | 12-Oct-05 | 46 | 134,472 | >2 | NA | A1 | A1 |
| 175,014 | R59M | 21-Oct-05 | 46 | 806,290 | Single | TF | A1 | A1 |
| 175,017 | R44M | 3-May-05 | 49 | 123,560 | Single | TF | A1C | A1 |
| 175,011 | R63M | 14-Dec-05 | 50 | 184,270 | Single | Con | A1C | A1C |
| 175,012 | R40F | 31-Mar-05 | 53 | 1,398,004 | Single | TF | A1C | A1 |
| 175,027 | R72M | 10-Aug-06 | 67 | 425,000 | >2 | NA | A1 | A1 |
| 175,010 | R65M | 27-Jan-06 | 73 | 148,220 | Single | TF | A1 | A1 |
PCID, protocol C identification number of the participant; Coded ID, coded identification number used in two previous publications (Haaland et al., 2009; Yue et al., 2015); Sample date, date of sample collection; EDI, time since estimated date of infection; VL, viral load of sample; Transmission variants, number of genetic variants transmitted from partner; Sequence type, sequence defined as TF: transmitted founder virus sequence amplicon identical to consensus identified in NFLG amplicons; (P1), (P2): TF sequence identified in subpopulation 1 or 2 of multiple variants, Con: consensus sequence of NFLG amplicons: Subtype by NFLG: subtype as deduced from the NFLG sequence, Subtype by pol: Subtype defined previously from pol gene sequencing.
Summary of recent infection samples and derived near full-length sequences.
| Coded ID | Sample date | EDI | VL | Subtype | Risk group* |
|---|---|---|---|---|---|
| BUS71F | 10-July-19 | 91.5 | 328,000 | A1 | FSW |
| GIT84F | 1-August-19 | 131 | 102,000 | A1 | FSW |
| GWE47F | 10-October-18 | 111.5 | 150,000 | A1 | FSW |
| GWE68F | 25-June-19 | 216 | 120,000 | A1 | FSW |
| KAG34F | 10-November-17 | 88 | 191,000 | A1/C/D | FSW |
| KIN18F | 17-March-17 | 46 | 113,000 | A1 | FSW |
| MAS1F | 18-August-16 | 44.5 | 553,000 | A1 | FSW |
| MAS21F | 30-March-17 | 164.5 | 856,000 | A1 | DC |
| MAS6M | 22-November-16 | 174.5 | 202,000 | A1/C | DC |
| MAS7F | 23-November-16 | 46 | 47,200 | A1/C | FSW |
| MAT81F | 24-July-19 | 181 | 292,000 | A1/C/D | FSW |
| NGA76F | 17-July-19 | 90 | 141,000 | A1/C | FSW |
| NGA77F | 17-July-19 | 90 | 473,000 | A1/C | FSW |
| PSF24M | 2-June-17 | 96 | 306,000 | A1/C | MSM |
| PSF33M | 20-October-17 | 194 | 160,000 | A1/C | MSM |
| PSF36F | 4-January-18 | 94 | 702,000 | A1/C | FSW |
| PSF38M | 17-September-18 | 174 | 145,000 | A1/C | MSM |
| PSF39M | 17-February-18 | 72 | 100,000 | A1/C | MSM |
| PSF3M | 21-October-16 | 51 | 86,500 | A1/C | MSM |
| PSF80M | 19-July-19 | 17 | 683,000 | C | MSM |
| REM29F | 24-August-17 | 91 | 148,000 | A1 | FSW |
Coded ID, coded identification number of the participant; sample date, date of sample collection; EDI, time since estimated date of infection; VL, viral load of sample; subtype, subtype defined by near full-length genomic sequence; risk group, risk group of the individual, FSW: female sex worker; DC, HIV discordant couple; and MSM, men having sex with men.
Figure 1Phylogenetic analysis of virus sequences from Protocol C acute infections. Neighbor-joining tree representing the near full-length genome (NFLG) sequences of viruses from 26 acutely infected recipients with reference subtypes A1, A6, C, and D. Subtype and recombinants are indicated with specific colors (Subtype A, Blue; Subtype C, Red; Recombinant AC, Purple; Recombinant CD, Orange). The IDs of subjects infected with more than one viral variant from their partner are highlighted in red. Nodes with bootstrap values>0.9 are denoted with an asterisk.
Figure 2Multi-variants transmission. The phylogenetic tree and aligned sequence highlighter analysis show 175,071 contains three viral populations at the first visit time-point (EDI=41). Tick marks indicate nucleotide change from the master at same position according to the alignment. The color codes are as follows: A: green, T: red, G: yellow, C: blue, and gaps: gray.
Figure 3Recombination analysis of Protocol C and recent infections. The positions of recombination cross-overs and the subtype origin of genomic regions are shown based on the jphMM analytical tool. Nucleotide positions refer to the equivlent positions on the HXB-2 HIV-1 genome. (A) The six unique recombinant forms (URFs) from the Protocol C acute infection cohort. (B) The recombination structure of recent Rwandan seroconvertor viruses. PSF-3M illustrates the identical recombinant structure found in all five early isolates from MSM (PSF-3M, -24M, -33M, -38M, and -39M). Subtype sequences are denoted by the same colors as in Figure 1.
Figure 4Phylogenetic analysis of virus sequences from recent infections in the context of Protocol C acute infections. Neighbor-joining tree representing NFLG sequences of viruses from 21 recent infections and 26 acute Protocol C recipients. Origin of the viruses is denoted by color – Red, Protocol C; Blue, Discordant Couple; Magenta, FSW; Cyan, MSM. Recombinant viruses are highlighted by appended Red (Protocol C) and Blue (Recent Infection) circles. Nodes with bootstrap values>0.9 are denoted with an asterisk.