| Literature DB >> 35215928 |
Sheila N Balinda1, Anne Kapaata1, Rui Xu2, Maria G Salazar1, Allison T Mezzell3, Qianhong Qin2, Kimberly Herard2, Dario Dilernia2, Anatoli Kamali4, Eugene Ruzagira1, Freddie M Kibengo1, Heeyah Song2, Christina Ochsenbauer5, Jesus F Salazar-Gonzalez1, Jill Gilmour6, Eric Hunter2,7, Ling Yue2, Pontiano Kaleebu1.
Abstract
Detailed characterization of transmitted HIV-1 variants in Uganda is fundamentally important to inform vaccine design, yet studies on the transmitted full-length strains of subtype D viruses are limited. Here, we amplified single genomes and characterized viruses, some of which were previously classified as subtype D by sub-genomic pol sequencing that were transmitted in Uganda between December 2006 to June 2011. Analysis of 5' and 3' half genome sequences showed 73% (19/26) of infections involved single virus transmissions, whereas 27% (7/26) of infections involved multiple variant transmissions based on predictions of a model of random virus evolution. Subtype analysis of inferred transmitted/founder viruses showed a high transmission rate of inter-subtype recombinants (69%, 20/29) involving mainly A1/D, while pure subtype D variants accounted for one-third of infections (31%, 9/29). Recombination patterns included a predominance of subtype D in the gag/pol region and a highly recombinogenic envelope gene. The signal peptide-C1 region and gp41 transmembrane domain (Tat2/Rev2 flanking region) were hotspots for A1/D recombination events. Analysis of a panel of 14 transmitted/founder molecular clones showed no difference in replication capacity between subtype D viruses (n = 3) and inter-subtype mosaic recombinants (n = 11). However, individuals infected with high replication capacity viruses had a faster CD4 T cell loss. The high transmission rate of unique inter-subtype recombinants is striking and emphasizes the extraordinary challenge for vaccine design and, in particular, for the highly variable and recombinogenic envelope gene, which is targeted by rational designs aimed to elicit broadly neutralizing antibodies.Entities:
Keywords: HIV-1; T/F; recombinant; subtype D
Mesh:
Year: 2022 PMID: 35215928 PMCID: PMC8874453 DOI: 10.3390/v14020334
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Uganda protocol C cohort and HIV-1 characteristics.
| SN | SAMPLE-ID | Sample Date | EDI * | Days Post Infection ***** | Initial Viral Load(Copies/mL) | IMC-VRC Score ** | Gender | Age *** | Subtype | GenBank No. |
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 194065 | 12-Jan-10 | 01-Dec-09 | 42 | 254,000 | N/A **** | M | 41 | A/D | MW006066 |
| 2 | 194289 | 29-Mar-10 | 14-Feb-10 | 43 | 1,394,000 | N/A | F | 24 | D | MW006068 |
| 3 | 193006 | 07-Apr-09 | 15-Feb-08 | 51 | 112,000 | 1.84 | M | 24 | A/D | MW006062 |
| 4 | 194604 | 28-Mar-11 | 12-Feb-11 | 44 | 254,516 | N/A | F | 35 | D | MW006076 |
| 5 | 194603 | 21-Jul-10 | 30-May-10 | 52 | 177,000 | 1.38 | F | 33 | A/D | MW006075 |
| 6 | 191696 | 11-Sep-07 | 23-Jul-07 | 50 | 435,000 | 0.72 | M | 29 | A/D | MW006053 |
| 7 | 194535 | 16-Jun-11 | 30-Apr-11 | 47 | 289,930 | 0.08 | M | 39 | D | MW006073 |
| 8 | 275026 | 10-Sep-08 | 21-Jul-08 | 51 | 412,000 | N/A | F | 21 | A/D | MW006079 |
| 9 | 194319 | 09-Mar-09 | 18-Feb-09 | 19 | 602,000 | N/A | F | 26 | A/D | MW006070 |
| 10 | 192023 | 28-Jan-08 | 06-Dec-07 | 53 | 996,000 | N/A | M | 27 | A/D | MW006059 |
| 11 | 191923 | 24-Jan-08 | 30-Nov-07 | 55 | 198,000 | 0.17 | F | 31 | A/D | MW006054 |
| 12 | 193004 | 30-Oct-07 | 26-Sep-07 | 34 | 6690 | N/A | F | 25 | A/D | MW006060 |
| 13 | 193005 | 06-Nov-07 | 11-Oct-07 | 26 | 90,500 | N/A | M | 22 | D | MW006061 |
| 14 | 194020 | 12-Jun-09 | 07-May-09 | 36 | 77,700 | 1.16 | M | 33 | A/D | MW006064 |
| 15 | 194346 | 31-Mar-09 | 28-Feb-09 | 31 | 259,000 | 3.58 | M | 29 | A/D | MW006071 |
| 16 | 191639 | 03-Apr-08 | 13-Feb-08 | 50 | 102,000 | 1.32 | M | 50 | A/D | MW006052 |
| 17 | 270015 | 19-Dec-06 | 08-Dec-06 | 11 | 19,300 | N/A | M | 58 | A/D | MW006077 |
| 18 | 193008 | 15-Jun-09 | 23-May-09 | 23 | 50,500 | 1.46 | M | 27 | A/D | MW006063 |
| 19 | 191955 | 26-Mar-07 | 03-Mar-07 | 23 | 4190 | 0.64 | F | 39 | A/D | MW006055 |
| 20 | 191997 | 15-Jan-08 | 19-Nov-07 | 57 | 4610 | 0.37 | M | 31 | A/D | MW006057 |
| 21 | 192002 | 24-Jul-07 | 04-Jun-07 | 50 | 2420 | 1.54 | F | 27 | D | MW006058 |
| 22 | 275031 | 05-Jun-09 | 11-May-09 | 25 | 3750 | 0.87 | M | 31 | D | MW006081 |
| 23 | 191996 | 19-Sep-08 | 26-Jul-08 | 55 | 201 | 2.64 | F | 37 | A/D | MW006056 |
| 24 | 194307 | 2-Mar-09 | 3-Feb-09 | 27 | 930,000 | N/A | F | 21 | A/D | MW006068 |
| 25 | 194374 | 04-Feb-10 | 24-Dec-09 | 42 | 43,300 | N/A | M | 33 | D | MW006072 |
| 26 | 194584 | 05-Jul-10 | 03-Jun-10 | 32 | 76,700 | N/A | F | 33 | A/D | MW006079 |
| 27 | 270535 | 17-Dec-08 | 05-Oct-08 | 73 | 51,200 | N/A | M | 31 | D | MW006078 |
| 28 | 194140 | 28-May-09 | 07-Apr-09 | 51 | 45,600 | N/A | F | 26 | A/D | MW006067 |
| 29 | 275027 | 01-Oct-08 | 01-Aug-08 | 61 | 14,700 | N/A | F | 22 | A/D | MW006080 |
* Estimated date of infection ** VRC measured by P33-labeled reverse transcriptase assay (see Methods) *** Age at time of sample collection **** N/A: not available ***** No. of days between EDI and the sample date.
Figure 1The study area in Masaka, a rural setting in the southern region of the country, and Entebbe, an urban setting in the Wakiso District in central Uganda, are labelled as red dots on the map. The image was rendered using QGIS.
Primers used for cDNA synthesis, amplification and infectious molecular clone (IMC) generation.
| Primer Name | Primer Sequence | Primer Use |
|---|---|---|
| UG1.3′3_PlCb | 5′-ACTACTTAAAGCACTCAAGGCAAGCTTTATTG-3′ | cDNA synthesis for near full-length |
| UGOFM19 | 5′-GCACTCAAGGCAAGCTTTATTGAGGCTTA-3′ | cDNA synthesis for near full-length |
| 1U5Cc | 5′-CCTTGAGTGCTTTAAGTAGTGTGTGCCCGTCTGT-3′ | Near full-length genome PCR |
| UG1.3′3_PlCb | 5′-ACTACTTAAAGCACTCAAGGCAAGCTTTATTG-3′ | Near full-length genome PCR |
| 2U5Cd | 5′-AGTAGTGTGTGCCCGTCTGTTGTGTGACTC-3′ | Near full-length genome PCR |
| UG2.3′3′plCb | 5′-TAAAGCACTCAAGGCAAGCTTTATTGAGGCTTA-3′ | Near full-length genome PCR |
| 5FIV-R1 | 5′-CTYTTTCTCCTGTATGCAGACCCC-3′ | cDNA synthesis for 5′ half genome |
| b5r1 | 5′-CTTGCCACACAATCATCACCTGCCAT-3′ | cDNA synthesis for 5′ half genome |
| 1.R3.B3R | 5′-ACTACTTGAAGCACTCAAGGCAAGCTTTAT-3′ | cDNA synthesis for 3′ half genome |
| RVDA-F1 | 5′-GGGTCTCTCDGTTAGACCAGAT-3′ | 5′ half genome PCR |
| b3F1 | 5′-ACAGCAGTACAAATGGCAGTATT-3′ | 3′ half genome PCR |
| 5FV-R22 | 5′-CCTAGTGGGATGTGTACTTCTGAAC-3′ | 5′ half genome PCR |
| b3f3 | 5′-TGGAAAGGTGAAGGGGCAGTAGTAATAC-3′ | 3′ half genome PCR |
| 2.R3.B6R | 5′-TGAAGCACTCAAGGCAAGCTTTATTGAGGC-3′ | 3′ half genome PCR |
|
| ||
| con1.rev | 5′-TCTGATGCTTTTTGTCTGGTGT-3′ | Generation of IMC (overlap of Fragments 1 & 2) |
| con1.fwd | 5′-ACACCAGACAAAAAGCATCAGA-3′ | Generation of IMC (overlap of Fragments 1 & 2) |
| 318.con2.fwd | 5′-CCATGTGTAAAGCTAACCCCACTC-3′ | Generation of IMC (overlap of Fragments 2 & 3) |
| 318.con2.rev | 5′-GGTTAGCTTTACACATGGTTTTAGAC-3′ | Generation of IMC overlap of Fragments 2 & 3) |
| AmpR.PvuI.fwd | 5′-CGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTT-3′ | Generation of IMC (ampR) |
| AmpR.PvuI.rev | 5′-TTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTT-3′ | Generation of IMC (ampR) |
Figure 2Schematic showing the strategy of constructing synthetic IMC showing 100 bp overlaps in POL and ENV genes.
Figure 3Maximum likelihood phylogenetic trees of half genome sequences from 29 subjects. (A) 5′ half genomes. (B) 3′ half genomes. Single genome amplification (SGA sequences from each subject fell into distinct monophyletic lineages with low genetic diversity and 100% bootstrap support (black asterisks); nodes with bootstrap support ≥80% shown with grey asterisks. Branches colored in red denote sequences with intra-patient maximum diversity >0.6%; some have sub-lineages with 100% bootstrap support. HIV-1 subtypes A1, D, A/D and subtype B HXB2 reference sequences from the LANL database are shown in grey. The scale bar represents 10% genetic distance.
Diversity analysis of HIV-1 half genomes derived from 29 recent sero-converters.
| Subject ID | HIV Half Genomes (a) | Total Number HIV Genomes | Minimum nt Length of Viral Genome | Maximum nt Length of Viral Genome | Nucleotide Sequence Diversity (mean %) | Nucleotide Sequence Diversity (Range %) | Number of Genomes Analysed (b) | Mean HD | Max HD | Poisson Estimated Days Since MRCA (c) | Lambda (d) | Goodness of Fit | HD Fit to Poisson | Star Phylogeny | Deviation from Star Phylogeny | Number of Transmitted Viruses |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 275026 | 5′ | 10 | 4474 | 4474 | 0.41 | 0.41–0.82 | 10 | 16 | 29 | 153 (103–202) | 16.27 | 2.00 × 10−16 | No | No | Multiple strains | ≥2 |
| 3′ | 10 | 4595 | 4628 | 0.57 | 0.24–0.97 | 10 | 24 | 41 | 217 (175–258) | 23.98 | 2.00 × 10−16 | No | No | Multiple strains | ≥2 | |
| 3′ lng1 | 7 | 4595 | 4616 | 0.41 | 0.24–0.55 | 7 | 18 | 25 | 160 (131, 189) | 17.62 | 0.8246 | Yes | No | Selection | Lng 1 | |
| 270015 | 5′ | 7 | 4449 | 4449 | 0.10 | 0–0.27 | 7 | 4.6 | 12 | 43 (6, 81) | 4.571 | 1.80 × 10−5 | No | Yes | ≥2 | |
| 3′ | 6 | 4622 | 4623 | 0.16 | 0–0.46 | 6 | 8 | 21 | 73 (22, 167) | 8 | 2.00 × 10−16 | No | Yes | ≥2 | ||
| 3′ lng1 | 5 | 4622 | 4623 | 0.03 | 0–0.07 | 5 | 2 | 3 | 18 (8, 29) | 2 | 0.6198 | Yes | Yes | Lng 1 | ||
| 192023 | 5′ | 6 | 4448 | 4448 | 0.48 | 0.16–0.99 | 6 | 19 | 34 | 184 (80, 288) | 19.47 | 2.00 × 10−16 | No | No | Multiple strains | ≥2 |
| 3′ | 6 | 4604 | 4607 | 0.54 | 0.17–0.72 | 6 | 23 | 32 | 206 (154, 258) | 22.6 | 2.00 × 10−16 | No | No | Multiple strains | ≥2 | |
| 193004 | 5′ | 8 | 4460 | 4494 | 0.87 | 0.36–1.41 | 8 | 36 | 60 | 340 (242, 439) | 36.46 | 2.00 × 10−16 | No | No | Multiple strains | ≥2 |
| 3′ | 8 | 4636 | 4673 | 1.67 | 0.51–3.39 | 8 | Too high | No | No | ≥2 | ||||||
| 193005 | 5′ | 6 | 4482 | 4485 | 1.22 | 0.31–2.30 | 6 | 49 | 87 | 456 (187, 724) | 48.67 | 2.00 × 10−16 | No | No | Multiple strains | ≥2 |
| 3′ | 6 | 4469 | 4678 | 0.20–11.01 | 6 | Too high | No | No | Multiple strains | ≥2 | ||||||
| 191696 | 5′ | 3 | 4450 | 4450 | 0.64 | 0.34–0.89 | 3 | N/A | No | No | Multiple strains | ≥2 | ||||
| 3′ | 3 | 4575 | 4602 | 0.54 | 0.13–0.77 | 3 | N/A | No | No | Multiple strains | ≥2 | |||||
| 191997 | 5′ | 9 | 4466 | 4468 | 0.20 | 0.02–0.33 | 9 | 5.8 | 13 | 62 (42, 83) | 5.778 | 0.00941 | No | No (f) | Selection | ≥2 |
| 3′ | 10 | 4651 | 4664 | 0.25 | 0–0.41 | 10 | 11 | 18 | 97 (75, 119) | 10.78 | 2.00 × 10−16 | No | No | Selection | ≥2 | |
| 194037 | 5′ | 12 | 4456 | 4456 | 0.04 | 0–0.11 | 12 | 1.7 | 5 | 16 (9, 23) | 1.667 | 0.9321 | Yes | Yes | 1 | |
| 3′ | 12 | 4596 | 4599 | 0.06 | 0–0.15 | 12 | 2.6 | 7 | 24 (14, 34) | 2.636 | 0.9332 | Yes | Yes | 1 | ||
| 194584 | 5′ | 14 | 4455 | 4456 | 0.08 | 0–0.20 | 14 | 1.6 | 4 | 17 (11, 23) | 1.571 | 0.7648 | Yes | Yes (f) | 1 | |
| 3′ | 7 | 4580 | 4580 | 0.07 | 0.02–0.15 | 7 | 3.3 | 7 | 31 (15, 46) | 3.333 | 0.9464 | Yes | Yes | 1 | ||
| 270535 | 5′ | 8 | 4459 | 4460 | 0.11 | 0.04–0.18 | 8 | 4.9 | 8 | 47 (34, 59) | 4.929 | 0.6833 | Yes | Yes | 1 | |
| 3′ | 5 | 4576 | 4601 | 0.11 | 0.09–0.13 | 5 | 5.2 | 6 | 48 (40,56) | 5.2 | 0.3723 | Yes | Yes | 1 | ||
| 191639 | 5′ | 6 | 4458 | 4459 | 0.04 | 0–0.09 | 6 | 2 | 4 | 19 (7, 31) | 2 | 0.7987 | Yes | Yes | 1 | |
| 3′ | 8 | 4618 | 4619 | 0.19 | 0.07–0.43 | 8 | 3.6 | 8 | 38 (20, 56) | 3.607 | 0.8119 | Yes | yes (f) | 1 | ||
| 194603 | 5′ | 6 | 4461 | 4461 | 0.10 | 0–0.22 | 6 | 3 | 7 | 33 (5, 60) | 3 | 0.8227 | Yes | Yes (f) | 1 | |
| 3′ | 6 | 4623 | 4626 | 0.19 | 0.11–0.22 | 6 | 5.3 | 7 | 55 (40, 70) | 5.333 | 0.8391 | Yes | yes (f) | 1 | ||
| 275031 | 5′ | 4 | 4547 | 4465 | 0.04 | 0.02–0.07 | 4 | 2 | 3 | 19 (4, 33) | 2 | 0.6822 | Yes | Yes | 1 | |
| 3′ | 6 | 4605 | 4606 | 0.07 | 0.02–0.13 | 6 | 2.3 | 4 | 24 (13, 35) | 2.333 | 0.5462 | Yes | yes (f) | 1 | ||
| 194604 | 5′ | 10 | 4459 | 4459 | 0.03 | 0–0.09 | 10 | 1.2 | 4 | 11 (4, 19) | 1.2 | 0.5375 | Yes | Yes | 1 | |
| 3′ | 10 | 4575 | 4575 | 0.07 | 0.02–0.18 | 10 | 3.3 | 8 | 30 (17, 40) | 3.289 | 0.7812 | Yes | No | Selection | 1 | |
| 192002 | 5′ | 4 | 4191 | 4477 | 0.16 | 0.07–0.26 | 4 | 7 | 11 | 66 (22, 110) | 7 | 0.9283 | Yes | Yes | 1 | |
| 3′ | 8 | 4549 | 4595 | 0.17 | 0.02–0.33 | 8 | 8.2 | 15 | 75 (48, 102) | 8.214 | 0.898 | Yes | No | Selection | 1 | |
| 194535 | 5′ | 7 | 4459 | 4459 | 0.08 | 0.02–0.11 | 7 | 3.4 | 5 | 32 (21, 44) | 3.429 | 0.6776 | Yes | Yes | 1 | |
| 3′ | 7 | 4624 | 4818 | 0.17 | 0.11–0.30 | 6 | 5.8 | 8 | 51 (43, 58) | 5.8 | 0.6561 | Yes | No (f) | Selection | 1 | |
| 194319 | 5′ | 7 | 4461 | 4461 | 0.05 | 0–0.09 | 7 | 2.3 | 4 | 22 (17, 26) | 2.286 | 0.7703 | Yes | No | Early selection | 1 |
| 3′ | 7 | 4575 | 4593 | 0.10 | 0.07–0.13 | 7 | 4.4 | 6 | 40 (32, 48) | 4.381 | 0.4586 | Yes | No | Early selection | 1 | |
| 193006 | 5′ | 6 | 4452 | 4452 | 0.07 | 0.02–0.11 | 6 | 3 | 5 | 28 (15, 42) | 3.0 | 0.8046 | Yes | Yes | 1 | |
| 3′ | 6 | 4435 | 4489 | 0.11 | 0.04–0.18 | 6 | 5.2 | 8 | 49 (29, 68) | 5.2 | 0.4767 | Yes | No | Early stochastic | 1 | |
| 194020 | 5′ | 5 | 4463 | 4463 | 0.06 | 0–0.13 | 5 | 2.8 | 6 | 27 (-3, 56) | 2.8 | 0.6693 | Yes | Yes | 1 | |
| 3′ | 5 | 4623 | 4623 | 0.19 | 0.14–0.22 | 5 | 8.5 | 10 | 77 (68, 87) | 8.5 | 0.4971 | Yes | No | Selection | 1 | |
| 194346 | 5′ | 5 | 4460 | 4460 | 0.06 | 0–0.11 | 5 | 2.8 | 5 | 27 (9, 44) | 2.8 | 0.7886 | Yes | Yes | 1 | |
| 3′ | 5 | 4599 | 4602 | 0.14 | 0.11–0.20 | 5 | 6.8 | 10 | 62 (44, 80) | 6.8 | 0.8729 | Yes | No | Selection | 1 | |
| 194374 | 5′ | 7 | 4455 | 4458 | 0.09 | 0–0.16 | 7 | 3.8 | 7 | 36 (24, 49) | 3.81 | 0.7509 | Yes | No | Early selection | 1 |
| 3′ | 6 | 4638 | 4638 | 0.07 | 0–0.17 | 6 | 3.3 | 8 | 30 (10, 50) | 3.267 | 0.657 | Yes | No | Early selection | 1 | |
| 191955 | 5′ | 6 | 4473 | 4475 | 0.10 | 0.07–0.13 | 6 | 4.7 | 6 | 44 (33, 55) | 4.667 | 0.4987 | Yes | Yes | 1 | |
| 3′ | 6 | 4630 | 4642 | 0.11 | 0–0.26 | 6 | 5 | 12 | 45 (6, 85) | 5 | 0.00015 | No | Yes | 1 | ||
| 191923 | 5′ | 11 | 4461 | 4461 | 0.11 | 0–0.36 | 11 | 1.8 | 6 | 20 (9, 31) | 1.818 | 0.3695 | Yes | Yes | 1 | |
| 3′ | 11 | 4490 | 4613 | 0.12 | 0–0.24 | 11 | 5.2 | 11 | 47 (30, 64) | 5.164 | 2.08 × 10−13 | No | No | Selection | 1 | |
| 194065 | 5′ | 11 | 4459 | 4459 | 0.06 | 0–0.16 | 11 | 2.9 | 7 | 28 (14, 41) | 2.909 | 0.4872 | Yes | Yes | 1 | |
| 3′ | 11 | 4591 | 4594 | 0.22 | 0–0.33 | 11 | 9.2 | 15 | 84 (68, 101) | 9.2 | 0.00063 | No | No | Selection | 1 | |
| 194289 | 5′ | 11 | 4425 | 4446 | 0.05 | 0–0.20 | 11 | 2.2 | 9 | 21 (6, 35) | 2.182 | 1.23 × 10−8 | No | Yes | yes | 1 |
| 3′ | 9 | 4599 | 4614 | 0.18 | 0–0.26 | 9 | 7.6 | 11 | 70 (57, 82) | 7.639 | 0.00011 | No | No | Selection | 1 | |
| 193008 | 5′ | 5 | 4491 | 4492 | 0.09 | 0.04–0.16 | 5 | 4.2 | 7 | 39 (21, 58) | 4.2 | 0.9419 | Yes | No | Early stochastic | 1 |
| 3′ | 6 | 4667 | 4670 | 0.18 | 0.02–0.29 | 6 | 8.7 | 13 | 78 (53, 103) | 8.667 | 0.00063 | No | No | Selection | 1 | |
| 275027 | 5′ | 9 | 4455 | 4455 | 0.15 | 0–0.29 | 9 | 6.4 | 12 | 60 (42, 79) | 6.389 | 2.00 × 10−16 | No | No | Selection | 1 |
| 3′ | 2 | 4302 | 4641 | 0.26 | N/A | 2 | N/A | ? | ||||||||
| 191996 | 5′ | 4 | 4459 | 4461 | 0.51 | 0.04–0.75 * | 4 | N/A. Two out of four sequences were hypermutated by APOBEC3G (A3G) | N/A | A3G Hypermutation | ? | |||||
| 3′ | 3 | 4633 | 4634 | 0.13 | 0.11–0.15 | 3 | N/A | Yes | ||||||||
| 194140 | 5′ | 2 | 4457 | 4457 | 0 | N/A | 2 | N/A | ? | |||||||
| 3′ | 2 | 4677 | 4677 | 0.02 | 2 | |||||||||||
Four samples listed at the bottom of the table were excluded from this analysis because of the extremely low number of 3′ half genome sequences. (a) Genomes (5′ half or 3′ half) from all amplicons or selected sequences identifying the most predominant transmitted lineage (lng1) were analyzed separately. (b) Number of genomes analyzed for conformance to a Poisson model of random diversification. The Poisson-Fitness v2 tool (https://www.hiv.lanl.gov/content/sequence/POISSON_FITTER/pfitter.html, accessed on 20–29 March 2021) automatically screens for enrichment of APOBEC G-to-A substitutions. APOBEC positions were removed when appropriate. (c) Poisson model prediction of the minimum number of days needed to achieve the observed intra-strain diversity. (d) Lambda is the parameter of the Poisson distribution that best fits the HD frequency counts. Lambda value should be close to the mean HD. (e) Low goodness-of-fit (GOF) p values (<0.05) indicate divergence from a Poisson; when the maximum Hamming Distance is too high, the Poisson-Fitter may fail. Conversely, a high p value indicates conformance to a Poisson distribution. (f) APOBEC sites were removed.
Figure 4Highlighter plots (A–H) of two individuals (194584 and 194289, 5′ and 3′ half genomes) with single virus transmission (one unambiguous and one with immune selection), plus two individuals (275026 and 270015, 5′ and 3′ half genomes) with multivariant transmission (one with two sub-lineages and one with a single divergent sequence).
Figure 5(A) Mosaic structures of 20 HIV-1 inter-subtype recombinant T/F viruses. (B) Recurrent recombination breakpoints (hotspots) in the envelope region. Subtypes D (green), A1 (red) and C (brown).
Inter-subtype recombination events and identification of recombination hot spots in envelope genes of HIV-1 transmitted/founder variants from Uganda.
| Unique Identifier | No. of Break Points | Breakpoint Interval | Length, No. Nucleotides | Directionality of Recombination | Envelope Region of Recombination Events | Recombination Hotspot # | vpu | Subtype Composition of Overlapping vpu, tat and Rev Genes/Exons | |
|---|---|---|---|---|---|---|---|---|---|
| Start–End, HXB2 Numbering | tat-1, rev-1 | tat-2, rev-2 | |||||||
| 194319-env | 1 | 8214–8252 | 38 | A1/D |
| #1 | A1/D | D | D |
| 194140-env | 1 | 8251–8313 | 62 | A1/D |
| #1 | A1 | D | D |
| 194065-env | 1 | 8258–8358 | 100 | A1/D |
| #1 | A1 | D | D |
| 191696-env | 1 | 8251–8303 | 52 | A1/D |
| #1 | A1/D | D | D |
| 194584-env | 2 | 6990–7202 | 212 | D/A1 | V3, CD4/co-receptor binding | D | D | D | |
| 8252–8330 | 78 | A1/D |
| #1 | |||||
| 194020-env | 2 | 6975–7057 | 82 | D/A1 | CD4 contact, near V3 | D | D | D | |
| 8252–8301 | 49 | A1/D |
| #1 | |||||
| 191997-env | 2 | 7116–7135 | 19 | D/A1 | V3 loop | D | D | D | |
| 8213–8305 | 92 | A1/D |
| #1 | |||||
| 191996-env | 2 | 6336–6364 | 28 | D/A1 | Signal peptide/C1 | #2 | D | D | D |
| 8272–8299 | 27 | A1/D |
| #1 | |||||
| 275026-env | 3 | 6326–6360 | 34 | D/A1 | Signal peptide/C1 | #2 | D | D | D |
| 8270–8333 | 63 | A1/D |
| #1 | |||||
| 8722–8740 | 18 | D/A1 | LLP-1 | ||||||
| 194603-env | 3 | 7329–? | ? | A1/D | CD4 binding | A1 | D | D | |
| 7473–? | ? | D/A1 | Coreceptor binding | ||||||
| 8386–8399 | 13 | A1/D |
| #1 | |||||
| 270015-env | 4 | 6309–6351 | 42 | D/A1 | Signal peptide/C1 | #2 | D | D | D |
| 6622–6674 | 52 | A1/D | V1 loop | ||||||
| 7470–7506 | 36 | D/A1 | CD4/coreceptor binding | ||||||
| 8251–8297 | 46 | A1/D |
| #1 | |||||
| 194346-env | 4 | 6339–6365 | 26 | D/A1 | Signal peptide/C1 | #2 | D | D | D |
| 6427–6468 | 41 | A1/D | Signal peptide/C1 | #2 | |||||
| 6745–6779 | 34 | D/A1 | V2 | ||||||
| 8246–8296 | 50 | A1/D |
| #1 | |||||
| 193006-env | 1 | 8031–8074 | 43 | A1/D | gp41 fusion domain | A1/D | D | D | |
| 193005-env | 2 | 7313–7327 | 14 | D/A1 | CD4 binding loop | D | D | D | |
| 7519–7540 | 21 | A1/ D | Coreceptor binding | ||||||
| 193004-env | 2 | 6426–6475 | 49 | D/A1 | Signal peptide/C1 | #2 | D | D | D |
| 6795–6889 | 94 | A1/ D | V2 loop | ||||||
| 191923-env | 2 | 7018–7055 | 37 | D/A1 | CD4 contact, near V3 | D | D | D | |
Figure 6Viral replicative capacity (VRC) scores of 14 infectious molecular clones generated, including the relevant controls, MJ4,NL4.3 and R880F.
Figure 7Infection with high replicating viruses resulted in lower CD4+ T cell counts in the first 4–6 years of infection. (A) Infection with high VRC resulted in faster drop to CD+T cell counts >350 cells/mm3. (B) Infection with high VRC resulted in faster drop to CD+ T cell counts >500 cells/mm3. * p values were calculated using a Mann–Whitney U test for small numbers.
Figure 8(A) A correlation of SPVL versus VRC. (B) Differences between low and high VRC and SPVL.
Histidine signatures across the various clades within the Envelope gene.
| Amino Acid (Single Nucleotide Code) | Sample ID | |
|---|---|---|
| D | H | 194604 |
| D | H | 194535 |
| D | H | 192023 |
| D | H | 191923 |
| D | H | 194020 |
| D | H | 270015 |
| D | H | 191997 |
| D | H | 275031 |
| D | H | 191996 |
| D | H | 194307 |
| D | H | 194374 |
| D | N/A | 193004 |
| D | N/A | 193005 |
| D | Q | 192002 |
| D | Q | 275027 |
| D | G | 194289 |
| D | I | 194584 |
| D | L | 270535 |
| D | T/P | 275026 |
| D | Del | 191639 |
| D | D | 191955 |
| A1 | N | 193006 |
| A1 | N | 194603 |
| A1 | N | 191696 |
| A1 | N | 194319 |
| A1 | N | 194140 |
| A1 | D | 194346 |
| A1 | D | 194065 |
| A1 | Y | 193008 |