| Literature DB >> 34685559 |
Imogen A Wright1, Kayla E Delaney1, Mary Grace K Katusiime2, Johannes C Botha1, Susan Engelbrecht1, Mary F Kearney2, Gert U van Zyl1.
Abstract
HIV-1 proviral single-genome sequencing by limiting-dilution polymerase chain reaction (PCR) amplification is important for differentiating the sequence-intact from defective proviruses that persist during antiretroviral therapy (ART). Intact proviruses may rebound if ART is interrupted and are the barrier to an HIV cure. Oxford Nanopore Technologies (ONT) sequencing offers a promising, cost-effective approach to the sequencing of long amplicons such as near full-length HIV-1 proviruses, but the high diversity of HIV-1 and the ONT sequencing error render analysis of the generated data difficult. NanoHIV is a new tool that uses an iterative consensus generation approach to construct accurate, near full-length HIV-1 proviral single-genome sequences from ONT data. To validate the approach, single-genome sequences generated using NanoHIV consensus building were compared to Illumina® consensus building of the same nine single-genome near full-length amplicons and an average agreement of 99.4% was found between the two sequencing approaches.Entities:
Keywords: HIV; consensus; nanopore; proviral; single-genome
Mesh:
Year: 2021 PMID: 34685559 PMCID: PMC8534097 DOI: 10.3390/cells10102577
Source DB: PubMed Journal: Cells ISSN: 2073-4409 Impact factor: 6.600
Study participant details, treatment histories and identified proviral genome status.
| PID | Age ART Initiated (Months) | ART Regimen | Time on ART (Years) | No. of Proviral Genomes Sequenced with ONT | |
|---|---|---|---|---|---|
| Identified as Intact | Identified as Defective | ||||
| 333716 | 2.3 | AZT/3TC/LPV/r | 8.55 | 0 | 1 |
| 339606 | 8.5 | AZT/3TC/LPV/r | 7.93 | 2 | 1 |
| 339266 | 9.23 | AZT/3TC/LPV/r | 8.2 | 4 | 0 |
| 340116 | 9.32 | AZT/3TC/LPV/r | 8.83 | 1 | 0 |
Therapy included Zidovudine (AZT), lamivudine (3TC) and lopinavir with low dose ritonavir (LPV/r).
Near full-length single-genome amplification primers.
|
| |||
|
|
|
|
|
| Li_OuterF +,* | Forward | 623–649 | AAATCTCTAGCAGTGGCGCCCGAACAG |
| Li_OuterR | Reverse | 9662–9686 | TGAGGGATCTCTAGTTACCAGAGTC |
|
| |||
|
|
|
|
|
| Li_InnerF * | Forward | 769–793 | GCGGAGGCTAGAAGGAGAGAGATGG |
| Li_InnerR + | Reverse | 9604–9632 | GCACTCAAGGCAAGCTTTATTGAGGCTTA |
| NFL-alt_in_F # | Forward | 642–664 | CCG AAC AGG GAC BHG AAA GCG AA |
+ Salminen et al., 1995, * Li et al., 2010, # Katusiime et al., 2020.
Figure 1The HIV reference sequence contains many variable regions, particularly in the envelope gene (ENV), where conventional reference mapping may not be able to capture the high diversity. NanoHIV consists of three mapping steps: (1) An initial step with a standard gap penalty of 4, which results in a gap on the generated consensus where the variable region may be; (2) A remapping to the consensus with a lower gap penalty of 1, which bridges these variable regions; (3) A final mapping with the conventional gap penalty of 4 to remove any artefacts introduced by step 2.
Total aligned similarity between intact and defective HIV proviral genomes sequenced with Illumina® MiSeq™ and ONT.
| Patient Identifier | Sample Identifier | Proviral Genome Status | Total Aligned Similarity Percentage (%) |
|---|---|---|---|
| 340116 | P4D1 | Intact | 99.3 |
| 339606 | P3D8 | Intact | 98.6 |
| P3G7 | Intact | 99.4 | |
| P3G8 | Defective | 99.6 | |
| 333716 | P2D4 | Defective | 99.1 |
| 339266 | P2C7 | Intact | 99.6 |
| P1C7 | Intact | 99.6 | |
| P1C8 | Intact | 99.7 | |
| P5D4 | Intact | 99.6 |
Figure 2A neighbour-joining phylogenetic tree (generated using Dnapars from the PHYLIP package [54]) indicating the genetic distance between Illumina® and ONT consensus sequences generated using SHIVER (Illumina®) and NanoHIV (ONT). There is generally good clustering between consensus sequences from the same sample but several very similar sequences from the same donor clustered more closely by pipeline than by sample, indicating potential artefacts in both pipelines. The tree is rooted on the subtype C reference (GenBank ID AY772699.1) used as the initial reference for the first round of NanoHIV.