| Literature DB >> 26202126 |
James C Iles1, Richard Njouom2, Yacouba Foupouapouognigni2, David Bonsall3, Rory Bowden4, Amy Trebes4, Paolo Piazza4, Ellie Barnes3, Jacques Pépin5, Paul Klenerman3, Oliver G Pybus6.
Abstract
The importance of recombination in the evolution and genetic diversity of the hepatitis C virus (HCV) is currently uncertain. Only a small number of intergenotypic recombinants have been identified so far, and each has core and envelope genes classified as belonging to genotype 2. Here, we investigated two putative genotype 4/1 recombinants from southern Cameroon using a number of approaches, including standard Sanger sequencing, genotype-specific PCR amplification, and non-HCV-specific Illumina RNA sequencing (RNA-seq). Recombination between genotypes 1 and 4 was confirmed in both samples, and the parental lineages of each recombinant belong to HCV subtypes that are cocirculating at a high prevalence in Cameroon. Using the RNA-seq approach, we obtained a complete genome for one sample, which contained a recombination breakpoint at the E2/P7 gene junction. We developed and applied a new method, called Deep SimPlot, which can be used to visualize and identify viral recombination directly from the short sequence reads created by next-generation sequencing in conjunction with a consensus sequence.Entities:
Mesh:
Year: 2015 PMID: 26202126 PMCID: PMC4572555 DOI: 10.1128/JCM.00483-15
Source DB: PubMed Journal: J Clin Microbiol ISSN: 0095-1137 Impact factor: 5.948
Details of HCV recombinants proposed in the literature
| Genotype or subtype | Location of estimated recombination breakpoint(s) | Source | |
|---|---|---|---|
| Gene(s) | Nucleotide position(s) | ||
| 1a/1c | E1, E2 | 1407, 2050 | |
| 1a/1c | Core, E1, E2, NS2, NS3 | 801, 1261, 2181, 3041, 3781 | |
| 1b/1a | NS5B | 8320 | |
| 1b/1a | Core | 387 | |
| NS2/NS3 | 3420–3440 | ||
| NS2/NS3 | 3405–3416 | ||
| NS2/NS3 | 3443 | ||
| NS2/NS3 | 3399 | ||
| NS2 | 3298–3305 | ||
| NS2/NS3 | 3429 | ||
| NS2/NS3 | 3175 | ||
| 4d/4a | Between E2 and NS5A | Unknown | |
| 6a/6o | NS5B | 8345 | |
| 6e/6h | NS5B | 8356 | |
| 6e/6o | NS5B | 8358 | |
| 6n/6o | NS5B | 8372 | |
Nucleotide positions are relative to isolate H77.
Entries in bold are intergenotypic recombinants.
Details of primers used in this study
| Primer name | Sequence | Specificity | Positions |
|---|---|---|---|
| 5′UTR-Ex-400F | CCTTGTGGTACTGCCTGATAG | Generic | 282–299 |
| 5′UTR-In-405F | CCTGATAGGGTGCTTGCGAG | Generic | 295–311 |
| Gg-767-Rex-Core | CAYGTRAGGGTATCGATGAC | Generic | 721–705 |
| G1-667-Rin-Core | GTCABTGGGGCCCCAACTAG | Genotype 1 | 671–655 |
| G4-668-Rin-Core | ATCATTTGGRCCCCAAGAC | Genotype 4 | 671–656 |
| NS5B-Ex-Fwd | TGGGGTTCTCRTATGAYACCCGCTGYTTTG | Generic | 8248–8274 |
| NS5B-Ex-Rev | AATACCTVGTCATAGCCTCCGTGA | Generic | 8637–8617 |
| NS5B-In-Fwd | GAYACCCGCTGYTTTGACTC | Generic | 8262–8278 |
| NS5B-In-Rev | TACCTNGTCATAGCCTCCGTGAAGACTC | Generic | 8635–8611 |
| CoreOS | ACTGCCTGATAGGGTGCTTGCGAG | Generic | 291–311 |
| CoreOAS | ATGTACCCCATGAGGTCGGC | Generic | 748–732 |
| CoreIS | AGGTCTCGTAGACCGTGCATC ATG | Generic | 324–344 |
| CoreIAS | CAYGTRAGGGTATCGATGAC | Generic | 721–705 |
| Pr3 | TATGAYACCCGCTGYTTTGCTC | Generic | 8259–8278 |
| Pr4 | GCNGARTAYCTVGTCATAGCCTC | Generic | 8641–8622 |
| Pr5 | GCTAGTCATAGCCTCCGT | Generic | 8633–8619 |
Location numbering is relative to isolate H77 (GenBank accession number AF009606).
FIG 1Estimated maximum-likelihood midpoint-rooted phylogeny of subgenomic core sequences. Nodes with bootstrap support of >70% are labeled with their bootstrap support values. The sequences generated in this study are highlighted in red. The branch lengths are in units of expected substitutions per site (see scale bar). Reference sequences are labeled with their subtype and accession number.
FIG 2Estimated maximum-likelihood midpoint-rooted phylogeny of subgenomic NS5B sequences. See Fig. 1 legend for further details.
FIG 3(a) Schematic of the RNA-seq data obtained for sample EBW034. A total of 9 genome regions were assembled from 14 reads and are colored in red. All regions showed the closest sequence similarity to subtype 1l. (b) Bootscan plot of the EBW436 consensus genome, generated using the Oxford HCV automated subtyping tool. Top, bootscan plot, which displays the bootstrap support for the clustering of EBW436 with genotype reference genomes in 400-nt sliding windows across the viral genome. The curves are colored according to the genotype key on the right. Bottom, schematic of the RNA-seq consensus genome obtained for sample EBW436. Genome regions with an association to a particular genotype supported by >70% bootstrap support (also shown at top as a red horizontal dashed line) are colored according to the genotype in question.
FIG 4Deep SimPlot analysis of the mapped reads from sample EBW436 compared to three reference genomes: subtype 1e (red), subtype 4f (green), and subtype 4q (blue). (a) Proportion of sites that differ between each read and reference genome (p-distance). Markers are colored according to the appropriate reference genome. The curves represent a sliding average of the points within a 500-nt window. (b and c) Relative difference of each read to two reference genomes, subtypes 1e and 4f (b), and subtypes 1e and 4q (c).