| Literature DB >> 29566006 |
Virginie Sauvage1, Laure Boizeau1, Daniel Candotti1, Mathias Vandenbogaert2, Annabelle Servant-Delmas1, Valérie Caro2, Syria Laperche1.
Abstract
Until recently, the method of choice to characterize viral diversity consisted in cloning PCR amplicons of full-length viral genomes and Sanger-sequencing of multiple clones. However, this is extremely laborious, time-consuming, and low-throughput. Next generation short-read sequencing appears also limited by its inability to directly sequence full-length viral genomes. The MinION™ device recently developed by Oxford Nanopore Technologies can be a promising alternative by applying long-read single-molecule sequencing directly to the overall amplified products generated in a PCR reaction. This new technology was evaluated by using hepatitis B virus (HBV) as a model. Several previously characterized HBV-infected clinical samples were investigated including recombinant virus, variants that harbored deletions and mixed population. Original MinION device was able to generate individual complete 3,200-nt HBV genome sequences and to identify recombinant variants. MinION was particularly efficient in detecting HBV genomes with multiple large in-frame deletions and spliced variants concomitantly with non-deleted parental genomes. However, an average-12% sequencing error rate per individual reads associated to a low throughput challenged single-nucleotide resolution, polymorphism calling and phasing mutations directly from the sequencing reads. Despite this high error rate, the pairwise identity of MinION HBV consensus genome was consistent with Sanger sequencing method. MinION being under continuous development, further studies are needed to evaluate its potential use for viral infection characterization.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29566006 PMCID: PMC5864009 DOI: 10.1371/journal.pone.0194366
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
HBV genetic diversity in three infected individuals characterized by sequencing whole genome PCR products using three different methods.
| Sample | HBV genotype | Direct Sanger sequencing | Molecular cloning | Direct MinION sequencing |
|---|---|---|---|---|
| B5584 | D1/E | Max. sequence length: 3,209-nt | Max. sequence length: 3,209-nt | Max. sequence length: 3,201 nt |
| D1/E recombinant sequence | Individual D1/E recombinant sequences | Individual D1/E recombinant sequences | ||
| Single nucleotide ambiguities across whole genome | D1 and E parental sequences not detected | D1 and E parental sequences not detected | ||
| Single nucleotide ambiguities across whole genome | ||||
| 1,229-nt deletion (pos. 2,471–489)—putative spliced variant | ||||
| B6505 | A2 | Max. sequence length: 3,220-nt | Not done | Max. sequence length: 3,217-nt |
| Mixed population of wild-type and single nucleotide mutant S sequences | Mixed population of wild-type and single nucleotide mutant S sequences | |||
| Evaluation of mutation association patterns | ||||
| 1,049-nt deletion (pos. 2,447–282)—putative spliced variant | ||||
| 96-nt deletion (pos. 2,131–2,226) | ||||
| B6260 | D2 | Overlapping sequence signals suggesting at least 2 sequences coexisting | Individual wild-type sequences (3,179-nt) | Max. sequence length: 3,015-nt |
| Single nucleotide ambiguities across whole genome | Individual sequence (3,032-nt) with 123-nt (pos. 2,929–3,051) and 24-nt (pos. 501–524) deletions | Individual wild-type sequences | ||
| Individual sequences with 123-nt (pos. 2,929–3,051) deletion | ||||
| Individual sequences with 123-nt (pos. 2,929–3,051) and 24-nt (pos. 501–524) deletions | ||||
| Single nucleotide ambiguities across whole genome |
* Single nucleotide sequence ambiguities were not investigated further.
Abbreviations: gt, genotype; nt, nucleotide; pos, position.
LAST read mapping statistics for HBV sequencing runs on the MinION™ device.
| LAST | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Total reads | Pass reads | Sanger sequence length (nt) | No of mapped reads | Mean Alignt read length (nt) | Max Alignt read length (nt) | Mean Alignt Identity (min-max) | Mean Miscall rate | Mean Insertion rate | Mean Deletion rate | Mean total error rate | No of reads covering ≥ 95% of the Sanger sequence | Mean total error rate (sequences covering ≥ 95% of the Sanger sequence) | |
| 3,209 | 3,060 | 3,209 | 74.8% | 12.25% | 3.8% | 10.2% | 26.25% | 25.6% | |||||
| (59.4–91.6) | (1.9–23.1) | (0.4–11.2) | (1.9–25.9) | (8.5–42.2) | (8.5–41.3) | ||||||||
| 3,032 | 2,848 | 3,032 | 73.6% | 13.0% | 3.65% | 10.8% | 27.4% | 27.05% | |||||
| (58.9–93.6) | (0.9–26.8) | (0.0–12.1) | (2.7–26.7) | (6.4–43.2) | (8.4–43.1) | ||||||||
a version 475
No: numberAlignment length values are rounded to the nearest whole number.
Read mapping statistics for HBV sequencing runs after error correction using NanoCorrect.
| NanoCorrect + LAST | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pass reads | Sanger sequence length (nt) | No of corrected Pass reads | No of mapped reads | Mean Alignt read length (nt) | Max Alignt read length (nt) | No of reads covering ≥ 95% of the Sanger sequence | Mean Alignt Identity (min-max) (sequences covering ≥ 95% of Sanger) | Mean Miscall rate (sequences covering ≥ 95% of Sanger) | Mean insertion rate (sequences covering ≥ 95% of Sanger) | Mean Deletion rate (sequences covering ≥ 95% of Sanger) | Mean total error rate (sequences covering ≥ 95% of Sanger) | |
| 3,209 | 2,892 | 3,209 | 86.7% | 5.2% | 1.8% | 4.6% | 11.6% | |||||
| (72%) | (60.2–99.0) | (0.0–15.7) | (0.0–7.9) | (1.0–14.0) | (1.1–32.3) | |||||||
| 3,032 | 2,555 | 3,032 | 88.2% | 5.6% | 2.1% | 4.5% | 12.3% | |||||
| (57%) | (67.3–98.7) | (0.0–19.5) | (0.0–8.6) | (1.1–12.2) | (1.4–34.0) | |||||||
*Percentage calculated against the number of corrected and mapped reads
No: Number
Alignment length values are rounded to the nearest whole number.
Fig 1Schematic representation of the recombinant HBV strain B5584.
The HBV genome is represented in a circular form and the HBV open reading frames are shown. Breakpoint positions for D (red, dotted)/E (blue, striped) genomic recombination are indicated with arrows and positions are given relative to an HBV genotype A reference sequence (GenBank AM282986).
Fig 2Phasing of the S gene mutations Thr116Asn and Thr118Ser in the HBV genome.
Each bar represents one read and the dots indicate a deletion. Both single nucleotide variants were called by MinION (A) and direct Sanger sequencing (B). The number of MinION reads called for each of the single nucleotide variant patterns (C).
Fig 3Alignment of nanopore reads of B6260.
Complete and partial Nanopore sequence reads were aligned with the corresponding Sanger consensus sequence used as reference. The viral core, surface, polymerase and X proteins are indicated and positions provided according to the reference sequence AM282986. Four different types of Minion reads are shown: nearly complete HBV genome (blue); reads showing a 123-nt deletion (positions 2,968–3,090) in the PreS1 region (purple); reads with the 123-nt and an additional 24-nt deletion (positions 501–524) in the “a determinant” of the S region (green); and unclassified partial reads (yellow).