| Literature DB >> 32722480 |
Deborah M Leigh1, Christopher Schefer1, Carolina Cornejo1.
Abstract
The MinION sequencer is increasingly being used for the detection and outbreak surveillance of pathogens due to its rapid throughput. For RNA viruses, MinION's new direct RNA sequencing is the next significant development. Direct RNA sequencing studies are currently limited and comparisons of its diagnostic performance relative to different DNA sequencing approaches are lacking as a result. We sought to address this gap and sequenced six subtypes from the mycovirus CHV-1 using MinION's direct RNA sequencing and DNA sequencing based on a targeted viral amplicon. Reads from both techniques could correctly identify viral presence and species using BLAST, though direct RNA reads were more frequently misassigned to closely related CHV species. De novo consensus sequences were error prone but suitable for viral species identification. However, subtype identification was less accurate from both reads and consensus sequences. This is due to the high sequencing error rate and the limited sequence divergence between some CHV-1 subtypes. Importantly, neither RNA nor amplicon sequencing reads could be used to obtain reliable intra-host variants. Overall, both sequencing techniques were suitable for virus detection, though limitations are present due to the error rate of MinION reads.Entities:
Keywords: DNA amplicon; MinION; RNA; cDNA; diagnostics; mycovirus
Year: 2020 PMID: 32722480 PMCID: PMC7472323 DOI: 10.3390/v12080801
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
The CHV-1 subtypes sequenced as DNA amplicons or directly as RNA. Subtypes each have two IDs: their viral subtype and the WSL laboratory strain ID. Also shown is the accession number in GenBank for the reference sequences used for each subtype.
| CHV-1 Subtype | WSL Collection Number | MinION Libraries Sequenced | Reference Sequence GenBank Accession Number |
|---|---|---|---|
| G | CHV1-M7757 | RNA | MF421719.1 + |
| I | CHV1-EP721 | RNA, 3 kb DNA, 5 kb DNA | DQ861913 |
| F1 | CHV1-M1123 | RNA, 3 kb DNA, 5 kb DNA | NC_001492 |
| F2 | CHV1-M2021 | 3 kb DNA, 5 kb DNA ** | MF421718 |
| D | CHV1-M1372 | 3 kb DNA, 5 kb DNA | MF431594 |
| E | CHV1-M9291 | 3 kb DNA, 5 kb DNA ** | MF431593 |
** RNA sequencing attempted, run failed. + misidentified as F2 in GenBank.
BLAST analysis for raw sequencing reads. Shown are the number reads with the top BLAST hit to CHV-1 or other CHV species for each sequencing library. PI—mean percentage identity of the top hits. Underlined values denote an incorrect subtype assignment.
| CHV-1 Subtype. | Sequencing Library | Reads with a Top BLAST Hit to the Correct CHV-1 Subtype (PI) | CHV-1 Subtype with the Highest Proportion of Reads | Reads with a Top BLAST Hit to Any CHV-1 Subtype (PI) | Reads with Top BLAST Hit to Other CHV Species (PI) | Consensus Sequence Top BLAST Hit (PI) |
|---|---|---|---|---|---|---|
|
| RNA | 52.03% | G | 70.36% | 29.64% | G |
|
| RNA | 29.76% | I | 61.2% | 38.7% | I |
| 3 kb DNA | 99.68% | I | 99.99% | 8.08 × 10−6% | I | |
| 5 kb DNA | 99.67% | I | 99.99% | 4.4 × 10−5% | I | |
|
| RNA | 93.78% | F1 | 96.61% | 0.034% | F1 |
| 3 kb DNA | 99.93% | F1 | 99.99% | 1.8 × 10−6% | F1 | |
| 5 kb DNA | 99.92% | F1 | 99.99% | 1.3 × 10−6% | F1 | |
|
| 3 kb DNA | 99.92% | F2 | 99.99% | 2.5 × 10−6% | F2 |
| 5 kb DNA | 99.82% | F2 | 100% | 0 | F2 | |
|
| 3 kb DNA | 0.0003% |
| 100% | 0 |
|
| 5 kb DNA | 94.17% | D | 99.99% | 2.0 × 10−5% | D | |
|
| 3 kb DNA | 29.75% |
| 99.99% | 1.6 × 10−5% |
|
| 5 kb DNA | 95.7% | E | 100% | 0 | E |
Shown is the divergence (upper rows) and percentage identity (lower half) between full length reference sequences. In brackets are the estimates for the 3 and 5 kb amplicon regions only. Estimates were calculated using the sequences from NCBI listed in Table 1 and CLC.
| Sequence Divergence | |||||||
|---|---|---|---|---|---|---|---|
|
|
| - | 0.12 (0.11/0.12) | 0.10 (0.07/0.10) | 0.11 (0.10/0.12) | 0.11 (0.10/0.12) | 0.10 (0.08/0.11) |
|
| 89 (90/89) | - | 0.08 (0.09/0.05) | 0.07 (0.06/0.05) | 0.06 (0.06/0.04) | 0.10 (0.10/0.08) | |
|
| 89 (92/89) | 91 (90/94) | - | 0.05 (0.08/0.06) | 0.05 (0.07/0.05) | 0.04 (0.03/0.04) | |
|
| 88 (89/88) | 92 (92/94) | 95 (93/94) | - | 0.02 (0.01/0.02) | 0.06 (0.09/0.06) | |
|
| 88 (89/88) | 93 (92/95) | 95 (93/96) | 98 (99/98) | - | 0.06 (0.09/0.07) | |
|
| 89 (91/89) | 89 (89/91) | 96 (97/96) | 94 (91/94) | 94 (91/92) | - | |
Figure 1Shown is a maximum likelihood phylogeny estimated in MegaX. Included are the reference sequences for each subtype and CHV species, as well as the consensus sequences from Wtbdg2 for each subtype sequenced with direct RNA sequencing.
The number of polymorphic intra-host variants for each subtype identified by the two variant callers that completed analysis, iVar and Ococo. The percentages shown are the number of overlapping variants divided by the mean number called from the focal libraries.
| Number of Variants | ||||||
|---|---|---|---|---|---|---|
| Subtype | RNA | 3 kb | 5 kb | RNA/DNA Overlap | 3 kb/5 kb Overlap | |
|
| G | 1145 | NA | NA | NA | NA |
| I | 54 | 6 | 8 | 2 (9%) | 5 (71%) | |
| F1 | 87 | 4 | 8 | 0 (0%) | 1 (17%) | |
| F2 | NA | 13 | 16 | NA | 11 (76%) | |
| D | NA | 210 | 225 | NA | 144 (66%) | |
| E | NA | 17 | 25 | NA | 16 (76%) | |
|
| G | 1845 | NA | NA | NA | NA |
| I | 670 | 612 | 1267 | 18 (2%) | 397 (42%) | |
| F1 | 683 | 595 | 1209 | 17 (2%) | 387 (43%) | |
| F2 | NA | 577 | 1101 | NA | 399 (48%) | |
| D | NA | 667 | 1268 | NA | 488 (50%) | |
| E | NA | 632 | 1222 | NA | 455 (49%) | |