| Literature DB >> 35633678 |
Anja Pecman1,2, Ian Adams3, Ion Gutiérrez-Aguirre1, Adrian Fox3, Neil Boonham4, Maja Ravnikar1, Denis Kutnjak1.
Abstract
High-throughput sequencing (HTS) has become an important tool for plant virus detection and discovery. Nanopore sequencing has been rapidly developing in the recent years and offers new possibilities for fast diagnostic applications of HTS. With this in mind, a study was completed, comparing the most established HTS platform (MiSeq benchtop sequencer-Illumina), with the MinION sequencer (Oxford Nanopore Technologies) for the detection of plant viruses and viroids. Method comparisons were performed on five selected samples, containing two viroids, which were sequenced using nanopore technology for the first time and 11 plant viruses with different genome organizations. For all samples, sequencing libraries for the MiSeq were prepared from ribosomal RNA-depleted total RNA (rRNA-depleted totRNA) and for MinION sequencing, direct RNA sequencing of totRNA was used. Moreover, for one of the samples, which contained five different plant viruses and a viroid, three additional variations of sample preparation for MinION sequencing were also used: direct RNA sequencing of rRNA-depleted totRNA, cDNA-PCR sequencing of totRNA, and cDNA-PCR sequencing of rRNA-depleted totRNA. Whilst direct RNA sequencing of total RNA was the quickest of the tested approaches, it was also the least sensitive: using this approach, we failed to detect only one virus that was present in a sample at an extremely low titer. All other MinION sequencing approaches showed improved performance with outcomes similar to Illumina sequencing, with cDNA-PCR sequencing of rRNA-depleted totRNA showing the best performance amongst tested nanopore MinION sequencing approaches. Moreover, when enough sequencing data were generated, high-quality consensus viral genome sequences could be reconstructed from MinION sequencing data, with high identity to the ones generated from Illumina data. The results of this study implicate that, when an appropriate sample and library preparation are selected, nanopore MinION sequencing could be used for the detection of plant viruses and viroids with similar performance as Illumina sequencing. Taken as a balance of practicality and performance, this suggests that MinION sequencing may be an ideal tool for fast and affordable virus diagnostics.Entities:
Keywords: Illumina MiSeq sequencing; comparison; high-throughput sequencing; nanopore MinION sequencing; plant virus/viroid detection
Year: 2022 PMID: 35633678 PMCID: PMC9131090 DOI: 10.3389/fmicb.2022.883921
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 6.064
Samples included in the comparison with corresponding results from: HTS (+ virus/viroid detected using pipeline described in section Virus and Viroid Detection Workflow,–virus/viroid not detected using pipeline described in section Virus and Viroid Detection Workflow; NA, not applicable).
|
|
|
|
|
|
| ||||
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
| |||||
| I |
| TYLCV (ssDNA) | + | + | + | + | + | KY810789 | SRR17660996/SRR17660995/SRR17660994/SRR17660993/SRR17319908 |
| ToCV (ssRNA+) | + | + | + | + | + | KY810786, KY810787 | |||
| PepMV (ssRNA+) | + | + | + | + | + | KF718832.1 (Pep-MV-EU), JX866666.1 (PepMV-CH) | |||
| ToMV (ssRNA+) |
| + | + | + | + | KY810788 | |||
| STV (dsRNA) | + | + | + | + | + | KY810783 | |||
| CLVd (viroid) | + | + | + | + | + | KY810771 | |||
| II |
| CaMV (dsDNA-RT) | + | NA | NA | NA | + | KY810770 | SRR17660992/NA/NA/NA/SRR17319907 |
| CCyV1 (ssRNA-) | + | NA | NA | NA | + | KY810772 | |||
| III |
| TSWV (ssRNA-) | + | NA | NA | NA | + | OM112200, OM112201, OM112202 | SRR17660991/NA/NA/NA/SRR17319906 |
| IV |
| TASVd (viroid) | + | NA | NA | NA | + | KY810784 | SRR17660990/NA/NA/NA/SRR17319905 |
| V |
| PVeV1 (dsRNA) | + | NA | NA | NA | + | / | SRR17660989/NA/NA/NA/SRR17319904 |
| PVeV2 (dsRNA) | + | NA | NA | NA | + | OM112199 | |||
| PVeV3 (dsRNA) | + | NA | NA | NA | + | / | |||
Virus/viroid names: tomato yellow leaf curl virus (TYLCV, Begomovirus, Geminiviridae), tomato chlorosis virus (ToCV, Crinivirus, Closteroviridae), pepino mosaic virus (PepMV, Potexvirus, Alphaflexiviridae), tomato mosaic virus (ToMV, Tobamovirus, Virgaviridae), southern tomato virus (STV, Amalgavirus, Amalgaviridae), columnea latent viroid (CLVd, Pospiviroid, Pospiviroidae), cauliflower mosaic virus (CaMV, Caulimovirus, Caulimoviridae), cabbage cytorhabdovirus 1 (CCvY1, Cytorhabdovirus, Rhabdoviridae), tomato spotted wilt orthotospovirus (TSWV, Orthotospovirus, Tospoviridae), tomato apical stunt viroid (TASVd, Pospiviroid, Pospiviroidae), and phaseolus vulgaris alphaendornavirus 1, 2, 3 (PVeV1, 2, 3, Alphaendornavirus, Endornaviridae).
Figure 1An overview of the conducted study. Listed titles of chapter material and methods are connected with arrows according to the course of work (part of the figure was created with BioRender.com).
Figure 2Heatmap of the samples included in the comparison with corresponding results: percentage of mapped reads, average depth (reads), fraction of reference covered (reads), consensus sequence identity (%), and average MinION reads/contigs identity (/ no data; NA not applicable).
Figure 3Comparison of MinION direct RNA sequencing of totRNA (represented inred) and Illumina sequencing of rRNA-depleted totRNA (represented in blue) using data size-normalized subsamples. Results for each virus included in the analysis are shown along the x-axis and are grouped according to Baltimore classification. (A) Percentage of specific virus reads in trimmed and filtered complete HTS datasets. (B) Average fraction of reference covered by reads (%) at different subsample sizes. Dots represent the average value of analysis of 5 replicate subsamples. Different subsample sizes were used (10, 30, 50, 100, 200, 300, 500, 700, 900, 1,100, 1,300, 1,500 million nts–note the enlarged x-axis in the lower left part of panel 3B for a clearer view). (C) Fraction of reference covered by contigs (%) at different subsample sizes. Every bar represents the result of analysis for separate replicate subsamples. In (B,C), gray areas designate the range in which the subsamples were available for both approaches compared.
Figure 4Comparison of MinION sequencing: direct RNA sequencing of totRNA, direct RNA sequencing of rRNA-depleted totRNA, cDNA-PCR sequencing of totRNA, cDNA-PCR sequencing of rRNA-depleted totRNA and Illumina sequencing of rRNA-depleted totRNA using data size-normalized subsamples. Results for each virus included in the analysis are shown along the x-axis and are grouped according to Baltimore classification. (A) Percentage of specific virus reads in trimmed and filtered complete HTS datasets. (B) Average fraction of reference covered by reads (%) at different subsample sizes. Dots represent the average value of analysis of 5 replicated subsamples. Different subsample sizes were used (10, 30, 50, 100, 200, 300, 500, 700, 900, 1,100, 1,300, 1,500 million nts). (C) Fraction of reference covered by contigs (%) at different subsample sizes. Each bar represents the result of analysis for a separate replicate subsample. Bars with * indicate chosen subsample for additional analysis explained in section Comparison of the Performance of Nanopore and Illumina Sequencing for Detection of Viruses on Rarefied Datasets. In (B,C), red represents MinION direct RNA sequencing of totRNA, green represents direct RNA sequencing of rRNA-depleted totRNA, gray represents cDNA-PCR sequencing of totRNA, yellow represents cDNA-PCR sequencing of rRNA-depleted totRNA, and blue represents the results for Illumina sequencing of rRNA-depleted totRNA. Gray areas designate the range in which the subsamples were available for both compared approaches.
Cumulative yield of nucleotides sequenced in time.
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||||
|
|
|
|
| ||||||
| I | TYLCV | 500 | 520 | / | / | 30 | 50 | 50 | 220 |
| ToCV | / | / | 30 | 80 | 50 | 130 | 10 | 20 | |
| PepMV | 10 | 10 | 10 | 30 | 10 | 10 | 10 | 20 | |
| ToMV | / | / | 50 | 140 | 10 | 10 | 10 | 20 | |
| STV | / | / | 30 | 80 | / | / | 100 | 600 | |
| CLVd | 700 | 710 | 30 | 80 | 30 | 50 | 10 | 20 | |
| II | CaMV | 10 | 10 | NA | NA | NA | NA | NA | NA |
| CCyV1 | / | / | NA | NA | NA | NA | NA | NA | |
| III | TSWV | 20 | 30 | NA | NA | NA | NA | NA | NA |
| IV | TASVd | 200 | 220 | NA | NA | NA | NA | NA | NA |
| V | PVeV2 | 200 | 170 | NA | NA | NA | NA | NA | NA |
Data in table represent the time when enough nucleotides would be sequenced to cover at least 50% of virus/viroid reference sequence by mapping reads. / no data: 50% coverage of virus/viroid reference by reads were not achieved. NA, not applicable.