| Literature DB >> 33036361 |
Shaoyuan Tan1, Cheryl M T Dvorak1, Michael P Murtaugh1.
Abstract
Emerging viral infectious diseases present a major threat to the global swine industry. Since 2015, Senecavirus A (SVA) has been identified as a cause of vesicular disease in different countries and is considered an emerging disease. Despite the growing concern about SVA, there is a lack of preventive and diagnostic strategies, which is also a problem for all emerging infectious diseases. Using SVA as a model, we demonstrated that Oxford Nanopore MinION sequencing could be used as a robust tool for the investigation and surveillance of emerging viral diseases. Our results identified that MinION sequencing allowed for rapid, unbiased pathogen detection at the species and strain level for clinical cases. SVA whole genome sequences were generated using both direct RNA sequencing and PCR-cDNA sequencing methods, with an optimized consensus accuracy of 94% and 99%, respectively. The advantages of direct RNA sequencing lie in its shorter turnaround time, higher analytical sensitivity and its quantitative relationship between input RNA and output sequencing reads, while PCR-cDNA sequencing excelled at creating highly accurate sequences. This study developed whole genome sequencing methods to facilitate the control of SVA and provide a reference for the timely detection and prevention of other emerging infectious diseases.Entities:
Keywords: Oxford Nanopore sequencing; Senecavirus A; bioinformatics; emerging infectious diseases; molecular diagnostic techniques; swine viral diseases
Mesh:
Year: 2020 PMID: 33036361 PMCID: PMC7600144 DOI: 10.3390/v12101136
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
A comparison between the sequencing statistics of the direct RNA and PCR-cDNA methods after 6 h sequencing *.
| Sequencing Statistics | Direct RNA Sequencing (DRS) | PCR-cDNA Sequencing (PCS) |
|---|---|---|
| Number of available pores | 403 ± 100 | 421 ± 18 |
| Total pass yield (Mb) | 68 ± 0 | 92 ± 24 |
| Senecavirus A (SVA) yield (Mb) | 4.5 ± 0.6 | 66.1 ± 16.2 |
| Number of SVA reads | 3559 ± 358 | 38,544 ± 0 |
| Mean SVA read length | 1267 ± 40 | 1721 ± 47 |
| SVA read error rate (%) | 15.14 ± 0.32 | 11.23 ± 0.23 |
* Data shown as the mean ± SD of 2 independent replicates.
Figure 1Coverage distribution of the direct RNA sequencing and PCR-cDNA sequencing. SVA reads were mapped to a reference genome using Minimap2 and analyzed by Qualimap to generate a coverage information file which was then visualized using GraphPad prism software. The dashed line represents the distribution of the direct RNA sequencing reads (DRS) and the solid black line represents the distribution of the PCR-cDNA sequencing reads (PCS).
Sequencing raw read statistics after using different read filters *.
| Groups | Sequencing Statistics | Direct RNA Sequencing (DRS) | PCR-cDNA Sequencing (PCS) |
|---|---|---|---|
| Group 1 | Yield (Mb) | 68 ± 0 | 70 ± 0 |
| Read quality > 7 | Average length (bp) | 621 ± 2 | 1506 ± 19 |
| Average quality | 8.5 ± 0.4 | 8.6 ± 0.1 | |
| Group 2 | Yield (Mb) | 7.2 ± 0.4 | 52.6 ± 0.8 |
| Read quality >7+ | Average length (bp) | 2314 ± 35 | 2977 ± 24 |
| length > 1314 bp | Average quality | 8.6 ± 0 | 8.9 ± 0 |
| Group 3 | Yield (Mb) | 4.5 ± 0.6 | 66.1 ± 0.8 |
| Read quality >7+ | Average length (bp) | 1267 ± 40 | 1726 ± 49 |
| mapped to SVA database | Average quality | 8.3 ± 0.1 | 8.6 ± 0.1 |
* Data shown as the mean ± SD of 2 independent replicates.
Performance of consensus generation using different raw read filters at different yields *.
| Group 1 | Group 2 (Length Filter) | Group 3 (SVA Mapped) | ||||||
|---|---|---|---|---|---|---|---|---|
| Sequencing | Yield | Consensus length (bp) | Accuracy (%) | Consensus length (bp) | Accuracy (%) | Consensus length (bp) | Accuracy (%) | SVA reads ^ |
| DRS | 0.7 | 5098 ± 786 | 89.3 ± 2.4 | 4881 ± 1728 | 89.8 ± 1.8 | 6057 ± 1143 | 86.2 ± 3.5 | 55 ± 2 |
| 7 | 7155 ± 4 | 90.8 ± 1.5 | 5522 ± 1229 | 91.2 ± 3 | 7091 ± 45 | 90.8 ± 1.3 | 548 ± 20 | |
| 70 | 7163 ± 18 | 94.3 ± 0.2 | 7096 ± 30 | 94.4 ± 0.4 | 7110 ± 21 | 94.4 ± 0.5 | 3559 ± 358 | |
| PCS | 0.7 | 6738 ± 424 | 99 ± 0.1 | 6316 ± 541 | 97.4 ± 2.4 | 6592 ± 489 | 98 ± 1.0 | 410 ± 6 |
| 7 | 7267 ± 132 | 98.9 ± 0.4 | 7238 ± 19 | 99.0 ± 0.1 | 7079 ± 161 | 99.0 ± 0.0 | 4092 ± 64 | |
| 70 | 3053 ± 35 | 90.5 ± 11.5 | 6596 ± 1903 | 91.6 ± 5.4 | 6761 ± 995 | 87.9 ± 2.0 | 38544 ± 10439 | |
* Data shown as the mean ± SD of 2 independent replicates; the whole genome length of this SVA strain is 7268 bp. ^ Number of reads mapped to SVA, only determined for group 3.
Analytical sensitivity of direct RNA sequencing (DRS) and PCR-cDNA sequencing (PCS).
| Species Detection | Strain Level Identification | Consensus Generation | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Ct value | Viral copies | Total reads | WIMP ‡ SVA reads | GenBank | DRS | PCS | Minimap SVA reads | DRS consensus | PCS consensus | ||||||||
| DRS | PCS | DRS | PCS | Best match | Identity (%) | Best match | Identity (%) | DRS | PCS | Length ^ (bp) | Accuracy (%) | Length ^ | Accuracy (%) | ||||
| Spike-in samples | 10 | 1.0 × 107 | 94,572 | 16,515 | 5722 | 2913 | MN164664 | MN164664 | 100.0 | MN164664 | 100.0 | 42,936 | 4658 | 7189 | 94.8 | 7395 | 99.2 |
| 13 | 1.2 × 106 | 40,555 | 5775 | 236 | 1182 | MN164664 | MN164664 | 100.0 | MN164664 | 100.0 | 1961 | 1611 | 7177 | 91.0 | 6873 | 99.0 | |
| 18 | 6.5 × 104 | 185,259 | 5071 | 17 | 296 | MN164664 | MN164664 | 100.0 | MN164664 | 100.0 | 43 | 420 | 2551 | 90.6 | 6770 | 99.4 | |
| 20 | 1.2 × 104 | 219,789 | 3889 | 3 | 3 | MN164664 | MN164664 | 100.0 | MN164664 | 100.0 | 10 | 3 | 1306 | 90.3 | 916 † | 80.8 | |
| 25 | 4.7 × 102 | 110,641 | 7880 | 1 | 0 | MN164664 | MN164664 | 100.0 | NA | NA | 1 | 0 | 456 † | 85.1 | NA | NA | |
| Clinical Samples | 13 | 1.1 × 106 | 45,478 | 78,075 | 299 | 436 | MN990489 | KX019804.1 | 97.1 | KX019804.1 | 97.1 | 377 | 630 | 7157 | 97.0 | 7013 | 99.6 |
| 16 | 1.3 × 105 | 148,513 | 5256 | 83 | 7 | MN990490 | KX019804.1 | 98.2 | KU058182.1 | 97.8 | 124 | 9 | 3421 | 96.4 | 880 † | 89.0 | |
| 18 | 5.0 × 104 | 47,411 | 174,551 | 19 | 24 | MN990491 | KY618836.1 | 97.4 | KY618836.1 | 97.4 | 20 | 27 | 6534 | 97.1 | 2285 † | 95.3 | |
| 20 | 1.2 × 104 | 54,964 | 5943 | 3 | 0 | MN990492 | KU051394.1 | 97.9 | NA | 4 | 0 | 1957 | 82.4 | NA | NA | ||
| 21 | 7.6 × 103 | 39,465 | 242,239 | 3 | 4 | MN990493 | KY618835.1 | 97.7 | MK256736.1 | 97.4 | 3 | 5 | 1158 | 94.3 | 988 † | 87.0 | |
| 22 | 2.3 × 103 | 53,359 | 96,661 | 1 | 4 | MN990494 | KY618836.1 | 97.4 | KT827250.1 | 97.7 | 1 | 7 | 511 † | 88.8 | 7745 † | 85.0 | |
| 23 | 2.2 × 103 | 58,929 | 633,632 | 4 | 1 | MN990495 | KX019804.1 | 98.2 | MH634514.1 | 98.2 | 4 | 1 | 3206 | 92.1 | 1171 † | 94.5 | |
| 24 | 9.2 × 102 | 41,645 | 135,552 | 1 | 0 | MN990496 | MH490944.1 | 97.0 | NA | NA | 1 | 0 | 300 † | 80.0 | NA | NA | |
* Obtained from Sanger sequencing and deposited in Genbank, † Longest raw read, due to lack of consensus, ^ SVA whole genome is around 7.3kb, ‡ What’s in my pot.
Summary of direct RNA sequencing and PCR-cDNA sequencing.
| Direct RNA Sequencing (DRS) | PCR-cDNA Sequencing (PCS) | |
|---|---|---|
| Laboratory time (sample prep) | 3 h | 5 h |
| Sequencing time | 6 h | 6 h |
| Amount of RNA recommended for input | 500 ng * | 2 ng |
| Analytical sensitivity (viral copies) | 102 to 103 | 103 to 104 |
| Recommended consensus generation program | Racon | Canu |
| Raw read accuracy | 85% | 89% |
| Consensus accuracy | 94% | 99% |
| Consensus genome coverage ^ | 100% | 100% |
| Read distribution | Coverage biases | Even coverage |
| Key attributes | Rapid, sensitive, potential RNA structure detection | Accurate |
| Key concerns | High input RNA amount *, higher error rate | Need for amplification, longer time to results |
* Exogenous RNA can be used to increase RNA input requirements allowing for very low amounts (i.e., <0.02 ng) of desired RNA as input. This would then require bioinformatic analysis to separate the exogenous RNA reads from the desired reads. ^ Consensus genome coverage of 95% requires a minimum level of 299 SVA reads for DRS and 436 SVA reads for PCS, based on the analysis of clinical samples (Table 4).