| Literature DB >> 31817886 |
Shaoyuan Tan1, Cheryl M T Dvorak1, Michael P Murtaugh1.
Abstract
Prompt detection and effective control of porcine reproductive and respiratory syndrome virus (PRRSV) during outbreaks is important given its immense adverse impact on the swine industry. However, the diagnostic process can be challenging due to the high genetic diversity and high mutation rate of PRRSV. A diagnostic method that can provide more detailed genetic information about pathogens is urgently needed. In this study, we evaluated the ability of Oxford Nanopore MinION direct RNA sequencing to generate a PRRSV whole genome sequence and detect and discriminate virus at the strain-level. A nearly full length PRRSV genome was successfully generated from raw sequence reads, achieving an accuracy of 96% after consensus genome generation. Direct RNA sequencing reliably detected the PRRSV strain present with an accuracy of 99.9% using as few as 5 raw sequencing reads and successfully differentiated multiple co-infecting strains present in a sample. In addition, PRRSV strain information was obtained from clinical samples containing 104 to 106 viral copies or more within 6 hours of sequencing. Overall, direct viral RNA sequencing followed by bioinformatic analysis proves to be a promising approach for identification of the viral strain or strains involved in clinical infections, allowing for more precise prevention and control strategies during PRRSV outbreaks.Entities:
Keywords: MinION; PRRSV; analytical sensitivity; bioinformatics; co-infection; direct RNA sequencing; quantitation; strain level detection; whole genome sequencing
Year: 2019 PMID: 31817886 PMCID: PMC6950593 DOI: 10.3390/v11121132
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Assessment of raw reads from direct RNA sequencing.
| Sequencing Statistic | Run #1 | Run #2 |
|---|---|---|
| Available pores (group 1) | 474 | 495 |
| Sequencing time | 1 hour | 1 hour |
| Total pass bases | 20,351,741 | 27,167,775 |
| Total pass reads | 14,963 | 23,547 |
| Mean read length (bp) | 1360 | 1154 |
| Mean read quality | 8.2 | 8.5 |
| Mappable reads/percentage | 13,284/88.8% | 19,549/83.0% |
| Longest read (bp)/accuracy | 15,026/86.3% | 15,060/86.7% |
| Consensus length (bp)/accuracy | 15,140/95.5% | 15,055/95.3% |
Figure 1Analysis of direct RNA sequencing errors. To obtain raw error rates and error patterns, raw reads were mapped to the VR2332 reference sequence, followed by evaluation of the mapping. (a) The percent of each error type is shown as well as the total error rate. (b) The error patterns of insertions (first row with darker pink indicating higher errors), deletions (first column with darker orange indicating higher errors), and mismatches (center matrix with darker red indicating higher error). The U bases in the query sequence were adjusted to T automatically by the minimap program in order to map to the reference sequence which was DNA.
Figure 2Depth of coverage and consensus accuracy across the porcine reproductive and respiratory syndrome virus (PRRSV) whole genome. Raw reads were mapped to the longest raw read which served as a scaffold to generate a consensus genome. The coverage distribution (left y-axis, black closed circles) was evaluated by Qualimap. The consensus accuracy (right y-axis, grey open square, dashed line) was generated using a Geneious pair-wise alignment. Both the average coverage and accuracy were evaluated using a window size of 1000 bp and visualized using GraphPad prism software.
Detection power of MinION direct RNA sequencing at different sequence yields a.
| Sequencing Yield (kb) | Number of Total Reads | Longest Read | Consensus Sequence | Coverage | |||
|---|---|---|---|---|---|---|---|
| Length (bp) | Accuracy | Length (bp) | Accuracy | Breadth b | Depth c | ||
| 3 | 5 | 1606 | 84.2% | - | - | - | - |
| 15 | 109 | 1899 | 88.1% | 1861 | 92.10% | 12.3% | 1 |
| 75 | 443 | 4036 | 84.1% | 4081 | 91.95% | 26.9% | 5 |
| 150 | 790 | 4496 | 83.4% | 4548 | 92.45% | 30.0% | 10 |
| 750 | 3500 | 7533 | 84.9% | 7609 | 92.55% | 50.1% | 49 |
| 1500 | 6857 | 8382 | 85.8% | 8435 | 94.20% | 55.6% | 99 |
| 7500 | 32,571 | 12,990 | 86.6% | 12,988 | 95.20% | 85.5% | 494 |
| 15,000 | 64,860 | 15,043 | 86.7% | 15,101 | 95.20% | 99.5% | 988 |
| 30,000 | 127,411 | 15,081 | 86.0% | 15,171 | 95.50% | 99.9% | 1976 |
a average of two sequence runs; b consensus genome length (bp)/reference genome length (bp); c sequencing yield (bp)/reference genome size (bp).
Analytical sensitivity of direct PRRSV RNA sequencing.
| Sample Type | Viral Copies/ Reaction | # of Total Reads | Top BLAST Match | Identity to ORF5/Whole Genome % | # of PRRSV Reads | Consensus/Longest Read | |
|---|---|---|---|---|---|---|---|
| Length (bp) | Accuracy % | ||||||
| Cell supernatant | 5.9 × 109 | 19,198 | KC469618.1 | 100.0/99.9 | 1247 | 8282 | 94.5 |
| 2.0 × 109 | 23,068 | KC469618.1 | 100.0/99.9 | 949 | 7167 | 93.8 | |
| 1.7 × 109 | 83,192 | KC469618.1 | 100.0/99.9 | 831 | 9187 | 93.2 | |
| 6.8 × 108 | 116,698 | KC469618.1 | 100.0/99.9 | 699 | 5975 | 93.1 | |
| 3.7 × 108 | 118,879 | KC469618.1 | 100.0/99.9 | 422 | 6028 | 93.7 | |
| Spike-in | 1.5 × 109 | 322,778 | KC469618.1 | 100.0/99.9 | 1589 | 15,021 | 93.0 |
| 9.4 × 106 | 300,143 | KC469618.1 | 100.0/99.9 | 45 | 3743 | 90.5 | |
| 3.4 × 104 | 161,569 | CS484777.1 | 99.0/99.4 | 3 | 905 * | 82.1 | |
| Clinical | 1.4 × 108 | 77,468 | MF327000.1 | 96.8/- | 42 | 1984 | 90.5 |
| 2.4 × 107 | 266,120 | KX192112.1 | 97.0/- | 16 | 2431 | 88.4 | |
| 3.8 × 106 | 286,680 | KT581982.1 | 94.4/- | 6 | 940 * | 83.7 | |
| 2.3 × 105 | 201,887 | ND | - | 0 | - | - | |
| 6.5 × 104 | 240,944 | ND | - | 0 | - | - | |
| 3.2 × 104 | 307,822 | ND | - | 0 | - | - | |
ND: not detected; * Longest raw read was used.
Mapping status of direct RNA sequencing on samples containing multiple viral strains.
| Groups | Total Yield (kb) | # of Total Reads | # of PRRSV Reads | PRRSV Reads /Total Reads | First Match | Second Match | Third Match | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Top BLAST Match (Identity %) | # of Matching Reads | % of PRRSV Reads | Top BLAST Match (identity %) | # of Matching Reads | % of PRRSV Reads | Top BLAST Match (identity %) | # of Matching Reads | % of PRRSV Reads | |||||
| Control | 30 | 25 | 20 | 80% | KC469618.1 (99.9) | 20 | 100% | ND | |||||
| 300 | 245 | 210 | 86% | KC469618.1 (99.9) | 208 | 99% | ND | ||||||
| 3000 | 2451 | 2079 | 85% | KC469618.1 (99.9) | 2044 | 98% | ND | ||||||
| 30,000 | 24,512 | 20,819 | 85% | KC469618.1 (99.9) | 20,472 | 98% | ND | ||||||
| VR2332 + SDEU mixed sample | 30 | 38 | 23 | 61% | KC469618.1 (99.9) | 19 | 83% | ND | |||||
| 300 | 375 | 234 | 62% | KC469618.1 (99.9) | 194 | 83% | CS421743.1 (99.8) | 35 | 15% | ND | |||
| 3000 | 3748 | 2281 | 61% | KC469618.1 (99.9) | 1722 | 75% | SDEU (100.0) | 514 | 23% | ND | |||
| 30,000 | 37,478 | 23,004 | 61% | KC469618.1 (99.9) | 17,610 | 77% | SDEU (100.0) | 4879 | 21% | ND | |||
| VR2332 + 1-7-4 mixed sample | 30 | 34 | 26 | 76% | JA894280.1 (100.0) | 18 | 69% | ND | |||||
| 300 | 335 | 272 | 81% | KC469618.1 (99.9) | 224 | 82% | 1-7-4 (100.0) | 38 | 14% | ND | |||
| 3000 | 3351 | 2699 | 81% | KC469618.1 (99.9) | 2287 | 85% | SDEU (100.0) | 14 | 1% | 1-7-4 (100.0) | 348 | 13% | |
| 30,000 | 33,512 | 26,917 | 80% | KC469618.1 (99.9) | 22,668 | 84% | SDEU (100.0) | 136 | 1% | 1-7-4 (100.0) | 3633 | 13% | |
ND: not detected.