| Literature DB >> 31837435 |
Sander van Boheemen1, Anneloes L van Rijn2, Nikos Pappas3, Ellen C Carbo1, Ruben H P Vorderman3, Igor Sidorov1, Peter J van T Hof3, Hailiang Mei3, Eric C J Claas1, Aloys C M Kroes1, Jutte J C de Vries1.
Abstract
Viruses are the main cause of respiratory tract infections. Metagenomic next-generation sequencing (mNGS) enables unbiased detection of all potential pathogens. To apply mNGS in viral diagnostics, sensitive and simultaneous detection of RNA and DNA viruses is needed. Herein, were studied the performance of an in-house mNGS protocol for routine diagnostics of viral respiratory infections with potential for automated pan-pathogen detection. The sequencing protocol and bioinformatics analysis were designed and optimized, including exogenous internal controls. Subsequently, the protocol was retrospectively validated using 25 clinical respiratory samples. The developed protocol using Illumina NextSeq 500 sequencing showed high repeatability. Use of the National Center for Biotechnology Information's RefSeq database as opposed to the National Center for Biotechnology Information's nucleotide database led to enhanced specificity of classification of viral pathogens. A correlation was established between read counts and PCR cycle threshold value. Sensitivity of mNGS, compared with PCR, varied up to 83%, with specificity of 94%, dependent on the cutoff for defining positive mNGS results. Viral pathogens only detected by mNGS, not present in the routine diagnostic workflow, were influenza C, KI polyomavirus, cytomegalovirus, and enterovirus. Sensitivity and analytical specificity of this mNGS protocol were comparable to PCR and higher when considering off-PCR target viral pathogens. One single test detected all potential viral pathogens and simultaneously obtained detailed information on detected viruses.Entities:
Mesh:
Year: 2019 PMID: 31837435 PMCID: PMC7106021 DOI: 10.1016/j.jmoldx.2019.10.007
Source DB: PubMed Journal: J Mol Diagn ISSN: 1525-1578 Impact factor: 5.568
Figure 1The bioinformatic workflow of the metagenomic next-generation sequencing protocol studied. NCBI, National Center for Biotechnology Information.
Internal Controls EAV/PhHV-1: Serial Dilutions against a Clinical Sample Background and Within-Run Precision (INFA)
| Sample EAV/PhHV-1 dilution | Cq value | Centrifuge reads (log) | ||||
|---|---|---|---|---|---|---|
| INFA | EAV | PhHV-1 | INFA | EAV | PhHV-1 | |
| 1:100 | 24.52 | 21.59 | 23.52 | 4438 (3.6) | 12,925 (4.1) | 347 (2.5) |
| 1:1000 | 24.67 | 24.91 | 26.83 | 3742 (3.6) | 1202 (3.1) | 49 (1.7) |
| 1:10,000 | 24.76 | 28.45 | 30.33 | 4628 (3.7) | 95 (2.0) | 14 (1.1) |
| 1:100,000 | 24.79 | 30.85 | 32.55 | 4093 (3.6) | 18 (1.3) | 14 (1.1) |
Cq, quantification cycle; EAV, equine arteritis virus; INFA, influenza A virus; PhHV-1, phocine herpesvirus 1.
Figure 2Comparison of fragmentation methods on target reads (species level, log scale). Asterisks indicate not tested with Bioruptor setting high intensity. ADV, adenovirus; HBOV, human bocavirus; INFC, influenza C virus; NL63, coronavirus NL63; PIV, parainfluenza virus; RSV, respiratory syncytial virus.
Figure 3Serial dilutions of an influenza A–positive clinical sample. Cq, quantification cycle.
Figure 4Analysis of in silico simulated equine arteritis virus (EAV) reads with the different bioinformatic settings of the Centrifuge pipeline.
Figure 5Spurious Lassa virus reads detected using the National Center for Biotechnology Information’s (NCBI's) nucleotide database (top), versus NCBI's RefSeq database (bottom). Black arrow points to the spurious Lassa virus reads. dsDNA, double-stranded DNA; ssRNA, single-stranded RNA.
Comparison of Analysis of 1 Million versus 10 Million Reads
| Virus | Virus family | Cq value | 10 million reads | 1 million reads | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Total reads | Virus family reads | % of total | % of viral | Total reads | Virus family reads | % of total | % of viral | |||
| RV | 37.7 | 8,203,894 | 8941 | 0.06 | 84.37 | 822,218 | 889 | 0.07 | 86.11 | |
| PIV4 | 24.9 | 10,886,798 | 2136 | 0.04 | 41.90 | 1,088,067 | 199 | 0.08 | 40.73 | |
| CMV | 34.5 | 15,889,428 | 22 | 00.01 | 10.88 | 1,588,922 | 2 | 0.04 | 11.87 | |
| ADV | 30.2 | 11,146,488 | 0 | 0 | 0 | 1,115,135 | 0 | 0.03 | 0 | |
| RSV | 27.3 | 10,191,995 | 1477 | 0.02 | 53.29 | 1,019,415 | 163 | 0.04 | 59.25 | |
| INFB | 30 | 8,535,672 | 652 | 0.01 | 48.67 | 853,149 | 61 | 0.02 | 46.58 | |
| NL63 | 36.2 | 10,386,928 | 0 | 0 | 0 | 1,038,469 | 0 | 0.02 | 0 | |
| INFA | 27.5 | 10,981,601 | 8403 | 0.11 | 70.28 | 1,097,872 | 855 | 0.17 | 69.84 | |
| MPV | 34.1 | 12,972,626 | 2 | 0 | 0.10 | 1,297,151 | 0 | 0.02 | 0 | |
| HBOV | 32.2 | 11,819,805 | 0 | 0 | 0 | 1,181,738 | 0 | 0 | 0 | |
| RV | 23.1 | 11,819,805 | 58,695 | 0.42 | 84.27 | 1,183,738 | 5754 | 0.49 | 84.25 | |
% of total, percentage of total reads; % of viral, percentage of all viral reads; ADV, adenovirus; CMV, cytomegalovirus; Cq, quantification cycle; HBOV, human bocavirus; INFA, influenza A virus; INFB, influenza B virus; MPV, metapneumovirus; NL63, coronavirus NL63; PIV4, parainfluenza virus 4; RSV, respiratory syncytial virus; RV, rhinovirus.
Detection of qPCR Virus Positive Respiratory Samples with mNGS
| Material | Routine diagnostics | mNGS | ||||
|---|---|---|---|---|---|---|
| PCR positive | Cq value | Virus genus | Genus reads | Virus species | Species reads | |
| NP wash | RV | 30.7 | 0 | 0 | ||
| PIV1 | 17.1 | 58,619 | 56,407 | |||
| ADV | 33.6 | 0 | 0 | |||
| NP wash | MPV | 24 | 127 | 123 | ||
| BAL | NL63 | 24.4 | 1999 | 2176 | ||
| HKU1 | 28.2 | 1 | 1 | |||
| Sputum | RV | 32 | 2326 | 2204 | ||
| NP wash | INFA | 22.2 | 1490 | 1490 | ||
| NP wash | MPV | 33.4 | 1 | 3 | ||
| ADV | 19.3 | 125 | 123 | |||
| Sputum | PIV4 | 21 | 7729 | 6798 | ||
| NP wash | HBOV | 22.3 | 7 | Human bocavirus | 7 | |
| NP wash | MPV | 22.2 | 139 | 312 | ||
| NP wash | INFB | 16.5 | 4971 | 4971 | ||
| NP wash | RV | 25.4 | 8 | 6 | ||
| RSV | 30.7 | 32 | 32 | |||
| NP wash | INFB | 21.4 | 2686 | 2686 | ||
| NP wash | RSV | 17.8 | 29,900 | 22,483 | ||
| NP wash | RV | 34.4 | 0 | 0 | ||
| INFB | 22.6 | 68,972 | 68,972 | |||
| BAL | INFB | 34.8 | 0 | 0 | ||
| HBOV | 34.1 | 0 | Human bocavirus | 0 | ||
| NP wash | HKU1 | 24.3 | 534 | 535 | ||
| NP wash | RV | 16.8 | 3877 | 1721 | ||
| NP wash | RV | 27.4 | 1 | 2 | ||
| HBOV | 19 | 1014 | Human bocavirus | 1064 | ||
| NP wash | INFA | 22.1 | 657 | 657 | ||
| NP wash | RSV | 17.2 | 31,179 | 72 | ||
| NP wash | RV | 17.7 | 50,642 | 29,293 | ||
ADV, adenovirus; BAL, bronchoalveolar lavage; Cq, quantification cycle; HBOV, human bocavirus; HKU1, coronavirus HKU1; INFA, influenza A virus; INFB, influenza B virus; mNGS, metagenomic next-generation sequencing; MPV, metapneumovirus; NL63, coronavirus NL63; NP, nasopharyngeal; PIV, parainfluenza virus; qPCR, real-time quantitative PCR; RSV, respiratory syncytial virus; RV, rhinovirus.
Number of reads assigned to the genus or species of the target virus.
Sensitivity and Specificity of the mNGS Protocol Tested, Based on PCR Target Viruses, with Different Sequence Read Cutoff Levels for Defining a Positive Result
| Variable | All reads | ≥15 sequence reads | ≥50 sequence reads |
|---|---|---|---|
| Sensitivity | 83 (24/29) | 66 (19/29) | 62 (18/29) |
| Specificity | 94 (325/346) | 100 (345/346) | 100 (346/346) |
Data are given as percentage (number/total).
mNGS, metagenomic next-generation sequencing.
Figure 6Receiver-operating characteristic curve for metagenomic next-generation sequencing detection of PCR target viruses, depending on the cutoff level of the number of mapped sequence reads for defining a positive result.
Figure 7Semiquantification of the metagenomic next-generation sequencing assay for target virus detection in clinical samples with real-time quantitative PCR confirms human respiratory viruses. Cq, quantification cycle.