| Literature DB >> 32866812 |
Ellen C Carbo1, Igor A Sidorov2, Jessika C Zevenhoven-Dobbe2, Eric J Snijder2, Eric C Claas2, Jeroen F J Laros3, Aloys C M Kroes2, Jutte J C de Vries2.
Abstract
INTRODUCTION: The SARS-CoV-2 pandemic of 2020 is a prime example of the omnipresent threat of emerging viruses that can infect humans. A protocol for the identification of novel coronaviruses by viral metagenomic sequencing in diagnostic laboratories may contribute to pandemic preparedness. AIM: The aim of this study is to validate a metagenomic virus discovery protocol as a tool for coronavirus pandemic preparedness.Entities:
Keywords: COVID-19; SARS-CoV-2; bioinformatics; metagenomics; virus discovery
Mesh:
Year: 2020 PMID: 32866812 PMCID: PMC7441049 DOI: 10.1016/j.jcv.2020.104594
Source DB: PubMed Journal: J Clin Virol ISSN: 1386-6532 Impact factor: 3.168
Fig. 1Centrifuge classification results of viral reads of SARS-CoV-2, SARS-CoV, and MERS-positive samples, using viral metagenomic databases created before the emergence of these viruses. A) SARS-CoV-2, B) SARS-CoV, C) MERS.
Classification of SARS-CoV-2, SARS-CoV, and MERS sequence reads using reference databases created before their emergence, using metagenomic classifier Centrifuge.
| Sample | Untargeted mNGS, or viral enrichment by capture probes | Total number of non-human reads | Number of reads classified as | |
|---|---|---|---|---|
| SARS-CoV-2 | Untargeted | 3,488,842 | 2,166 (0.06) | SARS-CoV |
| Viral capture | 9,582,942 | 3,518,798 (36.72) | SARS-CoV | |
| SARS-CoV-2 | Untargeted | 919,930 | 604 (0.07) | SARS-CoV |
| Viral capture | 9,894,246 | 572,061 (5.78) | SARS-CoV | |
| SARS-CoV Frankfurt-1 | Untargeted | 6,936,399 | 436 (0.006) | Bovine coronavirus |
| MERS-CoV EMC/2012 | Untargeted | 8,201,535 | 8,748 (0.1) | Bat coronavirus BM48-31/BGR/2008 |
Enrichment by capture probes targeting vertebrate viruses designed in 2015
Classification of SARS-CoV-2, SARS-CoV, and MERS de novo assembled contigs using BLAST.
| Sample | Untargeted mNGS, or viral enrichment by capture probes | Total contigs | Viral contigs | Length of the longest | BLAST alignment length, bp | BLAST identity match, % | Subject taxonomy name | Release year of sequence of the species | Release year of sequence of the subject found | |
|---|---|---|---|---|---|---|---|---|---|---|
| SARS-CoV-2 | Untargeted | 8,606 | 15 | 3 | 19,654 | 12,069 | 87.141 | Bat SARS SL CoVZC45 | 2003 | 2018 |
| Viral capture | 8,232 | 51 | 31 | 5,811 | 5,820 | 90.567 | Bat SARS SL CoVZC45 | 2003 | 2018 | |
| SARS-CoV-2 | Untargeted | 2,815 | 31 | 16 | 2,503 | 2,456 | 91.450 | Bat SARS SL CoVZXC21 | 2003 | 2018 |
| Viral capture | 2,110 | 39 | 13 | 4,866 | 4,856 | 92.360 | Bat SARS SL CoVZC45 | 2003 | 2018 | |
| SARS-CoV Frankfurt-1 | Untargeted | 3,836 | 10 | 1 | 29,692 | 1,236 | 72.411 | Bovine coronavirus isolate 4-17-03 | 2001 | 2018 |
| MERS-CoV EMC/2012 | Untargeted | 4,074 | 9 | 1 | 30,097 | 14,856 | 77.248 | Bat coronavirus HKU4-1 | 2006 | 2006 |
Table showing the total number of built contigs with a length > = 500bp, the number of these contigs where the hit with the lowest E-value would be a hit to viruses, the number of contigs where the hit with the lowest E-value would be a hit to Coronaviridae and of this last group the length of the longest contig, the alignment length, identity match, taxonomic name of BLAST result and the release years of sequences belonging to the species and subjects found by BLAST.
Enrichment by capture probes targeting vertebrate viruses designed in 2015
Fig. 2Discovery performance using metagenomic sequencing (A) without and (B) with enrichment by capture probes targeting vertebrate viruses, designed in 2015. Genome Detective classification of reads, coverage and aligment against the genome of Severe acute respiratory syndrome-related coronavirus are shown.
Fig. 3Coverage map of alignment against SARS-CoV-2 reference sequence NC_004718.2, without (left) and with (right) viral capture probes designed in 2015 after metagenomic sequencing of patient samples with respectively Cq 20 (upper graphs) and Cq 30 (lower graphs).