| Literature DB >> 34903237 |
Shiyang Song1, Liangxiao Ma2, Xintian Xu1, Han Shi3, Xuan Li3, Yuanhua Liu4, Pei Hao5.
Abstract
BACKGROUND: Virus screening and viral genome reconstruction are urgent and crucial for the rapid identification of viral pathogens, i.e., tracing the source and understanding the pathogenesis when a viral outbreak occurs. Next-generation sequencing (NGS) provides an efficient and unbiased way to identify viral pathogens in host-associated and environmental samples without prior knowledge. Despite the availability of software, data analysis still requires human operations. A mature pipeline is urgently needed when thousands of viral pathogen and viral genome reconstruction samples need to be rapidly identified.Entities:
Keywords: Epidemic; Metagenomics data; Pathogen screening; SARS-CoV-2; Viral genome assembly
Mesh:
Year: 2021 PMID: 34903237 PMCID: PMC8668262 DOI: 10.1186/s12920-021-01138-z
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Viral pathogens Identification Workflow. VIW is consisted of four modules, (1) data preprocessing; (2) virus detection; (3) viral genome assembly and (4) report generation
Fig. 2Runtimes needed for the assembling of samples with different sizes by three different methods. The grey line shows the time required for assembly with the bam file contains all the non-host sequences, called it BWA method; the blud line shows the time required for assembly with the sam file contains only the virus sequences detected by FastViromeExplorer, called it FastViromeExplorer + BWA method; and the orange line shows the time required for directly extracted the alignments from the sam output of FastViromeExplorer, called it FastViromeExplorer method. The x-axis is the number of virus reads detected by FastViromeExplorer. And the y-axis is the time used. As the figure shows, when the file size increase, the time used by Bwa method grows polynomially, but the time used by FastViromExplorer + BWA and FastViromeExplorer methods grow linearly
Overview of virus proportion from Wuhan patients
| Sample ID | Host % | Virus % | FastViromeExplorer + BWA (n = 3) | FastViromeExplorer (n = 3) | BWA (n = 3) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | |||
| 1 | 51.18 | 0.97 | 1m7s | 0m29s | 99.90 | 0 | 0m47s | 0m3s | 99.93 | 1 | 30m20s | 2m41s | 100.00 | 0 |
| 2 | 29.29 | 58.83 | 1m42s | 0m19s | 99.89 | 2 | 0m28s | 0m4s | 100.00 | 1 | 1m53s | 0m28s | 99.92 | 2 |
| 3 | 25.68 | 60.98 | 2m12s | 0m6s | 99.98 | 1 | 0m45s | 0m5s | 100.00 | 0 | 2m35s | 0m7s | 100.00 | 0 |
| 4 | 12.22 | 3.4 | 0m34s | 0m9s | 98.94 | 35 | 0m15s | 0m1s | 99.31 | 1 | 0m56s | 0m5s | 98.98 | 35 |
| 5 | 0.8 | 0.36 | 0m10s | 0m1s | 56.92 | 25 | 0m18s | 0m2s | 66.82 | 0 | 4m39s | 0m21s | 59.37 | 25 |
| 6 | 43.42 | 8.82 | 0m32s | 0m10s | 87.91 | 48 | 0m38s | 0m3s | 93.74 | 982 | 1m20s | 0m18s | 87,93 | 48 |
| 7 | 46.94 | 20.15 | 2m17s | 0m18s | 99.98 | 0 | 2m7s | 0m5s | 100.00 | 2 | 15m52s | 2m36s | 100.00 | 2 |
| 8 | 10.42 | 9.56 | 0m47s | 0m12s | 92.44 | 41 | 0m52s | 0m4s | 99.48 | 287 | 17m26s | 2m51s | 99.36 | 40 |
| 9 | 19.01 | 14.65 | 8m51s | 0m54s | 99.98 | 3 | 6m25s | 1m7s | 100.00 | 1 | 26m55s | 3m6s | 99.98 | 2 |
| 10 | 2.02 | 86.87 | 71m23s | 7m19s | 100.00 | 0 | 43m33s | 1m29s | 100.00 | 0 | 91m52s | 10m44s | 100.00 | 0 |
Overview of virus MERS NC_019843.3 in MERS infected MRC5 cells
| Sample ID | Host % | Virus % | FastViromeExplorer + BWA (n = 3) | FastViromeExplorer (n = 3) | BWA (n = 3) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | |||
| 11 | 32.68 | 65.73 | 111m12s | 3m15s | 100.00 | 5 | 16m15s | 2m7s | 100.00 | 4 | 166m38s | 9m0s | 100.00 | 6 |
| 12 | 40.07 | 58.10 | 121m17s | 17m3s | 99.90 | 5 | 16m16s | 2m50s | 100.00 | 5 | 163m10s | 17m3s | 100.00 | 5 |
| 13 | 38.45 | 59.85 | 144m15s | 12m53s | 100.00 | 5 | 25m36s | 3m38s | 100.00 | 4 | 228m17s | 17m14s | 100.00 | 5 |
| 14 | 40.14 | 57.96 | 123m19s | 16m10s | 98.86 | 5 | 25m12s | 4m13s | 100.00 | 4 | 234m6s | 13m53s | 100.00 | 5 |
| 15 | 32.23 | 66.20 | 193m33s | 7m25s | 100.00 | 4 | 31m40s | 4m1s | 100.00 | 4 | 540m25s | 54m25s | 100.00 | 5 |
Overview of virus MERS NC_038294.1 in MERS-CoV infected MRC5 cells
| Sample ID | Host % | Virus % | FastViromeExplorer + BWA (n = 3) | FastViromeExplorer (n = 3) | BWA (n = 3) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | |||
| 11 | 32.68 | 65.73 | 111m12s | 3m15s | 99.99 | 106 | 16m15s | 2m7s | 100 | 31 | 166m38s | 9m0s | 100 | 99 |
| 12 | 40.07 | 58.1 | 121m17s | 17m3s | 100 | 106 | 16m16s | 2m50s | 100 | 21 | 163m10s | 17m3s | 100 | 96 |
| 13 | 38.45 | 59.85 | 144m15s | 12m53s | 100 | 105 | 25m36s | 3m38s | 100 | 24 | 228m17s | 17m14s | 100 | 99 |
| 14 | 40.14 | 57.96 | 123m19s | 16m10s | 98.83 | 99 | 25m12s | 4m13s | 100 | 19 | 234m6s | 13m53s | 100 | 96 |
| 15 | 32.23 | 66.2 | 193m33s | 7m25s | 100 | 105 | 31m40s | 4m1s | 100 | 31 | 540m25s | 54m25s | 100 | 99 |
Overview of Sendai virus in pangolins’ lung samples
| Sample ID | Host % | Virus % | FastViromeExplorer + BWA (n = 3) | FastViromeExplorer (n = 3) | BWA (n = 3) | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | Time | SD | cov. % | #. SNV | |||
| Lung01 | 8.09 | 1.49 | 0m21s | 0m16s | 0 | 0 | 0m6s | 0m0.2 s | 53.66 | 222 | 16m8s | 3m28s | 1.94 | 0 |
| Lung02 | 19.25 | 3.67 | 0m21s | 0m5s | 6.37 | 0 | 0m21s | 0m1s | 72.2 | 861 | 43m57s | 10m30s | 10.47 | 4 |
| Lung03 | 8.32 | 2.16 | 0m34s | 0m17s | 0 | 0 | 0m7s | 0m1s | 22.19 | 23 | 16m25s | 1m14s | 0 | 0 |
| Lung04 | 12.29 | 2.23 | 0m18s | 0m20s | 8.66 | 6 | 0m5s | 0m0.1 s | 75.27 | 915 | 9m31s | 1m2s | 9.91 | 6 |
| Lung07 | 9.09 | 0.63 | 1m21s | 0m30s | 5.67 | 0 | 0m22s | 0m3s | 61.76 | 469 | 56m48s | 7m15s | 5.88 | 9 |
| Lung08 | 11.07 | 0.67 | 0m43s | 0m26s | 5.16 | 0 | 0m19s | 0m2s | 72.07 | 743 | 25m46s | 3m38s | 7.36 | 0 |
| Lung09 | 8.03 | 0.31 | 0m22s | 0m4s | 9.06 | 0 | 0m5s | 0m0.2 s | 74.32 | 957 | 30m9s | 2m41s | 9.33 | 0 |
| Lung11 | 9.32 | 0.69 | 0m47s | 0m39s | 0.97 | 0 | 0m5s | 0m1s | 18.65 | 31 | 23m5s | 3m11s | 0.97 | 0 |
| Lung12 | 6.82 | 1.03 | 0m1s | 0m0.1 s | NA | NA | 0m2s | 0m1s | NA | NA | 0m0.1 s | 0m0.05 s | NA | NA |
| Lung13 | 21.76 | 4.47 | 0m21s | 0m19s | NA | NA | 0m13s | 0m1s | NA | NA | 10m22s | 1m39s | NA | NA |
| Lung19 | 7.47 | 3.82 | 0m23s | 0m2s | 13.23 | 7 | 0m14s | 0m1s | 87.86 | 1013 | 47m59s | 4m1s | 14.5 | 7 |
Overview of SARS-CoV-2 virus in pangolins’ lung samples
| Sample ID | Host % | Virus % | FastViromeExplorer + BWA (n = 3) | FastViromeExplorer (n = 3) | BWA (n = 3) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Time | cov. % | #. SNV | Time | cov. % | #. SNV | Time | cov. % | #. SNV | |||
| Lung07 | 1.32 | 0.62 | 1m21s | 7 | 43 | 0m22s | 24.88 | 469 | 56m48s | 8.86 | 41 |
| Lung08 | 2.05 | 0.65 | 0m43s | 11.53 | 37 | 0m19s | 51.58 | 743 | 25m46s | 13.87 | 37 |