| Literature DB >> 28424887 |
Yu-Nong Gong1,2, Shu-Li Yang1,3, Guang-Wu Chen1,2,4, Yu-Wen Chen5, Yhu-Chering Huang5,6, Hsiao-Chen Ning1,3, Kuo-Chien Tsao7,8,9.
Abstract
Metagenomic approaches to detect viral genomes and variants in clinical samples have various challenges, including low viral titers and bacterial and human genome contamination. To address these limitations, we examined a next-generation sequencing (NGS) and iterative mapping approach for virus detection in clinical samples. We analyzed 40 clinical specimens from hospitalized children diagnosed with acute bronchiolitis, croup, or respiratory tract infections in which virus identification by viral culture or polymerase chain reaction (PCR) was unsuccessful. For our NGS data analysis pipeline, clinical samples were pooled into two NGS groups to reduce sequencing costs, and the depth and coverage of assembled contigs were effectively increased using an iterative mapping approach. PCR was individually performed for each specimen according to the NGS-predicted viral type. We successfully detected previously unidentified respiratory viruses in 26 of 40 specimens using our proposed NGS pipeline. Two dominant populations within the detected viruses were human rhinoviruses (HRVs; n = 14) and human coronavirus NL63 (n = 8), followed by human parainfluenza virus (HPIV), human parechovirus, influenza A virus, respiratory syncytial virus (RSV), and human metapneumovirus. This is the first study reporting the complete genome sequences of HRV-A101, HRV-C3, HPIV-4a, and RSV, as well as an analysis of their genetic variants, in Taiwan. These results demonstrate that this NGS pipeline allows to detect viruses which were not identified by routine diagnostic assays, directly from clinical samples.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28424887 PMCID: PMC7087367 DOI: 10.1007/s00705-017-3367-4
Source DB: PubMed Journal: Arch Virol ISSN: 0304-8608 Impact factor: 2.574
Fig. 1Read classification by BLASTN searches
Genomic depths and coverages of viral contigs obtained by NGS using an iterative mapping approach. Sixteen viruses were identified in two NGS pools using the proposed NGS pipeline. Genomic depths of assembled contigs for each virus type were averaged. Genomic coverages were estimated as the covered regions of contig(s) divided by the length of the coding sequence (CDS). Covered regions of assembled contigs were defined as the sum of aligned regions of contigs in BLASTN searches. The CDS lengths for HCoV-NL63, HPIV-4a, and RSV were approximately 27,000, 14,450, and 13,600 bp, respectively, and for HRV and HPeV they were approximately 6450 bp. The CDS lengths for the PB2, PB1, PA, HA, NP, NA, MP, and NS segments were approximately 2280, 2274, 2151, 1701, 1497, 692, 409, and 614 bp, respectively. In iterative mapping, genomics depths were examined by the average of position-specific read counts, and genomic coverages were defined as the sum of aligned regions of read mapping
| NGS pool |
| Iterative mapping | |||||
|---|---|---|---|---|---|---|---|
| NGS-predicted candidate | Contig count | Genomic coverage (%) | Average depth | Iterations | Genomic coverage (%) | Average depth | |
| 1 | HCoV-NL63 | 123 | 100 | 570.0 | 1 | 100 | 7001.1 |
| HPeV-1 | 3 | 97.0 | 376.9 | 3 | 100 | 644.0 | |
| HPIV-4a | 17 | 48.4 | 17.1 | 6 | 99.8 | 65.7 | |
| HRV-A101 | 5 | 95.0 | 46.2 | 3 | 100 | 81.1 | |
| HRV-C3 | 3 | 96.0 | 174.0 | 3 | 99.9 | 275.2 | |
| HRV-C4 | 9 | 56.3 | 10.3 | 3 | 95.2 | 16.9 | |
| HRV-C40 | 1 | 100 | 690.0 | 3 | 100 | 743.7 | |
| IAV* | 19 | 66.3, 69.1, 100, 99.5, 87.3, 100, 100, and 85.8 | 5.8, 21.1, 13.6, 11.6, 16.4, 6.4, 6.5, and 13.7 | 4 | 95.1, 77.5, 100, 98.9, 88.2, 84.7, 100, and 86.4 | 11.4, 19.3, 18.3, 18.5, 27.5, 7.1, 8.3, and 10.0 | |
| 2 | HCoV-NL63 | 59 | 22.0 | 149.0 | 1 | 100 | 773.8 |
| HPeV-1 | 1 | 4.0 | 2.2 | 2 | 19.4 | 1.6 | |
| HPIV-4a | 22 | 44.8 | 167.0 | 2 | 100 | 341.6 | |
| HRV-B92 | 4 | 64.3 | 23.5 | 5 | 83.1 | 42.0 | |
| HRV-C6 | 5 | 43.0 | 29.5 | 5 | 42.1 | 34.6 | |
| HRV-C40 | 6 | 44.3 | 28.5 | 5 | 85.0 | 38.1 | |
| IAV | 1 | 17.4 (PA) | 6.1 (PA) | 1 | 3.5 (PB1), 37.2 (PA) | 1.0 (PB1), 4.0 (PA) | |
| RSV | 33 | 33.6 | 45.6 | 6 | 100 | 180.2 | |
* For IAV, eight depth or coverage estimates for the PB2, PB1, PA, HA, NP, NA, MP, and NS genes are presented in order
Fig. 2Read distributions of viral genomes. Read distributions of the viral genomes revealed by read mapping: (A) HCoV-NL63, (B) HPeV-1, (C) HPIV-4a, (D-K) from PB2 to NS gene of IAV, (L) HRV-A101, (M) HRV-C3, (N) HRV-C4, and (O) HRV-C40 in NGS 1; (P) HCoV-NL63, (Q) HPeV-1, (R) HPIV-4a, (S) RSV, (T) HRV-B92, (U) HRV-C6, (V) HRV-C40, and (W) PA gene of IAV in NGS 2. Read distributions in NGS 1 showed >77.0% coverage and average depths from 7.1 to 7001.1, and in NGS 2 showed >83.0% coverage and average depths from 38.1 to 773.8, except for HPeV-1, HRV-C6, and IAV. The read distribution of IAV PB1 gene in NGS 2 was not shown, due to only one read mapping to this gene
PCR confirmation of the 40 clinical samples. Virus-specific PCR assays for each of the 40 isolates were performed to confirm NGS-predicted viruses and to validate their presence. PCR results agreed with NGS predictions in NGS 1; however, hMPV, HRV-C3, and an unclassified type of HRV-C detected by PCR and HPeV-1 detected by NGS were inconsistent in NGS 2 and are marked in bold
| NGS pool: prediction | Sample ID | PCR-validated candidates |
|---|---|---|
| 1: HCoV-NL63, HPeV-1, HPIV-4a, HRV-A101, -C3, -C4, and -C40, and IAV | 1 | IAV |
| 2 | HRV-C3 | |
| 3 | HCoV-NL63 | |
| 4 | HRV-C4 | |
| 5 | HCoV-NL63 | |
| 6 | HRV-A101 | |
| 7 | HPIV-4a | |
| 8 | — | |
| 9 | HRV-C40 | |
| 10 | HCoV-NL63, HPeV-1 | |
| 11 | HRV-C40 | |
| 12 | — | |
| 13 | HCoV-NL63 | |
| 14 | — | |
| 15 | HCoV-NL63 | |
| 16 | — | |
| 17 | — | |
| 18 | HRV-C40 | |
| 19 |
| |
| 20 | HCoV-NL63 | |
| 2: HCoV-NL63, | 21 | HRV-C40 |
| 22 |
| |
| 23 | HRV-C6 | |
| 24 | — | |
| 25 | HCoV-NL63 | |
| 26 | RSV | |
| 27 | HRV-B92 | |
| 28 | — | |
| 29 | IAV, HPIV-4a | |
| 30 | — | |
| 31 | — | |
| 32 | — | |
| 33 | HRV-C40 | |
| 34 | HRV-C6 | |
| 35 |
| |
| 36 | HRV-B92 | |
| 37 | — | |
| 38 | — | |
| 39 | — | |
| 40 | HCoV-NL63, |
Genetic variants in heterogeneous populations identified by read mapping
| NGS pool | Sample ID | Viral type | Gene | Nonsynonymous mutation | Variant nucleotide position (total depth: nucleotide composition/frequency) |
|---|---|---|---|---|---|
| 1 | 2 | HRV-C3 | VP2 | L307I | 919 (135: T/100, A/27, C/7, G/1) |
| VP1 | D622Y | 1864 (79: G/58, T/15, C/5, A/1) | |||
| N623K | 1869 (69: C/50, A/19) | ||||
| 2C | Q1486K | 4456 (234: C/164, A/46, T/19, G/5) | |||
| P1487Q/T/K | 4459 (194: C/120, A/59, T/15); 4460 (186: C/131, A/49, T/5 G/1) | ||||
| V1489I | 4465 (133: G/93, A/26, C/7, T/7) | ||||
| 3D | I1976N/K/L/H/Q | 5926 (372: A/240, C/90, T/29, G/13); 5927 (328: T/201, A/68, C/52, G/7); 5928 (271: T/131, A/63, C/48, G/29) | |||
| F1977I | 5929 (218: T/129 A/58 C/26 G/5) | ||||
| I1980N/K | 5939 (82: T/58, A/19, G/5); 5940 (80: T/56, A/14, G/7, C/3) | ||||
| I1981L | 5941 (76: A/55, T/10, G/9, C/2) | ||||
| 6 | HRV-A101 | VP4 | T15K | 44 (90: C/66, A/24) | |
| 7 | HPIV-4a | M | C201R | 604 (88: T/44, A/40, G/4) | |
| L202M | 609 (117: A/81, G/21, T/15) | ||||
| L | R3595G | 754 (36: A/24, G/12) | |||
| 10 | HPeV-1 | VP1 | N589K | 1767 (1054: T/766, A/161, G/103, C/24) | |
| 2A | K918N | 2754 (21: A/15, C/3, G/2, T/1) | |||
| G919V | 2756 (21: G/15, T/4, A/2) | ||||
| 3D | Y1770N | 5308 (302: T/221, A/53, C/28) | |||
| F2126L | 6376 (430: T/320, C/75, A/34, G/1) | ||||
| 2 | 26 | RSV | NS1 | S99P | 295 (740: T/530, C/124, A/76, G/10) |
| NS2 | D2N | 4 (75: G/53, A/21, C/1) | |||
| N | M257L | 769 (94: A/70, T/14, G/6, C/4) | |||
| L258F | 774 (106: A/75, T/22, G/6, C/3) | ||||
| E388D | 1164 (21: G/10, T/10, C/1) | ||||
| F | R49G | 145 (50: A/36, G/7, T/6, C/1) | |||
| T50S | 148 (66: A/46, T/15, G/5) | ||||
| M2 | I173T | 518 (71: T/50, C/16, A/5); 519 (68: C/49, T/15, A/4) | |||
| N174H | 520 (62: A/45, C/15, T/2) | ||||
| L | L196S | 587 (30: T/16, C/14) | |||
| Q198R | 593 (39: A/26, G/9, T/4) | ||||
| Y724S | 2171 (26: A /17, C/7, T/2) | ||||
| S767G | 2299 (52: A/34, G/13, T/3, C/2) | ||||
| S910T | 2729 (54: G/38, C/15, T/1) | ||||
| N969D | 2905 (35: A/20, G/15) | ||||
| N991D | 2971 (57: A/42, G/8, C/7) | ||||
| R1256G | 3766 (52: A/29, G/22, T/1) | ||||
| 29 | HPIV-4a | NP | R216S | 648 (25: A/18, C/5, G/2) | |
| A268E | 803 (128: C/92, A/32, T/3, G/1) | ||||
| P345L | 1034 (1801: C/1265, T/451, A/59, G/26) |