| Literature DB >> 30158979 |
Andrey A Ayginin1,2, Ekaterina V Pimkina1, Alina D Matsvay1,2, Anna S Speranskaya1,3, Marina V Safonova1, Ekaterina A Blinova1, Ilya V Artyushin3, Vladimir G Dedkov1,4, German A Shipulin1, Kamil Khafizov1,2.
Abstract
Advances in the next generation sequencing (NGS) technologies have significantly increased our ability to detect new viral pathogens and systematically determine the spectrum of viruses prevalent in various biological samples. In addition, this approach has also helped in establishing the associations of viromes with many diseases. However, unlike the metagenomic studies using 16S rRNA for the detection of bacteria, it is impossible to create universal oligonucleotides to target all known and novel viruses, owing to their genomic diversity and variability. On the other hand, sequencing the entire genome is still expensive and has relatively low sensitivity for such applications. The existing approaches for the design of oligonucleotides for targeted enrichment are usually involved in the development of primers for the PCR-based detection of particular viral species or genera, but not for families or higher taxonomic orders. In this study, we have developed a computational pipeline for designing the oligonucleotides capable of covering a significant number of known viruses within various taxonomic orders, as well as their novel variants. We have subsequently designed a genus-specific oligonucleotide panel for targeted enrichment of viral nucleic acids in biological material and demonstrated the possibility of its application for virus detection in bird samples. We have tested our panel using a number of collected samples and have observed superior efficiency in the detection and identification of viral pathogens. Since a reliable, bioinformatics-based analytical method for the rapid identification of the sequences was crucial, an NGS-based data analysis module was developed in this study, and its functionality in the detection of novel viruses and analysis of virome diversity was demonstrated.Entities:
Year: 2018 PMID: 30158979 PMCID: PMC6109506 DOI: 10.1155/2018/3248285
Source DB: PubMed Journal: Adv Virol ISSN: 1687-8639
Viral families and genera covered by the primer panel and control samples of viral RNA used.
|
|
|
|
|
|
|---|---|---|---|---|
| 1 | Filoviridae |
| Zaire ebolavirus | ZEBOV |
| 2 |
| Marburg virus | MARV | |
| 3 |
| N/A | ||
|
| ||||
| 4 | Arenoviridae |
| Lassa virus | LASV |
|
| ||||
| 5 | Rhabdoviridae |
| Rabies virus | RABV |
| 6 |
| N/A | ||
|
| ||||
| 7 | Coronaviridae |
| N/A | |
| 8 |
| Middle East respiratory syndrome coronavirus | MERS CoV | |
| 9 |
| N/A | ||
|
| ||||
| 10 | Reoviridae |
| Kemerovo virus | KEMV |
| 11 |
| N/A | ||
| 12 |
| N/A | ||
|
| ||||
| 13 | Paramyxoviridae |
| N/A | |
|
| ||||
| 14 | Phenuiviridae |
| Uukuniemi virus N/A | |
|
| ||||
| 15 | Hantaviridae |
| Dobrava virus | DOBV |
|
| ||||
| 16 | Nairoviridae |
| Crimean-Congo virus hemorrhagic fever virus, Paramushir virus | CCHFV, PRMV |
|
| ||||
| 17 | Peribunyaviridae |
| Inkoo virus | INKV |
|
| ||||
| 18 | Orthomyxoviridae |
| N/A | |
|
| ||||
| 19 | Picornaviridae |
| N/A | |
| 20 |
| N/A | ||
|
| ||||
| 21 | Flaviviridae |
| Tick-borne encephalitis virus, Yellow fever virus, Japanese encephalitis virus | TBEV, YFV, JEV |
|
| ||||
| 22 | Togaviridae |
| N/A | |
Primers used in the panel along with their structures.
|
|
|
|
|---|---|---|
| 1 | ebola_f | GCAATGTTCAAACACTTTGTGARGC |
| ebola_r | CTTAACACCATAGCAACGGTTR | |
|
| ||
| 2 | marburg_f | TGGACGATAGGAAATCGAGCAC |
| marburg_r | TGAACTATRTTGCCTGAGTAGTGWG | |
|
| ||
| 3 | cueva_f | GTGCCAGAACAGTTTGAACTCA |
| cueva_r | CCGAATTCTCTGGGTAACACAA | |
|
| ||
| 4 | mammarena_f | CAATMCTTGAYATGGGWCARGG |
| mammarena_r | WGATTTRAACTCTGCAACAAAYCTR | |
|
| ||
| 5 | lyssa_f | CTKGAYTATGARAARTGGAACA |
| lyssa_r | TATGTCGGRCAYARAACCTGRT | |
|
| ||
| 6 | ledante_f | AAYAATACATGGCCCACWCC |
| ledante_r | ARTATTCTCTMARMKCCCARGACAT | |
|
| ||
| 7 | alphacorona_f | GGYACHACHTCWGGTGATGS |
| alphacorona_r | GYTTACGYARRTAACCAWAAWABTC | |
|
| ||
| 8 | betacorona_f | GTGCWAAGAATAGAGCYCGCAC |
| betacorona_r | RTCACAYTTWGGRTAATCCCAACCC | |
|
| ||
| 9 | gammacorona_f | CCACATCTGCTAATGTTGCR |
| gammacorona_r | CAGAAATRTCWGCTACAAGACCYTG | |
|
| ||
| 10 | orbi_f1 | TACCGCARGATMGWATGATGAT |
| orbi_f2 | TATGTTCCWCARGATCGRATGATG | |
| orbi_r | TGCGCTCCAWAVCCATTCCA | |
|
| ||
| 11 | orthoreo_f | GTYTCGGCGCCYCAYACDYT |
| orthoreo_r | GCAGTRTGCTCAGTDGARGT | |
|
| ||
| 12 | seadorna_f | CCRCATGAYGTHATGGCYCC |
| seadorna_r | TCACCWGACTTAACWCCWGM | |
|
| ||
| 13 | henipa_f | GGTCAGARACWYTGGTGGAYGA |
| henipa_r | ARTAYGGATCACTRGCCCARTC | |
|
| ||
| 14 | phlebo_f | GATTYAATCTSTKSARRGCY |
| phlebo_r | YTATYWGYTCCAYCCAGTYYTC | |
|
| ||
| 15 | hanta_f | GCWGATGCAACWAARTGGTC |
| hanta_r | YARRTTYCCYTGYARCCART | |
|
| ||
| 16 | nairo_f | CCTTCTTTTSHGGYATGATGCA |
| nairo_r | GAAGTTAACACTGNCGAWGTWGCATG | |
|
| ||
| 17 | orthobunya_f | CWGAWGARATGATWWSTGARCCWGG |
| orthobunya_r | GCACTCCATTTWGACATRTCWG | |
|
| ||
| 18 | thogoto_f | ATCAARGAYMRRCTGAARAANA |
| thogoto_r | TCGATSYGMGGCTTTATDGM | |
|
| ||
| 19 | cardio_f | MRGGYATGGAYCCMATGGAV |
| cardio_r | AAGTTRGARTARTCYACATCRTAGA | |
|
| ||
| 20 | parecho_f | GGRATYAACCCATAYAARGAYTGGC |
| parecho_r | GAYCCTGATGGCATACCRCC | |
|
| ||
| 21 | flavi_f | CTSCTKTGTGACATMGGDGA |
| flavi_r | TACATCTCRTGYGTGGARTTBC | |
|
| ||
| 22 | alpha_f | ACWCTGTTTGTSAACACWGTVRTYA |
| alpha_r | CTYTTYARRGGGTCTGCSACHC | |
Figure 1Individual primer pairs testing. (1, 8, 18, 19, 28, 33, 36) DNA Ladder (100 bp, 200 bp, 300 bp, 400 bp, 500 bp, 800 bp); (2) ZEBOV, primers ebola_f/r; (3) negative control for primers ebola_f/r; (4) MARV, primers marburg_f/r, (5) negative control for primers marburg_f/r; (6) LASV, primers mammarena_f/r, (7) negative control for primers mammarena_f/r; (9) CCHFV, primers nairo_f/r; (10) PRMV, primers nairo_f/r; (11) negative control for primers nairo_f/r; (12) INKV, primers orthobunya_f/r; (13) negative control for primers orthobunya_f/r; (14) DOBV, primers hanta_f/r; (15) negative control for primers hanta_f/r; (16) KEMV, primers orbi_f1/f2/r; (17) negative control for primers orbi_f1/f2/r; (20) MERS CoV, primers alphacorona_f/r; (21) negative control for primers alphacorona_f/r; (22) MERS (Betacoronavirus), primers betacorona_1_f/r; (23) negative control for primers betacorona_1_f/r; (24) MERS (Betacoronavirus), primers betacorona_2_f/r; (25) negative control for primers betacorona_2_f/r; (26) MERS (Betacoronavirus), primers gammacorona_f/r; (27) negative control for primers gammacorona_f/r; (29) TBEV, primers flavi_f/r; (30) YFV, primers flavi_f/r; (31) JEV, primers flavi_f/r; (32) negative control for primers flavi_f/r; (34) RABV, primers lyssa_f/r; (35) negative control for primers lyssa_f/r.
Figure 2Reamplification of the multiplex PCR product; ß-C1: Betacoronavirus-specific primers; Colti: coltivirus-specific primers; Orthoreo: Orthoreovirus-specific primers; Sead.: Seadornavirus-specific primers; Orbi: orbivirus-specific primers.
Figure 3A schematic picture of the bioinformatics pipeline developed for the analysis of the NGS data in this study. Specific third-party tools that were employed are shown in parentheses.
Results of the bird samples sequencing with prior enrichment using the primers panel. The number of viral reads and total number of reads per corresponding sample are shown. For three samples (B11, B23, and B24, highlighted in bold) there was a specific primer pair in the panel present for the amplification of the detected genus. Closest viral homologs names and their GenBank IDs are shown.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
|
| |||||
|
|
|
|
|
|
|
|
| |||||
|
| Sanxia water strider virus 16 | Temminck's Stint | YP_009337377.1 | 4,062 | 103,488 |
|
| |||||
|
| Fowl aviadenovirus C∣Turkey aviadenovirus B | Temminck's Stint | ACL68145.1, ANB27700.1, ALY06332.1, ALY06333.1 | 54 | 103,488 |
|
| |||||
|
| Cimodo virus | Temminck's Stint | YP_009059075.1 | 160 | 103,488 |
|
| |||||
|
| Watercress white vein virus ∣ Turnip yellow mosaic virus | Taiga Bean Goose | AFC95826.1, AMH40128.1 | 636 | 28,468 |
|
| |||||
|
| Circovirus | Taiga Bean Goose | AEL87792.1 | 180 | 149,230 |
|
| |||||
|
| Lake Sarah-associated circular virus-32 | Taiga Bean Goose | ALE29729.1 | 10 | 42,348 |
|
| |||||
|
| Duck aviadenovirus B | Fieldfare | YP_009047166.1 | 198 | 159,604 |
|
| |||||
|
| Duck adenovirus A | Common tern | AGS11269.1, NP_044717.1 | 95,872 | 256,549 |
|
| |||||
|
| Tunis virus | Taiga Bean Goose | AMT75434.1 | 16 | 126,648 |
Figure 4A graphical representation of the percentage of the detected viral reads (grey) with respect to the total number of sequencing reads obtained for the samples listed in Table 3.
Comparison of the sequencing results of two samples with and without preenrichment by the primer's panel. In the latter case, the total number of reads was ~4.5 million per sample.
|
|
| |||
|---|---|---|---|---|
| With enrichment | Total RNA | With enrichment | Total RNA | |
|
| 43.1% | 0.2% | - | - |
|
| - | 0.2% | 0.1% | 0.05% |