Literature DB >> 35136930

RdRp-based sensitive taxonomic classification of RNA viruses for metagenomic data.

Xubo Tang1, Jiayu Shang1, Yanni Sun1.   

Abstract

With advances in library construction protocols and next-generation sequencing technologies, viral metagenomic sequencing has become the major source for novel virus discovery. Conducting taxonomic classification for metagenomic data is an important means to characterize the viral composition in the underlying samples. However, RNA viruses are abundant and highly diverse, jeopardizing the sensitivity of comparison-based classification methods. To improve the sensitivity of read-level taxonomic classification, we developed an RNA-dependent RNA polymerase (RdRp) gene-based read classification tool RdRpBin. It combines alignment-based strategy with machine learning models in order to fully exploit the sequence properties of RdRp. We tested our method and compared its performance with the state-of-the-art tools on the simulated and real sequencing data. RdRpBin competes favorably with all. In particular, when the query RNA viruses share low sequence similarity with the known viruses ($\sim 0.4$), our tool can still maintain a higher F-score than the state-of-the-art tools. The experimental results on real data also showed that RdRpBin can classify more RNA viral reads with a relatively low false-positive rate. Thus, RdRpBin can be utilized to classify novel and diverged RNA viruses.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Keywords:  Graph Neural Network; Probabilistic Relational Neighbor Classifier; RNA virus; RNA-dependent RNA polymerase

Mesh:

Substances:

Year:  2022        PMID: 35136930      PMCID: PMC8921650          DOI: 10.1093/bib/bbac011

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  29 in total

1.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors:  Weizhong Li; Adam Godzik
Journal:  Bioinformatics       Date:  2006-05-26       Impact factor: 6.937

2.  Fitting a mixture model by expectation maximization to discover motifs in biopolymers.

Authors:  T L Bailey; C Elkan
Journal:  Proc Int Conf Intell Syst Mol Biol       Date:  1994

3.  Detecting contamination in viromes using ViromeQC.

Authors:  Moreno Zolfo; Federica Pinto; Francesco Asnicar; Paolo Manghi; Adrian Tett; Frederic D Bushman; Nicola Segata
Journal:  Nat Biotechnol       Date:  2019-12       Impact factor: 54.908

4.  Sensitive protein alignments at tree-of-life scale using DIAMOND.

Authors:  Benjamin Buchfink; Klaus Reuter; Hajk-Georg Drost
Journal:  Nat Methods       Date:  2021-04-07       Impact factor: 28.547

5.  drVM: a new tool for efficient genome assembly of known eukaryotic viruses from metagenomes.

Authors:  Hsin-Hung Lin; Yu-Chieh Liao
Journal:  Gigascience       Date:  2017-02-01       Impact factor: 6.524

Review 6.  RNA Dependent RNA Polymerases: Insights from Structure, Function and Evolution.

Authors:  Sangita Venkataraman; Burra V L S Prasad; Ramasamy Selvarajan
Journal:  Viruses       Date:  2018-02-10       Impact factor: 5.048

7.  fastp: an ultra-fast all-in-one FASTQ preprocessor.

Authors:  Shifu Chen; Yanqing Zhou; Yaru Chen; Jia Gu
Journal:  Bioinformatics       Date:  2018-09-01       Impact factor: 6.937

8.  Endangered wild salmon infected by newly discovered viruses.

Authors:  Gideon J Mordecai; Kristina M Miller; Emiliano Di Cicco; Angela D Schulze; Karia H Kaukinen; Tobi J Ming; Shaorong Li; Amy Tabata; Amy Teffer; David A Patterson; Hugh W Ferguson; Curtis A Suttle
Journal:  Elife       Date:  2019-09-03       Impact factor: 8.140

9.  Fast and sensitive taxonomic classification for metagenomics with Kaiju.

Authors:  Peter Menzel; Kim Lee Ng; Anders Krogh
Journal:  Nat Commun       Date:  2016-04-13       Impact factor: 14.919

10.  Global trends in emerging infectious diseases.

Authors:  Kate E Jones; Nikkita G Patel; Marc A Levy; Adam Storeygard; Deborah Balk; John L Gittleman; Peter Daszak
Journal:  Nature       Date:  2008-02-21       Impact factor: 49.962

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.