Literature DB >> 33441133

Prokaryotic virus host predictor: a Gaussian model for host prediction of prokaryotic viruses in metagenomics.

Congyu Lu1, Zheng Zhang1, Zena Cai1, Zhaozhong Zhu1, Ye Qiu1, Aiping Wu2,3, Taijiao Jiang2,3, Heping Zheng1, Yousong Peng4.   

Abstract

BACKGROUND: Viruses are ubiquitous biological entities, estimated to be the largest reservoirs of unexplored genetic diversity on Earth. Full functional characterization and annotation of newly discovered viruses requires tools to enable taxonomic assignment, the range of hosts, and biological properties of the virus. Here we focus on prokaryotic viruses, which include phages and archaeal viruses, and for which identifying the viral host is an essential step in characterizing the virus, as the virus relies on the host for survival. Currently, the method for determining the viral host is either to culture the virus, which is low-throughput, time-consuming, and expensive, or to computationally predict the viral hosts, which needs improvements at both accuracy and usability. Here we develop a Gaussian model to predict hosts for prokaryotic viruses with better performances than previous computational methods.
RESULTS: We present here Prokaryotic virus Host Predictor (PHP), a software tool using a Gaussian model, to predict hosts for prokaryotic viruses using the differences of k-mer frequencies between viral and host genomic sequences as features. PHP gave a host prediction accuracy of 34% (genus level) on the VirHostMatcher benchmark dataset and a host prediction accuracy of 35% (genus level) on a new dataset containing 671 viruses and 60,105 prokaryotic genomes. The prediction accuracy exceeded that of two alignment-free methods (VirHostMatcher and WIsH, 28-34%, genus level). PHP also outperformed these two alignment-free methods much (24-38% vs 18-20%, genus level) when predicting hosts for prokaryotic viruses which cannot be predicted by the BLAST-based or the CRISPR-spacer-based methods alone. Requiring a minimal score for making predictions (thresholding) and taking the consensus of the top 30 predictions further improved the host prediction accuracy of PHP.
CONCLUSIONS: The Prokaryotic virus Host Predictor software tool provides an intuitive and user-friendly API for the Gaussian model described herein. This work will facilitate the rapid identification of hosts for newly identified prokaryotic viruses in metagenomic studies.

Entities:  

Keywords:  Bioinformatics; Gaussian model; Host prediction; Metagenomics; Prokaryotic viruses; Virome

Year:  2021        PMID: 33441133      PMCID: PMC7807511          DOI: 10.1186/s12915-020-00938-6

Source DB:  PubMed          Journal:  BMC Biol        ISSN: 1741-7007            Impact factor:   7.431


  26 in total

Review 1.  Viral metagenomics.

Authors:  Robert A Edwards; Forest Rohwer
Journal:  Nat Rev Microbiol       Date:  2005-06       Impact factor: 60.633

2.  Marine DNA Viral Macro- and Microdiversity from Pole to Pole.

Authors:  Ann C Gregory; Ahmed A Zayed; Nádia Conceição-Neto; Ben Temperton; Ben Bolduc; Adriana Alberti; Mathieu Ardyna; Ksenia Arkhipova; Margaux Carmichael; Corinne Cruaud; Céline Dimier; Guillermo Domínguez-Huerta; Joannie Ferland; Stefanie Kandels; Yunxiao Liu; Claudie Marec; Stéphane Pesant; Marc Picheral; Sergey Pisarev; Julie Poulain; Jean-Éric Tremblay; Dean Vik; Marcel Babin; Chris Bowler; Alexander I Culley; Colomban de Vargas; Bas E Dutilh; Daniele Iudicone; Lee Karp-Boss; Simon Roux; Shinichi Sunagawa; Patrick Wincker; Matthew B Sullivan
Journal:  Cell       Date:  2019-04-25       Impact factor: 41.582

Review 3.  Metagenomics for the discovery of novel human viruses.

Authors:  Patrick Tang; Charles Chiu
Journal:  Future Microbiol       Date:  2010-02       Impact factor: 3.165

Review 4.  Molecular and Evolutionary Determinants of Bacteriophage Host Range.

Authors:  Patrick A de Jonge; Franklin L Nobrega; Stan J J Brouns; Bas E Dutilh
Journal:  Trends Microbiol       Date:  2018-09-01       Impact factor: 17.079

5.  HostPhinder: A Phage Host Prediction Tool.

Authors:  Julia Villarroel; Kortine Annina Kleinheinz; Vanessa Isabell Jurtz; Henrike Zschach; Ole Lund; Morten Nielsen; Mette Voldby Larsen
Journal:  Viruses       Date:  2016-05-04       Impact factor: 5.048

Review 6.  A Review of Phage Therapy against Bacterial Pathogens of Aquatic and Terrestrial Organisms.

Authors:  Janis Doss; Kayla Culbertson; Delilah Hahn; Joanna Camacho; Nazir Barekzi
Journal:  Viruses       Date:  2017-03-18       Impact factor: 5.048

7.  Database resources of the National Center for Biotechnology Information.

Authors:  Eric W Sayers; Richa Agarwala; Evan E Bolton; J Rodney Brister; Kathi Canese; Karen Clark; Ryan Connor; Nicolas Fiorini; Kathryn Funk; Timothy Hefferon; J Bradley Holmes; Sunghwan Kim; Avi Kimchi; Paul A Kitts; Stacy Lathrop; Zhiyong Lu; Thomas L Madden; Aron Marchler-Bauer; Lon Phan; Valerie A Schneider; Conrad L Schoch; Kim D Pruitt; James Ostell
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats.

Authors:  Charles Bland; Teresa L Ramsey; Fareedah Sabree; Micheal Lowe; Kyndall Brown; Nikos C Kyrpides; Philip Hugenholtz
Journal:  BMC Bioinformatics       Date:  2007-06-18       Impact factor: 3.169

Review 9.  Understanding bacteriophage specificity in natural microbial communities.

Authors:  Britt Koskella; Sean Meaden
Journal:  Viruses       Date:  2013-03-11       Impact factor: 5.048

Review 10.  Computational approaches to predict bacteriophage-host relationships.

Authors:  Robert A Edwards; Katelyn McNair; Karoline Faust; Jeroen Raes; Bas E Dutilh
Journal:  FEMS Microbiol Rev       Date:  2015-12-09       Impact factor: 16.408

View more
  10 in total

1.  Accurate identification of bacteriophages from metagenomic data using Transformer.

Authors:  Jiayu Shang; Xubo Tang; Ruocheng Guo; Yanni Sun
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

2.  Phage-bacterial contig association prediction with a convolutional neural network.

Authors:  Tianqi Tang; Shengwei Hou; Jed A Fuhrman; Fengzhu Sun
Journal:  Bioinformatics       Date:  2022-06-24       Impact factor: 6.931

3.  Predicting the hosts of prokaryotic viruses using GCN-based semi-supervised learning.

Authors:  Jiayu Shang; Yanni Sun
Journal:  BMC Biol       Date:  2021-11-24       Impact factor: 7.431

4.  Taxonomy-aware, sequence similarity ranking reliably predicts phage-host relationships.

Authors:  Andrzej Zielezinski; Jakub Barylski; Wojciech M Karlowski
Journal:  BMC Biol       Date:  2021-10-08       Impact factor: 7.431

5.  Prediction of coronavirus 3C-like protease cleavage sites using machine-learning algorithms.

Authors:  Huiting Chen; Zhaozhong Zhu; Ye Qiu; Xingyi Ge; Heping Zheng; Yousong Peng
Journal:  Virol Sin       Date:  2022-05-02       Impact factor: 6.947

6.  Structural characterization of a soil viral auxiliary metabolic gene product - a functional chitosanase.

Authors:  Ruonan Wu; Clyde A Smith; Garry W Buchko; Ian K Blaby; David Paez-Espino; Nikos C Kyrpides; Yasuo Yoshikuni; Jason E McDermott; Kirsten S Hofmockel; John R Cort; Janet K Jansson
Journal:  Nat Commun       Date:  2022-09-19       Impact factor: 17.694

7.  Computational Viromics: Applications of the Computational Biology in Viromics Studies.

Authors:  Congyu Lu; Yousong Peng
Journal:  Virol Sin       Date:  2021-05-31       Impact factor: 4.327

8.  HoPhage: an ab initio tool for identifying hosts of phage fragments from metaviromes.

Authors:  Jie Tan; Zhencheng Fang; Shufang Wu; Qian Guo; Xiaoqing Jiang; Huaiqiu Zhu
Journal:  Bioinformatics       Date:  2021-08-12       Impact factor: 6.937

Review 9.  Computational Prediction of Bacteriophage Host Ranges.

Authors:  Cyril J Versoza; Susanne P Pfeifer
Journal:  Microorganisms       Date:  2022-01-12

Review 10.  Phages in the Gut Ecosystem.

Authors:  Michele Zuppi; Heather L Hendrickson; Justin M O'Sullivan; Tommi Vatanen
Journal:  Front Cell Infect Microbiol       Date:  2022-01-04       Impact factor: 5.293

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.