Literature DB >> 33613485

VirionFinder: Identification of Complete and Partial Prokaryote Virus Virion Protein From Virome Data Using the Sequence and Biochemical Properties of Amino Acids.

Zhencheng Fang1,2, Hongwei Zhou1,3.   

Abstract

Viruses are some of the most abundant biological entities on Earth, and prokaryote virus are the dominant members of the viral community. Because of the diversity of prokaryote virus, functional annotation cannot be performed on a large number of genes from newly discovered prokaryote virus by searching the current database; therefore, the development of an alignment-free algorithm for functional annotation of prokaryote virus proteins is important to understand the viral community. The identification of prokaryote virus proteins (PVVPs) is a critical step for many viral analyses, such as species classification, phylogenetic analysis and the exploration of how prokaryote virus interact with their hosts. Although a series of PVVP prediction tools have been developed, the performance of these tools is still not satisfactory. Moreover, viral metagenomic data contains fragmented sequences, leading to the existence of some incomplete genes. Therefore, a tool that can identify partial prokaryote virus proteins is also needed. In this work, we present a novel algorithm, called VirionFinder, to identify the complete and partial PVVPs from non-prokaryote virus virion proteins (non-PVVPs). VirionFinder uses the sequence and biochemical properties of 20 amino acids as the mathematical model to encode the protein sequences and uses a deep learning technique to identify whether a given protein is a PVVP. Compared with the state-of-the-art tools using artificial benchmark datasets, the results show that under the same specificity (Sp), the sensitivity (Sn) of VirionFinder is approximately 10-34% much higher than the Sn of these tools on both complete and partial proteins. When evaluating related tools using real virome data, the recognition rate of PVVP-like sequences of VirionFinder is also much higher than that of the other tools. We expect that VirionFinder will be a powerful tool for identifying novel virion proteins from both complete prokaryote virus genomes and viral metagenomic data. VirionFinder is freely available at https://github.com/zhenchengfang/VirionFinder.
Copyright © 2021 Fang and Zhou.

Entities:  

Keywords:  deep learning; gene function annotation; metagenome; prokaryote virus virion protein; virome

Year:  2021        PMID: 33613485      PMCID: PMC7894196          DOI: 10.3389/fmicb.2021.615711

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


  30 in total

1.  Exploring the contribution of bacteriophages to antibiotic resistance.

Authors:  Itziar Lekunberri; Jèssica Subirats; Carles M Borrego; José Luis Balcázar
Journal:  Environ Pollut       Date:  2016-11-24       Impact factor: 8.071

2.  Viral metagenomics reveal blooms of anelloviruses in the respiratory tract of lung transplant recipients.

Authors:  J C Young; C Chehoud; K Bittinger; A Bailey; J M Diamond; E Cantu; A R Haas; A Abbas; L Frye; J D Christie; F D Bushman; R G Collman
Journal:  Am J Transplant       Date:  2014-11-17       Impact factor: 8.086

3.  Disease-specific alterations in the enteric virome in inflammatory bowel disease.

Authors:  Jason M Norman; Scott A Handley; Megan T Baldridge; Lindsay Droit; Catherine Y Liu; Brian C Keller; Amal Kambal; Cynthia L Monaco; Guoyan Zhao; Phillip Fleshner; Thaddeus S Stappenbeck; Dermot P B McGovern; Ali Keshavarzian; Ece A Mutlu; Jenny Sauk; Dirk Gevers; Ramnik J Xavier; David Wang; Miles Parkes; Herbert W Virgin
Journal:  Cell       Date:  2015-01-22       Impact factor: 41.582

4.  cBar: a computer program to distinguish plasmid-derived from chromosome-derived sequence fragments in metagenomics data.

Authors:  Fengfeng Zhou; Ying Xu
Journal:  Bioinformatics       Date:  2010-06-10       Impact factor: 6.937

5.  GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses.

Authors:  John Besemer; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

6.  VirSorter: mining viral signal from microbial genomic data.

Authors:  Simon Roux; Francois Enault; Bonnie L Hurwitz; Matthew B Sullivan
Journal:  PeerJ       Date:  2015-05-28       Impact factor: 2.984

7.  An Ensemble Method to Distinguish Bacteriophage Virion from Non-Virion Proteins Based on Protein Sequence Characteristics.

Authors:  Lina Zhang; Chengjin Zhang; Rui Gao; Runtao Yang
Journal:  Int J Mol Sci       Date:  2015-09-09       Impact factor: 5.923

8.  Choice of assembly software has a critical impact on virome characterisation.

Authors:  Thomas D S Sutton; Adam G Clooney; Feargal J Ryan; R Paul Ross; Colin Hill
Journal:  Microbiome       Date:  2019-01-28       Impact factor: 14.650

9.  PVPred-SCM: Improved Prediction and Analysis of Phage Virion Proteins Using a Scoring Card Method.

Authors:  Phasit Charoenkwan; Sakawrat Kanthawong; Nalini Schaduangrat; Janchai Yana; Watshara Shoombuatong
Journal:  Cells       Date:  2020-02-03       Impact factor: 6.600

10.  PlasGUN: gene prediction in plasmid metagenomic short reads using deep learning.

Authors:  Zhencheng Fang; Jie Tan; Shufang Wu; Mo Li; Chunhui Wang; Yongchu Liu; Huaiqiu Zhu
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

View more
  5 in total

1.  DeePVP: Identification and classification of phage virion proteins using deep learning.

Authors:  Zhencheng Fang; Tao Feng; Hongwei Zhou; Muxuan Chen
Journal:  Gigascience       Date:  2022-08-11       Impact factor: 7.658

Review 2.  Computational Tools for the Analysis of Uncultivated Phage Genomes.

Authors:  Juan Sebastián Andrade-Martínez; Laura Carolina Camelo Valera; Luis Alberto Chica Cárdenas; Laura Forero-Junco; Gamaliel López-Leal; J Leonardo Moreno-Gallego; Guillermo Rangel-Pineros; Alejandro Reyes
Journal:  Microbiol Mol Biol Rev       Date:  2022-03-21       Impact factor: 13.044

Review 3.  Large-scale comparative review and assessment of computational methods for phage virion proteins identification.

Authors:  Muhammad Kabir; Chanin Nantasenamat; Sakawrat Kanthawong; Phasit Charoenkwan; Watshara Shoombuatong
Journal:  EXCLI J       Date:  2022-01-03       Impact factor: 4.068

4.  SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins.

Authors:  Saeed Ahmad; Phasit Charoenkwan; Julian M W Quinn; Mohammad Ali Moni; Md Mehedi Hasan; Pietro Lio'; Watshara Shoombuatong
Journal:  Sci Rep       Date:  2022-03-08       Impact factor: 4.379

5.  Diversity of Pseudomonas aeruginosa Temperate Phages.

Authors:  Genevieve Johnson; Swarnali Banerjee; Catherine Putonti
Journal:  mSphere       Date:  2022-02-23       Impact factor: 4.389

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.