Literature DB >> 35950840

DeePVP: Identification and classification of phage virion proteins using deep learning.

Zhencheng Fang1, Tao Feng1, Hongwei Zhou1, Muxuan Chen1.   

Abstract

BACKGROUND: Many biological properties of phages are determined by phage virion proteins (PVPs), and the poor annotation of PVPs is a bottleneck for many areas of viral research, such as viral phylogenetic analysis, viral host identification, and antibacterial drug design. Because of the high diversity of PVP sequences, the PVP annotation of a phage genome remains a particularly challenging bioinformatic task.
FINDINGS: Based on deep learning, we developed DeePVP. The main module of DeePVP aims to discriminate PVPs from non-PVPs within a phage genome, while the extended module of DeePVP can further classify predicted PVPs into the 10 major classes of PVPs. Compared with the present state-of-the-art tools, the main module of DeePVP performs better, with a 9.05% higher F1-score in the PVP identification task. Moreover, the overall accuracy of the extended module of DeePVP in the PVP classification task is approximately 3.72% higher than that of PhANNs. Two application cases show that the predictions of DeePVP are more reliable and can better reveal the compact PVP-enriched region than the current state-of-the-art tools. Particularly, in the Escherichia phage phiEC1 genome, a novel PVP-enriched region that is conserved in many other Escherichia phage genomes was identified, indicating that DeePVP will be a useful tool for the analysis of phage genomic structures.
CONCLUSIONS: DeePVP outperforms state-of-the-art tools. The program is optimized in both a virtual machine with graphical user interface and a docker so that the tool can be easily run by noncomputer professionals. DeePVP is freely available at https://github.com/fangzcbio/DeePVP/.
© The Author(s) 2022. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  deep learning; phage virion protein; protein annotation

Mesh:

Year:  2022        PMID: 35950840      PMCID: PMC9366990          DOI: 10.1093/gigascience/giac076

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   7.658


  36 in total

Review 1.  Viral metagenomics.

Authors:  Robert A Edwards; Forest Rohwer
Journal:  Nat Rev Microbiol       Date:  2005-06       Impact factor: 60.633

2.  PPR-Meta: a tool for identifying phages and plasmids from metagenomic fragments using deep learning.

Authors:  Zhencheng Fang; Jie Tan; Shufang Wu; Mo Li; Congmin Xu; Zhongjie Xie; Huaiqiu Zhu
Journal:  Gigascience       Date:  2019-06-01       Impact factor: 6.524

3.  Exploring the contribution of bacteriophages to antibiotic resistance.

Authors:  Itziar Lekunberri; Jèssica Subirats; Carles M Borrego; José Luis Balcázar
Journal:  Environ Pollut       Date:  2016-11-24       Impact factor: 8.071

4.  Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis.

Authors:  Hui Ding; Peng-Mian Feng; Wei Chen; Hao Lin
Journal:  Mol Biosyst       Date:  2014-08

Review 5.  Phage diversity, genomics and phylogeny.

Authors:  Moïra B Dion; Frank Oechslin; Sylvain Moineau
Journal:  Nat Rev Microbiol       Date:  2020-02-03       Impact factor: 60.633

6.  An Ensemble Method to Distinguish Bacteriophage Virion from Non-Virion Proteins Based on Protein Sequence Characteristics.

Authors:  Lina Zhang; Chengjin Zhang; Rui Gao; Runtao Yang
Journal:  Int J Mol Sci       Date:  2015-09-09       Impact factor: 5.923

7.  Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks.

Authors:  David R Kelley; Jasper Snoek; John L Rinn
Journal:  Genome Res       Date:  2016-05-03       Impact factor: 9.043

8.  PhANNs, a fast and accurate tool and web server to classify phage structural proteins.

Authors:  Vito Adrian Cantu; Peter Salamon; Victor Seguritan; Jackson Redfield; David Salamon; Robert A Edwards; Anca M Segall
Journal:  PLoS Comput Biol       Date:  2020-11-02       Impact factor: 4.475

9.  Artificial neural networks trained to detect viral and phage structural proteins.

Authors:  Victor Seguritan; Nelson Alves; Michael Arnoult; Amy Raymond; Don Lorimer; Alex B Burgin; Peter Salamon; Anca M Segall
Journal:  PLoS Comput Biol       Date:  2012-08-23       Impact factor: 4.475

10.  Identifying Phage Virion Proteins by Using Two-Step Feature Selection Methods.

Authors:  Jiu-Xin Tan; Fu-Ying Dao; Hao Lv; Peng-Mian Feng; Hui Ding
Journal:  Molecules       Date:  2018-08-10       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.