Literature DB >> 24931825

Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis.

Hui Ding1, Peng-Mian Feng, Wei Chen, Hao Lin.   

Abstract

The bacteriophage virion proteins play extremely important roles in the fate of host bacterial cells. Accurate identification of bacteriophage virion proteins is very important for understanding their functions and clarifying the lysis mechanism of bacterial cells. In this study, a new sequence-based method was developed to identify phage virion proteins. In the new method, the protein sequences were initially formulated by the g-gap dipeptide compositions. Subsequently, the analysis of variance (ANOVA) with incremental feature selection (IFS) was used to search for the optimal feature set. It was observed that, in jackknife cross-validation, the optimal feature set including 160 optimized features can produce the maximum accuracy of 85.02%. By performing feature analysis, we found that the correlation between two amino acids with one gap was more important than other correlations for phage virion protein prediction and that some of the 1-gap dipeptides were important and mainly contributed to the virion protein prediction. This analysis will provide novel insights into the function of phage virion proteins. On the basis of the proposed method, an online web-server, PVPred, was established and can be freely accessed from the website (http://lin.uestc.edu.cn/server/PVPred). We believe that the PVPred will become a powerful tool to study phage virion proteins and to guide the related experimental validations.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24931825     DOI: 10.1039/c4mb00316k

Source DB:  PubMed          Journal:  Mol Biosyst        ISSN: 1742-2051


  46 in total

1.  Sequence-based predictive modeling to identify cancerlectins.

Authors:  Hong-Yan Lai; Xin-Xin Chen; Wei Chen; Hua Tang; Hao Lin
Journal:  Oncotarget       Date:  2017-04-25

2.  PHYPred: a tool for identifying bacteriophage enzymes and hydrolases.

Authors:  Hui Ding; Wuritu Yang; Hua Tang; Peng-Mian Feng; Jian Huang; Wei Chen; Hao Lin
Journal:  Virol Sin       Date:  2016-08       Impact factor: 4.327

3.  MathFeature: feature extraction package for DNA, RNA and protein sequences based on mathematical descriptors.

Authors:  Robson P Bonidia; Douglas S Domingues; Danilo S Sanches; André C P L F de Carvalho
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

4.  DeePVP: Identification and classification of phage virion proteins using deep learning.

Authors:  Zhencheng Fang; Tao Feng; Hongwei Zhou; Muxuan Chen
Journal:  Gigascience       Date:  2022-08-11       Impact factor: 7.658

5.  Identifying DNA-binding proteins by combining support vector machine and PSSM distance transformation.

Authors:  Ruifeng Xu; Jiyun Zhou; Hongpeng Wang; Yulan He; Xiaolong Wang; Bin Liu
Journal:  BMC Syst Biol       Date:  2015-02-06

6.  Prediction of MicroRNA-Disease Associations Based on Social Network Analysis Methods.

Authors:  Quan Zou; Jinjin Li; Qingqi Hong; Ziyu Lin; Yun Wu; Hua Shi; Ying Ju
Journal:  Biomed Res Int       Date:  2015-07-26       Impact factor: 3.411

7.  An Ensemble Method to Distinguish Bacteriophage Virion from Non-Virion Proteins Based on Protein Sequence Characteristics.

Authors:  Lina Zhang; Chengjin Zhang; Rui Gao; Runtao Yang
Journal:  Int J Mol Sci       Date:  2015-09-09       Impact factor: 5.923

Review 8.  Survey of Natural Language Processing Techniques in Bioinformatics.

Authors:  Zhiqiang Zeng; Hua Shi; Yun Wu; Zhiling Hong
Journal:  Comput Math Methods Med       Date:  2015-10-07       Impact factor: 2.238

9.  Predicting cancerlectins by the optimal g-gap dipeptides.

Authors:  Hao Lin; Wei-Xin Liu; Jiao He; Xin-Hui Liu; Hui Ding; Wei Chen
Journal:  Sci Rep       Date:  2015-12-09       Impact factor: 4.379

Review 10.  Application of machine learning in bacteriophage research.

Authors:  Yousef Nami; Nazila Imeni; Bahman Panahi
Journal:  BMC Microbiol       Date:  2021-06-26       Impact factor: 3.605

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.