Literature DB >> 21860064

Identifying bacterial virulent proteins by fusing a set of classifiers based on variants of Chou's pseudo amino acid composition and on evolutionary information.

Loris Nanni1, Alessandra Lumini, Dinesh Gupta, Aarti Garg.   

Abstract

The availability of a reliable prediction method for prediction of bacterial virulent proteins has several important applications in research efforts targeted aimed at finding novel drug targets, vaccine candidates, and understanding virulence mechanisms in pathogens. In this work, we have studied several feature extraction approaches for representing proteins and propose a novel bacterial virulent protein prediction method, based on an ensemble of classifiers where the features are extracted directly from the amino acid sequence and from the evolutionary information of a given protein. We have evaluated and compared several ensembles obtained by combining six feature extraction methods and several classification approaches based on two general purpose classifiers (i.e., Support Vector Machine and a variant of input decimated ensemble) and their random subspace version. An extensive evaluation was performed according to a blind testing protocol, where the parameters of the system are optimized using the training set and the system is validated in three different independent data sets, allowing selection of the most performing system and demonstrating the validity of the proposed method. Based on the results obtained using the blind test protocol, it is interesting to note that even if in each independent data set the most performing stand-alone method is not always the same, the fusion of different methods enhances prediction efficiency in all the tested independent data sets.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21860064     DOI: 10.1109/TCBB.2011.117

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  38 in total

1.  iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Authors:  Muhammad Kabir; Maqsood Hayat
Journal:  Mol Genet Genomics       Date:  2015-08-30       Impact factor: 3.291

2.  DP-BINDER: machine learning model for prediction of DNA-binding proteins by fusing evolutionary and physicochemical information.

Authors:  Farman Ali; Saeed Ahmed; Zar Nawab Khan Swati; Shahid Akbar
Journal:  J Comput Aided Mol Des       Date:  2019-05-23       Impact factor: 3.686

3.  EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou's PseAAC.

Authors:  Tzu-Hao Chang; Li-Ching Wu; Tzong-Yi Lee; Shu-Pin Chen; Hsien-Da Huang; Jorng-Tzong Horng
Journal:  J Comput Aided Mol Des       Date:  2013-01-03       Impact factor: 3.686

4.  Protein remote homology detection by combining Chou's distance-pair pseudo amino acid composition and principal component analysis.

Authors:  Bin Liu; Junjie Chen; Xiaolong Wang
Journal:  Mol Genet Genomics       Date:  2015-04-21       Impact factor: 3.291

Review 5.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

6.  A multilabel model based on Chou's pseudo-amino acid composition for identifying membrane proteins with both single and multiple functional types.

Authors:  Chao Huang; Jing-Qi Yuan
Journal:  J Membr Biol       Date:  2013-04-02       Impact factor: 1.843

7.  Comprehensive comparative analysis and identification of RNA-binding protein domains: multi-class classification and feature selection.

Authors:  Samad Jahandideh; Vinodh Srinivasasainagendra; Degui Zhi
Journal:  J Theor Biol       Date:  2012-08-03       Impact factor: 2.691

8.  Analysis of protein determinants of host-specific infection properties of polyomaviruses using machine learning.

Authors:  Myeongji Cho; Hayeon Kim; Hyeon S Son
Journal:  Genes Genomics       Date:  2021-03-01       Impact factor: 1.839

9.  Naïve Bayes classifier with feature selection to identify phage virion proteins.

Authors:  Peng-Mian Feng; Hui Ding; Wei Chen; Hao Lin
Journal:  Comput Math Methods Med       Date:  2013-05-15       Impact factor: 2.238

10.  Predicting secretory proteins of malaria parasite by incorporating sequence evolution information into pseudo amino acid composition via grey system model.

Authors:  Wei-Zhong Lin; Jian-An Fang; Xuan Xiao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2012-11-26       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.