Literature DB >> 18175049

Combing ontologies and dipeptide composition for predicting DNA-binding proteins.

Loris Nanni1, Alessandra Lumini.   

Abstract

Given a novel protein it is very important to know if it is a DNA-binding protein, because DNA-binding proteins participate in the fundamental role to regulate gene expression. In this work, we propose a parallel fusion between a classifier trained using the features extracted from the gene ontology database and a classifier trained using the dipeptide composition of the protein. As classifiers the support vector machine (SVM) and the 1-nearest neighbour are used. Matthews's correlation coefficient obtained by our fusion method is approximately 0.97 when the jackknife cross-validation is used; this result outperforms the best performance obtained in the literature (0.924) using the same dataset where the SVM is trained using only the Chou's pseudo amino acid based features. In this work also the area under the ROC-curve (AUC) is reported and our results show that the fusion permits to obtain a very interesting 0.995 AUC. In particular we want to stress that our fusion obtains a 5% false negative with a 0% of false positive. Matthews's correlation coefficient obtained using the single best GO-number is only 0.7211 and hence it is not possible to use the gene ontology database as a simple lookup table. Finally, we test the complementarity of the two tested feature extraction methods using the Q-statistic. We obtain the very interesting result of 0.58, which means that the features extracted from the gene ontology database and the features extracted from the amino acid sequence are partially independent and that their parallel fusion should be studied more.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18175049     DOI: 10.1007/s00726-007-0016-3

Source DB:  PubMed          Journal:  Amino Acids        ISSN: 0939-4451            Impact factor:   3.520


  10 in total

1.  DNAPred_Prot: Identification of DNA-Binding Proteins Using Composition- and Position-Based Features.

Authors:  Omar Barukab; Yaser Daanial Khan; Sher Afzal Khan; Kuo-Chen Chou
Journal:  Appl Bionics Biomech       Date:  2022-04-13       Impact factor: 1.664

2.  iDNA-Prot: identification of DNA binding proteins using random forest with grey model.

Authors:  Wei-Zhong Lin; Jian-An Fang; Xuan Xiao; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-09-15       Impact factor: 3.240

3.  Identifying DNA-binding proteins by combining support vector machine and PSSM distance transformation.

Authors:  Ruifeng Xu; Jiyun Zhou; Hongpeng Wang; Yulan He; Xiaolong Wang; Bin Liu
Journal:  BMC Syst Biol       Date:  2015-02-06

4.  PredDBP-Stack: Prediction of DNA-Binding Proteins from HMM Profiles using a Stacked Ensemble Method.

Authors:  Jun Wang; Huiwen Zheng; Yang Yang; Wanyue Xiao; Taigang Liu
Journal:  Biomed Res Int       Date:  2020-04-13       Impact factor: 3.411

5.  An improved sequence based prediction protocol for DNA-binding proteins using SVM and comprehensive feature analysis.

Authors:  Chuanxin Zou; Jiayu Gong; Honglin Li
Journal:  BMC Bioinformatics       Date:  2013-03-09       Impact factor: 3.169

6.  Sequence based prediction of DNA-binding proteins based on hybrid feature selection using random forest and Gaussian naïve Bayes.

Authors:  Wangchao Lou; Xiaoqing Wang; Fan Chen; Yixiao Chen; Bo Jiang; Hua Zhang
Journal:  PLoS One       Date:  2014-01-24       Impact factor: 3.240

7.  iEzy-drug: a web server for identifying the interaction between enzymes and drugs in cellular networking.

Authors:  Jian-Liang Min; Xuan Xiao; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2013-11-26       Impact factor: 3.411

8.  Improved detection of DNA-binding proteins via compression technology on PSSM information.

Authors:  Yubo Wang; Yijie Ding; Fei Guo; Leyi Wei; Jijun Tang
Journal:  PLoS One       Date:  2017-09-29       Impact factor: 3.240

9.  Prediction of RNA- and DNA-Binding Proteins Using Various Machine Learning Classifiers.

Authors:  Mehdi Poursheikhali Asghari; Parviz Abdolmaleki
Journal:  Avicenna J Med Biotechnol       Date:  2019 Jan-Mar

10.  HMMPred: Accurate Prediction of DNA-Binding Proteins Based on HMM Profiles and XGBoost Feature Selection.

Authors:  Xiuzhi Sang; Wanyue Xiao; Huiwen Zheng; Yang Yang; Taigang Liu
Journal:  Comput Math Methods Med       Date:  2020-03-28       Impact factor: 2.238

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.