Literature DB >> 22868682

Sequence-based prediction of DNA-binding residues in proteins with conservation and correlation information.

Xin Ma1, Jing Guo, Hong-De Liu, Jian-Ming Xie, Xiao Sun.   

Abstract

The recognition of DNA-binding residues in proteins is critical to our understanding of the mechanisms of DNA-protein interactions, gene expression, and for guiding drug design. Therefore, a prediction method DNABR (DNA Binding Residues) is proposed for predicting DNA-binding residues in protein sequences using the random forest (RF) classifier with sequence-based features. Two types of novel sequence features are proposed in this study, which reflect the information about the conservation of physicochemical properties of the amino acids, and the correlation of amino acids between different sequence positions in terms of physicochemical properties. The first type of feature uses the evolutionary information combined with the conservation of physicochemical properties of the amino acids while the second reflects the dependency effect of amino acids with regards to polarity charge and hydrophobic properties in the protein sequences. Those two features and an orthogonal binary vector which reflect the characteristics of 20 types of amino acids are used to build the DNABR, a model to predict DNA-binding residues in proteins. The DNABR model achieves a value of 0.6586 for Matthew’s correlation coefficient (MCC) and 93.04 percent overall accuracy (ACC) with a68.47 percent sensitivity (SE) and 98.16 percent specificity (SP), respectively. The comparisons with each feature demonstrate that these two novel features contribute most to the improvement in predictive ability. Furthermore, performance comparisons with other approaches clearly show that DNABR has an excellent prediction performance for detecting binding residues in putative DNA-binding protein. The DNABR web-server system is freely available at http://www.cbi.seu.edu.cn/DNABR/.

Mesh:

Substances:

Year:  2012        PMID: 22868682     DOI: 10.1109/TCBB.2012.106

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  13 in total

1.  ATPbind: Accurate Protein-ATP Binding Site Prediction by Combining Sequence-Profiling and Structure-Based Comparisons.

Authors:  Jun Hu; Yang Li; Yang Zhang; Dong-Jun Yu
Journal:  J Chem Inf Model       Date:  2018-02-08       Impact factor: 4.956

2.  DRNApred, fast sequence-based method that accurately predicts and discriminates DNA- and RNA-binding residues.

Authors:  Jing Yan; Lukasz Kurgan
Journal:  Nucleic Acids Res       Date:  2017-06-02       Impact factor: 16.971

3.  Identification of DNA-binding proteins using support vector machine with sequence information.

Authors:  Xin Ma; Jiansheng Wu; Xiaoyun Xue
Journal:  Comput Math Methods Med       Date:  2013-09-16       Impact factor: 2.238

Review 4.  An overview of the prediction of protein DNA-binding sites.

Authors:  Jingna Si; Rui Zhao; Rongling Wu
Journal:  Int J Mol Sci       Date:  2015-03-06       Impact factor: 5.923

5.  Enhancing protein-vitamin binding residues prediction by multiple heterogeneous subspace SVMs ensemble.

Authors:  Dong-Jun Yu; Jun Hu; Hui Yan; Xi-Bei Yang; Jing-Yu Yang; Hong-Bin Shen
Journal:  BMC Bioinformatics       Date:  2014-09-05       Impact factor: 3.169

6.  High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder.

Authors:  Zhenling Peng; Lukasz Kurgan
Journal:  Nucleic Acids Res       Date:  2015-06-24       Impact factor: 16.971

7.  DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.

Authors:  Xin Ma; Jing Guo; Xiao Sun
Journal:  PLoS One       Date:  2016-12-01       Impact factor: 3.240

8.  nDNA-Prot: identification of DNA-binding proteins based on unbalanced classification.

Authors:  Li Song; Dapeng Li; Xiangxiang Zeng; Yunfeng Wu; Li Guo; Quan Zou
Journal:  BMC Bioinformatics       Date:  2014-09-08       Impact factor: 3.169

9.  PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context.

Authors:  Jiyun Zhou; Ruifeng Xu; Yulan He; Qin Lu; Hongpeng Wang; Bing Kong
Journal:  Sci Rep       Date:  2016-06-10       Impact factor: 4.379

10.  EL_PSSM-RT: DNA-binding residue prediction by integrating ensemble learning with PSSM Relation Transformation.

Authors:  Jiyun Zhou; Qin Lu; Ruifeng Xu; Yulan He; Hongpeng Wang
Journal:  BMC Bioinformatics       Date:  2017-08-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.