Literature DB >> 23737141

DNABind: a hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches.

Rong Liu1, Jianjun Hu.   

Abstract

Accurate prediction of DNA-binding residues has become a problem of increasing importance in structural bioinformatics. Here, we presented DNABind, a novel hybrid algorithm for identifying these crucial residues by exploiting the complementarity between machine learning- and template-based methods. Our machine learning-based method was based on the probabilistic combination of a structure-based and a sequence-based predictor, both of which were implemented using support vector machines algorithms. The former included our well-designed structural features, such as solvent accessibility, local geometry, topological features, and relative positions, which can effectively quantify the difference between DNA-binding and nonbinding residues. The latter combined evolutionary conservation features with three other sequence attributes. Our template-based method depended on structural alignment and utilized the template structure from known protein-DNA complexes to infer DNA-binding residues. We showed that the template method had excellent performance when reliable templates were found for the query proteins but tended to be strongly influenced by the template quality as well as the conformational changes upon DNA binding. In contrast, the machine learning approach yielded better performance when high-quality templates were not available (about 1/3 cases in our dataset) or the query protein was subject to intensive transformation changes upon DNA binding. Our extensive experiments indicated that the hybrid approach can distinctly improve the performance of the individual methods for both bound and unbound structures. DNABind also significantly outperformed the state-of-art algorithms by around 10% in terms of Matthews's correlation coefficient. The proposed methodology could also have wide application in various protein functional site annotations. DNABind is freely available at http://mleg.cse.sc.edu/DNABind/.
Copyright © 2013 Wiley Periodicals, Inc.

Keywords:  DNA-binding residue; conformational change; machine learning; protein-DNA interaction; structural analysis; template

Mesh:

Substances:

Year:  2013        PMID: 23737141     DOI: 10.1002/prot.24330

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  12 in total

1.  Individually double minimum-distance definition of protein-RNA binding residues and application to structure-based prediction.

Authors:  Wen Hu; Liu Qin; Menglong Li; Xuemei Pu; Yanzhi Guo
Journal:  J Comput Aided Mol Des       Date:  2018-11-26       Impact factor: 3.686

2.  The choice of sequence homologs included in multiple sequence alignments has a dramatic impact on evolutionary conservation analysis.

Authors:  Nelson Gil; Andras Fiser
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

3.  Structural changes in DNA-binding proteins on complexation.

Authors:  Sayan Poddar; Devlina Chakravarty; Pinak Chakrabarti
Journal:  Nucleic Acids Res       Date:  2018-04-20       Impact factor: 16.971

Review 4.  Template-based prediction of protein function.

Authors:  Donald Petrey; T Scott Chen; Lei Deng; Jose Ignacio Garzon; Howook Hwang; Gorka Lasso; Hunjoong Lee; Antonina Silkov; Barry Honig
Journal:  Curr Opin Struct Biol       Date:  2015-02-10       Impact factor: 6.809

5.  Dissecting and predicting different types of binding sites in nucleic acids based on structural information.

Authors:  Zheng Jiang; Si-Rui Xiao; Rong Liu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

6.  SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

Authors:  Xiaoxia Yang; Jia Wang; Jun Sun; Rong Liu
Journal:  PLoS One       Date:  2015-07-15       Impact factor: 3.240

Review 7.  An overview of the prediction of protein DNA-binding sites.

Authors:  Jingna Si; Rui Zhao; Rongling Wu
Journal:  Int J Mol Sci       Date:  2015-03-06       Impact factor: 5.923

8.  A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs.

Authors:  Zhichao Miao; Eric Westhof
Journal:  PLoS Comput Biol       Date:  2015-12-17       Impact factor: 4.475

9.  PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context.

Authors:  Jiyun Zhou; Ruifeng Xu; Yulan He; Qin Lu; Hongpeng Wang; Bing Kong
Journal:  Sci Rep       Date:  2016-06-10       Impact factor: 4.379

10.  CRHunter: integrating multifaceted information to predict catalytic residues in enzymes.

Authors:  Jun Sun; Jia Wang; Dan Xiong; Jian Hu; Rong Liu
Journal:  Sci Rep       Date:  2016-09-26       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.