Literature DB >> 16568445

Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins.

Igor B Kuznetsov1, Zhenkun Gou, Run Li, Seungwoo Hwang.   

Abstract

Proteins that interact with DNA are involved in a number of fundamental biological activities such as DNA replication, transcription, and repair. A reliable identification of DNA-binding sites in DNA-binding proteins is important for functional annotation, site-directed mutagenesis, and modeling protein-DNA interactions. We apply Support Vector Machine (SVM), a supervised pattern recognition method, to predict DNA-binding sites in DNA-binding proteins using the following features: amino acid sequence, profile of evolutionary conservation of sequence positions, and low-resolution structural information. We use a rigorous statistical approach to study the performance of predictors that utilize different combinations of features and how this performance is affected by structural and sequence properties of proteins. Our results indicate that an SVM predictor based on a properly scaled profile of evolutionary conservation in the form of a position specific scoring matrix (PSSM) significantly outperforms a PSSM-based neural network predictor. The highest accuracy is achieved by SVM predictor that combines the profile of evolutionary conservation with low-resolution structural information. Our results also show that knowledge-based predictors of DNA-binding sites perform significantly better on proteins from mainly-alpha structural class and that the performance of these predictors is significantly correlated with certain structural and sequence properties of proteins. These observations suggest that it may be possible to assign a reliability index to the overall accuracy of the prediction of DNA-binding sites in any given protein using its sequence and structural properties. A web-server implementation of the predictors is freely available online at http://lcg.rit.albany.edu/dp-bind/. (c) 2006 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16568445     DOI: 10.1002/prot.20977

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  59 in total

1.  Learning to translate sequence and structure to function: identifying DNA binding and membrane binding proteins.

Authors:  Robert E Langlois; Matthew B Carson; Nitin Bhardwaj; Hui Lu
Journal:  Ann Biomed Eng       Date:  2007-04-13       Impact factor: 3.934

2.  Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions.

Authors:  Nitin Bhardwaj; Hui Lu
Journal:  FEBS Lett       Date:  2007-02-07       Impact factor: 4.124

3.  FINDSITE-metal: integrating evolutionary information and machine learning for structure-based metal-binding site prediction at the proteome level.

Authors:  Michal Brylinski; Jeffrey Skolnick
Journal:  Proteins       Date:  2010-12-06

4.  Leishmania actin binds and nicks kDNA as well as inhibits decatenation activity of type II topoisomerase.

Authors:  Prabodh Kapoor; Ashutosh Kumar; Rangeetha Naik; Munia Ganguli; Mohammad I Siddiqi; Amogh A Sahasrabuddhe; Chhitar M Gupta
Journal:  Nucleic Acids Res       Date:  2010-02-10       Impact factor: 16.971

5.  BindN+ for accurate prediction of DNA and RNA-binding residues from protein sequence features.

Authors:  Liangjiang Wang; Caiyan Huang; Mary Qu Yang; Jack Y Yang
Journal:  BMC Syst Biol       Date:  2010-05-28

6.  Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

Authors:  John A Capra; Roman A Laskowski; Janet M Thornton; Mona Singh; Thomas A Funkhouser
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

7.  From nonspecific DNA-protein encounter complexes to the prediction of DNA-protein interactions.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  PLoS Comput Biol       Date:  2009-04-03       Impact factor: 4.475

8.  Common physical basis of macromolecule-binding sites in proteins.

Authors:  Yao Chi Chen; Carmay Lim
Journal:  Nucleic Acids Res       Date:  2008-11-06       Impact factor: 16.971

9.  Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.

Authors:  Marcin J Mizianty; Lukasz Kurgan
Journal:  BMC Bioinformatics       Date:  2009-12-13       Impact factor: 3.169

10.  A threading-based method for the prediction of DNA-binding proteins with application to the human genome.

Authors:  Mu Gao; Jeffrey Skolnick
Journal:  PLoS Comput Biol       Date:  2009-11-13       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.