Literature DB >> 24448651

Predicting DNA-binding sites of proteins based on sequential and 3D structural information.

Bi-Qing Li1, Kai-Yan Feng, Juan Ding, Yu-Dong Cai.   

Abstract

Protein-DNA interactions play important roles in many biological processes. To understand the molecular mechanisms of protein-DNA interaction, it is necessary to identify the DNA-binding sites in DNA-binding proteins. In the last decade, computational approaches have been developed to predict protein-DNA-binding sites based solely on protein sequences. In this study, we developed a novel predictor based on support vector machine algorithm coupled with the maximum relevance minimum redundancy method followed by incremental feature selection. We incorporated not only features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure, solvent accessibility, but also five three-dimensional (3D) structural features calculated from PDB data to predict the protein-DNA interaction sites. Feature analysis showed that 3D structural features indeed contributed to the prediction of DNA-binding site and it was demonstrated that the prediction performance was better with 3D structural features than without them. It was also shown via analysis of features from each site that the features of DNA-binding site itself contribute the most to the prediction. Our prediction method may become a useful tool for identifying the DNA-binding sites and the feature analysis described in this paper may provide useful insights for in-depth investigations into the mechanisms of protein-DNA interaction.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24448651     DOI: 10.1007/s00438-014-0812-x

Source DB:  PubMed          Journal:  Mol Genet Genomics        ISSN: 1617-4623            Impact factor:   3.291


  51 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm.

Authors:  P E Wright; H J Dyson
Journal:  J Mol Biol       Date:  1999-10-22       Impact factor: 5.469

3.  Conserved cysteine residue in the DNA-binding domain of the bovine papillomavirus type 1 E2 protein confers redox regulation of the DNA-binding activity in vitro.

Authors:  A A McBride; R D Klausner; P M Howley
Journal:  Proc Natl Acad Sci U S A       Date:  1992-08-15       Impact factor: 11.205

4.  Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information.

Authors:  Shandar Ahmad; M Michael Gromiha; Akinori Sarai
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

5.  Solving the protein sequence metric problem.

Authors:  William R Atchley; Jieping Zhao; Andrew D Fernandes; Tanja Drüke
Journal:  Proc Natl Acad Sci U S A       Date:  2005-04-25       Impact factor: 11.205

6.  Prediction of protein B-factor profiles.

Authors:  Zheng Yuan; Timothy L Bailey; Rohan D Teasdale
Journal:  Proteins       Date:  2005-03-01

7.  Analysis of protein-protein interaction sites using surface patches.

Authors:  S Jones; J M Thornton
Journal:  J Mol Biol       Date:  1997-09-12       Impact factor: 5.469

8.  PreDNA: accurate prediction of DNA-binding sites in proteins by integrating sequence and geometric structure information.

Authors:  Tao Li; Qian-Zhong Li; Shuai Liu; Guo-Liang Fan; Yong-Chun Zuo; Yong Peng
Journal:  Bioinformatics       Date:  2013-01-17       Impact factor: 6.937

9.  Predicting DNA-binding sites of proteins from amino acid sequence.

Authors:  Changhui Yan; Michael Terribilini; Feihong Wu; Robert L Jernigan; Drena Dobbs; Vasant Honavar
Journal:  BMC Bioinformatics       Date:  2006-05-19       Impact factor: 3.169

10.  PSAIA - protein structure and interaction analyzer.

Authors:  Josip Mihel; Mile Sikić; Sanja Tomić; Branko Jeren; Kristian Vlahovicek
Journal:  BMC Struct Biol       Date:  2008-04-09
View more
  8 in total

1.  iTTCA-MFF: identifying tumor T cell antigens based on multiple feature fusion.

Authors:  Hongliang Zou; Fan Yang; Zhijian Yin
Journal:  Immunogenetics       Date:  2022-03-05       Impact factor: 3.330

Review 2.  An overview of the prediction of protein DNA-binding sites.

Authors:  Jingna Si; Rui Zhao; Rongling Wu
Journal:  Int J Mol Sci       Date:  2015-03-06       Impact factor: 5.923

3.  A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs.

Authors:  Zhichao Miao; Eric Westhof
Journal:  PLoS Comput Biol       Date:  2015-12-17       Impact factor: 4.475

4.  Prediction of protein-protein interaction sites by means of ensemble learning and weighted feature descriptor.

Authors:  Xiuquan Du; Shiwei Sun; Changlin Hu; Xinrui Li; Junfeng Xia
Journal:  J Biol Res (Thessalon)       Date:  2016-07-04       Impact factor: 1.889

5.  RBscore&NBench: a high-level web server for nucleic acid binding residues prediction with a large-scale benchmarking database.

Authors:  Zhichao Miao; Eric Westhof
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

6.  Identification of DNA-protein Binding Sites through Multi-Scale Local Average Blocks on Sequence Information.

Authors:  Cong Shen; Yijie Ding; Jijun Tang; Jian Song; Fei Guo
Journal:  Molecules       Date:  2017-11-28       Impact factor: 4.411

7.  SXGBsite: Prediction of Protein-Ligand Binding Sites Using Sequence Information and Extreme Gradient Boosting.

Authors:  Ziqi Zhao; Yonghong Xu; Yong Zhao
Journal:  Genes (Basel)       Date:  2019-11-22       Impact factor: 4.096

8.  iProDNA-CapsNet: identifying protein-DNA binding residues using capsule neural networks.

Authors:  Binh P Nguyen; Quang H Nguyen; Giang-Nam Doan-Ngoc; Thanh-Hoang Nguyen-Vo; Susanto Rahardja
Journal:  BMC Bioinformatics       Date:  2019-12-27       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.