Literature DB >> 19233205

Identification of DNA-binding proteins using structural, electrostatic and evolutionary features.

Guy Nimrod1, András Szilágyi, Christina Leslie, Nir Ben-Tal.   

Abstract

DNA-binding proteins (DBPs) participate in various crucial processes in the life-cycle of the cells, and the identification and characterization of these proteins is of great importance. We present here a random forests classifier for identifying DBPs among proteins with known 3D structures. First, clusters of evolutionarily conserved regions (patches) on the surface of proteins were detected using the PatchFinder algorithm; earlier studies showed that these regions are typically the functionally important regions of proteins. Next, we trained a classifier using features like the electrostatic potential, cluster-based amino acid conservation patterns and the secondary structure content of the patches, as well as features of the whole protein, including its dipole moment. Using 10-fold cross-validation on a dataset of 138 DBPs and 110 proteins that do not bind DNA, the classifier achieved a sensitivity and a specificity of 0.90, which is overall better than the performance of published methods. Furthermore, when we tested five different methods on 11 new DBPs that did not appear in the original dataset, only our method annotated all correctly. The resulting classifier was applied to a collection of 757 proteins of known structure and unknown function. Of these proteins, 218 were predicted to bind DNA, and we anticipate that some of them interact with DNA using new structural motifs. The use of complementary computational tools supports the notion that at least some of them do bind DNA.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19233205      PMCID: PMC2726711          DOI: 10.1016/j.jmb.2009.02.023

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  73 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance.

Authors:  A S Yang; B Honig
Journal:  J Mol Biol       Date:  2000-08-18       Impact factor: 5.469

3.  Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information.

Authors:  Shandar Ahmad; M Michael Gromiha; Akinori Sarai
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

Review 4.  Facilitated target location in biological systems.

Authors:  P H von Hippel; O G Berg
Journal:  J Biol Chem       Date:  1989-01-15       Impact factor: 5.157

Review 5.  Type II restriction endonucleases: structure and mechanism.

Authors:  A Pingoud; M Fuxreiter; V Pingoud; W Wende
Journal:  Cell Mol Life Sci       Date:  2005-03       Impact factor: 9.261

Review 6.  Archaeal chromatin proteins: different structures but common function?

Authors:  Kathleen Sandman; John N Reeve
Journal:  Curr Opin Microbiol       Date:  2005-10-26       Impact factor: 7.934

7.  Protein function prediction using local 3D templates.

Authors:  Roman A Laskowski; James D Watson; Janet M Thornton
Journal:  J Mol Biol       Date:  2005-08-19       Impact factor: 5.469

8.  HTHquery: a method for detecting DNA-binding proteins with a helix-turn-helix structural motif.

Authors:  C Ferrer-Costa; H P Shanahan; S Jones; J M Thornton
Journal:  Bioinformatics       Date:  2005-07-19       Impact factor: 6.937

9.  Kernel-based machine learning protocol for predicting DNA-binding proteins.

Authors:  Nitin Bhardwaj; Robert E Langlois; Guijun Zhao; Hui Lu
Journal:  Nucleic Acids Res       Date:  2005-11-10       Impact factor: 16.971

10.  ConSurf 2005: the projection of evolutionary conservation scores of residues on protein structures.

Authors:  Meytal Landau; Itay Mayrose; Yossi Rosenberg; Fabian Glaser; Eric Martz; Tal Pupko; Nir Ben-Tal
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

View more
  34 in total

1.  Influence of pK(a) shifts on the calculated dipole moments of proteins.

Authors:  Brett L Mellor; Shiul Khadka; David D Busath; Brian A Mazzeo
Journal:  Protein J       Date:  2011-10       Impact factor: 2.371

Review 2.  DNA-protein interaction: identification, prediction and data analysis.

Authors:  Abbasali Emamjomeh; Darush Choobineh; Behzad Hajieghrari; Nafiseh MahdiNezhad; Amir Khodavirdipour
Journal:  Mol Biol Rep       Date:  2019-03-26       Impact factor: 2.316

3.  Speeding up the drug discovery process: structural similarity searches using molecular surfaces.

Authors:  Dimitrios Vlachakis; Georgia Tsiliki; Dimosthenis Tsagkrasoulis; Carla Sofia Carvalho; Vasileios Megalooikonomou; Sofia Kossida
Journal:  EMBnet J       Date:  2012

4.  Predicting Hot Spot Residues at Protein-DNA Binding Interfaces Based on Sequence Information.

Authors:  Lingsong Yao; Huadong Wang; Yannan Bin
Journal:  Interdiscip Sci       Date:  2020-10-17       Impact factor: 2.233

5.  Classification and interaction in random forests.

Authors:  Danielle Denisko; Michael M Hoffman
Journal:  Proc Natl Acad Sci U S A       Date:  2018-02-12       Impact factor: 11.205

6.  Features of CPB: a Poisson-Boltzmann solver that uses an adaptive Cartesian grid.

Authors:  Marcia O Fenley; Robert C Harris; Travis Mackoy; Alexander H Boschitsch
Journal:  J Comput Chem       Date:  2014-11-27       Impact factor: 3.376

7.  ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids.

Authors:  Haim Ashkenazy; Elana Erez; Eric Martz; Tal Pupko; Nir Ben-Tal
Journal:  Nucleic Acids Res       Date:  2010-05-16       Impact factor: 16.971

8.  DNABINDPROT: fluctuation-based predictor of DNA-binding residues within a network of interacting residues.

Authors:  Pemra Ozbek; Seren Soner; Burak Erman; Turkan Haliloglu
Journal:  Nucleic Acids Res       Date:  2010-05-16       Impact factor: 16.971

9.  iDBPs: a web server for the identification of DNA binding proteins.

Authors:  Guy Nimrod; Maya Schushan; András Szilágyi; Christina Leslie; Nir Ben-Tal
Journal:  Bioinformatics       Date:  2010-01-19       Impact factor: 6.937

10.  Boosting the prediction and understanding of DNA-binding domains from sequence.

Authors:  Robert E Langlois; Hui Lu
Journal:  Nucleic Acids Res       Date:  2010-02-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.