Literature DB >> 12386003

Building an automated classification of DNA-binding protein domains.

Julia V Ponomarenko1, Philip E Bourne, Ilya N Shindyalov.   

Abstract

Intensive growth in 3D structure data on DNA-protein complexes as reflected in the Protein Data Bank (PDB) demands new approaches to the annotation and characterization of these data and will lead to a new understanding of critical biological processes involving these data. These data and those from other protein structure classifications will become increasingly important for the modeling of complete proteomes. We propose a fully automated classification of DNA-binding protein domains based on existing 3D-structures from the PDB. The classification, by domain, relies on the Protein Domain Parser (PDP) and the Combinatorial Extension (CE) algorithm for structural alignment. The approach involves the analysis of 3D-interaction patterns in DNA-protein interfaces, assignment of structural domains interacting with DNA, clustering of domains based on structural similarity and DNA-interacting patterns. Comparison with existing resources on describing structural and functional classifications of DNA-binding proteins was used to validate and improve the approach proposed here. In the course of our study we defined a set of criteria and heuristics allowing us to automatically build a biologically meaningful classification and define classes of functionally related protein domains. It was shown that taking into consideration interactions between protein domains and DNA considerably improves the classification accuracy. Our approach provides a high-throughput and up-to-date annotation of DNA-binding protein families which can be found at http://spdc.sdsc.edu.

Mesh:

Substances:

Year:  2002        PMID: 12386003     DOI: 10.1093/bioinformatics/18.suppl_2.s192

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

Review 1.  Genomic repertoires of DNA-binding transcription factors across the tree of life.

Authors:  Varodom Charoensawan; Derek Wilson; Sarah A Teichmann
Journal:  Nucleic Acids Res       Date:  2010-07-30       Impact factor: 16.971

2.  Re-visiting protein-centric two-tier classification of existing DNA-protein complexes.

Authors:  Sony Malhotra; Ramanathan Sowdhamini
Journal:  BMC Bioinformatics       Date:  2012-07-16       Impact factor: 3.169

3.  Computational technique for improvement of the position-weight matrices for the DNA/protein binding sites.

Authors:  Naum I Gershenzon; Gary D Stormo; Ilya P Ioshikhes
Journal:  Nucleic Acids Res       Date:  2005-04-22       Impact factor: 16.971

4.  Bioinformatic analysis of the protein/DNA interface.

Authors:  Bohdan Schneider; Jirí Cerný; Daniel Svozil; Petr Cech; Jean-Christophe Gelly; Alexandre G de Brevern
Journal:  Nucleic Acids Res       Date:  2013-12-11       Impact factor: 16.971

5.  An updated version of NPIDB includes new classifications of DNA-protein complexes and their families.

Authors:  Olga Zanegina; Dmitriy Kirsanov; Eugene Baulin; Anna Karyagina; Andrei Alexeevski; Sergey Spirin
Journal:  Nucleic Acids Res       Date:  2015-12-09       Impact factor: 16.971

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.