Literature DB >> 12706730

Exhaustive enumeration of protein domain families.

Andreas Heger1, Liisa Holm.   

Abstract

Domains are considered as the basic units of protein folding, evolution, and function. Decomposing each protein into modular domains is thus a basic prerequisite for accurate functional classification of biological molecules. Here, we present ADDA, an automatic algorithm for domain decomposition and clustering of all protein domain families. We use alignments derived from an all-on-all sequence comparison to define domains within protein sequences based on a global maximum likelihood model. In all, 90% of domain boundaries are predicted within 10% of domain size when compared with the manual domain definitions given in the SCOP database. A representative database of 249,264 protein sequences were decomposed into 450,462 domains. These domains were clustered on the basis of sequence similarities into 33,879 domain families containing at least two members with less than 40% sequence identity. Validation against family definitions in the manually curated databases SCOP and PFAM indicates almost perfect unification of various large domain families while contamination by unrelated sequences remains at a low level. The global survey of protein-domain space by ADDA confirms that most large and universal domain families are already described in PFAM and/or SMART. However, a survey of the complete set of mobile modules leads to the identification of 1479 new interesting domain families which shuffle around in multi-domain proteins. The data are publicly available at ftp://ftp.ebi.ac.uk/pub/contrib/heger/adda.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12706730     DOI: 10.1016/s0022-2836(03)00269-9

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  57 in total

1.  More for less in structural genomics.

Authors:  A Heger; L Holm
Journal:  J Struct Funct Genomics       Date:  2003

2.  SMART 4.0: towards genomic data integration.

Authors:  Ivica Letunic; Richard R Copley; Steffen Schmidt; Francesca D Ciccarelli; Tobias Doerks; Jörg Schultz; Chris P Ponting; Peer Bork
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

4.  Selection of recombinant anti-SH3 domain antibodies by high-throughput phage display.

Authors:  Haiming Huang; Nicolas O Economopoulos; Bernard A Liu; Andrea Uetrecht; Jun Gu; Nick Jarvik; Vincent Nadeem; Tony Pawson; Jason Moffat; Shane Miersch; Sachdev S Sidhu
Journal:  Protein Sci       Date:  2015-09-16       Impact factor: 6.725

5.  Detecting remotely related proteins by their interactions and sequence similarity.

Authors:  Jordi Espadaler; Ramón Aragüés; Narayanan Eswar; Marc A Marti-Renom; Enrique Querol; Francesc X Avilés; Andrej Sali; Baldomero Oliva
Journal:  Proc Natl Acad Sci U S A       Date:  2005-05-09       Impact factor: 11.205

6.  Structural characterization of GntR/HutC family signaling domain.

Authors:  Marina Gorelik; Vladimir V Lunin; Tatiana Skarina; Alexei Savchenko
Journal:  Protein Sci       Date:  2006-05-02       Impact factor: 6.725

7.  Graph theoretical insights into evolution of multidomain proteins.

Authors:  Teresa Przytycka; George Davis; Nan Song; Dannie Durand
Journal:  J Comput Biol       Date:  2006-03       Impact factor: 1.479

8.  DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile.

Authors:  Hongyi Zhou; Bin Xue; Yaoqi Zhou
Journal:  Protein Sci       Date:  2007-05       Impact factor: 6.725

9.  Identification of correct regions in protein models using structural, alignment, and consensus information.

Authors:  Björn Wallner; Arne Elofsson
Journal:  Protein Sci       Date:  2006-03-07       Impact factor: 6.725

10.  Identification of a chemoreceptor zinc-binding domain common to cytoplasmic bacterial chemoreceptors.

Authors:  Jenny Draper; Kevin Karplus; Karen M Ottemann
Journal:  J Bacteriol       Date:  2011-07-01       Impact factor: 3.490

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.