Literature DB >> 17166515

Modeling the evolution of protein domain architectures using maximum parsimony.

Jessica H Fong1, Lewis Y Geer, Anna R Panchenko, Stephen H Bryant.   

Abstract

Domains are basic evolutionary units of proteins and most proteins have more than one domain. Advances in domain modeling and collection are making it possible to annotate a large fraction of known protein sequences by a linear ordering of their domains, yielding their architecture. Protein domain architectures link evolutionarily related proteins and underscore their shared functions. Here, we attempt to better understand this association by identifying the evolutionary pathways by which extant architectures may have evolved. We propose a model of evolution in which architectures arise through rearrangements of inferred precursor architectures and acquisition of new domains. These pathways are ranked using a parsimony principle, whereby scenarios requiring the fewest number of independent recombination events, namely fission and fusion operations, are assumed to be more likely. Using a data set of domain architectures present in 159 proteomes that represent all three major branches of the tree of life allows us to estimate the history of over 85% of all architectures in the sequence database. We find that the distribution of rearrangement classes is robust with respect to alternative parsimony rules for inferring the presence of precursor architectures in ancestral species. Analyzing the most parsimonious pathways, we find 87% of architectures to gain complexity over time through simple changes, among which fusion events account for 5.6 times as many architectures as fission. Our results may be used to compute domain architecture similarities, for example, based on the number of historical recombination events separating them. Domain architecture "neighbors" identified in this way may lead to new insights about the evolution of protein function.

Mesh:

Year:  2006        PMID: 17166515      PMCID: PMC1858635          DOI: 10.1016/j.jmb.2006.11.017

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  35 in total

1.  Genome evolution. Gene fusion versus gene fission.

Authors:  B Snel; P Bork; M Huynen
Journal:  Trends Genet       Date:  2000-01       Impact factor: 11.639

2.  Relating whole-genome expression data with protein-protein interactions.

Authors:  Ronald Jansen; Dov Greenbaum; Mark Gerstein
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

3.  The geometry of domain combination in proteins.

Authors:  Matthew Bashton; Cyrus Chothia
Journal:  J Mol Biol       Date:  2002-01-25       Impact factor: 5.469

4.  An insight into domain combinations.

Authors:  G Apic; J Gough; S A Teichmann
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

5.  CDD: a database of conserved domain alignments with links to domain three-dimensional structure.

Authors:  Aron Marchler-Bauer; Anna R Panchenko; Benjamin A Shoemaker; Paul A Thiessen; Lewis Y Geer; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

6.  SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments.

Authors:  Julian Gough; Cyrus Chothia
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  CDART: protein homology by domain architecture.

Authors:  Lewis Y Geer; Michael Domrachev; David J Lipman; Stephen H Bryant
Journal:  Genome Res       Date:  2002-10       Impact factor: 9.043

8.  Recent improvements to the SMART domain-based sequence annotation resource.

Authors:  Ivica Letunic; Leo Goodstadt; Nicholas J Dickens; Tobias Doerks; Joerg Schultz; Richard Mott; Francesca Ciccarelli; Richard R Copley; Chris P Ponting; Peer Bork
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

9.  The Pfam protein families database.

Authors:  Alex Bateman; Ewan Birney; Lorenzo Cerruti; Richard Durbin; Laurence Etwiller; Sean R Eddy; Sam Griffiths-Jones; Kevin L Howe; Mhairi Marshall; Erik L L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

10.  Functional associations of proteins in entire genomes by means of exhaustive detection of gene fusions.

Authors:  A J Enright; C A Ouzounis
Journal:  Genome Biol       Date:  2001       Impact factor: 13.583

View more
  45 in total

1.  Domain III of the T. thermophilus 23S rRNA folds independently to a near-native state.

Authors:  Shreyas S Athavale; J Jared Gossett; Chiaolong Hsiao; Jessica C Bowman; Eric O'Neill; Eli Hershkovitz; Thanawadee Preeprem; Nicholas V Hud; Roger M Wartell; Stephen C Harvey; Loren Dean Williams
Journal:  RNA       Date:  2012-02-14       Impact factor: 4.942

2.  Evolution of protein domain promiscuity in eukaryotes.

Authors:  Malay Kumar Basu; Liran Carmel; Igor B Rogozin; Eugene V Koonin
Journal:  Genome Res       Date:  2008-01-29       Impact factor: 9.043

Review 3.  Nothing about protein structure classification makes sense except in the light of evolution.

Authors:  Ruben E Valas; Song Yang; Philip E Bourne
Journal:  Curr Opin Struct Biol       Date:  2009-04-24       Impact factor: 6.809

4.  CanProVar: a human cancer proteome variation database.

Authors:  Jing Li; Dexter T Duncan; Bing Zhang
Journal:  Hum Mutat       Date:  2010-03       Impact factor: 4.878

5.  Nature of the protein universe.

Authors:  Michael Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  2009-06-18       Impact factor: 11.205

6.  Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments.

Authors:  Lei Xie; Philip E Bourne
Journal:  Proc Natl Acad Sci U S A       Date:  2008-04-02       Impact factor: 11.205

7.  Processes of fungal proteome evolution and gain of function: gene duplication and domain rearrangement.

Authors:  Inbar Cohen-Gihon; Roded Sharan; Ruth Nussinov
Journal:  Phys Biol       Date:  2011-05-13       Impact factor: 2.583

8.  Modular architecture and evolution of the map-1 gene family in the root-knot nematode Meloidogyne incognita.

Authors:  Philippe Castagnone-Sereno; Jean-Philippe Semblat; Chantal Castagnone
Journal:  Mol Genet Genomics       Date:  2009-09-29       Impact factor: 3.291

9.  Protein comparison at the domain architecture level.

Authors:  Byungwook Lee; Doheon Lee
Journal:  BMC Bioinformatics       Date:  2009-12-03       Impact factor: 3.169

10.  The evolutionary history of protein domains viewed by species phylogeny.

Authors:  Song Yang; Philip E Bourne
Journal:  PLoS One       Date:  2009-12-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.