Literature DB >> 12784216

Scaling law in sizes of protein sequence families: from super-families to orphan genes.

Ron Unger1, Shai Uliel, Shlomo Havlin.   

Abstract

It has been observed that the size of protein sequence families is unevenly distributed, with few super families with a large number of members and many "orphan" proteins that do not belong to any family. Here it is shown that the distribution of sizes of protein families in different databases and classifications (Protomap, Prodom, Cog) follows a power-law behavior with similar scaling exponents, which is characteristic of self-organizing systems. Since large databases are used in this study, a more detailed analysis of the data than in previous studies was possible. Hence, it is shown that the size distribution is governed by two exponents, different for the super families and the orphan proteins. A simple model of protein evolution is proposed, in which proteins are dynamically generated and clustered into families. The model yields a scaling behavior very similar to the distribution observed in the actual sequence databases, including the two distinct regimes for the large and small families, and thus suggests that the existence of "super families" of proteins and "orphan" proteins are two manifestations of the same evolutionary process. Copyright 2003 Wiley-Liss, Inc.

Mesh:

Substances:

Year:  2003        PMID: 12784216     DOI: 10.1002/prot.10347

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  16 in total

1.  Evolution of protein families: is it possible to distinguish between domains of life?

Authors:  Marta Sales-Pardo; Albert O B Chan; Luís A N Amaral; Roger Guimerà
Journal:  Gene       Date:  2007-08-14       Impact factor: 3.688

2.  Nature of the protein universe.

Authors:  Michael Levitt
Journal:  Proc Natl Acad Sci U S A       Date:  2009-06-18       Impact factor: 11.205

3.  Models of gene gain and gene loss for probabilistic reconstruction of gene content in the last universal common ancestor of life.

Authors:  Lavanya Kannan; Hua Li; Boris Rubinstein; Arcady Mushegian
Journal:  Biol Direct       Date:  2013-12-19       Impact factor: 4.540

4.  Comparative modeling and protein-like features of hydrophobic-polar models on a two-dimensional lattice.

Authors:  Sergio Moreno-Hernández; Michael Levitt
Journal:  Proteins       Date:  2012-04-13

5.  The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families.

Authors:  Shibu Yooseph; Granger Sutton; Douglas B Rusch; Aaron L Halpern; Shannon J Williamson; Karin Remington; Jonathan A Eisen; Karla B Heidelberg; Gerard Manning; Weizhong Li; Lukasz Jaroszewski; Piotr Cieplak; Christopher S Miller; Huiying Li; Susan T Mashiyama; Marcin P Joachimiak; Christopher van Belle; John-Marc Chandonia; David A Soergel; Yufeng Zhai; Kannan Natarajan; Shaun Lee; Benjamin J Raphael; Vineet Bafna; Robert Friedman; Steven E Brenner; Adam Godzik; David Eisenberg; Jack E Dixon; Susan S Taylor; Robert L Strausberg; Marvin Frazier; J Craig Venter
Journal:  PLoS Biol       Date:  2007-03       Impact factor: 8.029

Review 6.  The impact of structural genomics: the first quindecennial.

Authors:  Marek Grabowski; Ewa Niedzialkowska; Matthew D Zimmerman; Wladek Minor
Journal:  J Struct Funct Genomics       Date:  2016-03-02

7.  Scaling properties of protein family phylogenies.

Authors:  Alejandro Herrada; Víctor M Eguíluz; Emilio Hernández-García; Carlos M Duarte
Journal:  BMC Evol Biol       Date:  2011-06-06       Impact factor: 3.260

8.  Evolutionary origins of genomic repertoires in bacteria.

Authors:  Emmanuelle Lerat; Vincent Daubin; Howard Ochman; Nancy A Moran
Journal:  PLoS Biol       Date:  2005-04-05       Impact factor: 8.029

9.  Unravelling the ORFan Puzzle.

Authors:  Naomi Siew; Daniel Fischer
Journal:  Comp Funct Genomics       Date:  2003

10.  Systematic and searchable classification of cytochrome P450 proteins encoded by fungal and oomycete genomes.

Authors:  Venkatesh Moktali; Jongsun Park; Natalie D Fedorova-Abrams; Bongsoo Park; Jaeyoung Choi; Yong-Hwan Lee; Seogchan Kang
Journal:  BMC Genomics       Date:  2012-10-04       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.