Literature DB >> 9580988

The frequency distribution of gene family sizes in complete genomes.

M A Huynen1, E van Nimwegen.   

Abstract

We compare the frequency distribution of gene family sizes in the complete genomes of six bacteria (Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Mycoplasma genitalium, Mycoplasma pneumoniae, and Synechocystis sp. PCC6803), two Archaea (Methanococcus jannaschii and Methanobacterium thermoautotrophicum), one eukaryote (Saccharomyces cerevisiae), the vaccinia virus, and the bacteriophage T4. The sizes of the gene families versus their frequencies show power-law distributions that tend to become flatter (have a larger exponent) as the number of genes in the genome increases. Power-law distributions generally occur as the limit distribution of a multiplicative stochastic process with a boundary constraint. We discuss various models that can account for a multiplicative process determining the sizes of gene families in the genome. In particular, we argue that, in order to explain the observed distributions, gene families have to behave in a coherent fashion within the genome; i.e., the probabilities of duplications of genes within a gene family are not independent of each other. Likewise, the probabilities of deletions of genes within a gene family are not independent of each other.

Entities:  

Mesh:

Year:  1998        PMID: 9580988     DOI: 10.1093/oxfordjournals.molbev.a025959

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


  71 in total

1.  Lineage-specific gene expansions in bacterial and archaeal genomes.

Authors:  I K Jordan; K S Makarova; J L Spouge; Y I Wolf; E V Koonin
Journal:  Genome Res       Date:  2001-04       Impact factor: 9.043

2.  Genes linked by fusion events are generally of the same functional category: a systematic analysis of 30 microbial genomes.

Authors:  I Yanai; A Derti; C DeLisi
Journal:  Proc Natl Acad Sci U S A       Date:  2001-07-03       Impact factor: 11.205

3.  GenomeHistory: a software tool and its application to fully sequenced genomes.

Authors:  Gavin C Conant; Andreas Wagner
Journal:  Nucleic Acids Res       Date:  2002-08-01       Impact factor: 16.971

4.  Discriminating self from nonself with short peptides from large proteomes.

Authors:  Nigel J Burroughs; Rob J de Boer; Can Keşmir
Journal:  Immunogenetics       Date:  2004-07-30       Impact factor: 2.846

5.  Protein structure and evolutionary history determine sequence space topology.

Authors:  Boris E Shakhnovich; Eric Deeds; Charles Delisi; Eugene Shakhnovich
Journal:  Genome Res       Date:  2005-03       Impact factor: 9.043

6.  Estimating the tempo and mode of gene family evolution from comparative genomic data.

Authors:  Matthew W Hahn; Tijl De Bie; Jason E Stajich; Chi Nguyen; Nello Cristianini
Journal:  Genome Res       Date:  2005-08       Impact factor: 9.043

7.  Universal sharing patterns in proteomes and evolution of protein fold architecture and life.

Authors:  Gustavo Caetano-Anollés; Derek Caetano-Anollés
Journal:  J Mol Evol       Date:  2005-04       Impact factor: 2.395

8.  A model for the evolution of paralog families in genomes.

Authors:  Ryszard Rudnicki; Jerzy Tiuryn; Damian Wójtowicz
Journal:  J Math Biol       Date:  2006-09-19       Impact factor: 2.259

9.  Evolution of protein families: is it possible to distinguish between domains of life?

Authors:  Marta Sales-Pardo; Albert O B Chan; Luís A N Amaral; Roger Guimerà
Journal:  Gene       Date:  2007-08-14       Impact factor: 3.688

10.  The mode and tempo of genome size evolution in eukaryotes.

Authors:  Matthew J Oliver; Dmitri Petrov; David Ackerly; Paul Falkowski; Oscar M Schofield
Journal:  Genome Res       Date:  2007-04-09       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.