| Literature DB >> 20333219 |
Michel C Milinkovitch, Raphaël Helaers, Athanasia C Tzika.
Abstract
Recent analyses indicated that genes with larger effect of knockout or mutation and with larger probability to revert to single copy after whole genome duplication are expressed earlier in development. Here, we further investigate whether tissue specificity of gene expression is constrained by the age of origin of the corresponding genes. We use 38 metazoan genomes and a comparative genomic application system to integrate inference of gene duplication with expression data from 17,503 human genes into a strictly phylogenetic framework. We show that the number of anatomical systems in which genes are expressed decreases steadily with decreased age of the genes' first appearance in the phylogeny: the oldest genes are expressed, on average, in twice as many anatomical systems than the genes gained recently in evolution. These results are robust to different sources of expression data, to different levels of the anatomical system hierarchy, and to the use of gene families rather than duplication events. Finally, we show that the rate of increase in gene tissue specificity correlates with the relative rate of increase in the maximum number of cell types in the corresponding taxa. Although subfunctionalization and increase in cell type number throughout evolution could constitute, respectively, the proximal and ultimate causes of this correlation, the two phenomena are intermingled. Our analyses identify a striking historical constraint in gene expression: the number of cell types in existence at the time of a gene appearance (through duplication or de novo origination) tends to determine its level of tissue specificity for tens or hundreds of millions of years.Entities:
Keywords: duplication; expression; gene gain; genome content; metazoa; phylogeny
Year: 2009 PMID: 20333219 PMCID: PMC2839353 DOI: 10.1093/gbe/evp052
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FMean number (±standard error) of first-level (a) and last-level (b) anatomical systems in which human genes are expressed (16,943 genes with available eGenetics expression data) as a function of their first appearance in the phylogeny.
FMean number (±standard error) of first-level (a) and last-level (b) anatomical systems in which human genes are expressed (6,585 genes with available HMDEG expression data) as a function of their first appearance in the phylogeny.
FThe number of cell types in existence at the time of appearance of a gene seems to constrain its level of tissue specificity for hundreds of millions of years. Red line (and primary vertical axis): mean number of first-level anatomical systems in which members of human gene families are expressed (16,943 genes, corresponding to 10,302 families, with available eGenetics expression data) as a function of the family's first appearance in the phylogeny. Blue line (and secondary vertical axis): estimated maximum number of cell types of primitive members of metazoa taxa (indicated with vertical dotted lines). The dashed blue line indicates the gap in available estimates of cell type numbers between early Amniotes and Hominidae. Note that values on the secondary vertical axis are in reverse order.