| Literature DB >> 20360214 |
Svetlana A Shabalina, Aleksey Y Ogurtsov, Alexey N Spiridonov, Pavel S Novichkov, Nikolay A Spiridonov, Eugene V Koonin.
Abstract
Comparison of expression levels and breadth and evolutionary rates of intronless and intron-containing mammalian genes shows that intronless genes are expressed at lower levels, tend to be tissue specific, and evolve significantly faster than spliced genes. By contrast, monomorphic spliced genes that are not subject to detectable alternative splicing and polymorphic alternatively spliced genes show similar statistically indistinguishable patterns of expression and evolution. Alternative splicing is most common in ancient genes, whereas intronless genes appear to have relatively recent origins. These results imply tight coupling between different stages of gene expression, in particular, transcription, splicing, and nucleocytosolic transport of transcripts, and suggest that formation of intronless genes is an important route of evolution of novel tissue-specific functions in animals.Entities:
Mesh:
Year: 2010 PMID: 20360214 PMCID: PMC2908711 DOI: 10.1093/molbev/msq086
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Intronless, monomorphic, and polymorphic genes in different age classes of human genes.
| Number of Genes | Evolutionary Age of Genes | ||||||
| Gene Type | Cellular | Eukaryota | Metazoa | Chordata | Mammalia | Primates | |
| Polymorphic | 11,082 | 3,111 (0.609) | 3,310 (0.617) | 2,474 (0.560) | 1,623 (0.481) | 505 (0.389) | 59 (0.249) |
| Monomorphic | 7,651 | 1,903 (0.372) | 1,874 (0.349) | 1,745 (0.395) | 1,325 (0.393) | 645 (0.497) | 159 (0.671) |
| Intronless | 1,072 | 96 (0.019) | 184 (0.034) | 199 (0.045) | 426 (0.126) | 148 (0.114) | 19 (0.080) |
| Mono/Poly | 0.883 | 0.826 | 1.022 | 1.185 | 1.826 | 4.2 | |
| Intronless/Poly | 0.317 | 0.541 | 0.834 | 2.719 | 3 | 3.4 | |
Protein CDSs of 22,849 human genes were classified according to the ancestral taxa where the proteins first appeared. Ancestral homologues were identified in RefSeq sequences from organisms of different taxonomic levels by BLASTP search with an expectation cutoff value of 10−6 using a procedure described in Supplementary Material online. Frequencies of gene types in evolutionary age classes are shown in parentheses. Mono/poly and intronless/poly ratios were normalized taking into account numbers of genes in the groups.
FMeans of functional domain lengths in mature transcripts of intronless, monomorphic, and polymorphic genes. Gene structure and lengths of functional domains were determined from genomic coordinates of the human genome (March 2006 assembly) that were downloaded from the UCSC genome server (http://genome.ucsc.edu). Monomorphic or polymorphic gene status was defined from the number of annotated transcript isoforms, as described in Supplementary Material online.
FMeans of expression levels and breadth in human intronless (Int0), monomorphic (Mono), and polymorphic (Poly) genes with different numbers of isoforms (from 2 to 4 and more). Gene expression levels and breadths were evaluated by tallying the numbers of gene-specific EST sequences from normal human tissues in GenBank (Ogurtsov et al. 2008) and from GenAtlas expression data, as described in Supplementary Material online.
FMeans of evolutionary rates in intronless, monomorphic, and polymorphic (Poly) genes with different numbers of isoforms (from 2 to 4 and more). Rates of synonymous (dS) and non-synonymous (dN) substitutions in the protein CDSs and evolutionary rates in 5′UTRs (K5) and 3′UTRs (K3) were estimated from sequence alignments of approximately 9,000 human and macaque orthologous genes, as described previously (Ogurtsov et al. 2008).