| Literature DB >> 20889654 |
Svetlana A Shabalina1, Alexey N Spiridonov, Nikolay A Spiridonov, Eugene V Koonin.
Abstract
The majority of mammalian genes produce multiple transcripts resulting from alternative splicing (AS) and/or alternative transcription initiation (ATI) and alternative transcription termination (ATT). Comparative analysis of the number of alternative nucleotides, isoforms, and introns per locus in genes with different types of alternative events suggests that ATI and ATT contribute to the diversity of human and mouse transcriptome even more than AS. There is a strong negative correlation between AS and ATI in 5' untranslated regions (UTRs) and AS in coding sequences (CDSs) but an even stronger positive correlation between AS in CDSs and ATT in 3' UTRs. These observations could reflect preferential regulation of distinct, large groups of genes by different mechanisms: 1) regulation at the level of transcription initiation and initiation of translation resulting from ATI and AS in 5' UTRs and 2) posttranslational regulation by different protein isoforms. The tight linkage between AS in CDSs and ATT in 3' UTRs suggests that variability of 3' UTRs mediates differential translational regulation of alternative protein forms. Together, the results imply coordinate evolution of AS and alternative transcription, processes that occur concomitantly within gene expression factories.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20889654 PMCID: PMC2975443 DOI: 10.1093/gbe/evq058
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FAnatomy of mammalian transcripts: functional domains, constitutive and alternative nucleotides, and AEs. TI, transcription initiation site; TS, translation initiation site; TT, transcription termination site; *, translation termination site. Protein-coding regions are filled. Frequent AEs and common combinations of AEs are shown in red. Rare AEs and avoided combinations of AEs are shown in blue.
Numbers of Human Gene Loci Producing Transcript Isoforms with Alternative Events in Different Functional Domains (UCSC Database)
| Transcripts with AEs in Functional Domains | ||||||
| Transcript Group | 5′ UTR | 5′ grey | CDS | 3′ grey | 3′ UTR | AT Gene Loci |
| Total | 7,662 | 5,796 | 7,951 | 4,529 | 4,755 | 11,589 |
| AS | 468 | 608 | 1,669 | 191 | 32 | 2,358 |
| (ATT + AS) | 10 | 18 | 156 | 136 | 190 | 226 |
| ATT | 130 | 166 | 1,778 | 1,680 | 1,833 | 1,833 |
| (ATI + AS) | 2,266 | 1,263 | 837 | 77 | 7 | 2,327 |
| ATI | 2,112 | 1,905 | 1,225 | 99 | 9 | 2,112 |
| (ATI + AS) + (ATT + AS) | 98 | 67 | 69 | 71 | 78 | 109 |
| (ATI + AS) + ATT | 1,208 | 705 | 1,000 | 1,081 | 1,254 | 1,254 |
| ATI + (ATT + AS) | 133 | 103 | 99 | 89 | 115 | 133 |
| ATI + ATT | 1,237 | 961 | 1,118 | 1,105 | 1,237 | 1,237 |
NOTE.—The table includes all possible combinations of AEs in transcripts: Total, all types of AEs; (ATI + AS), ATI and AS in 5′ UTRs; (ATT + AS), ATT and AS in 3′ UTRs; ATI + (ATT + AS), ATT and AS in 3′ UTRs accompanied by ATI; (ATI + AS) + ATT, ATI and AS in 5′ UTRs accompanied by ATT; (ATI + AS) + (ATT + AS), ATI and AS in 5′ UTRs accompanied by ATT and AS in 3′ UTRs; ATI + ATT, ATI accompanied by ATT. The table shows the number and the fraction (italic) of each type of AEs in each of the functional domains of human transcripts. The fraction of gene loci for each subset patterns of AEs is shown in “italic and bold.”
Distribution of Human Genes/Transcripts with Alternative Events in Different Functional Domains
| Number of Genes/Transcripts with AEs in | ||||
| Gene/Transcript Group | Number (and Fraction) of Genes/Transcripts | 5′ UTRs | CDSs | 3′ UTRs |
| RefSeq database | ||||
| Non-ALT genes | 13,945 (0.77) | |||
| ALT genes | 4,131 (0.23) | 2,096 (0.51) | 2,781 (0.67) | 1,097 (0.26) |
| Non-ALT transcripts | 13,945 (0.56) | |||
| ALT transcripts | 10,959 (0.44) | 5,903 (0.54) | 7,562 (0.69) | 3,068 (0.28) |
| UCSC database | ||||
| Non-ALT genes | 10,210 (0.47) | |||
| ALT genes | 11,567 (0.53) | 7,722 (0.67) | 7,953 (0.69) | 4,839 (0.42) |
| Non-ALT transcripts | 10,210 (0.22) | |||
| ALT transcripts | 35,265 (0.78) | 25,497 (0.72) | 25,348 (0.72) | 15,953 (0.45) |
NOTE.—The table shows numbers and fractions (in parentheses) of genes and transcripts with and without alternative nucleotides (ALT and non-ALT) in the RefSeq and UCSC databases and distribution of alternative nucleotides in different functional domains of mammalian transcripts.
FDistribution of number of alternative nucleotides (A), introns (B), and isoforms (C) per gene locus for sets of human genes with different types of AEs. AS, gene loci with AS alone, where all transcripts from the gene locus begin and end at the same positions; ATI + ATT, gene loci with both ATI and ATT.
F.Relationships between AEs within and between the functional domains of mammalian transcripts. The diameters of the circles are roughly proportional to the prevalence of the respective AEs in the given transcript domain; (+) denotes a significant positive correlation and (−) denotes a significant negative correlation; the strongest correlations are shown in red.