| Literature DB >> 19404386 |
Shiqin Song1, Qianli Huang, Jiaming Guo, Jesse Li-Ling, Xueping Chen, Fei Ma.
Abstract
Transcriptional isoforms are not just random combinations of exons. What has caused exons to be differentially spliced and whether exons with different splicing frequencies are subjected to divergent regulation by potential elements or splicing signals? Beyond the conventional classification for alternatively spliced exons (ASEs) and constitutively spliced exons (CSEs), we have classified exons from alternatively spliced human genes and their mouse orthologs (12,314 and 5,464, respectively) into four types based on their splicing frequencies. Analysis has indicated that different groups of exons presented divergent compositional and regulatory properties. Interestingly, with the decrease of splicing frequency, exons tend to have greater lengths, higher GC content, and contain more splicing elements and repetitive elements, which seem to imply that the splicing frequency is influenced by such factors. Comparison of non-alternatively spliced (NAS) mouse genes with alternatively spliced human orthologs also suggested that exons with lower splicing frequencies may be newly evolved ones which gained functions with splicing frequencies altered through the evolution. Our findings have revealed for the first time that certain factors may have critical influence on the splicing frequency, suggesting that exons with lower splicing frequencies may originate from old repetitive sequences, with splicing sites altered by mutation, gaining novel functions and become more frequently spliced.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19404386 PMCID: PMC2671145 DOI: 10.1371/journal.pone.0005387
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Average lengths of exons spliced with different frequencies in human and mouse.
| Number of Exon | Minimum Length | Maximum Length | Average Length (±SD) | |||||||||||||
| G1 | G2 | G3 | G4 | G1 | G2 | G3 | G4 | G1 | G2 | G3 | G4 | G1 | G2 | G3 | G4 | |
|
| 5477 | 1730 | 1935 | 3172 | 11 | 11 | 15 | 12 | 11710 | 11846 | 5073 | 4381 | 369.36 (685.5) | 249.61 (570.92) | 191.18 (341.08) | 154.34 (205.11) |
|
| 1971 | 826 | 1359 | 1308 | 11 | 12 | 11 | 12 | 15963 | 5527 | 5970 | 5398 | 428.27 (866.08) | 226.41 (438.71) | 218.52 (445.97) | 159.42 (246.01) |
Figure 1Distribution of exon lengths of four groups for alternatively spliced human genes (A) and mouse orthologs (B).
Figure 2Distribution of GC content of the four types of exons for alternatively spliced human genes (A) and mouse orthologs (B).
Figure 3Average numbers of ESE elements of different exonic groups for alternatively spliced human genes (A) and mouse homologs (B).
Statistical significance for differences between alternatively spliced human genes and mouse orthologs grouped by exonic and ESE types.
| Human | Mouse | |||
| Chi-Square | Pr>ChiSq | Chi-Square | Pr>ChiSq | |
|
| 149621.8 | <.0001 | 37258.42 | <.0001 |
|
| 191658.2 | <.0001 | 120152 | <.0001 |
|
| 5488.71 | <.0001 | 2952.42 | <.0001 |
Proportion of exons that contain repetitive elements in human and orthologous mouse genes.
| Number of exons with repetitive elements | Total number of exons | Proportion of exons with repetitive elements (%) | ||||||||||
| G1 | G2 | G3 | G4 | G1 | G2 | G3 | G4 | G1 | G2 | G3 | G4 | |
|
| 788 | 98 | 72 | 65 | 5477 | 1730 | 1935 | 3172 | 14.39 | 5.67 | 3.72 | 2.05 |
|
| 256 | 40 | 60 | 22 | 1971 | 826 | 1359 | 1308 | 12.99 | 4.84 | 4.42 | 2.12 |
Repetitive elements found in different groups of exons from human and mouse.
| Species | Exonic group | |||||||
| G1 | G2 | G3 | G4 | |||||
| Human | Mouse | Human | Mouse | Human | Mouse | Human | Mouse | |
|
| ||||||||
| DNA | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/AcHobo | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/hAT | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| DNA/Mariner | 2 | 1 | 0 | 0 | 0 | 1 | 0 | 0 |
| DNA/MER1_type | 55 | 5 | 4 | 0 | 3 | 1 | 0 | 0 |
| DNA/MER2_type | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/Tc2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/TcMar | 2 | 0 | 0 | 1 | 2 | 0 | 0 | 0 |
| DNA/TcMar? | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/Tigger | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| DNA/Tip100 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| ||||||||
| LINE/CR1 | 6 | 2 | 1 | 0 | 0 | 0 | 1 | 0 |
| LINE/L1 | 67 | 13 | 5 | 5 | 2 | 3 | 0 | 0 |
| LINE/L2 | 50 | 5 | 8 | 1 | 4 | 0 | 4 | 0 |
| LINE/RTE | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
|
| ||||||||
| LTR/ERV1 | 17 | 1 | 1 | 0 | 2 | 1 | 1 | 0 |
| LTR/ERVK | 4 | 8 | 0 | 1 | 0 | 0 | 0 | 0 |
| LTR/ERVL | 12 | 1 | 1 | 0 | 2 | 0 | 0 | 0 |
| LTR/MaLR | 31 | 12 | 1 | 0 | 0 | 7 | 0 | 0 |
|
| ||||||||
| SINE | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| SINE/Alu | 241 | 48 | 18 | 2 | 14 | 12 | 7 | 2 |
| SINE/MIR | 105 | 9 | 11 | 1 | 3 | 1 | 1 | 0 |
| SINE/tRNA | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| SINE/B2 | 0 | 13 | 0 | 0 | 0 | 3 | 0 | 0 |
| SINE/B4 | 0 | 20 | 0 | 1 | 0 | 5 | 0 | 0 |
| SINE/ID | 0 | 2 | 0 | 0 | 0 | 5 | 0 | 0 |
|
| ||||||||
| scRNA | 0 | 3 | 0 | 0 | 0 | 2 | 0 | 0 |
| srpRNA | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| snRNA | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| tRNA | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| RNA | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Other | 2 | 4 | 0 | 0 | 0 | 0 | 0 | 0 |
| LCRs | 397 | 156 | 51 | 30 | 61 | 36 | 38 | 9 |
| Simple_repeat | 264 | 181 | 52 | 24 | 29 | 41 | 36 | 15 |
Exonic lengths for mouse NAS genes and alternatively spliced human orthologs.
| Number of Exons | Minimum length | Maximum length | Average length (±SD) | |
|
| 1633 | 13 | 6453 | 231.63 (430.14) |
|
| 2338 | 13 | 9453 | 275.05 (502.39) |
|
| 1197 | 13 | 9453 | 359.54 (627.9) |
|
| 353 | 16 | 5454 | 224.29 (415.9 ) |
|
| 342 | 23 | 3050 | 185.43 (287.4) |
|
| 446 | 23 | 1793 | 157.18 (156.27) |
GC contents for mouse NAS genes and alternatively spliced human orthologs.
| Species | Number of Exons | Minimum GC | Maximum GC | Average GC(±SD) |
|
| 1633 | 25 % | 80.8 % | 51.48 % (8.64 %) |
|
| 2338 | 16.7 % | 90 % | 53.05 % (11.9 %) |
|
| 1197 | 16.7 % | 90 % | 53.87 % (11.99 %) |
|
| 353 | 28 % | 80 % | 53.14 % (10.23 %) |
|
| 342 | 28.1 % | 80 % | 53.41 % (10.5 %) |
|
| 446 | 27.9 % | 82.4 % | 50.49 % (9.16 %) |
Average numbers of ESE elements per exons in NAS mouse genes and human orthologs.
| ESE Type | Exonic group | |||||
| Mouse NAS | Human AS | human G1 | human G2 | human G3 | human G4 | |
|
| 21.70 | 26.88 | 34.65 | 21.80 | 20.13 | 15.22 |
|
| 9.84 | 11.76 | 15.23 | 9.50 | 8.58 | 6.65 |
|
| 15.91 | 18.01 | 23.46 | 14.82 | 12.48 | 10.18 |
|
| 24.64 | 28.68 | 37.61 | 23.08 | 19.10 | 16.48 |
|
| 1.58 | 2.10 | 2.92 | 1.67 | 1.32 | 0.83 |
|
| 0.75 | 0.99 | 1.41 | 0.76 | 0.512 | 0.40 |
|
| 74.42 | 88.42 | 115.28 | 71.63 | 62.13 | 49.76 |
Total numbers of repetitive elements in NAS mouse genes and alternatively spliced human orthologs.
| Repeats Type | Mouse NAS | Human AS | Human G1 | Human G2 | Human G3 | Human G4 |
|
| 6 | 45 | 42 | 3 | 0 | 0 |
|
| 9 | 41 | 38 | 2 | 1 | 0 |
|
| 51 | 99 | 88 | 5 | 5 | 1 |
|
| 5 | 22 | 22 | 0 | 0 | 0 |
|
| 47 | 91 | 60 | 9 | 14 | 8 |
|
| 46 | 67 | 52 | 5 | 8 | 2 |
|
| 164 | 365 | 302 | 24 | 28 | 11 |