| Literature DB >> 19664208 |
Abstract
BACKGROUND: Eukaryotic pre-mRNA gene transcripts are processed by the spliceosome to remove portions of the transcript, called spliceosomal introns. The spliceosome recognizes intron boundaries by the presence of sequence signals (motifs) contained in the actual transcript, thus sequence changes in the genome that affect existing splicing signals or create new signals may lead to changes in transcript splicing patterns. Such changes may lead to previously excluded (intronic) transcript regions being included (exonic) or vice versa. Such changes can affect the encoded protein sequence and/or post-transcriptional regulation, and are thus a potentially important source of genomic and phenotypic novelty. Two recent papers suggest that such changes may be a major force in remodeling of eukaryotic gene structures, however the rate of occurrence of such changes has not been assessed at the genomic level.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19664208 PMCID: PMC2740785 DOI: 10.1186/1471-2148-9-192
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Species studied, with estimated branch length, in average changes per synonymous site (dS), from reference [28].
A summary of the 40 observed exceptions
| Conserved alternative boundary | |||||||||
| CNA00590 | 1 | 5' | -6 (GTGAGT) | +/+ | +/+ | -/+ | -/+ | Yes | cDNA supporting alternative boundary |
| CNA03150 | 3 | 3' | -9 (CAG) | +/+ | -/+ | -/+ | -/+ | Yes | |
| CNC00370 | 9 | 5' | +3 (GTGAGT) | +/+ | -/+ | -/+ | -/+ | Yes | cDNA supporting intron ret. |
| CNC01260 | 8 | 3' | +6 (TAG) | +/+ | +/+ | -/+ | -/+ | Yes | cDNA supporting alt boundary, intron ret. |
| CND02500 | 1 | 3' | +6 (TAG) | +/+ | +/+ | -/+ | -/+ | Yes | cDNA supporting out-of-frame boundary |
| CND03170 | 8 | 5' | +6 (GTAAGT) | +/+ | +/+ | -/+ | +/+ | Yes | |
| CND03550 | 1 | 3' | +6 (CAG) | +/+ | -/+ | +/+ | +/+ | Yes | Fig 2A |
| CND03880 | 7 | 3' | -3 (CAG) | +/+ | +/+ | -/+ | -/+ | Yes | |
| CND04350 | 1 | 3' | +9 (C/TAG) | +/+ | -/+ | -/+ | -/+ | Yes | |
| CNE00380 | 3 | 3' | +3 (C/AAG) | +/+ | -/+ | -/+ | -/+ | No | cDNA supporting alternative boundary |
| CNE01370 | 3 | 3' | +9 (CAG) | +/+ | +/+ | +/+ | -/+ | Yes | |
| CNF02810 | 9 | 3' | +9 (C/TAG) | +/+ | +/+ | -/+ | -/+ | Yes | |
| CNH03260 | 8 | 3' | +6 (TAG) | +/+ | -/+ | -/+ | -/+ | Yes | |
| CNK01580 | 2 | 3' | +6 (C/TAG) | +/+ | +/+ | -/+ | -/+ | No | cDNAs supporting alternative boundary |
| Non-conserved alternative boundaries | |||||||||
| CNA01650 | 1 | 3' | -3 (TAG) | +/- | -/+ | -/+ | -/+ | Yes | |
| CNB04950 | 6 | 5' | +3 (GTTCAC) | +/- | +/- | -/+ | -/+ | No | |
| CNC04760 | 1 | 3' | +3 (TAG) | +/- | +/- | +/+ | -/+ | Yes | Fig 2B |
| CND00770 | 1 | 3' | -3 (CAG) | +/- | +/- | +/- | -/+ | Yes | |
| CNH03490 | 7 | 3' | -12 (CAG) | +/- | +/- | -/+ | -/+ | Yes | |
| CNI02860 | 6 | 5' | +3 (GTTCAT) | +/- | +/- | +/- | -/+ | Yes | |
| CNK00690 | 2 | 3' | +6 (CAG) | +/+ | -/+ | +/- | +/+ | Yes | |
| CNL04670 | 4 | 3' | +9 (C/TAG) | +/- | +/+ | -/+ | -/+ | Yes | |
| CNL06040 | 1 | 3' | -3 (GAG) | +/- | +/- | +/+ | -/+ | Yes | |
| CNM02250 | 2 | 3' | -9 (AAG) | +/- | +/- | -/+ | -/+ | No | |
| Intronization/de-intronization | |||||||||
| CND01950 | 3 | 3' | No | +/ | -/ | -/ | -/ | No | Intronization Fig 3B (1/0 cDNAs spliced/unspliced) |
| CNF02770 | 6 | Both | No | +/ | -/ | -/ | -/ | No | Intronization Fig 3E (1/7 cDNAs spliced/unspliced) |
| CNG00990 | 3 | Both | No | +/ | -/ | -/ | -/ | No | Intronization Fig 3A (1/4 cDNAs spliced/unspliced) |
| CNL03830 | 4 | 3' | No | +/ | -/ | -/ | -/ | No | Intronization Fig 3D (1/7 cDNAs spliced/unspliced) |
| CNI02050 | 4 | Both | No | +/ | -/ | -/ | -/ | No | Intronization Fig 3C (2/4 cDNAs spliced/unspliced) |
| CNL04380 | 1 | 3' | No | +/ | +/ | -/ | -/ | Yes | Direction unclear Fig 4A (1/0 cDNAs spliced/unspliced) |
| CND00470 | 1 | 5' | No | +/ | -/ | -/ | +/ | No | Convergent Fig 4B (2/0 cDNAs spliced/unspliced) |
| More complex cases | |||||||||
| CNB05120 | 2 | 5' | No | +/ | +/ | -/ | +/ | Yes | Unclear (53 bp) |
| CNF04220 | 1 | Both | No | +/ | +/ | -/ | -/ | Yes | Unclear (58 bp) |
| CNI04240 | 1 | 3' | No | +/ | -/ | +/ | +/ | Yes | Unclear (64 bp) |
| CNJ02980 | 2 | 5' | No | +/ | +/ | -/ | -/ | Yes | Unclear (52/53 bp) |
| CNK02560 | 2 | 3' | No | +/ | -/ | -/ | -/ | Yes | Unclear (136/137/143 bp) |
| CNK02690 | 2 | 3' | No | +/ | +/ | -/ | -/ | Yes | Unclear (57/58 bp) |
| CNL06330 | 4 | 3' | No | +/ | -/ | +/ | +/ | Yes | Unclear (62 bp) |
| CNM00170 | 5 | Both | No | +/ | -/ | -/ | -/ | Yes | Unclear (Fig 4C), unspliced cDNA (61/64 bp) |
| CNN00940 | 2 | 3' | No | +/ | -/ | +/ | +/ | Yes | Unclear (84 bp, inframe stop) |
Figure 2Most cases of non-conserved boundaries have nearby in frame candidate alternative boundaries. Lower/upper case sequence indicates intronic/exonic sequence as defined for JEC21. A. Intron 1 of gene CND03550. The JEC21 TAG 3' intron boundary is mutated to TGG in H99 (single underline). H99 may use the downstream CAG instead (double underline). B. Intron 1 of gene CNC04760. The JEC21 CAG 3' intron boundary is mutated to CAA in WM276 (single underline). WM276 may use the downstream TAG instead (double underline). In all cases, intron numbers are given in reference to the coding sequence.
Figure 3Apparent cases of intronization of exonic sequence. In each case, nucleotides conforming to extended consensus sequences (GTRAGT and YAG) are underlined, and branch-point like sites are double underlined. A. Intron 3 of CNG00990; B. Intron 3 of CND01950; C. Intron 5 of CNI02050; D. Intron 4 of CNL03830; E. Intron 6 of CNF02770.
Figure 4Complex cases of intron boundary evolution. A. Intron 5 of CNM00170. Both boundaries of the intron in JEC21 are non-conserved, however the presence of gaps inducing frameshifts suggests that the region is intronic across all species. B. Intron 1 of CND00470. Only WM276 shares the JEC21 5' intron boundary, suggesting either convergent intronization in JEC21 and WM276 or convergent loss of splicing in the other two species. C. Intron 5 of CNM00170. The 5' intron boundary is not conserved, and there is no nearby alternative candidate boundary, however the presence of coding frame-interrupting indels suggests that the sequence is ancestrally intronic.