| Literature DB >> 35140743 |
Niall P Keegan1,2, Steve D Wilton1,2, Sue Fletcher1,2.
Abstract
Understanding pre-mRNA splicing is crucial to accurately diagnosing and treating genetic diseases. However, mutations that alter splicing can exert highly diverse effects. Of all the known types of splicing mutations, perhaps the rarest and most difficult to predict are those that activate pseudoexons, sometimes also called cryptic exons. Unlike other splicing mutations that either destroy or redirect existing splice events, pseudoexon mutations appear to create entirely new exons within introns. Since exon definition in vertebrates requires coordinated arrangements of numerous RNA motifs, one might expect that pseudoexons would only arise when rearrangements of intronic DNA create novel exons by chance. Surprisingly, although such mutations do occur, a far more common cause of pseudoexons is deep-intronic single nucleotide variants, raising the question of why these latent exon-like tracts near the mutation sites have not already been purged from the genome by the evolutionary advantage of more efficient splicing. Possible answers may lie in deep intronic splicing processes such as recursive splicing or poison exon splicing. Because these processes utilize intronic motifs that benignly engage with the spliceosome, the regions involved may be more susceptible to exonization than other intronic regions would be. We speculated that a comprehensive study of reported pseudoexons might detect alignments with known deep intronic splice sites and could also permit the characterisation of novel pseudoexon categories. In this report, we present and analyse a catalogue of over 400 published pseudoexon splice events. In addition to confirming prior observations of the most common pseudoexon mutation types, the size of this catalogue also enabled us to suggest new categories for some of the rarer types of pseudoexon mutation. By comparing our catalogue against published datasets of non-canonical splice events, we also found that 15.7% of pseudoexons exhibit some splicing activity at one or both of their splice sites in non-mutant cells. Importantly, this included seven examples of experimentally confirmed recursive splice sites, confirming for the first time a long-suspected link between these two splicing phenomena. These findings have the potential to improve the fidelity of genetic diagnostics and reveal new targets for splice-modulating therapies.Entities:
Keywords: cryptic splicing; genetic disease; poison exons; pseudoexons; recursive splicing; splicing mutations
Year: 2022 PMID: 35140743 PMCID: PMC8819188 DOI: 10.3389/fgene.2021.806946
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
FIGURE 1Positional frequencies of 255 pseudoexon splice-motif SNVs. Stacked columns are categorised by the identity of the reference nucleotides, which in 249 of the 255 cases were mutated to the most frequently observed nucleotide at that position of the motifs, as shown on the X-axis. Exceptions (6) are categorised as “Other.”
FIGURE 2Relative locations of 14 mutations that instigate pseudoexons via enhancement or creation of branch point motifs.
Pseudoexon mutations that enhance pseudoexon branch point motifs.
| Gene | Intron | # | Mutation | AGEZ | Max BPS | Pyr SNV? | References |
|---|---|---|---|---|---|---|---|
|
| 6 | 1 | c.769-784C>T | 17 | 0.22 |
|
|
|
| 2 | 1 | c.49-763G>T | 24 | −0.15 |
|
|
|
| 29 | 1 | c.2395+1275C>G; c.2395+1292G>T |
|
|
|
|
|
| 26 | 2 | c.3603+820G>T |
|
|
|
|
| 37 | 2 | c.5325+1740_5325+1757del | 17 |
| — |
| |
|
| 13 | 2a | c.2113+461_2113+473del | 74 -> 61 |
| — |
|
| 18 | 2b | c.5999-798G>A |
| 0.23 | No |
| |
|
| 6 | 1 | c.262+132_262+133del |
| 0.14 | — |
|
|
| 8 | 3 | c.889-941G>T |
| −0.73 |
|
|
|
| 5 | 1 | c.516+232G>A |
|
| No |
|
|
| 11 | 1 | c.1199+502A>T | 34 |
| Yes |
|
|
| 56 | 1 | c.8798-459C>A | 46 |
| No |
|
|
| 2 | 1b | c.163+696del55 |
|
| — |
|
|
| 12 | 1a | c.1468-263G>C |
| −0.56 |
|
|
“AGEZ,” “Max BPS,” and “Pyr SNV?” changes that are predicted to enhance branch point definition are in bold text. Cases where deletion mutations moved a branch point site closer to the AGEZ/acceptor site are noted in the “Max BPS” column as “(MC).”
Pseudoexons associated with seven confirmed intronic recursive splice sites.
| Gene | Intron | # | Start | End | Size | Pseudoexon mutation(s) | ME-A | ME-D | PE RNA source | PE references | RSS RNA source | RSS references |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 27 | 1b | 108287410 | 108287521 | 112 | c.3994-159A>G (A+32) | 7.71 -> 8.12 | 8.49 | LCLs |
| HBECs |
|
| 1a |
| 29 | c.3994-193C>T (A−3) |
| LCLs; peripheral blood |
| ||||||
|
| 6 | 1 | 108570649 |
| 147 | c.385-719G>A (A+46) | 5.25 |
| Hair bulb |
| HBECs |
|
|
| 9 | 1 |
| 148411238 | 159 | c.1093+532C>G (D+59) |
| 6.57 | Whole blood, lung tissue |
| HBECs |
|
|
| 3 | 1 | 101401233 |
| 115 | c.547+395G>C (D−5) |
| 7.82 | Whole blood |
| Cerebellum, K562 cells |
|
|
| 10 | 1 |
| 71636167 | 64 | c.1054G>A (e11 D-19) |
| 3.24 | Emetine-treated fibroblasts |
| HBECs |
|
|
| 3 | 1 |
| 132718117 | 163 | c.671-996C>G (D+50) | 6.50 |
| Leukocytes |
| HBECs |
|
|
| 4 | 1 |
| 129553301 | 66 | c.239-4023A>G (D+1) |
| 2.68 -> 10.86 | Skin fibroblasts |
| PA1 cells |
|
Genomic coordinates of the recursive splice sites and their maximum entropy scores are in bold text. “ME-A” and “ME-D” refer to the Maximum Entropy scores for the acceptor and donor splice sites, respectively.
FIGURE 3Shared features of two terminal pseudoexons (tPEs). (A) ARHGEF9 tPE (ARHGEF9-6-2) and internal PE (ARHGEF9-6-1) arising from within a translocated region of chromosome 18. (B) F8 tPE (F8-25-1) arising within intragenic sequence of a transposed 3.8 Mb tract of the X chromosome. (C) Relative locations, to scale, of the affected genes on the X chromosome q-arm, and genomic origin of transposition in patient B. Arrows indicate reading directions of each gene.