| Literature DB >> 35188075 |
Alexieva D1, Long Y1, Sarkar R1, Dhayan H1, Bruet E1, Winston Rm1, Vorechovsky I2, Castellano L3,4, Dibb N J1.
Abstract
Mutations of splice sites, auxiliary splicing elements and the splicing machinery cause a wide range of genetic disease. Here we report that many of the complex effects of splicing mutations can be predicted from background splicing information, with emphasis on BRCA1, BRCA2 and DMD. Background splicing arises from very low level splicing between rarely used background splice sites and from low-level exon skipping between intron splice sites. We show how this information can be downloaded from the Snaptron database of spliced RNA, which we then compared with databases of human splice site mutations. We report that inactivating mutations of intron splice sites typically caused the non-mutated partner splice site to splice to a known background splice site in over 90% of cases and to the strongest background splice site in the large majority of cases. Consequently, background splicing information can usefully predict the effects of splice site mutations, which include cryptic splice activation and single or multiple exon skipping. In addition, de novo splice sites and splice sites involved in pseudoexon formation, recursive splicing and aberrant splicing in cancer show a 90% match to background splice sites, so establishing that the enhancement of background splicing causes a wide range of splicing aberrations. We also discuss how background splicing information can identify cryptic splice sites that might be usefully targeted by antisense oligonucleotides (ASOs) and how it might indicate possible multiple exon skipping side effects of ASOs designed to induce single exon skipping.Entities:
Keywords: Cryptic splice site; background splicing; cancer; exon skipping; pseudoexons; recursive splicing
Mesh:
Substances:
Year: 2021 PMID: 35188075 PMCID: PMC8865296 DOI: 10.1080/15476286.2021.2024031
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Figure 1.A. Aberrant splicing events that are commonly activated by mutations of the 5ʹ or 3’ss of introns. The brackets reflect that most activated css lie within 1000 bases of the ss mutation. B,C. Splice site mutations activate background ss (bss), particularly those with the most reads. B. The Snaptron data is arranged to show all splicing events involving the 3’ss 41,219,713 of wild type BRCA1 (see text). Blue shading shows normal splicing, yellow shading shows background exon skipping, red shading shows the mutated 5’ss partner 41,222,944 and grey shading shows the reads of the bss prior to their further activation by this mutation. C. All splicing events involving the 5’ss 41,209,068, blue and yellow shading as above, red and grey shading indicates the effect of mutation of the normal partner 3’ss 41,203,135.
Summary of matches between 5ʹ and 3ʹ css from DBASS, BRCA1, BRCA2 and DMD (columns 1,2) with background splice sites from Snaptron (columns 3 to 5). Data summarized from the Tables listed in source column 6
| No. of css | No that match | Top match | Poor match | Source | |
| | analysed | snaptron bss | | | |
| DBASS5 | 237 | 201 (85%) | 150 (75%) | 9 | Table S2 |
| DBASS5w | 14 | 11 | 10 | 2 | Table S2 |
| BRCA1 | 10 | 9 | 8 | 1 | |
| BRCA2 | 5 | 4 | 2 | 1 | Table S2 |
| DMD | 13 | 4 | 1 | 5 | Table S2 |
| DBASS3 | 110 | 97 (88%) | 62 (64%) | 2 | Table S2 |
| DBASS3w | 39 | 38 | 31 | 0 | Table S2 |
| BRCA1 | 7 | 6 | 4 | 1 | |
| BRCA2 | 13 | 10 | 6 | 2 | Table S2 |
| DMD | 9 | 6 | 2 | 1 | Table S2 |
Css activation versus exon skipping. The experimental results listed in column A are summarized from the cryptic splice site database DBASS (Table S2) and an exon skip database (Table S3) and they show the numbers of reports of css activation only, exon skipping only or both in response to 5ʹ or 3’ss mutations. Columns B and C are from Snaptron and show how the samples divide with respect to the relative number of reads for single exon skipping versus the number of reads for the bss that matches the css. For examples that do not report a css or more rarely report a css that does not match a bss we used the read numbers of the top bss (bss with the most reads within 1000 bp of the mutated ss). Columns D and E show the total css and single exon skip read count (Tables S2, S3). Shaded examples are discussed (Tables S2, S3, see text)
| A | B | C | D | E | |
|---|---|---|---|---|---|
| Experimental results | Snaptron data | ||||
| skip>css | css>skip | Total css | Total skip | ||
| reads | reads | reads | reads | ||
| 1 | 70 css only | 11 | 59 | 105,757 | 6884 |
| 2 | 36 css + skip | 24 | 12 | 10,112 | 143,955 |
| 3 | 18 css only | 2 | 16 | 26,786 | 2659 |
| 4 | 22 css + skip | 11 | 11 | 15,839 | 31,527 |
| 5 | 79 skip only | 71 | 8 | 5978 | 217,587 |
| 6 | 3 skip + css | 1 | 2 | 9395 | 2852 |
| 7 | 64 skip only | 54 | 10 | 17,346 | 349,439 |
| 8 | 4 skip + css | 2 | 2 | 1939 | 3185 |
Comparison of the experimental effect of splice site mutations of BRCA1 with snaptron splicing data. Column A lists the mutated intron 5ʹ or 3ʹ splice sites (chr17hg19). Column B shows whether the mutation caused css activation, exon skipping or both, the position of the css relative to the mutated intron ss is also indicated. Columns C-G compare RNA sequencing data from Snaptron. Column C indicates whether the experimentally identified css (from column B) exactly matches a background splice site in Snaptron. Column D shows the css rank, for example 1(4) for row 3 of this column means that Snaptron identified four background splice sites within 1000 bp (upstream or downstream) of the 5’ss 41,258,472 and that the site that matched the experimentally identified css had the most reads. Column E lists the reads for the highest scoring bss within 1000 bases of the mutated splice site. For rows 5, 24, 31 and 35 the reads for the bss that matches the css is given in brackets. Columns F and G list background reads for single and double exon skipping. The shaded boxes indicate the RNA splicing reads that are not a good fit to the experimental data, these are discussed in Table S1, which also lists references
| A | B | C | D | E | F | G | |
|---|---|---|---|---|---|---|---|
| Mutated splice site | Experimental | css | snaptron | snaptron | snaptron | snaptron | |
| 5ʹ ss | summary | match in | css rank | bss reads | single skip | double skip | |
| | | | snaptron | | | reads | reads |
| 1 | 41,276,033 (exon 2) | skip | 11 | 898 | 0 | ||
| 2 | 41,267,742 (exon 3) | skip | 24 | 2622 | 104 | ||
| 3 | 41,258,472 (exon 5) | Css(−22) & skip | Yes | 1(4) | 7375 | 3629 | 178 |
| 4 | 41,256,884 (exon 6) | Css (−9) | Yes | 1(5) | 327 | 5 | 6 |
| 5 | 41,256,138 (exon 7) | Css (−62) | No | 0(1) | (0)1 | 3 | 2 |
| 6 | 41,251,791 (exon 8) | skip | 0 | 83 | 0 | ||
| 7 | 41,249,260 (exon 9) | skip | 31 | 2193 | 3 | ||
| 8 | 41,243,451 (exon 11) | skip (and alt ss enhancement) | 2 | 346 | 44 | ||
| 9 | 41,242,960 (exon 12) | skip | 115 | 1 | 18 | ||
| 10 | 41,234,420 (exon 13) | skip | 0 | 164 | 0 | ||
| 11 | 41,228,504 (exon 14) | single and double skip (weak) | 1 | 971 | 58 | ||
| 12 | 41,226,347 (exon 15) | Css (−11), single and double skip | Yes | 1(1) | 494 | 476 | 356 |
| 13 | 41,222,944 (exon 16) | Css (65,69) | Yes, yes | 1(5), 2(5) | 243, 78 | 0 | 5 |
| 14 | 41,219,624 (exon 17) | Css (153, weak) & skip | Yes | 1(3) | 36 | 1030 | 1 |
| 15 | 41,215,890 (exon 18) | skip | 179 | 140 | 17 | ||
| 16 | 41,215,349 (exon 19) | skip | 23 | 1 | 56 | ||
| 17 | 41,209,068 (exon 20) | Css (87) & skip | Yes | 1(1) | 19 | 52 | 0 |
| 18 | 41,203,079 (exon 21) | skip | 4 | 1298 | 0 | ||
| 19 | 41,201,137 (exon 22) | Css (156,weak) & skip | Yes | 1(1) | 6 | 2732 | 26 |
| 20 | 41,199,659 (exon 23) | Css (5, weak) & skip | Yes | 1(2) | 247 | 300 | 132 |
| 3’ss | |||||||
| 21 | 41,267,797 (exon 3) | Css (7) | Yes | 1(3) | 5 | 2622 | 178 |
| 22 | 41,258,551 (exon 5) | skip | 18 | 3629 | 6 | ||
| 23 | 41,256,974 (exon 6) | Css (−59) | Yes | 1(4) | 15 | 5 | 2 |
| 24 | 41,256,279 (exon 7) | Css (−10) | No | 0(1) | (0)81 | 3 | 0 |
| 25 | 41,251,898 (exon 8) | Css (−69) | Yes | 1(1) | 4 | 83 | 3 |
| 26 | 41,247,940 (exon 10) | skip | 339 | 840 | 44 | ||
| 27 | 41,246,878 (exon 11) | skip & alt skip | 3 | 206,364 | 18 | ||
| 28 | 41,219,713 (exon 17) | skip | 12 | 1030 | 17 | ||
| 29 | 41,215,969 (exon 18) | skip | 11 | 140 | 56 | ||
| 30 | 41,215,391 (exon 19) | skip | 1 | 1 | 0 | ||
| 31 | 41,209,153 (exon 20) | Css (13,weak) & strong skip | Yes | 2(2) | (1)3 | 52 | 0 |
| 32 | 41,203,135 (exon 21) | Css (8,weak) & skip | Yes | 1(5) | 206 | 1298 | 26 |
| 33 | 41,201,212 (exon 22) | skip | 1147 | 2732 | 132 | ||
| 34 | 41,199,721 (exon 23) | skip | 26 | 300 | 0 | ||
| 35 | 41,197,820 (exon 24) | Css (11) | Yes | 2(4) | (5)26 | 0 | 0 |
Multi-exon skipping events. Experimental reports of mutations that cause multi-exon skipping compared to background splicing predictions. Genes are listed in column B and the experimental results are listed in column C and also column F. Snaptron data is compared in columns D, E and G to I. For shading see text
| A | B | C | D | E | F | G | H | I |
|---|---|---|---|---|---|---|---|---|
| Splice site mutations that cause multiple exon skipping | Snaptron data | |||||||
| Gene | Experimental effect | Single skipreads | Double skipreads | css | css | css | Source | |
| 1 | LAMP2A | Single and double exon skip (similar ratio). | 64 | 80 | no | Appendix 1 | ||
| 2 | LAMP2B | weak css and strong single exon skip, no double skip | 8 | 0 | yes | × | 0 | Appendix 1 |
| 3 | LAMP2C | weak single exon skip and strong double exon skip | 1 | 25 | no | Appendix 1 | ||
| 4 | p67-PHOX | css, single and double skip, relative ratios not given | 26 | 1 | yes | √ | 3 | Table S2 5ʹcss |
| 5 | PKLR | css, single (major event) and double skips | 0 | 0 | yes | × | 0 | Table S2 5ʹcss |
| 6 | ATP7A | css, single (major event) and double skips | 340 | 84 | yes | √ | 24 | Table S2 5ʹcss |
| 7 | COL5A1 | css (x2, weakest), exon skip, double exon skip (major) | 1 | 60 | yes | √√ | 10,6 | Table S2 3ʹcss |
| 8 | HPRT1 | css (20%), exon skip (60%), double skip (20%) | 26 | 914 | yes | √ | 410 | Table S2 3ʹcss |
| 9 | ALDH3A2 | Single and double exon skip (strongest) | 748 | 4364 | no | TableS3 5’skip | ||
| 10 | ATM | Single skip (90%) and double skip (10%) | 734 | 56 | no | TableS3 5’skip | ||
| 11 | CAPN3 | Double exon skip reported (single skip unclear) | 8 | 2 | no | TableS3 5’skip | ||
| 12 | ECHA | Single (major) and double exon skip (minor) | 22 | 285 | no | TableS3 5’skip | ||
| 13 | NTRK1 | Single (stronger) and double exon skip. | 9 | 42 | no | TableS3 5’skip | ||
| 14 | SEDL | single and double exon skip (ratio not clear). | 194 | 135 | no | TableS3 5’skip | ||
| 15 | WT1 | Single and double exon skip (similar amounts). | 0 | 6 | no | TableS3 5’skip | ||
| 16 | ALDH3A2 | Single and double exon skip. Ratio not given | 6111 | 1536 | no | TableS3 3’skip | ||
| 17 | ATM | Single and double exon skip. Ratio not given. | 1052 | 683 | no | TableS3 3’skip | ||
| 18 | BTK | Triple exon skip and css only | 1 | 11, 27 (triple) | yes | √ | 66 | TableS3 3’skip |
| 19 | KCNQ1 | Double exon skip only. | 0 | 295 | no | TableS3 3’skip | ||
| 20 | BRCA1 | Single and double skip (weak) | 971 | 58 | no | |||
| 21 | BRCA1 | css, single (major events) and double skip (minor) | 476 | 356 | yes | √ | 494 | |
| 22 | BRCA2 | Single and double skip (major effect for 1 of 2 reports) | 4 | 62 | no | Table S4 | ||
| 23 | BRCA2 | Single and quadruple skip | 56 | 22, 144, 194(quad) | no | Table S4 | ||
| 24 | BRCA2 | Single and double skip (major effect) | 14 | 64 | no | Table S4 | ||
| 25 | BRCA2 | Single and double skip (minor) | 37 | 8 | no | Table S4 | ||
| 26 | DMD | double skip only | 7 | 83 | no | Table S5 | ||
| 27 | DMD | skip and double skip (ratio not given) | 4 | 67 | no | Table S5 | ||
| 28 | DMD | skip and double skip (ratio not given) | 19 | 0 | no | Table S5 | ||
| 29 | DMD | css, skip (strongest) and double skip (weakest) | 92 | 1 | yes | √ | 12 | Table S5 |
| 30 | DMD | skip, double skip, triple skip (ratio not clear) | 0 | 21, 6 (triple) | no | Table S5 | ||
| 31 | DMD | css, skip, double skip (ratio not clear) | 13 | 1 | yes | √ | 1 | Table S5 |
| 32 | SLC35A1 | css (major) single skip, double skip (weakest) | 2919 | 650 | yes | √ | 26,028 | Table S6 5ʹcss |
| 33 | FGA | multiple css reported but no single or double exon skipping | 1 | 11 | yes | × | 1,0,3 | Table S3 5ʹcss |
| 34 | COL5A1 | Single and double exon skips not reported | 10 | 27 | yes | × | 0 | Table S3 5ʹcss |
| 35 | STK11 | only css reported | 0 | 2058 | yes | √ | 79,56 | Table S3 3ʹcss |
| 36 | COL7A1 | only a css reported | 895 | 494 | yes | × | 151 | Table S3 3ʹcss |
| 37 | FBN1 | Single exon skip only | 9 | 55 | no | Table S3 3’ss | ||
| 38 | BRCA1 | css at −62 reported but not single or double exon skipping | 3 | 2 | yes | √ | 0 | |
| 39 | BRCA2 | css and a single exon skip reported but not a double skip | 15 | 390 | yes | √ | 0,23 | Table S4 |
| 40 | DMD | single exon skip reported | 0 | 8 | no | Table S5 | ||
| 41 | DMD | single exon skip reported but not a triple skip | 3 | 0, 10(triple) | no | Table S5 | ||
| 42 | DMD | css and single exon skip but not a double skip reported | 11 | 21 | yes | √ | 1 | Table S5 |
Figure 2.A-C. The three most common ways of generating a pseudoexon [9]. A) A 3ʹ de novo mutation activates a downstream 5ʹ pseudo splice site. B) A 5ʹ de novo mutation activates an upstream 3ʹ pseudo splice C) Mutations other than de novo splice site mutations can enhance pseudoexon usage, of these the most common mutations occur within the pseudoexon. D) Match between background splices sites (bss) with de novo splice sites, pseudoexon ss (pss), recursive ss (RS) and aberrant ss in cancer (Tables S4, S5). Columns 1: 47/50 match between bss and ‘enhanced’ de novo ss. 2, 3: 63/71 and 14/22 match between bss and the 3ʹ or 5ʹ pss of pseudoexons type I (Figure 2(A,B)). 4: 71/77 bss that match the pseudo ss of type I pseudoexons are nearest to the causative de novo mutation. 5: 50/52 match of bss to the 3ʹ or 5ʹ ss of pseudoexons type II (Figure 2(C)). 6: 48/50 type II pss match intron bss with top 3 reads. 7: 20/20 match between bss and 3ʹ recursive splice sites identified in a genome screen [42] 8: 124/148 match between bss and 3ʹRS and 5ʹRS of DMD43 9:72/72 match between bss and aberrant ss activated by mutations of the spliceosome.