| Literature DB >> 32176650 |
Abstract
The DMD gene is the largest in the human genome, with a total intron content exceeding 2.2Mb. In the decades since DMD was discovered there have been numerous reported cases of pseudoexons (PEs) arising in the mature DMD transcripts of some individuals, either as the result of mutations or as low-frequency errors of the spliceosome. In this review, I collate from the literature 58 examples of DMD PEs and examine the diversity and commonalities of their features. In particular, I note the high frequency of PEs that arise from deep intronic SNVs and discuss a possible link between PEs induced by distal mutations and the regulation of recursive splicing.Entities:
Keywords: Cryptic Splice Sites; DNA Mutational Analysis; Duchenne; Muscular Dystrophy; RNA Splicing
Year: 2020 PMID: 32176650 PMCID: PMC7175933 DOI: 10.3233/JND-190431
Source DB: PubMed Journal: J Neuromuscul Dis
Details of 58 known pseudoexons (PEs) arising from mutations within the human DMD gene. Bold, upper-case sequence denotes the PE itself, while regular lower-case sequence indicates flanking intron. Inverted sequence is italicized. Duplications and insertions are underlined. Single base transitions are shown in the form of “X > Y,” where X is the reference nucleotide and Y is the mutant. Shapiro-Senapathy splice scores are also shown, before and after mutation where appropriate (see supplementary materials for splice score calculator). “RNA Source” lists the cell or tissue types where the PE was detected. Genomic co-ordinates pertain to human genome assembly accession GCA_000001405.27, chromosome X, except where otherwise stated. Transcript co-ordinates pertain to DMD reference sequence NG_012232.1(DMD_v001)
| PE # | Intron | Sequence | Genomic Co-ordinates (chrX.) | Size (bp) | Acceptor Score | Donor Score | Mutation(s) | Orientation of Mutation to PE and Splice Sites | ORF | RNA Source | Reference |
| 1 | 33174185-33174333 | 149 | 71.37 ->87.94 | 72.72 | c.31+36947G>A | Proximal | Frameshift | Skeletal muscle | [ | ||
| 1 | 33088244-33088348 | 105 | 62.92 | 81.59 | c.exon3_6del | Distal (3′) | Premature stop codon | Heart, skeletal muscle | [ | ||
| 1 | 33085901-33085927 | 27 | 58.89 | 78.81 | c.exon3_6del | Distal (3′) | Premature stop codon | Heart, skeletal muscle | [ | ||
| 1 | 33078186-33078347 | 162 | 96.38 | 88.39 | Multiple mutations, also observedin normalcells (see main text) | Distal (3′) | Premature stop codon | Lymphocytes | [ | ||
| 2 | 33014502-33014547 | 46 | 68.17 ->84.74 | 82.07 | c.93+5590T>A | Proximal | Frameshift | Lymphocytes | [ | ||
| 2 | 33014416-33014547 | 132 | 68.17 ->84.74 | 77.7 | c.93+5590T>A | Proximal | ORF preserved | Lymphocytes, skeletal muscle | [ | ||
| 2 | 32960208-32960347 | 140 | 100 | 77.84 | Multiple deletions and duplications [ | Distal (3′) | Frameshift | Skeletal muscle | [ | ||
| 2 | 32878883-32878980 | 98 | 72.84 | 76.65 | c.exon2dup; also observed in normal cells | Distal (3′) | Frameshift | Multiple tissue types | [ | ||
| 2 | 32863759-32863915 | 157 | 70.85 | 73.68 | c.exon8_11dup | Distal (3′) | Frameshift | Lymphocytes | [ | ||
| 2 | 32863759-32863911 | 154 | 79.35 | 73.68 | c.exon8_11dup | Distal (3′) | Frameshift | Lymphocytes | [ | ||
| 3 | 32846622-32846978 | 357 | 81.06 | 79.88 | c.exon8_11dup; also observed in normal cells | Distal (3′) | Premature stop codon | Lymphocytes | [ | ||
| 3 | 32846622-32846683 | 62 | 72.62 | 79.88 | c.280Adel [ | Distal (3′) | Frameshift | Lymphocytes | [ | ||
| 4 | 32823851-32823982 | 132 | 90.56 | 74.59 ->91.82 | c.265-463A>G | Proximal | Premature stop codon | Skeletal muscle | [ | ||
| 7 | 32738792-32738868 | 77 | 79.91 ->78.04 | 56.78 ->74.00 | c.650-39575A>C; c.650-39498A>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 9 | 32650985-32651074 | 90 | 83.55 ->93.28 | 70.04 | c.961-5925A>C | Proximal | Premature stop codon | Skeletal muscle | [ | ||
| 9 | 32650985-32651074 | 90 | 83.55 | 70.28 ->87.27 | c.961-5831C>T | Proximal | Premature stop codon | Skeletal muscle | [ | ||
| 11 | 32641751-32641799; 32630010-32630119 | 159 | 73.92 | 79.54 | c.1331+2382_1331+14010del | Proximal | Premature stop codon | Multiple tissue types | [ | ||
| 11 | 32638732-32638888 | 157 | 79.76 | 87.83 | c.1336_1337del | Distal (3′) | Frameshift | Lymphocytes | [ | ||
| 11 | 32626363-32626441 | 79 | 90.08 | 56.56 ->73.78 | c.1332-11909C>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 18 | 32514091-32514222 | 132 | 68.21 | 67.88 | c.2622+1G>A | Distal (3′) | Premature stop codon | Lymphocytes | [ | ||
| 21 | 32480880-32480945 | 66 | 80.03 | 78.22 | None | N/A | ORF preserved | Skeletal muscle | [ | ||
| 25 | 32461308-32461402 | 95 | 79.15 ->95.72 | 70.75 | c.3432+2036A>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 25 | 32461200-32461401 | 202 | 76.71 | 70.14 ->87.36 | c.3432+2240A>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 25 | 32461200-32461371 | 172 | 82.72 | 70.14 ->87.36 | c.3432+2240A>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 26 | 32452550-32452629 | 80 | 86.2 | 77.21 | c.3603+2053G>C | Proximal | Frameshift | Skeletal muscle | [ | ||
| 27 | 32442160-32442278 | 119 | 87.46 | 80.58 ->90.39 | c.3787-843C>A | Proximal | Frameshift | Lymphocytes or skeletal muscle (unspecified) | [ | ||
| 29 | 32412019-32412063 | 45 | 90.17 | 83.09 | c.3613delG | Distal (5′) | Premature stop codon | Lymphocytes | [ | ||
| 34/42 | 32372085-32372098; 32293942-32294069 | 167 | 83.07 | 88.06 | c.4846-6885_6118-6369delinsATACAATA;c.4846-6900_4846-6899ins17 | Proximal | Frameshift | Unspecified | [ | ||
| 37 | 32360933-32361009 | 77 | 77.72 | 74.58 | c.5325+1740_5325+1757del | Distal (5′) | Frameshift | MyoD transformed fibroblasts | [ | ||
| 37 | 32348692-32348742 | 51 | 71.85 ->88.42 | 77.01 | c.5326-215T>G | Proximal | Premature stop codon | Skeletal muscle | [ | ||
| 43 | 32256577-32256704 | 128 | 86.94 | 65.72 ->82.94 | c.6290+30954C>T | Proximal | Frameshift | Skeletal muscle | [ | ||
| 43 | chr4.182041664-182041743 | 80 | 90.26 | 99.52 | Complex insertion/deletion (see cited work) | Encompassing | Frameshift | Skeletal muscle | [ | ||
| 43 | 32235147-32235204 | 58 | 76.18 | 86.71 | c.6291-21015_6438+98743dupinsA; c.6291-21008_6291-21007insCTCCCCTGAACATGG | Distal (5′ and 3′) | Frameshift | Unspecified | [ | ||
| 44 | 32074802-32075188 | 387 | 75.04 | 90.97 | Complex inversion (see cited work) | Encompassing | Premature stop codon | Skeletal muscle | [ | ||
| 44 | 32024402-32024483 | 82 | 85.52 | 62.43 ->78.86 | c.6439-55921_6912+26400del | Proximal | Frameshift | MyoD transformed fibroblasts | [ | ||
| 45 | 31965031-31965167 | 137 | 73.52 | 71.16 ->88.39 | c.6614+3310G>T | Proximal | Frameshift | Skeletal muscle | [ | ||
| 47 | 31879338-31879409 | 72 | 61.61 ->78.18 | 74.53 | c.6913-4037T>G | Proximal | Premature stop codon | Skeletal muscle | [ | ||
| 48 | 31850637-31850744 | 108 | 81.97 | 70.41 | Complex inversion (see cited work) | Encompassing | Premature stop codon | Cultured myogenic cells | [ | ||
| 48 | 31845152-31845276 | 125 | 75.76 | 78.66 | Complex inversion (see cited work) | Encompassing | Frameshift | Cultured myogenic cells | [ | ||
| 49 | 31833539-31833718 | 180 | 93.4 | 87.02 | Complex inversion (see cited work) | Encompassing | Premature stop codon | Cultured myogenic cells | [ | ||
| 49 | 31832500-31832659 | 160 | 87.27 | 75.66 | Complex inversion (see cited work) | Encompassing | Frameshift | Cultured myogenic cells | [ | ||
| 49 | 31831990-31832138 | 149 | 88.15 | 88.06 | Complex inversion (see cited work) | Encompassing | Frameshift | Cultured myogenic cells | [ | ||
| 49 | 31831041-31831138 | 98 | 73.62 | 87.58 | Complex inversion (see cited work) | Encompassing | Frameshift | Cultured myogenic cells | [ | ||
| 51 | 31752441-31752524 | 84 | 96.74 | 76.2 | None | N/A | ORF preserved | Skeletal muscle | [ | ||
| 51 | 31765009-31765013 | 103 | 79.68 | 68.7 | c.7542+8951_7542+8952ins6091 | Proximal | Frameshift | Skeletal muscle | [ | ||
| 53 | 31678558-31678630 | 73 | 64.29 | 91.82 | Complex inversion (see cited work) | Encompassing | Frameshift | Skeletal muscle | [ | ||
| 55 | 31609571-31609620 | 50 | 79.61 ->96.18 | 87.36 | c.8217+18052A>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 55 | 31595572-31595644 | 73 | 90.26 | 78.52 ->95.74 | c.8217+32103G>T | Proximal | Frameshift | Skeletal muscle | [ | ||
| 56 | 31497079-31497244 | 166 | 94.32 | 87.36 | c.8391-1419_8391-828del; c.8391-102_8391-76del | Distal (5′ and 3′) | Frameshift | Skeletal muscle | [ | ||
| 60 | 31364155-31364243 | 89 | 88.65 | 76.64 ->93.87 | c.9085-15519G>T | Proximal | Frameshift | Skeletal muscle | [ | ||
| 62 | 31261663-31261729 | 67 | 96.61 | 75.91 ->88.06 | c.9225-647A>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 62 | 31261306-31261363 | 58 | 81.83 | 77.76 ->89.70 | c.9225-285A>G | Proximal | Frameshift | Skeletal muscle [ | [ | ||
| 63 | 31247694-31247768 | 75 | 76.44 | 74.39 | Unknown | Unknown | Premature stop codon | Lymphocytes | [ | ||
| 65 | 31208284-31208430 | 147 | 96.18 | 77.17 ->94.40 | c.9563+1215A>G | Proximal | Premature stop codon | Skeletal muscle | [ | ||
| 65 | 31207099-31207151 | 53 | 83.88 | 79.28 ->91.45 | c.9564-427T>G | Proximal | Frameshift | Skeletal muscle | [ | ||
| 67 | 31203344-31203394 | 51 | 81.16 | 63.87 | Unknown | Unknown | ORF preserved | Lymphocytes | [ | ||
| 67 | 31201249-31201369 | 121 | 96.18 | 67.75 ->84.98 | c.9807+2714C>T | Proximal | Frameshift | Lymphocytes or skeletal muscle (unspecified) | [ | ||
| 77 | 31132483-31132633 | 151 | 76.72 | 87.1 | Unknown | Unknown | Frameshift | Lymphocytes | [ |
Fig. 1Locations of pseudoexon-initiating single-nucleotide variations in the DMD gene, relative to acceptor and donor splice site consensus sequences. Numbers above each nucleotide indicate the exemplar pseudoexons. Lower-case letters indicate intron sequence, upper-case letters indicate exon sequence. Dash-line boxes highlight the essential “ag” and “gt” of the acceptor and donor site motifs respectively.
DMD pseudoexon splice sites coinciding with recursive splicing ratchet points predicted by Gazzoli et al. (2016). Co-ordinates listed are for genomic reference sequence NC_000023.10, as used by the cited authors. Dotted-line boxes enclose pairs of split reads that match to the same pseudoexon splice site. Asterisks (*) indicate the six coinciding splice sites previously noted by Bouge et al. [14]
Fig. 2Suggested model of the two most common modes of pseudoexon initiation observed in the DMD gene. (A) Proximal mutations at non-RS sites. (i) In the absence of mutation, a putative pseudoexon presents a weak exon-like profile to the spliceosome and is predominantly excluded from mature transcripts. (ii) The presence of a mutation, usually a splice-site-creating SNV, increases the exon-like profile of the putative pseudoexon, resulting in its inclusion in a much higher proportion of transcripts. (B) Mutations 3′ distal to RS sites. (i) In the absence of mutation, the exon n donor site and RS-acceptor site are used to excise the 5′ segment of an intron. Silencing elements distal (usually 3′) to the RS-exon prevent spliceosome recognition of its donor-like motif, and the RS-exon is subsequently removed from the transcript along with the rest of the 3′ intron segment. (ii) When mutations to the distal silencing elements impair their function, the intron segment 5′ to the RS-exon is spliced as normal, but the RS-exon donor-like site escapes silencing and is more readily recognised by the spliceosome, leading to a much higher frequency of inclusion of the RS-exon in the mature transcript population.