| Literature DB >> 34708937 |
Niall Patrick Keegan1,2,3, Sue Fletcher1,2,4.
Abstract
BACKGROUND: Cryptic exons are typically characterised as deleterious splicing aberrations caused by deep intronic mutations. However, low-level splicing of cryptic exons is sometimes observed in the absence of any pathogenic mutation. Five recent reports have described how low-level splicing of cryptic exons can be modulated by common single-nucleotide polymorphisms (SNPs), resulting in phenotypic differences amongst different genotypes.Entities:
Keywords: RNA splicing; cryptic exon; genome-wide association study; single-nucleotide polymorphism
Mesh:
Substances:
Year: 2021 PMID: 34708937 PMCID: PMC8801146 DOI: 10.1002/mgg3.1840
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
FIGURE 1A general model of SNPtic exon splicing. A cryptic exon, or CE (dashed‐line box) is included in mature transcripts at frequencies that vary depending on the genotype of the carrier. Because the CE encodes a premature stop codon more than 55 nt from the final splice junction, mature transcripts that include the CE are targeted for nonsense‐mediated decay (NMD, grey circle) and are not translated. If a patient carries an SNV (C>G) that greatly increases CE inclusion, NMD predominates and little protein is translated, resulting in a rare but distinct disease phenotype. Conversely, through similar mechanisms a common SNP (A>T) with a weak effect on CE splicing leads to a common but indistinct phenotype, which may only be measurable with a sufficiently powered genome‐wide association study
Sixteen putative SNPtic exons and their associated phenotypes
| SNPtic exon | SNPs | Expected effect | SNP phenotype | Exon high‐inclusion phenotype |
|---|---|---|---|---|
|
(Known) |
(NC_000011.10: g.108287407T>C) | Less inclusion | Lower cancer risk | Ataxia telangiectasia (poor coordination, prominent eye blood vessels and high cancer risk) |
|
(Known) |
(NC_000023.11: g.154947237_154947249del) | More inclusion | Mild haemophilia type A | Mild haemophilia type A |
|
(Known) |
(NC_000015.10: g.81308110T>C) | More inclusion (no NMD) | Higher interleukin‐16 levels in blood | Unknown |
|
(Known) |
(NC_000002.12: g.48721568A>C) | Splice‐switch (S > L) | Higher testosterone levels and higher androgen sensitivity index | Male pseudohermaphroditism |
|
(Known) (rs116086311) |
(NC_000012.12: g.112910849C>T),
(NC_000012.12: g.112910856C>T) | More inclusion | Higher risk of encephalitis and paralysis if infected with West Nile virus (rs34137742) | Unknown. Other |
|
(Known) |
(NC_000012.12: g.57783654C>G) | More inclusion | PREDICTED: Higher risk of multiple sclerosis | Unknown. Other |
|
(Probable) |
(NC_000005.10: g.112822734G>A) | More inclusion | Higher colorectal cancer risk | Adenomatous polyposis (colon cancer) |
|
(Probable) |
(NC_000004.12: g.154565229C>T) | Less inclusion | Higher blood fibrinogen levels | Afibrinogenemia (Persistent cerebral transient ischemic attacks, blood clots and 1/50th normal fibrinogen levels) |
|
(Probable) |
(NC_000003.12: g.10287125T>C) | Splice‐switch (L > S) | Decreases cancer risk and increases bulimia risk | Unknown; other |
|
(Probable) |
(NC_000011.10: g.47345820C>A) | Less inclusion | Slightly higher cognitive performance | Hypertrophic cardiomyopathy |
|
(Probable) |
(NC_000023.11: g.38412940T>A) | Less inclusion | Increased risk of bipolar disorder | Hyperammonemia leading to brain damage and death |
|
(Possible) |
(NC_000005.10: g.78884913T>C) | More inclusion | PREDICTED: Shorter stature and higher risk profile for other symptoms. | Mucopolysaccharidosis Type VI (Skeletal abnormalities, hearing and vision loss and heart disease) |
|
(Possible) |
(NC_000005.10: g.150060771G>A) | More inclusion | PREDICTED: Shorter stature and increased susceptibility to cognitive decline. | Early onset HDLS, skeletal dysplasia (dwarfism) and brain malformation |
|
(Possible) |
(NC_000023.11: g.32863915T>C) | Splice‐switch (S > L) | PREDICTED: Asymptomatically lower dystrophin levels. May compound an existing BMD phenotype. | Duchenne muscular dystrophy, primarily due to |
|
(Possible) |
(NC_000017.11: g.31324211C>T) | More inclusion | PREDICTED: Higher cancer risk | Unknown; other |
|
(Possible) |
(NC_000012.12: g.89461145del) | More inclusion | PREDICTED: Lower visual acuity | Reduced visual acuity and contrast, photophobia |
Citations are shown in main text. GenBank IDs of studied genes: APC, NG_008481.4; ARSB, NG_007089.1; ATM, NG_009830.1; CSF1R, NG_012303.2; DMD, NG_012232.1; F8, NG_011403.2; FGB, NG_008833.1; GHRL, NG_011560.1; IL16, NG_029933.1; LHCGR, NG_008193.2; MYBPC3, NG_007667.1; NF1, NG_009018.1; OAS1, NG_011530.2; OTC, NG_008471.1; POC1B, NG_041783.1; TSFM, NG_016971.1.
FIGURE 2Cryptic exons APC‐11a, LHCGR‐6a and POC1B‐9a exhibit high sequence conservation. Images were captured as screenshots from the UCSC Genome Browser (Kent et al., 2002). In descending order, displayed tracks are: Base position, dbSNP 153, input sequence, ‘GENCODE V37’ (aligned transcript variants) and ‘Cons 30 Primates’. ‘The Cons 30 Primates’ track, which is erroneously labelled as ‘Cons 30 Mammals’ in the browser, displays sequence conservation data from 30 non‐human primate species
SNPtic exons caused by common SNPs (≥1% haploid frequency) as reported by Sakaguchi and Suyama (2021)
| Chr. | Gene | Start | End | SNP position | rsID | Varnomen |
|---|---|---|---|---|---|---|
| chr1‐ |
| 882,137 | 882,244 | 882,250 | rs111463901 | NC_000001.11:g.946870C>A |
| chr1+ |
| 95,702,899 | 95,703,016 | 95,702,898 | rs80241359 | NC_000001.11:g.95237342A>G |
| chr5‐ |
| 77,026,223 | 77,026,280 | 77,026,221 | rs75503375 | NC_000005.10:g.77730396C>A |
| chr5‐ |
| 139,932,741 | 139,932,889 | 139,932,740 | rs112703681 | NC_000005.10:g.140553155T>C |
| chr6+ |
| 139,354,886 | 139,354,992 | 139,354,992 | rs62441851 | NC_000006.12:g.139033855A>G |
| chr7‐ |
| 43,695,632 | 43,695,752 | 43,695,628 | rs1859877 | NC_000007.14:g.43656029C>T |
| chr10+ |
| 38,654,838 | 38,654,939 | 38,654,940 | rs2804645 | NC_000010.11:g.38366012T>A |
| chr11‐ |
| 71,157,568 | 71,157,656 | 71,157,567 | rs75686975 | NC_000011.10:g.71446521G>A |
| chr12+ |
| 16,503,692 | 16,503,788 | 16,503,789 | rs9332891 | NC_000012.12:g.16350855T>G |
|
|
|
|
|
|
|
|
| chr14+ |
| 105,954,227 | 105,954,364 | 105,954,368 | rs112661676 | NC_000014.9:g.105488031G>A |
|
|
|
|
|
|
|
|
| chr16‐ |
| 58,662,843 | 58,663,002 | 58,662,841 | rs28644182 | NC_000016.10:g.58628937G>A |
| chr16‐ |
| 89,829,046 | 89,829,201 | 89,829,201 | rs9806894 | NC_000016.10:g.89762793G>A |
| chr17+ |
| 40,440,948 | 40,441,015 | 40,441,014 | rs74875201 | NC_000017.11:g.42288996G>A |
| chr19+ |
| 8,312,329 | 8,312,446 | 8,312,447 | rs12977774 | NC_000019.10:g.8247563A>G |
| chr21‐ |
| 26,758,995 | 26,759,072 | 26,758,994 | rs13049048 | NC_000021.9:g.25386681T>A |
| chr21‐ |
| 33,980,707 | 33,980,799 | 33,980,705 | rs111323620 | NC_000021.9:g.32608395G>A |
| chr21+ |
| 44,326,950 | 44,327,012 | 44,327,013 | rs73905782 | NC_000021.9:g.42906903A>G |
| chr22+ |
| 39,419,690 | 39,419,852 | 39,419,853 | rs6001388 | NC_000022.11:g.39023848T>G |
‘Start’ and ‘End’ coordinates refer to human genome assembly hg19, as per cited work. SNPtic exons OAS1‐2a and IL16‐6a are indicated with bold text. The APOBEC3D SNP is not shown in the cited work but is required for splicing in addition to the published variant (Narumi Sakaguchi 2021, Pers. Comm).