| Literature DB >> 32123317 |
Htoo A Wai1, Jenny Lord1, Andrew G L Douglas1,2, Diana Baralle3,4, Matthew Lyon5, Adam Gunning6, Hugh Kelly1, Penelope Cibin1, Eleanor G Seaby1,7, Kerry Spiers-Fitzgerald1, Jed Lye1, Sian Ellard6, N Simon Thomas1,5, David J Bunyan1,5.
Abstract
PURPOSE: Diagnosis of genetic disorders is hampered by large numbers of variants of uncertain significance (VUSs) identified through next-generation sequencing. Many such variants may disrupt normal RNA splicing. We examined effects on splicing of a large cohort of clinically identified variants and compared performance of bioinformatic splicing prediction tools commonly used in diagnostic laboratories.Entities:
Keywords: RNA splicing; RNA-seq; genetic diagnosis; genomic medicine; variant interpretation
Mesh:
Substances:
Year: 2020 PMID: 32123317 PMCID: PMC7272326 DOI: 10.1038/s41436-020-0766-9
Source DB: PubMed Journal: Genet Med ISSN: 1098-3600 Impact factor: 8.822
Fig. 1Variant locations and effects on splicing.
(a) Plot of the numbers of single-nucleotide variants (SNVs) in this cohort (multinucleotide variants not included) present at each donor (D-3 to D+8) and acceptor (A-8 to A+3) splice region position, along with the numbers of these found to affect splicing. (b, c) Position–weight matrices of nucleotide sequence across the splice donor (b) and acceptor (c) regions as determined for the specific exon–intron junctions analyzed in this study. In this representation, the donor splice site +1 position correlates to position 12 in (b), while the acceptor splice site −1 position correlates to position 25 in (c). (d) Abnormal splicing effects plotted by SNV location. Sequence ontology defines the donor splice region as extending from the third last nucleotide of the exon (D-3) to the eighth nucleotide of the intron (D+8) and the acceptor splice region as extending form the eighth last nucleotide of the intron (A-8) to the third nucleotide of the exon (A+3).[22] (e) Overall proportion of all variants affecting splicing in this cohort. (f) Proportions of different abnormal splicing events identified in this cohort. A3SS alternative 3´ splice site; A5SS alternative 5´ splice site, IR intron retention, SE skipped exon.
Fig. 2Illustrative examples of variant splicing analysis.
DKC1 c.915+10G>A could not be identified by reverse transcription polymerase chain reaction (RT-PCR) and Sanger sequencing but alternative donor splice site usage was identified by RNA-seq. P3H1 (LEPRE1) c.1224-80G>A causes at least three abnormal splicing events using alternative splice donor and acceptor sites, as well as increasing levels of intron retention. DCTN1 c.414+1G>A appears to alter a canonical splice donor site but exons 5–7, although annotated, are never expressed and are constitutively spliced out. SF3B4 c.417C>T is a synonymous coding variant but causes formation of a 125-nt “exitron,” an intronic region within an exon. A3SS alternative 3´splice site, A5SS alternative 5´ splice site, Ctrl control, Pt patient.
Performance assessment of in silico prediction tools on experimentally validated variants (n = 257).
| Scoring metric | Sensitivity | Specificity | Accuracy | PPV | NPV | |
|---|---|---|---|---|---|---|
| HSF (2%) | 28 | 0.8941 | 0.3958 | 0.5808 | 0.4663 | 0.8636 |
| SpliceAI (0.2) | 11 | 0.8987 | 0.9162 | 0.9106 | 0.8353 | 0.9503 |
| Alamut SSF (5%) | 5 | 0.7317 | 0.9294 | 0.8651 | 0.8333 | 0.8778 |
| Alamut MES (10%) | 1 | 0.7381 | 0.9070 | 0.8516 | 0.7949 | 0.8764 |
| Alamut NNSplice (5%) | 11 | 0.6923 | 0.8631 | 0.8089 | 0.7013 | 0.8580 |
| Alamut 2/3 | 14a | 0.7237 | 0.9162 | 0.8560 | 0.7971 | 0.8793 |
Values have been calculated omitting the missing scores for each tool.
HSF Human Splicing Finder, MES MaxEntScan, NPV negative predictive value, PPV positive predictive value, SSF Splice Site Finder.
a11 variants missing one score, three variants missing two scores.
Fig. 3Bioinformatic tools for predicting abnormal splicing.
Receiver operating characteristic (ROC) curves and area under the curve (AUC) comparing in silico methods for predicting splice disruption in overlapping set of experimentally validated variants scored by all measures (136 non–splice disrupting, 70 splice disrupting). HSF human splicing finder, MES MaxEntScan (Alamut), NN NNSplice (Alamut), SSF SpliceSiteFinder (Alamut). Ala23 = number of Alamut tools exceeding specified thresholds.
Fig. 4A potential model of splicing disruption.
Where an upstream splicing event is complete, a splice donor or acceptor site variant may lead to intron retention. Where a preceding splicing event remains incomplete, a splice donor variant may cause skipping of the upstream exon. Similarly, if a splice acceptor site variant causes an upstream splice donor site to remain unused then this may cause skipping of the exon downstream of the acceptor site variant. Exonic or intronic variants that create or strengthen cryptic splice sites can lead to use of alternative splice donor or acceptor sites.