Literature DB >> 32331533

Clinical impact of splicing in neurodevelopmental disorders.

Stephan J Sanders1, Grace B Schwartz2, Kyle Kai-How Farh3.   

Abstract

Clinical exome sequencing is frequently used to identify gene-disrupting variants in individuals with neurodevelopmental disorders. While splice-disrupting variants are known to contribute to these disorders, clinical interpretation of cryptic splice variants outside of the canonical splice site has been challenging. Here, we discuss papers that improve such detection.

Entities:  

Keywords:  Antisense oligonucleotide; Autism spectrum disorder; Canonical splice site; Clinical exome sequencing; Cryptic splice site; Developmental delay; Gene splicing; Isoform; Polypyrimidine tract; SpliceAI

Mesh:

Year:  2020        PMID: 32331533      PMCID: PMC7183108          DOI: 10.1186/s13073-020-00737-2

Source DB:  PubMed          Journal:  Genome Med        ISSN: 1756-994X            Impact factor:   11.117


Splicing disruption in human disorders

Gene-disrupting genetic variants frequently lead to neurodevelopmental disorders, including developmental delay and autism spectrum disorder (ASD), when they occur in one of the several hundred genes associated with these disorders [1, 2]. Many of these variants are de novo, observed in the affected child, but not in either parent, and capable of mediating substantial risk for neurodevelopmental disorders. Such variants alter the quantity or quality of the encoded proteins, through deletions, premature stop codons, or missense variants. In this commentary, we consider the impact of an additional class of gene-disrupting variants that act by altering gene splicing. Three papers outline improvements in detecting splice-disrupting variants [3-5], and applying these methods predicts cryptic splicing variants in genes associated with neurodevelopmental disorders in about 0.5% of cases and no controls [1, 2].

Splicing motifs and mechanisms

Splicing is a key process in eukaryotic cells. After transcription, a nascent pre-mRNA must be converted into a mature mRNA that can serve as a template for protein translation. This involves the removal of introns from the pre-mRNA, usually by the major spliceosome, through splicing (Fig. 1a). Critical to this process are the two-nucleotide “essential” or “canonical” splice sites (CSS) at either side of exons: an “AG” motif upstream of the acceptor site (A, also called the 3′ splice site), at positions A-1 and A-2, and a “GT” motif downstream of the donor site (D, also called the 5′ splice site), at positions D+1 and D+2 (Fig. 1b).
Fig. 1

Overview of the splicing region. a The spliceosome attaches to pre-mRNA as it is transcribed from DNA, removing introns and leaving an exon junction complex upstream. The mature mRNA can migrate out of the nucleus for translation. b Motifs of the polypyrimidine tract, acceptor, and donor, calculated from all protein-coding exons. c Odds ratios of observed and expected variant frequencies around the splice site based on ExAC exome sequencing data in Zhang et al. [4]. Lord et al. [3] use the same ExAC data to calculate the mutability-adjusted proportion of singletons (MAPS) across splicing regions, which is higher at nucleotides intolerant of variation. Jaganathan et al. [5] developed SpliceAI, a neural network for predicting the impact of variants on splicing across the genome; the number of potential variants with a Δ score ≥ 0.1 is shown across splicing regions. Abbreviations: TSS, transcription start site; UTR, untranslated region; A, acceptor; D, donor; Pol II, polymerase II; Ter, termination codon; Poly(A), polyadenylation

Overview of the splicing region. a The spliceosome attaches to pre-mRNA as it is transcribed from DNA, removing introns and leaving an exon junction complex upstream. The mature mRNA can migrate out of the nucleus for translation. b Motifs of the polypyrimidine tract, acceptor, and donor, calculated from all protein-coding exons. c Odds ratios of observed and expected variant frequencies around the splice site based on ExAC exome sequencing data in Zhang et al. [4]. Lord et al. [3] use the same ExAC data to calculate the mutability-adjusted proportion of singletons (MAPS) across splicing regions, which is higher at nucleotides intolerant of variation. Jaganathan et al. [5] developed SpliceAI, a neural network for predicting the impact of variants on splicing across the genome; the number of potential variants with a Δ score ≥ 0.1 is shown across splicing regions. Abbreviations: TSS, transcription start site; UTR, untranslated region; A, acceptor; D, donor; Pol II, polymerase II; Ter, termination codon; Poly(A), polyadenylation Along with the CSS, other DNA features are known to determine splicing behavior, including several motifs representing binding targets of the small nuclear ribonucleoproteins (snRNPs) that make up the major spliceosome. Motif analysis across exons (Fig. 1b) has identified broader “CAG” and “AGGTAAGT” motifs at the acceptor and donor, respectively, as well as the polypyrimidine tract, characterized by enrichment of thymine and cytosine upstream of the acceptor (A-5 to A-40). Upstream of the polypyrimidine tract is the branch point (A-10 to A-50, median A-25) with a “TNA” motif (Fig. 1b). In the major spliceosome, U1 snRNPs bind to the donor site, U2 snRNPs bind to the branch point, and the U2AF protein binds to the polypyrimidine tract and acceptor site [6].

Variation at canonical splice sites

Noncoding genetic variants that disrupt the CSS of critical genes are a known cause of human genetic diseases, including neurodevelopmental disorders [1, 2]. Improper splicing can lead to exon skipping or novel splice sites, both of which can alter the reading frame of protein-coding genes. Alternatively, intron retention incorporates noncoding DNA, which often contains stop codons, into the mature RNA. Consequently, identifying CSS variants in genes with known disease associations is a routine practice in clinical exome sequencing, in which they are treated as protein-truncating variants (PTVs) along with premature stop codons or frameshift variants [7]. Out of 1863 de novo variants identified in individuals with neurodevelopmental disorders [2], the 296 CSS variants account for 16% of PTVs, 637 premature stop codons account for 34%, and 930 frameshift insertions or deletions account for 50%. To quantify the contribution of these de novo variants to disorders, we consider the frequency of variants in 3230 protein-coding genes that are predicted to be “PTV-intolerant,” based on fewer than expected PTVs in whole-exome sequencing data from over 60,000 individuals (expressed statistically as a probability loss-of-function intolerant (pLI) score ≥ 0.9) [8]. Comparing differences in the rate of de novo PTVs in these PTV-intolerant genes between cases and controls [1, 2], we estimate that de novo PTVs contribute to 5% of ASD cases and 16% of developmental delay cases. Since 16% of PTVs are in the CSS, this equates to 0.8% of ASD cases and 2.6% of developmental delay cases due to splicing disruption at the CSS.

Variation at cryptic splice sites

Exonic or intronic splice-disrupting variants outside of the CSS are commonly referred to as cryptic splice variants, due to the challenge of identifying them. The below articles focus on improving clinical interpretation of these cryptic splice sites in neurodevelopmental disorders by leveraging exome sequencing data from population samples [3, 4] or deep learning methods [5]. Zhang et al. [4] use exome sequencing data from over 60,000 population samples from the Exome Aggregation Consortium (ExAC) [8] to assess the observed versus expected number of variants in the 10 nucleotides flanking the acceptor and donor sites in PTV-intolerant genes. They highlight six non-CSS nucleotides that are intolerant of variation (Fig. 1c) and validate splicing dysregulation in four of these (D-1, D+4, D+5, D+6) using paired whole-genome and RNA sequencing from GTEx [9]. Such cryptic splice de novo variants were observed in 0.2% of ASD cases and 0.2% of developmental delay cases. Lord et al. also use ExAC data [3] to highlight two nucleotides that are intolerant of variation (D-1, D+5, Fig. 1c). The D+5 site is also enriched for de novo variants in cases of developmental delay in genes associated with this disorder, as was the polypyrimidine tract when all nucleotides (A-5 to A-25) were considered together. By integrating phenotype data, they identified 18 likely diagnostic de novo variants in 7833 cases (0.2%). Functional assessment of splicing using a minigene assay validated six of the seven likely diagnostic variants that were tested (86%). Jaganathan et al., which includes the authors of this commentary [5], describe the SpliceAI algorithm, a neural network that predicts the impact of cryptic splice variants based on a pre-mRNA sequence. The network, trained on 10,000 nucleotides of human genomic sequence around 260,000 known splice sites from GENCODE, is used to calculate the SpliceAI Δ score by considering the difference in predicted splicing between reference and variant sequence. Scores range from 0 to 1 with high scores more likely to alter splicing (Fig. 1c). Assessing performance in the paired whole-genome and RNA sequencing data from GTEx [9] identifies splicing disruption proportional to the Δ score (i.e., 20% at 0.2; 80% at 0.8) with higher sensitivity and specificity than prior algorithms [5]. High depth RNA-seq of ASD patient-derived lymphoblastoid cell lines validated 21 of 28 (75%) de novo variants predicted to alter splicing (Δ score, 0.10–0.99; median 0.58), including variants in the ASD-associated genes TCF4 and KDM6B [2]. Of note, analysis of GTEx also revealed widespread tissue-specific splicing, which may lead such validation to underestimate the true accuracy. An excess of de novo variants predicted to alter splicing (Δ score ≥ 0.1) was observed in both developmental delay and ASD, compared to controls. Considering only genes previously associated with neurodevelopmental disorders, de novo variants at cryptic splice sites were observed in 23 out of 3953 ASD cases (0.6%), 21 out of 4293 developmental delay cases (0.5%), and none of the 2073 controls [1, 2]. Overall, SpliceAI predicts about 7-fold more cryptic splice site variants than the other two approaches because it is not limited to specific nucleotides (e.g. D+5), includes splice sites further from the exons, and evaluates each splice site individually. Considering variants assessed consistently between these three methods, SpliceAI predicts all four “likely diagnostic” variants in Lord et al. and 10 of the 18 variants (56%) highlighted by Zhang et al. With these improvements in detection [3-5], we propose that de novo variants at cryptic splice sites identified in exome or genome sequencing of individuals with neurodevelopmental disorders should undergo clinical evaluation in a manner similar to deleterious missense variants. Such evaluation would incorporate evidence from gene association studies, pLI scores, and consistency of phenotype [7].

Prevalence of splicing disruption in neurodevelopmental disorders and therapeutic potential

Using the SpliceAI estimates, splicing disruption by de novo variants in PTV-intolerant genes underlies at least 1.4% of ASD cases (0.8% CSS and 0.6% cryptic, see estimates above) and 3.1% of developmental delay cases (2.6% CSS and 0.5% cryptic, see estimates above). These estimates are equivalent to about 20,000 ASD cases 18 years-of-age or below in the USA and 21,000 equivalent developmental delay cases. Inclusion of more genes (PTV-tolerant, noncoding), whole-genome sequencing to identify deep intronic variants missed by exome sequencing, and consideration of homozygous and heterozygous inherited variation will only increase these estimates. While splicing variants contribute to thousands of cases of neurodevelopmental disorders, they may offer opportunities for novel therapeutic targets. The success of the FDA-approved antisense oligonucleotide (ASO) Nusinersen to modify splicing behavior, resulting in life-saving clinical improvement in patients with spinal muscular atrophy [10], sets a precedent for treating central nervous system disorders via splicing mechanisms. Such a therapy would need to be developed specifically for each splicing variant in most neurodevelopmental disorders [11]. Key research milestones will include assessing the fraction of splicing variation that can be rescued by ASOs, efficient methods to design and test ASOs, and assessment of the extent of rescue in vivo. These approaches may provide the first insights into whether gene therapy can modify the symptoms of ASD and developmental delay, potentially providing a route to treatment for thousands of individuals with splicing variants and de-risking more complicated approaches to gene therapy that could be applicable in larger populations.
  11 in total

1.  Nusinersen versus Sham Control in Infantile-Onset Spinal Muscular Atrophy.

Authors:  Richard S Finkel; Eugenio Mercuri; Basil T Darras; Anne M Connolly; Nancy L Kuntz; Janbernd Kirschner; Claudia A Chiriboga; Kayoko Saito; Laurent Servais; Eduardo Tizzano; Haluk Topaloglu; Már Tulinius; Jacqueline Montes; Allan M Glanzman; Kathie Bishop; Z John Zhong; Sarah Gheuens; C Frank Bennett; Eugene Schneider; Wildon Farwell; Darryl C De Vivo
Journal:  N Engl J Med       Date:  2017-11-02       Impact factor: 91.245

2.  Patient-Customized Oligonucleotide Therapy for a Rare Genetic Disease.

Authors:  Jinkuk Kim; Chunguang Hu; Christelle Moufawad El Achkar; Lauren E Black; Julie Douville; Austin Larson; Mary K Pendergast; Sara F Goldkind; Eunjung A Lee; Ashley Kuniholm; Aubrie Soucy; Jai Vaze; Nandkishore R Belur; Kristina Fredriksen; Iva Stojkovska; Alla Tsytsykova; Myriam Armant; Renata L DiDonato; Jaejoon Choi; Laura Cornelissen; Luis M Pereira; Erika F Augustine; Casie A Genetti; Kira Dies; Brenda Barton; Lucinda Williams; Benjamin D Goodlett; Bobbie L Riley; Amy Pasternak; Emily R Berry; Kelly A Pflock; Stephen Chu; Chantal Reed; Kimberly Tyndall; Pankaj B Agrawal; Alan H Beggs; P Ellen Grant; David K Urion; Richard O Snyder; Susan E Waisbren; Annapurna Poduri; Peter J Park; Al Patterson; Alessandra Biffi; Joseph R Mazzulli; Olaf Bodamer; Charles B Berde; Timothy W Yu
Journal:  N Engl J Med       Date:  2019-10-09       Impact factor: 91.245

3.  Clinical exome sequencing for genetic identification of rare Mendelian disorders.

Authors:  Hane Lee; Joshua L Deignan; Naghmeh Dorrani; Samuel P Strom; Sibel Kantarci; Fabiola Quintero-Rivera; Kingshuk Das; Traci Toy; Bret Harry; Michael Yourshaw; Michelle Fox; Brent L Fogel; Julian A Martinez-Agosto; Derek A Wong; Vivian Y Chang; Perry B Shieh; Christina G S Palmer; Katrina M Dipple; Wayne W Grody; Eric Vilain; Stanley F Nelson
Journal:  JAMA       Date:  2014-11-12       Impact factor: 56.272

4.  Predicting Splicing from Primary Sequence with Deep Learning.

Authors:  Kishore Jaganathan; Sofia Kyriazopoulou Panagiotopoulou; Jeremy F McRae; Siavash Fazel Darbandi; David Knowles; Yang I Li; Jack A Kosmicki; Juan Arbelaez; Wenwu Cui; Grace B Schwartz; Eric D Chow; Efstathios Kanterakis; Hong Gao; Amirali Kia; Serafim Batzoglou; Stephan J Sanders; Kyle Kai-How Farh
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

5.  Analysis of protein-coding genetic variation in 60,706 humans.

Authors:  Monkol Lek; Konrad J Karczewski; Eric V Minikel; Kaitlin E Samocha; Eric Banks; Timothy Fennell; Anne H O'Donnell-Luria; James S Ware; Andrew J Hill; Beryl B Cummings; Taru Tukiainen; Daniel P Birnbaum; Jack A Kosmicki; Laramie E Duncan; Karol Estrada; Fengmei Zhao; James Zou; Emma Pierce-Hoffman; Joanne Berghout; David N Cooper; Nicole Deflaux; Mark DePristo; Ron Do; Jason Flannick; Menachem Fromer; Laura Gauthier; Jackie Goldstein; Namrata Gupta; Daniel Howrigan; Adam Kiezun; Mitja I Kurki; Ami Levy Moonshine; Pradeep Natarajan; Lorena Orozco; Gina M Peloso; Ryan Poplin; Manuel A Rivas; Valentin Ruano-Rubio; Samuel A Rose; Douglas M Ruderfer; Khalid Shakir; Peter D Stenson; Christine Stevens; Brett P Thomas; Grace Tiao; Maria T Tusie-Luna; Ben Weisburd; Hong-Hee Won; Dongmei Yu; David M Altshuler; Diego Ardissino; Michael Boehnke; John Danesh; Stacey Donnelly; Roberto Elosua; Jose C Florez; Stacey B Gabriel; Gad Getz; Stephen J Glatt; Christina M Hultman; Sekar Kathiresan; Markku Laakso; Steven McCarroll; Mark I McCarthy; Dermot McGovern; Ruth McPherson; Benjamin M Neale; Aarno Palotie; Shaun M Purcell; Danish Saleheen; Jeremiah M Scharf; Pamela Sklar; Patrick F Sullivan; Jaakko Tuomilehto; Ming T Tsuang; Hugh C Watkins; James G Wilson; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2016-08-18       Impact factor: 49.962

6.  Base-specific mutational intolerance near splice sites clarifies the role of nonessential splice nucleotides.

Authors:  Sidi Zhang; Kaitlin E Samocha; Manuel A Rivas; Konrad J Karczewski; Emma Daly; Ben Schmandt; Benjamin M Neale; Daniel G MacArthur; Mark J Daly
Journal:  Genome Res       Date:  2018-06-01       Impact factor: 9.438

7.  Prevalence and architecture of de novo mutations in developmental disorders.

Authors: 
Journal:  Nature       Date:  2017-01-25       Impact factor: 49.962

8.  Pathogenicity and selective constraint on variation near splice sites.

Authors:  Jenny Lord; Giuseppe Gallone; Patrick J Short; Jeremy F McRae; Holly Ironfield; Elizabeth H Wynn; Sebastian S Gerety; Liu He; Bronwyn Kerr; Diana S Johnson; Emma McCann; Esther Kinning; Frances Flinter; I Karen Temple; Jill Clayton-Smith; Meriel McEntagart; Sally Ann Lynch; Shelagh Joss; Sofia Douzgou; Tabib Dabir; Virginia Clowes; Vivienne P M McConnell; Wayne Lam; Caroline F Wright; David R FitzPatrick; Helen V Firth; Jeffrey C Barrett; Matthew E Hurles
Journal:  Genome Res       Date:  2018-12-26       Impact factor: 9.043

Review 9.  Cryo-electron microscopy snapshots of the spliceosome: structural insights into a dynamic ribonucleoprotein machine.

Authors:  Sebastian M Fica; Kiyoshi Nagai
Journal:  Nat Struct Mol Biol       Date:  2017-10-05       Impact factor: 15.369

10.  Genetic effects on gene expression across human tissues.

Authors:  Alexis Battle; Christopher D Brown; Barbara E Engelhardt; Stephen B Montgomery
Journal:  Nature       Date:  2017-10-11       Impact factor: 49.962

View more
  8 in total

Review 1.  A matter of space and time: Emerging roles of disease-associated proteins in neural development.

Authors:  Georgia Panagiotakos; Sergiu P Pasca
Journal:  Neuron       Date:  2021-11-29       Impact factor: 17.173

Review 2.  Transcriptome analysis provides critical answers to the "variants of uncertain significance" conundrum.

Authors:  Mackenzie D Postel; Julie O Culver; Charité Ricker; David W Craig
Journal:  Hum Mutat       Date:  2022-05-18       Impact factor: 4.700

3.  Cell-type and fetal-sex-specific targets of prenatal alcohol exposure in developing mouse cerebral cortex.

Authors:  Nihal A Salem; Amanda H Mahnke; Kranti Konganti; Andrew E Hillhouse; Rajesh C Miranda
Journal:  iScience       Date:  2021-04-20

Review 4.  RNA Modifications and RNA Metabolism in Neurological Disease Pathogenesis.

Authors:  Biswanath Chatterjee; Che-Kun James Shen; Pritha Majumder
Journal:  Int J Mol Sci       Date:  2021-11-01       Impact factor: 5.923

5.  Elevated Small Nuclear Ribonucleoprotein Polypeptide an Expression Correlated With Poor Prognosis and Immune Infiltrates in Patients With Hepatocellular Carcinoma.

Authors:  Youfu Zhang; Xuyang Wang; Huaxiang Wang; Yi Jiang; Zhidan Xu; Laibang Luo
Journal:  Front Oncol       Date:  2022-07-04       Impact factor: 5.738

6.  A systematic analysis of splicing variants identifies new diagnoses in the 100,000 Genomes Project.

Authors:  Diana Baralle; Jenny Lord; Alexander J M Blakes; Htoo A Wai; Ian Davies; Hassan E Moledina; April Ruiz; Tessy Thomas; David Bunyan; N Simon Thomas; Christine P Burren; Lynn Greenhalgh; Melissa Lees; Amanda Pichini; Sarah F Smithson; Ana Lisa Taylor Tavares; Peter O'Donovan; Andrew G L Douglas; Nicola Whiffin
Journal:  Genome Med       Date:  2022-07-26       Impact factor: 15.266

7.  Performance evaluation of differential splicing analysis methods and splicing analytics platform construction.

Authors:  Kuokuo Li; Tengfei Luo; Yan Zhu; Yuanfeng Huang; An Wang; Di Zhang; Lijie Dong; Yujian Wang; Rui Wang; Dongdong Tang; Zhen Yu; Qunshan Shen; Mingrong Lv; Zhengbao Ling; Zhenghuan Fang; Jing Yuan; Bin Li; Kun Xia; Xiaojin He; Jinchen Li; Guihu Zhao
Journal:  Nucleic Acids Res       Date:  2022-08-22       Impact factor: 19.160

8.  Unraveling synonymous and deep intronic variants causing aberrant splicing in two genetically undiagnosed epilepsy families.

Authors:  Qiang Li; Yiting Wang; Yijun Pan; Jia Wang; Weishi Yu; Xiaodong Wang
Journal:  BMC Med Genomics       Date:  2021-06-09       Impact factor: 3.063

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.