Literature DB >> 34289339

Interpretable prioritization of splice variants in diagnostic next-generation sequencing.

Daniel Danis1, Julius O B Jacobsen2, Leigh C Carmody1, Michael A Gargano1, Julie A McMurry3, Ayushi Hegde1, Melissa A Haendel3, Giorgio Valentini4, Damian Smedley2, Peter N Robinson5.   

Abstract

A critical challenge in genetic diagnostics is the computational assessment of candidate splice variants, specifically the interpretation of nucleotide changes located outside of the highly conserved dinucleotide sequences at the 5' and 3' ends of introns. To address this gap, we developed the Super Quick Information-content Random-forest Learning of Splice variants (SQUIRLS) algorithm. SQUIRLS generates a small set of interpretable features for machine learning by calculating the information-content of wild-type and variant sequences of canonical and cryptic splice sites, assessing changes in candidate splicing regulatory sequences, and incorporating characteristics of the sequence such as exon length, disruptions of the AG exclusion zone, and conservation. We curated a comprehensive collection of disease-associated splice-altering variants at positions outside of the highly conserved AG/GT dinucleotides at the termini of introns. SQUIRLS trains two random-forest classifiers for the donor and for the acceptor and combines their outputs by logistic regression to yield a final score. We show that SQUIRLS transcends previous state-of-the-art accuracy in classifying splice variants as assessed by rank analysis in simulated exomes, and is significantly faster than competing methods. SQUIRLS provides tabular output files for incorporation into diagnostic pipelines for exome and genome analysis, as well as visualizations that contextualize predicted effects of variants on splicing to make it easier to interpret splice variants in diagnostic settings.
Copyright © 2021 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Mendelian genetics; bioinformatics; cryptic splicing; exome sequencing; genome sequencing; machine learning; random forest; sequence logo; splice mutation; splice variant; splicing

Mesh:

Substances:

Year:  2021        PMID: 34289339      PMCID: PMC8456162          DOI: 10.1016/j.ajhg.2021.06.014

Source DB:  PubMed          Journal:  Am J Hum Genet        ISSN: 0002-9297            Impact factor:   11.025


  49 in total

1.  S-CAP extends pathogenicity prediction to genetic variants that affect RNA splicing.

Authors:  Karthik A Jagadeesh; Joseph M Paggi; James S Ye; Peter D Stenson; David N Cooper; Jonathan A Bernstein; Gill Bejerano
Journal:  Nat Genet       Date:  2019-02-25       Impact factor: 38.330

2.  Spectrum of splicing errors caused by CHRNE mutations affecting introns and intron/exon boundaries.

Authors:  K Ohno; A Tsujino; X-M Shen; M Milone; A G Engel
Journal:  J Med Genet       Date:  2005-08       Impact factor: 6.318

3.  Quantitative evaluation of all hexamers as exonic splicing elements.

Authors:  Shengdong Ke; Shulian Shang; Sergey M Kalachikov; Irina Morozova; Lin Yu; James J Russo; Jingyue Ju; Lawrence A Chasin
Journal:  Genome Res       Date:  2011-06-09       Impact factor: 9.043

4.  Diagnostic Impact and Cost-effectiveness of Whole-Exome Sequencing for Ambulant Children With Suspected Monogenic Conditions.

Authors:  Tiong Yang Tan; Oliver James Dillon; Zornitza Stark; Deborah Schofield; Khurshid Alam; Rupendra Shrestha; Belinda Chong; Dean Phelan; Gemma R Brett; Emma Creed; Anna Jarmolowicz; Patrick Yap; Maie Walsh; Lilian Downie; David J Amor; Ravi Savarirayan; George McGillivray; Alison Yeung; Heidi Peters; Susan J Robertson; Aaron J Robinson; Ivan Macciocca; Simon Sadedin; Katrina Bell; Alicia Oshlack; Peter Georgeson; Natalie Thorne; Clara Gaff; Susan M White
Journal:  JAMA Pediatr       Date:  2017-09-01       Impact factor: 16.193

5.  Microsatellite instability and the PTEN1 gene mutation in a subset of early onset gliomas carrying germline mutation or promoter methylation of the hMLH1 gene.

Authors:  M Kanamori; H Kon; T Nobukuni; S Nomura; K Sugano; S Mashiyama; T Kumabe; T Yoshimoto; M Meuth; T Sekiya; Y Murakami
Journal:  Oncogene       Date:  2000-03-16       Impact factor: 9.867

6.  Information content of individual genetic sequences.

Authors:  T D Schneider
Journal:  J Theor Biol       Date:  1997-12-21       Impact factor: 2.691

7.  Mutations affecting mRNA splicing are the most common molecular defects in patients with neurofibromatosis type 1.

Authors:  E Ars; E Serra; J García; H Kruyer; A Gaona; C Lázaro; X Estivill
Journal:  Hum Mol Genet       Date:  2000-01-22       Impact factor: 6.150

8.  Molecular findings among patients referred for clinical whole-exome sequencing.

Authors:  Yaping Yang; Donna M Muzny; Fan Xia; Zhiyv Niu; Richard Person; Yan Ding; Patricia Ward; Alicia Braxton; Min Wang; Christian Buhay; Narayanan Veeraraghavan; Alicia Hawes; Theodore Chiang; Magalie Leduc; Joke Beuten; Jing Zhang; Weimin He; Jennifer Scull; Alecia Willis; Megan Landsverk; William J Craigen; Mir Reza Bekheirnia; Asbjorg Stray-Pedersen; Pengfei Liu; Shu Wen; Wendy Alcaraz; Hong Cui; Magdalena Walkiewicz; Jeffrey Reid; Matthew Bainbridge; Ankita Patel; Eric Boerwinkle; Arthur L Beaudet; James R Lupski; Sharon E Plon; Richard A Gibbs; Christine M Eng
Journal:  JAMA       Date:  2014-11-12       Impact factor: 56.272

9.  Expanding the Boundaries of RNA Sequencing as a Diagnostic Tool for Rare Mendelian Disease.

Authors:  Hernan D Gonorazky; Sergey Naumenko; Arun K Ramani; Viswateja Nelakuditi; Pouria Mashouri; Peiqui Wang; Dennis Kao; Krish Ohri; Senthuri Viththiyapaskaran; Mark A Tarnopolsky; Katherine D Mathews; Steven A Moore; Andres N Osorio; David Villanova; Dwi U Kemaladewi; Ronald D Cohn; Michael Brudno; James J Dowling
Journal:  Am J Hum Genet       Date:  2019-02-28       Impact factor: 11.025

10.  Human Splicing Finder: an online bioinformatics tool to predict splicing signals.

Authors:  François-Olivier Desmet; Dalil Hamroun; Marine Lalande; Gwenaëlle Collod-Béroud; Mireille Claustres; Christophe Béroud
Journal:  Nucleic Acids Res       Date:  2009-04-01       Impact factor: 16.971

View more
  8 in total

1.  Identification and in-silico characterization of splice-site variants from a large cardiogenetic national registry.

Authors:  Kaveh Rayani; Brianna Davies; Matthew Cheung; Drake Comber; Jason D Roberts; Rafik Tadros; Martin S Green; Jeffrey S Healey; Christopher S Simpson; Shubhayan Sanatani; Christian Steinberg; Ciorsti MacIntyre; Paul Angaran; Henry Duff; Robert Hamilton; Laura Arbour; Richard Leather; Colette Seifer; Anne Fournier; Joseph Atallah; Shane Kimber; Bhavanesh Makanjee; Wael Alqarawi; Julia Cadrin-Tourigny; Jacqueline Joza; Martin Gardner; Mario Talajic; Richard D Bagnall; Andrew D Krahn; Zachary W M Laksman
Journal:  Eur J Hum Genet       Date:  2022-09-22       Impact factor: 5.351

2.  CI-SpliceAI-Improving machine learning predictions of disease causing splicing variants using curated alternative splice sites.

Authors:  Yaron Strauch; Jenny Lord; Mahesan Niranjan; Diana Baralle
Journal:  PLoS One       Date:  2022-06-03       Impact factor: 3.752

3.  Case Review: Whole-Exome Sequencing Analyses Identify Carriers of a Known Likely Pathogenic Intronic BRCA1 Variant in Ovarian Cancer Cases Clinically Negative for Pathogenic BRCA1 and BRCA2 Variants.

Authors:  Wejdan M Alenezi; Caitlin T Fierheller; Timothée Revil; Corinne Serruya; Anne-Marie Mes-Masson; William D Foulkes; Diane Provencher; Zaki El Haffaf; Jiannis Ragoussis; Patricia N Tonin
Journal:  Genes (Basel)       Date:  2022-04-15       Impact factor: 4.141

Review 4.  Phenotype-driven approaches to enhance variant prioritization and diagnosis of rare disease.

Authors:  Julius O B Jacobsen; Catherine Kelly; Valentina Cipriani; Genomics England Research Consortium; Christopher J Mungall; Justin Reese; Daniel Danis; Peter N Robinson; Damian Smedley
Journal:  Hum Mutat       Date:  2022-04-27       Impact factor: 4.700

5.  Comparison of in silico strategies to prioritize rare genomic variants impacting RNA splicing for the diagnosis of genomic disorders.

Authors:  Charlie Rowlands; Huw B Thomas; Jenny Lord; Htoo A Wai; Gavin Arno; Glenda Beaman; Panagiotis Sergouniotis; Beatriz Gomes-Silva; Christopher Campbell; Nicole Gossan; Claire Hardcastle; Kevin Webb; Christopher O'Callaghan; Robert A Hirst; Simon Ramsden; Elizabeth Jones; Jill Clayton-Smith; Andrew R Webster; Andrew G L Douglas; Raymond T O'Keefe; William G Newman; Diana Baralle; Graeme C M Black; Jamie M Ellingford
Journal:  Sci Rep       Date:  2021-10-18       Impact factor: 4.379

6.  Feasibility of Follow-Up Studies and Reclassification in Spinocerebellar Ataxia Gene Variants of Unknown Significance.

Authors:  Fatemeh Ghorbani; Mohamed Z Alimohamed; Juliana F Vilacha; Krista K Van Dijk; Jelkje De Boer-Bergsma; Michiel R Fokkens; Henny Lemmink; Rolf H Sijmons; Birgit Sikkema-Raddatz; Matthew R Groves; Corien C Verschuuren-Bemelmans; Dineke S Verbeek; Cleo C Van Diemen; Helga Westers
Journal:  Front Genet       Date:  2022-03-25       Impact factor: 4.599

7.  Performance evaluation of differential splicing analysis methods and splicing analytics platform construction.

Authors:  Kuokuo Li; Tengfei Luo; Yan Zhu; Yuanfeng Huang; An Wang; Di Zhang; Lijie Dong; Yujian Wang; Rui Wang; Dongdong Tang; Zhen Yu; Qunshan Shen; Mingrong Lv; Zhengbao Ling; Zhenghuan Fang; Jing Yuan; Bin Li; Kun Xia; Xiaojin He; Jinchen Li; Guihu Zhao
Journal:  Nucleic Acids Res       Date:  2022-08-22       Impact factor: 19.160

8.  The RD-Connect Genome-Phenome Analysis Platform: Accelerating diagnosis, research, and gene discovery for rare diseases.

Authors:  Steven Laurie; Davide Piscia; Leslie Matalonga; Alberto Corvó; Marcos Fernández-Callejo; Carles Garcia-Linares; Carles Hernandez-Ferrer; Cristina Luengo; Inés Martínez; Anastasios Papakonstantinou; Daniel Picó-Amador; Joan Protasio; Rachel Thompson; Raul Tonda; Mònica Bayés; Gemma Bullich; Jordi Camps-Puchadas; Ida Paramonov; Jean-Rémi Trotta; Angel Alonso; Marcella Attimonelli; Christophe Béroud; Virginie Bros-Facer; Orion J Buske; Andrés Cañada-Pallarés; José M Fernández; Mats G Hansson; Rita Horvath; Julius O B Jacobsen; Rajaram Kaliyaperumal; Séverine Lair-Préterre; Luana Licata; Pedro Lopes; Estrella López-Martín; Deborah Mascalzoni; Lucia Monaco; Luis A Pérez-Jurado; Manuel Posada de la Paz; Jordi Rambla; Ana Rath; Olaf Riess; Peter N Robinson; David Salgado; Damian Smedley; Dylan Spalding; Peter A C 't Hoen; Ana Töpf; Irina Zaharieva; Holm Graessner; Ivo G Gut; Hanns Lochmüller; Sergi Beltran
Journal:  Hum Mutat       Date:  2022-06       Impact factor: 4.700

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.