Bethany Signal1,2, Brian S Gloss1,2, Marcel E Dinger1,2, Tim R Mercer1,2,3. 1. Genomics and Epigenetics, Garvan Institute of Medical Research, Sydney, NSW 2010, Australia. 2. St Vincent's Clinical School, University of New South Wales, Sydney, NSW 2052, Australia. 3. Altius Institute for Biomedical Sciences, Seattle, WA 98121, USA.
Abstract
Motivation: The branchpoint element is required for the first lariat-forming reaction in splicing. However current catalogues of human branchpoints remain incomplete due to the difficulty in experimentally identifying these splicing elements. To address this limitation, we have developed a machine-learning algorithm-branchpointer-to identify branchpoint elements solely from gene annotations and genomic sequence. Results: Using branchpointer, we annotate branchpoint elements in 85% of human gene introns with sensitivity (61.8%) and specificity (97.8%). In addition to annotation, branchpointer can evaluate the impact of SNPs on branchpoint architecture to inform functional interpretation of genetic variants. Branchpointer identifies all published deleterious branchpoint mutations annotated in clinical variant databases, and finds thousands of additional clinical and common genetic variants with similar predicted effects. This genome-wide annotation of branchpoints provides a reference for the genetic analysis of splicing, and the interpretation of noncoding variation. Availability and implementation: Branchpointer is written and implemented in the statistical programming language R and is freely available under a BSD license as a package through Bioconductor. Contact: b.signal@garvan.org.au or t.mercer@garvan.org. Supplementary information: Supplementary data are available at Bioinformatics online.
Motivation: The branchpoint element is required for the first lariat-forming reaction in splicing. However current catalogues of human branchpoints remain incomplete due to the difficulty in experimentally identifying these splicing elements. To address this limitation, we have developed a machine-learning algorithm-branchpointer-to identify branchpoint elements solely from gene annotations and genomic sequence. Results: Using branchpointer, we annotate branchpoint elements in 85% of human gene introns with sensitivity (61.8%) and specificity (97.8%). In addition to annotation, branchpointer can evaluate the impact of SNPs on branchpoint architecture to inform functional interpretation of genetic variants. Branchpointer identifies all published deleterious branchpoint mutations annotated in clinical variant databases, and finds thousands of additional clinical and common genetic variants with similar predicted effects. This genome-wide annotation of branchpoints provides a reference for the genetic analysis of splicing, and the interpretation of noncoding variation. Availability and implementation: Branchpointer is written and implemented in the statistical programming language R and is freely available under a BSD license as a package through Bioconductor. Contact: b.signal@garvan.org.au or t.mercer@garvan.org. Supplementary information: Supplementary data are available at Bioinformatics online.
Authors: Claudia Calabrese; Natalie R Davidson; Deniz Demircioğlu; Nuno A Fonseca; Yao He; André Kahles; Kjong-Van Lehmann; Fenglin Liu; Yuichi Shiraishi; Cameron M Soulette; Lara Urban; Liliana Greger; Siliang Li; Dongbing Liu; Marc D Perry; Qian Xiang; Fan Zhang; Junjun Zhang; Peter Bailey; Serap Erkek; Katherine A Hoadley; Yong Hou; Matthew R Huska; Helena Kilpinen; Jan O Korbel; Maximillian G Marin; Julia Markowski; Tannistha Nandi; Qiang Pan-Hammarström; Chandra Sekhar Pedamallu; Reiner Siebert; Stefan G Stark; Hong Su; Patrick Tan; Sebastian M Waszak; Christina Yung; Shida Zhu; Philip Awadalla; Chad J Creighton; Matthew Meyerson; B F Francis Ouellette; Kui Wu; Huanming Yang; Alvis Brazma; Angela N Brooks; Jonathan Göke; Gunnar Rätsch; Roland F Schwarz; Oliver Stegle; Zemin Zhang Journal: Nature Date: 2020-02-05 Impact factor: 49.962
Authors: Patrick G Gallagher; Yelena Maksimova; Kimberly Lezon-Geyda; Peter E Newburger; Desiree Medeiros; Robin D Hanson; Jennifer Rothman; Sara Israels; Donna A Wall; Robert F Sidonio; Colin Sieff; L Kate Gowans; Nupur Mittal; Roland Rivera-Santiago; David W Speicher; Susan J Baserga; Vincent P Schulz Journal: J Clin Invest Date: 2019-04-30 Impact factor: 14.808
Authors: Marie Wong; Chelsea Mayoh; Loretta M S Lau; David S Ziegler; Paul G Ekert; Mark J Cowley; Dong-Anh Khuong-Quang; Mark Pinese; Amit Kumar; Paulette Barahona; Emilie E Wilkie; Patricia Sullivan; Rachel Bowen-James; Mustafa Syed; Iñigo Martincorena; Federico Abascal; Alexandra Sherstyuk; Noemi A Bolanos; Jonathan Baber; Peter Priestley; M Emmy M Dolman; Emmy D G Fleuren; Marie-Emilie Gauthier; Emily V A Mould; Velimir Gayevskiy; Andrew J Gifford; Dylan Grebert-Wade; Patrick A Strong; Elodie Manouvrier; Meera Warby; David M Thomas; Judy Kirk; Katherine Tucker; Tracey O'Brien; Frank Alvaro; Geoffry B McCowage; Luciano Dalla-Pozza; Nicholas G Gottardo; Heather Tapp; Paul Wood; Seong-Lin Khaw; Jordan R Hansford; Andrew S Moore; Murray D Norris; Toby N Trahair; Richard B Lock; Vanessa Tyrrell; Michelle Haber; Glenn M Marshall Journal: Nat Med Date: 2020-10-05 Impact factor: 53.440
Authors: Amandine Duchemin; Tina O'Grady; Sarah Hanache; Agnès Mereau; Marc Thiry; Ludivine Wacheul; Catherine Michaux; Eric Perpète; Eric Hervouet; Paul Peixoto; Felix G M Ernst; Yann Audic; Franck Dequiedt; Denis L J Lafontaine; Denis Mottet Journal: Nat Commun Date: 2021-11-17 Impact factor: 14.919