Literature DB >> 29701825

Predicting gene structure changes resulting from genetic variants via exon definition features.

William H Majoros1,2, Carson Holt3,4, Michael S Campbell5, Doreen Ware5,6, Mark Yandell3,4, Timothy E Reddy1,2,7.   

Abstract

Motivation: Genetic variation that disrupts gene function by altering gene splicing between individuals can substantially influence traits and disease. In those cases, accurately predicting the effects of genetic variation on splicing can be highly valuable for investigating the mechanisms underlying those traits and diseases. While methods have been developed to generate high quality computational predictions of gene structures in reference genomes, the same methods perform poorly when used to predict the potentially deleterious effects of genetic changes that alter gene splicing between individuals. Underlying that discrepancy in predictive ability are the common assumptions by reference gene finding algorithms that genes are conserved, well-formed and produce functional proteins.
Results: We describe a probabilistic approach for predicting recent changes to gene structure that may or may not conserve function. The model is applicable to both coding and non-coding genes, and can be trained on existing gene annotations without requiring curated examples of aberrant splicing. We apply this model to the problem of predicting altered splicing patterns in the genomes of individual humans, and we demonstrate that performing gene-structure prediction without relying on conserved coding features is feasible. The model predicts an unexpected abundance of variants that create de novo splice sites, an observation supported by both simulations and empirical data from RNA-seq experiments. While these de novo splice variants are commonly misinterpreted by other tools as coding or non-coding variants of little or no effect, we find that in some cases they can have large effects on splicing activity and protein products and we propose that they may commonly act as cryptic factors in disease. Availability and implementation: The software is available from geneprediction.org/SGRF. Supplementary information: Supplementary information is available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 29701825      PMCID: PMC6198862          DOI: 10.1093/bioinformatics/bty324

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  53 in total

1.  Exon definition complexes contain the tri-snRNP and can be directly converted into B-like precatalytic splicing complexes.

Authors:  Marc Schneider; Cindy L Will; Maria Anokhina; Jamal Tazi; Henning Urlaub; Reinhard Lührmann
Journal:  Mol Cell       Date:  2010-04-23       Impact factor: 17.970

Review 2.  The SR protein family of splicing factors: master regulators of gene expression.

Authors:  Jennifer C Long; Javier F Caceres
Journal:  Biochem J       Date:  2009-01-01       Impact factor: 3.857

3.  Efficient implementation of a generalized pair hidden Markov model for comparative gene finding.

Authors:  W H Majoros; M Pertea; S L Salzberg
Journal:  Bioinformatics       Date:  2005-02-02       Impact factor: 6.937

4.  Human genomics. The human transcriptome across tissues and individuals.

Authors:  Marta Melé; Pedro G Ferreira; Ferran Reverter; David S DeLuca; Jean Monlong; Michael Sammeth; Taylor R Young; Jakob M Goldmann; Dmitri D Pervouchine; Timothy J Sullivan; Rory Johnson; Ayellet V Segrè; Sarah Djebali; Anastasia Niarchou; Fred A Wright; Tuuli Lappalainen; Miquel Calvo; Gad Getz; Emmanouil T Dermitzakis; Kristin G Ardlie; Roderic Guigó
Journal:  Science       Date:  2015-05-08       Impact factor: 47.728

5.  Biased exon/intron distribution of cryptic and de novo 3' splice sites.

Authors:  Jana Královicová; Mikkel B Christensen; Igor Vorechovský
Journal:  Nucleic Acids Res       Date:  2005-09-01       Impact factor: 16.971

6.  MutPred Splice: machine learning-based prediction of exonic variants that disrupt splicing.

Authors:  Matthew Mort; Timothy Sterne-Weiler; Biao Li; Edward V Ball; David N Cooper; Predrag Radivojac; Jeremy R Sanford; Sean D Mooney
Journal:  Genome Biol       Date:  2014-01-13       Impact factor: 13.583

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources.

Authors:  Mario Stanke; Oliver Schöffmann; Burkhard Morgenstern; Stephan Waack
Journal:  BMC Bioinformatics       Date:  2006-02-09       Impact factor: 3.169

9.  Transcriptome and genome sequencing uncovers functional variation in humans.

Authors:  Tuuli Lappalainen; Michael Sammeth; Marc R Friedländer; Peter A C 't Hoen; Jean Monlong; Manuel A Rivas; Mar Gonzàlez-Porta; Natalja Kurbatova; Thasso Griebel; Pedro G Ferreira; Matthias Barann; Thomas Wieland; Liliana Greger; Maarten van Iterson; Jonas Almlöf; Paolo Ribeca; Irina Pulyakhina; Daniela Esser; Thomas Giger; Andrew Tikhonov; Marc Sultan; Gabrielle Bertier; Daniel G MacArthur; Monkol Lek; Esther Lizano; Henk P J Buermans; Ismael Padioleau; Thomas Schwarzmayr; Olof Karlberg; Halit Ongen; Helena Kilpinen; Sergi Beltran; Marta Gut; Katja Kahlem; Vyacheslav Amstislavskiy; Oliver Stegle; Matti Pirinen; Stephen B Montgomery; Peter Donnelly; Mark I McCarthy; Paul Flicek; Tim M Strom; Hans Lehrach; Stefan Schreiber; Ralf Sudbrak; Angel Carracedo; Stylianos E Antonarakis; Robert Häsler; Ann-Christine Syvänen; Gert-Jan van Ommen; Alvis Brazma; Thomas Meitinger; Philip Rosenstiel; Roderic Guigó; Ivo G Gut; Xavier Estivill; Emmanouil T Dermitzakis
Journal:  Nature       Date:  2013-09-15       Impact factor: 49.962

10.  Widespread alternative and aberrant splicing revealed by lariat sequencing.

Authors:  Nicholas Stepankiw; Madhura Raghavan; Elizabeth A Fogarty; Andrew Grimson; Jeffrey A Pleiss
Journal:  Nucleic Acids Res       Date:  2015-08-10       Impact factor: 16.971

View more
  3 in total

1.  Bayesian estimation of genetic regulatory effects in high-throughput reporter assays.

Authors:  William H Majoros; Young-Sook Kim; Alejandro Barrera; Fan Li; Xingyan Wang; Sarah J Cunningham; Graham D Johnson; Cong Guo; William L Lowe; Denise M Scholtens; M Geoffrey Hayes; Timothy E Reddy; Andrew S Allen
Journal:  Bioinformatics       Date:  2020-01-15       Impact factor: 6.937

2.  Assessing cell-specific effects of genetic variations using tRNA microarrays.

Authors:  Christine Polte; Daniel Wedemeyer; Kathryn E Oliver; Johannes Wagner; Marcel J C Bijvelds; John Mahoney; Hugo R de Jonge; Eric J Sorscher; Zoya Ignatova
Journal:  BMC Genomics       Date:  2019-07-16       Impact factor: 3.969

3.  Genome-Wide Identification of the LHC Gene Family in Kiwifruit and Regulatory Role of AcLhcb3.1/3.2 for Chlorophyll a Content.

Authors:  Juan Luo; Muhammad Abid; Jing Tu; Puxing Gao; Zupeng Wang; Hongwen Huang
Journal:  Int J Mol Sci       Date:  2022-06-10       Impact factor: 6.208

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.