Literature DB >> 31178116

A Deep Neural Network for Predicting and Engineering Alternative Polyadenylation.

Nicholas Bogard1, Johannes Linder2, Alexander B Rosenberg1, Georg Seelig3.   

Abstract

Alternative polyadenylation (APA) is a major driver of transcriptome diversity in human cells. Here, we use deep learning to predict APA from DNA sequence alone. We trained our model (APARENT, APA REgression NeT) on isoform expression data from over 3 million APA reporters. APARENT's predictions are highly accurate when tasked with inferring APA in synthetic and human 3'UTRs. Visualizing features learned across all network layers reveals that APARENT recognizes sequence motifs known to recruit APA regulators, discovers previously unknown sequence determinants of 3' end processing, and integrates these features into a comprehensive, interpretable, cis-regulatory code. We apply APARENT to forward engineer functional polyadenylation signals with precisely defined cleavage position and isoform usage and validate predictions experimentally. Finally, we use APARENT to quantify the impact of genetic variants on APA. Our approach detects pathogenic variants in a wide range of disease contexts, expanding our understanding of the genetic origins of disease.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  MPRA; SNV; alternative polyadenylation; cis-regulation; deep learning; generative model; mRNA processing; machine learning; massively parallel reporter assay; single nucleotide variant; synthetic biology

Mesh:

Substances:

Year:  2019        PMID: 31178116      PMCID: PMC6599575          DOI: 10.1016/j.cell.2019.04.046

Source DB:  PubMed          Journal:  Cell        ISSN: 0092-8674            Impact factor:   41.582


  77 in total

1.  Increased efficiency of mRNA 3' end formation: a new genetic mechanism contributing to hereditary thrombophilia.

Authors:  N H Gehring; U Frede; G Neu-Yilik; P Hundsdoerfer; B Vetter; M W Hentze; A E Kulozik
Journal:  Nat Genet       Date:  2001-08       Impact factor: 38.330

2.  A novel point mutation in the 3' region of the prothrombin gene at position 20221 in a Lebanese/Syrian family.

Authors:  M Wylenzek; C Geisen; L Stapenhorst; K Wielckens; K R Klingler
Journal:  Thromb Haemost       Date:  2001-05       Impact factor: 5.249

3.  Recognition of GU-rich polyadenylation regulatory elements by human CstF-64 protein.

Authors:  José Manuel Pérez Cañadillas; Gabriele Varani
Journal:  EMBO J       Date:  2003-06-02       Impact factor: 11.598

4.  The prothrombin 3'end formation signal reveals a unique architecture that is sensitive to thrombophilic gain-of-function mutations.

Authors:  Sven Danckwardt; Niels H Gehring; Gabriele Neu-Yilik; Patrick Hundsdoerfer; Margit Pforsich; Ute Frede; Matthias W Hentze; Andreas E Kulozik
Journal:  Blood       Date:  2004-04-01       Impact factor: 22.113

5.  Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation.

Authors:  Jun Hu; Carol S Lutz; Jeffrey Wilusz; Bin Tian
Journal:  RNA       Date:  2005-08-30       Impact factor: 4.942

6.  Distribution and intensity of constraint in mammalian genomic sequence.

Authors:  Gregory M Cooper; Eric A Stone; George Asimenos; Eric D Green; Serafim Batzoglou; Arend Sidow
Journal:  Genome Res       Date:  2005-06-17       Impact factor: 9.043

7.  A rare polyadenylation signal mutation of the FOXP3 gene (AAUAAA-->AAUGAA) leads to the IPEX syndrome.

Authors:  C L Bennett; M E Brunkow; F Ramsdell; K C O'Briant; Q Zhu; R L Fuleihan; A O Shigeoka; H D Ochs; P F Chance
Journal:  Immunogenetics       Date:  2001-08       Impact factor: 2.846

8.  Molecular characterisation of the defective alpha 1-antitrypsin alleles PI Mwurzburg (Pro369Ser), Mheerlen (Pro369Leu), and Q0lisbon (Thr68Ile).

Authors:  W Poller; F Merklein; S Schneider-Rasp; A Haack; H Fechner; H Wang; I Anagnostopoulos; S Weidinger
Journal:  Eur J Hum Genet       Date:  1999-04       Impact factor: 4.246

9.  Human Fip1 is a subunit of CPSF that binds to U-rich RNA elements and stimulates poly(A) polymerase.

Authors:  Isabelle Kaufmann; Georges Martin; Arno Friedlein; Hanno Langen; Walter Keller
Journal:  EMBO J       Date:  2004-01-29       Impact factor: 11.598

10.  A comparison of RNA folding measures.

Authors:  Eva Freyhult; Paul P Gardner; Vincent Moulton
Journal:  BMC Bioinformatics       Date:  2005-10-03       Impact factor: 3.169

View more
  36 in total

1.  Recent molecular insights into canonical pre-mRNA 3'-end processing.

Authors:  Yadong Sun; Keith Hamilton; Liang Tong
Journal:  Transcription       Date:  2020-06-11

2.  Fine gene expression regulation by minor sequence variations downstream of the polyadenylation signal.

Authors:  Lidiya V Boldyreva; Lyubov A Yarinich; Elena N Kozhevnikova; Anton V Ivankin; Mikhail O Lebedev; Alexey V Pindyurin
Journal:  Mol Biol Rep       Date:  2021-01-31       Impact factor: 2.316

3.  Improving representations of genomic sequence motifs in convolutional networks with exponential activations.

Authors:  Peter K Koo; Matt Ploenzke
Journal:  Nat Mach Intell       Date:  2021-02-08

4.  Deep learning of immune cell differentiation.

Authors:  Alexandra Maslova; Ricardo N Ramirez; Ke Ma; Hugo Schmutz; Chendi Wang; Curtis Fox; Bernard Ng; Christophe Benoist; Sara Mostafavi
Journal:  Proc Natl Acad Sci U S A       Date:  2020-09-25       Impact factor: 11.205

5.  Deep learning for inferring transcription factor binding sites.

Authors:  Peter K Koo; Matt Ploenzke
Journal:  Curr Opin Syst Biol       Date:  2020-06-11

6.  Large-scale design and refinement of stable proteins using sequence-only models.

Authors:  Jedediah M Singer; Scott Novotney; Devin Strickland; Hugh K Haddox; Nicholas Leiby; Gabriel J Rocklin; Cameron M Chow; Anindya Roy; Asim K Bera; Francis C Motta; Longxing Cao; Eva-Maria Strauch; Tamuka M Chidyausiku; Alex Ford; Ethan Ho; Alexander Zaitzeff; Craig O Mackenzie; Hamed Eramian; Frank DiMaio; Gevorg Grigoryan; Matthew Vaughn; Lance J Stewart; David Baker; Eric Klavins
Journal:  PLoS One       Date:  2022-03-14       Impact factor: 3.240

7.  Interpreting Neural Networks for Biological Sequences by Learning Stochastic Masks.

Authors:  Johannes Linder; Alyssa La Fleur; Zibo Chen; Ajasja Ljubeti; David Baker; Sreeram Kannan; Georg Seelig
Journal:  Nat Mach Intell       Date:  2022-01-25

8.  An atlas of alternative polyadenylation quantitative trait loci contributing to complex trait and disease heritability.

Authors:  Lei Li; Kai-Lieh Huang; Yipeng Gao; Ya Cui; Gao Wang; Nathan D Elrod; Yumei Li; Yiling Elaine Chen; Ping Ji; Fanglue Peng; William K Russell; Eric J Wagner; Wei Li
Journal:  Nat Genet       Date:  2021-05-13       Impact factor: 38.330

9.  CROTON: an automated and variant-aware deep learning framework for predicting CRISPR/Cas9 editing outcomes.

Authors:  Victoria R Li; Zijun Zhang; Olga G Troyanskaya
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

Review 10.  Learning the Regulatory Code of Gene Expression.

Authors:  Jan Zrimec; Filip Buric; Mariia Kokina; Victor Garcia; Aleksej Zelezniak
Journal:  Front Mol Biosci       Date:  2021-06-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.