Literature DB >> 33526657

A machine learning-based framework for modeling transcription elongation.

Peiyuan Feng1, An Xiao1, Meng Fang2, Fangping Wan1, Shuya Li1, Peng Lang1, Dan Zhao3, Jianyang Zeng3,4.   

Abstract

RNA polymerase II (Pol II) generally pauses at certain positions along gene bodies, thereby interrupting the transcription elongation process, which is often coupled with various important biological functions, such as precursor mRNA splicing and gene expression regulation. Characterizing the transcriptional elongation dynamics can thus help us understand many essential biological processes in eukaryotic cells. However, experimentally measuring Pol II elongation rates is generally time and resource consuming. We developed PEPMAN (polymerase II elongation pausing modeling through attention-based deep neural network), a deep learning-based model that accurately predicts Pol II pausing sites based on the native elongating transcript sequencing (NET-seq) data. Through fully taking advantage of the attention mechanism, PEPMAN is able to decipher important sequence features underlying Pol II pausing. More importantly, we demonstrated that the analyses of the PEPMAN-predicted results around various types of alternative splicing sites can provide useful clues into understanding the cotranscriptional splicing events. In addition, associating the PEPMAN prediction results with different epigenetic features can help reveal important factors related to the transcription elongation process. All these results demonstrated that PEPMAN can provide a useful and effective tool for modeling transcription elongation and understanding the related biological factors from available high-throughput sequencing data.

Keywords:  Pol II pausing; alternative splicing; deep learning

Year:  2021        PMID: 33526657      PMCID: PMC8017690          DOI: 10.1073/pnas.2007450118

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  58 in total

1.  Native elongating transcript sequencing (NET-seq).

Authors:  L Stirling Churchman; Jonathan S Weissman
Journal:  Curr Protoc Mol Biol       Date:  2012-04

2.  ChIP-Seq: a method for global identification of regulatory elements in the genome.

Authors:  Debasish Raha; Miyoung Hong; Michael Snyder
Journal:  Curr Protoc Mol Biol       Date:  2010-07

Review 3.  The role of chromatin during transcription.

Authors:  Bing Li; Michael Carey; Jerry L Workman
Journal:  Cell       Date:  2007-02-23       Impact factor: 41.582

4.  Global analysis of nascent RNA reveals transcriptional pausing in terminal exons.

Authors:  Fernando Carrillo Oesterreich; Stephan Preibisch; Karla M Neugebauer
Journal:  Mol Cell       Date:  2010-11-24       Impact factor: 17.970

5.  Acetylation on histone H3 lysine 9 mediates a switch from transcription initiation to elongation.

Authors:  Leah A Gates; Jiejun Shi; Aarti D Rohira; Qin Feng; Bokai Zhu; Mark T Bedford; Cari A Sagum; Sung Yun Jung; Jun Qin; Ming-Jer Tsai; Sophia Y Tsai; Wei Li; Charles E Foulds; Bert W O'Malley
Journal:  J Biol Chem       Date:  2017-07-17       Impact factor: 5.157

6.  Effects of RNA secondary structure on alternative splicing of pre-mRNA: is folding limited to a region behind the transcribing RNA polymerase?

Authors:  L P Eperon; I R Graham; A D Griffiths; I C Eperon
Journal:  Cell       Date:  1988-07-29       Impact factor: 41.582

7.  EWS, but not EWS-FLI-1, is associated with both TFIID and RNA polymerase II: interactions between two members of the TET family, EWS and hTAFII68, and subunits of TFIID and RNA polymerase II complexes.

Authors:  A Bertolotti; T Melot; J Acker; M Vigneron; O Delattre; L Tora
Journal:  Mol Cell Biol       Date:  1998-03       Impact factor: 4.272

8.  Applied force provides insight into transcriptional pausing and its modulation by transcription factor NusA.

Authors:  Jing Zhou; Kook Sun Ha; Arthur La Porta; Robert Landick; Steven M Block
Journal:  Mol Cell       Date:  2011-11-18       Impact factor: 17.970

9.  Human transcription elongation factor NELF: identification of novel subunits and reconstitution of the functionally active complex.

Authors:  Takashi Narita; Yuki Yamaguchi; Keiichi Yano; Seiji Sugimoto; Sittinan Chanarat; Tadashi Wada; Dong-ki Kim; Jun Hasegawa; Masashi Omori; Naoto Inukai; Masaki Endoh; Tomoko Yamada; Hiroshi Handa
Journal:  Mol Cell Biol       Date:  2003-03       Impact factor: 4.272

10.  DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.

Authors:  Daniel Quang; Xiaohui Xie
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

View more
  4 in total

1.  Predicting CRISPR/Cas9 Repair Outcomes by Attention-Based Deep Learning Framework.

Authors:  Xiuqin Liu; Shuya Wang; Dongmei Ai
Journal:  Cells       Date:  2022-06-05       Impact factor: 7.666

Review 2.  The histone chaperone FACT: a guardian of chromatin structure integrity.

Authors:  Célia Jeronimo; François Robert
Journal:  Transcription       Date:  2022-04-29

3.  Roles of Physicochemical and Structural Properties of RNA-Binding Proteins in Predicting the Activities of Trans-Acting Splicing Factors with Machine Learning.

Authors:  Lin Zhu; Wenjin Li
Journal:  Int J Mol Sci       Date:  2022-04-17       Impact factor: 6.208

Review 4.  Virtual Gene Concept and a Corresponding Pragmatic Research Program in Genetical Data Science.

Authors:  Łukasz Huminiecki
Journal:  Entropy (Basel)       Date:  2021-12-23       Impact factor: 2.524

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.