Literature DB >> 31081512

DeepPASTA: deep neural network based polyadenylation site analysis.

Ashraful Arefeen1, Xinshu Xiao2, Tao Jiang1,3,4.   

Abstract

MOTIVATION: Alternative polyadenylation (polyA) sites near the 3' end of a pre-mRNA create multiple mRNA transcripts with different 3' untranslated regions (3' UTRs). The sequence elements of a 3' UTR are essential for many biological activities such as mRNA stability, sub-cellular localization, protein translation, protein binding and translation efficiency. Moreover, numerous studies in the literature have reported the correlation between diseases and the shortening (or lengthening) of 3' UTRs. As alternative polyA sites are common in mammalian genes, several machine learning tools have been published for predicting polyA sites from sequence data. These tools either consider limited sequence features or use relatively old algorithms for polyA site prediction. Moreover, none of the previous tools consider RNA secondary structures as a feature to predict polyA sites.
RESULTS: In this paper, we propose a new deep learning model, called DeepPASTA, for predicting polyA sites from both sequence and RNA secondary structure data. The model is then extended to predict tissue-specific polyA sites. Moreover, the tool can predict the most dominant (i.e. frequently used) polyA site of a gene in a specific tissue and relative dominance when two polyA sites of the same gene are given. Our extensive experiments demonstrate that DeepPASTA signisficantly outperforms the existing tools for polyA site prediction and tissue-specific relative and absolute dominant polyA site prediction.
AVAILABILITY AND IMPLEMENTATION: https://github.com/arefeen/DeepPASTA. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31081512      PMCID: PMC6853695          DOI: 10.1093/bioinformatics/btz283

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  41 in total

1.  Secondary structure as a functional feature in the downstream region of mammalian polyadenylation signals.

Authors:  Chunxiao Wu; James C Alwine
Journal:  Mol Cell Biol       Date:  2004-04       Impact factor: 4.272

2.  Bioinformatic identification of candidate cis-regulatory elements involved in human mRNA polyadenylation.

Authors:  Jun Hu; Carol S Lutz; Jeffrey Wilusz; Bin Tian
Journal:  RNA       Date:  2005-08-30       Impact factor: 4.942

3.  An RNA secondary structure juxtaposes two remote genetic signals for human T-cell leukemia virus type I RNA 3'-end processing.

Authors:  A Bar-Shira; A Panet; A Honigman
Journal:  J Virol       Date:  1991-10       Impact factor: 5.103

Review 4.  Alternative polyadenylation of mRNA precursors.

Authors:  Bin Tian; James L Manley
Journal:  Nat Rev Mol Cell Biol       Date:  2016-09-28       Impact factor: 94.444

5.  An in-depth map of polyadenylation sites in cancer.

Authors:  Yuefeng Lin; Zhihua Li; Fatih Ozsolak; Sang Woo Kim; Gustavo Arango-Argoty; Teresa T Liu; Scott A Tenenbaum; Timothy Bailey; A Paula Monaghan; Patrice M Milos; Bino John
Journal:  Nucleic Acids Res       Date:  2012-06-29       Impact factor: 16.971

6.  POLYAR, a new computer program for prediction of poly(A) sites in human sequences.

Authors:  Malik Nadeem Akhtar; Syed Abbas Bukhari; Zeeshan Fazal; Raheel Qamar; Ilham A Shahmuradov
Journal:  BMC Genomics       Date:  2010-11-19       Impact factor: 3.969

7.  GraphProt: modeling binding preferences of RNA-binding proteins.

Authors:  Daniel Maticzka; Sita J Lange; Fabrizio Costa; Rolf Backofen
Journal:  Genome Biol       Date:  2014-01-22       Impact factor: 13.583

8.  Poly(A) code analyses reveal key determinants for tissue-specific mRNA alternative polyadenylation.

Authors:  Lingjie Weng; Yi Li; Xiaohui Xie; Yongsheng Shi
Journal:  RNA       Date:  2016-04-19       Impact factor: 4.942

9.  Genome-wide identification and predictive modeling of tissue-specific alternative polyadenylation.

Authors:  Dina Hafez; Ting Ni; Sayan Mukherjee; Jun Zhu; Uwe Ohler
Journal:  Bioinformatics       Date:  2013-07-01       Impact factor: 6.937

10.  TITER: predicting translation initiation sites by deep learning.

Authors:  Sai Zhang; Hailin Hu; Tao Jiang; Lei Zhang; Jianyang Zeng
Journal:  Bioinformatics       Date:  2017-07-15       Impact factor: 6.937

View more
  7 in total

1.  Interpreting Neural Networks for Biological Sequences by Learning Stochastic Masks.

Authors:  Johannes Linder; Alyssa La Fleur; Zibo Chen; Ajasja Ljubeti; David Baker; Sreeram Kannan; Georg Seelig
Journal:  Nat Mach Intell       Date:  2022-01-25

2.  Leveraging omic features with F3UTER enables identification of unannotated 3'UTRs for synaptic genes.

Authors:  Mina Ryten; Harpreet Saini; Juan A Botia; Siddharth Sethi; David Zhang; Sebastian Guelfi; Zhongbo Chen; Sonia Garcia-Ruiz; Emmanuel O Olagbaju
Journal:  Nat Commun       Date:  2022-04-27       Impact factor: 17.694

Review 3.  Alternative polyadenylation: methods, mechanism, function, and role in cancer.

Authors:  Yi Zhang; Lian Liu; Qiongzi Qiu; Qing Zhou; Jinwang Ding; Yan Lu; Pengyuan Liu
Journal:  J Exp Clin Cancer Res       Date:  2021-02-01

4.  Aptardi predicts polyadenylation sites in sample-specific transcriptomes using high-throughput RNA sequencing and DNA sequence.

Authors:  Ryan Lusk; Evan Stene; Farnoush Banaei-Kashani; Boris Tabakoff; Katerina Kechris; Laura M Saba
Journal:  Nat Commun       Date:  2021-03-12       Impact factor: 14.919

5.  SCAPTURE: a deep learning-embedded pipeline that captures polyadenylation information from 3' tag-based RNA-seq of single cells.

Authors:  Guo-Wei Li; Fang Nan; Guo-Hua Yuan; Chu-Xiao Liu; Xindong Liu; Ling-Ling Chen; Bin Tian; Li Yang
Journal:  Genome Biol       Date:  2021-08-10       Impact factor: 13.583

6.  Genomics enters the deep learning era.

Authors:  Etienne Routhier; Julien Mozziconacci
Journal:  PeerJ       Date:  2022-06-24       Impact factor: 3.061

7.  Analysis of Polyadenylation Signal Usage with Full-Length Transcriptome in Spodoptera frugiperda (Lepidoptera: Noctuidae).

Authors:  Liying Fang; Lina Guo; Min Zhang; Xianchun Li; Zhongyuan Deng
Journal:  Insects       Date:  2022-09-02       Impact factor: 3.139

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.