Literature DB >> 35109787

RNA secondary structure prediction with convolutional neural networks.

Mehdi Saman Booy1, Alexander Ilin2, Pekka Orponen2.   

Abstract

BACKGROUND: Predicting the secondary, i.e. base-pairing structure of a folded RNA strand is an important problem in synthetic and computational biology. First-principle algorithmic approaches to this task are challenging because existing models of the folding process are inaccurate, and even if a perfect model existed, finding an optimal solution would be in general NP-complete.
RESULTS: In this paper, we propose a simple, yet effective data-driven approach. We represent RNA sequences in the form of three-dimensional tensors in which we encode possible relations between all pairs of bases in a given sequence. We then use a convolutional neural network to predict a two-dimensional map which represents the correct pairings between the bases. Our model achieves significant accuracy improvements over existing methods on two standard datasets, RNAStrAlign and ArchiveII, for 10 RNA families, where our experiments show excellent performance of the model across a wide range of sequence lengths. Since our matrix representation and post-processing approaches do not require the structures to be pseudoknot-free, we get similar good performance also for pseudoknotted structures.
CONCLUSION: We show how to use an artificial neural network design to predict the structure for a given RNA sequence with high accuracy only by learning from samples whose native structures have been experimentally characterized, independent of any energy model.
© 2022. The Author(s).

Entities:  

Keywords:  Deep learning; Pseudoknotted structures; RNA structure prediction

Mesh:

Substances:

Year:  2022        PMID: 35109787      PMCID: PMC8812003          DOI: 10.1186/s12859-021-04540-7

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  18 in total

1.  ProbKnot: fast prediction of RNA secondary structure including pseudoknots.

Authors:  Stanislav Bellaousov; David H Mathews
Journal:  RNA       Date:  2010-08-10       Impact factor: 4.942

2.  Thermodynamic parameters for an expanded nearest-neighbor model for formation of RNA duplexes with Watson-Crick base pairs.

Authors:  T Xia; J SantaLucia; M E Burkard; R Kierzek; S J Schroeder; X Jiao; C Cox; D H Turner
Journal:  Biochemistry       Date:  1998-10-20       Impact factor: 3.162

3.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.

Authors:  M Zuker; P Stiegler
Journal:  Nucleic Acids Res       Date:  1981-01-10       Impact factor: 16.971

4.  TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.

Authors:  Zhen Tan; Yinghan Fu; Gaurav Sharma; David H Mathews
Journal:  Nucleic Acids Res       Date:  2017-11-16       Impact factor: 16.971

5.  RNAstructure: Web servers for RNA secondary structure prediction and analysis.

Authors:  Stanislav Bellaousov; Jessica S Reuter; Matthew G Seetin; David H Mathews
Journal:  Nucleic Acids Res       Date:  2013-04-24       Impact factor: 16.971

6.  IPknot: fast and accurate prediction of RNA secondary structures with pseudoknots using integer programming.

Authors:  Kengo Sato; Yuki Kato; Michiaki Hamada; Tatsuya Akutsu; Kiyoshi Asai
Journal:  Bioinformatics       Date:  2011-07-01       Impact factor: 6.937

7.  ViennaRNA Package 2.0.

Authors:  Ronny Lorenz; Stephan H Bernhart; Christian Höner Zu Siederdissen; Hakim Tafer; Christoph Flamm; Peter F Stadler; Ivo L Hofacker
Journal:  Algorithms Mol Biol       Date:  2011-11-24       Impact factor: 1.405

8.  Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures.

Authors:  Michael F Sloma; David H Mathews
Journal:  RNA       Date:  2016-10-19       Impact factor: 4.942

9.  Solving the RNA design problem with reinforcement learning.

Authors:  Peter Eastman; Jade Shi; Bharath Ramsundar; Vijay S Pande
Journal:  PLoS Comput Biol       Date:  2018-06-21       Impact factor: 4.475

10.  RNA secondary structure prediction using an ensemble of two-dimensional deep neural networks and transfer learning.

Authors:  Jaswinder Singh; Jack Hanson; Kuldip Paliwal; Yaoqi Zhou
Journal:  Nat Commun       Date:  2019-11-27       Impact factor: 14.919

View more
  1 in total

1.  Inverse folding based pre-training for the reliable identification of intrinsic transcription terminators.

Authors:  Vivian B Brandenburg; Franz Narberhaus; Axel Mosig
Journal:  PLoS Comput Biol       Date:  2022-07-07       Impact factor: 4.779

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.