Literature DB >> 33384721

ATTfold: RNA Secondary Structure Prediction With Pseudoknots Based on Attention Mechanism.

Yili Wang1,2, Yuanning Liu2,3, Shuo Wang2,3, Zhen Liu2,3,4, Yubing Gao2,3, Hao Zhang2,3, Liyan Dong2,3.   

Abstract

Accurate RNA secondary structure information is the cornerstone of gene function research and RNA tertiary structure prediction. However, most traditional RNA secondary structure prediction algorithms are based on the dynamic programming (DP) algorithm, according to the minimum free energy theory, with both hard and soft constraints. The accuracy is particularly dependent on the accuracy of soft constraints (from experimental data like chemical and enzyme detection). With the elongation of the RNA sequence, the time complexity of DP-based algorithms will increase geometrically, as a result, they are not good at coping with relatively long sequences. Furthermore, due to the complexity of the pseudoknots structure, the secondary structure prediction method, based on traditional algorithms, has great defects which cannot predict the secondary structure with pseudoknots well. Therefore, few algorithms have been available for pseudoknots prediction in the past. The ATTfold algorithm proposed in this article is a deep learning algorithm based on an attention mechanism. It analyzes the global information of the RNA sequence via the characteristics of the attention mechanism, focuses on the correlation between paired bases, and solves the problem of long sequence prediction. Moreover, this algorithm also extracts the effective multi-dimensional features from a great number of RNA sequences and structure information, by combining the exclusive hard constraints of RNA secondary structure. Hence, it accurately determines the pairing position of each base, and obtains the real and effective RNA secondary structure, including pseudoknots. Finally, after training the ATTfold algorithm model through tens of thousands of RNA sequences and their real secondary structures, this algorithm was compared with four classic RNA secondary structure prediction algorithms. The results show that our algorithm significantly outperforms others and more accurately showed the secondary structure of RNA. As the data in RNA sequence databases increase, our deep learning-based algorithm will have superior performance. In the future, this kind of algorithm will be more indispensable.
Copyright © 2020 Wang, Liu, Wang, Liu, Gao, Zhang and Dong.

Entities:  

Keywords:  RNA; attention mechanism; deep learning; hard constraints; pseudoknot; secondary structure prediction

Year:  2020        PMID: 33384721      PMCID: PMC7770172          DOI: 10.3389/fgene.2020.612086

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


  22 in total

Review 1.  The G x U wobble base pair. A fundamental building block of RNA structure crucial to RNA function in diverse biological systems.

Authors:  G Varani; W H McClain
Journal:  EMBO Rep       Date:  2000-07       Impact factor: 8.807

2.  ProbKnot: fast prediction of RNA secondary structure including pseudoknots.

Authors:  Stanislav Bellaousov; David H Mathews
Journal:  RNA       Date:  2010-08-10       Impact factor: 4.942

3.  The centrality of RNA.

Authors:  Phillip A Sharp
Journal:  Cell       Date:  2009-02-20       Impact factor: 41.582

4.  In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

Authors:  Yiliang Ding; Yin Tang; Chun Kit Kwok; Yu Zhang; Philip C Bevilacqua; Sarah M Assmann
Journal:  Nature       Date:  2013-11-24       Impact factor: 49.962

5.  Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.

Authors:  M Zuker; P Stiegler
Journal:  Nucleic Acids Res       Date:  1981-01-10       Impact factor: 16.971

6.  RNAstructure: software for RNA secondary structure prediction and analysis.

Authors:  Jessica S Reuter; David H Mathews
Journal:  BMC Bioinformatics       Date:  2010-03-15       Impact factor: 3.169

7.  Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures.

Authors:  Michael F Sloma; David H Mathews
Journal:  RNA       Date:  2016-10-19       Impact factor: 4.942

Review 8.  Sizing up long non-coding RNAs: do lncRNAs have secondary and tertiary structure?

Authors:  Irina V Novikova; Scott P Hennelly; Karissa Y Sanbonmatsu
Journal:  Bioarchitecture       Date:  2012 Nov-Dec

9.  Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics.

Authors:  Jens Reeder; Robert Giegerich
Journal:  BMC Bioinformatics       Date:  2004-08-04       Impact factor: 3.169

Review 10.  Viral RNA pseudoknots: versatile motifs in gene expression and replication.

Authors:  Ian Brierley; Simon Pennell; Robert J C Gilbert
Journal:  Nat Rev Microbiol       Date:  2007-08       Impact factor: 60.633

View more
  3 in total

Review 1.  Non-Coding RNAs: New Dawn for Diabetes Mellitus Induced Erectile Dysfunction.

Authors:  Wenchao Xu; Hongyang Jiang; Jihong Liu; Hao Li
Journal:  Front Mol Biosci       Date:  2022-06-22

2.  Inverse folding based pre-training for the reliable identification of intrinsic transcription terminators.

Authors:  Vivian B Brandenburg; Franz Narberhaus; Axel Mosig
Journal:  PLoS Comput Biol       Date:  2022-07-07       Impact factor: 4.779

3.  Knotify: An Efficient Parallel Platform for RNA Pseudoknot Prediction Using Syntactic Pattern Recognition.

Authors:  Christos Andrikos; Evangelos Makris; Angelos Kolaitis; Georgios Rassias; Christos Pavlatos; Panayiotis Tsanakas
Journal:  Methods Protoc       Date:  2022-02-02
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.