Literature DB >> 33106872

Single-sequence and profile-based prediction of RNA solvent accessibility using dilated convolutional neural network.

Anil Kumar Hanumanthappa1, Jaswinder Singh1, Kuldip Paliwal1, Jaspreet Singh1, Yaoqi Zhou2.   

Abstract

MOTIVATION: RNA solvent accessibility, similar to protein solvent accessibility, reflects the structural regions that are accessible to solvents or other functional biomolecules, and plays an important role for structural and functional characterization. Unlike protein solvent accessibility, only a few tools are available for predicting RNA solvent accessibility despite the fact that millions of RNA transcripts have unknown structures and functions. Also, these tools have limited accuracy. Here, we have developed RNAsnap2 that uses a dilated convolutional neural network with a new feature, based on predicted base-pairing probabilities from LinearPartition.
RESULTS: Using the same training set from the recent predictor RNAsol, RNAsnap2 provides an 11% improvement in median Pearson Correlation Coefficient (PCC) and 9% improvement in mean absolute errors for the same test set of 45 RNA chains. A larger improvement (22% in median PCC) is observed for 31 newly deposited RNA chains that are non-redundant and independent from the training and the test sets. A single-sequence version of RNAsnap2 (i.e. without using sequence profiles generated from homology search by Infernal) has achieved comparable performance to the profile-based RNAsol. In addition, RNAsnap2 has achieved comparable performance for protein-bound and protein-free RNAs. Both RNAsnap2 and RNAsnap2 (SingleSeq) are expected to be useful for searching structural signatures and locating functional regions of non-coding RNAs.
AVAILABILITY AND IMPLEMENTATION: Standalone-versions of RNAsnap2 and RNAsnap2 (SingleSeq) are available at https://github.com/jaswindersingh2/RNAsnap2. Direct prediction can also be made at https://sparks-lab.org/server/rnasnap2. The datasets used in this research can also be downloaded from the GITHUB and the webserver mentioned above. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2021        PMID: 33106872     DOI: 10.1093/bioinformatics/btaa652

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Dissecting and predicting different types of binding sites in nucleic acids based on structural information.

Authors:  Zheng Jiang; Si-Rui Xiao; Rong Liu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

Review 2.  Probing RNA structures and functions by solvent accessibility: an overview from experimental and computational perspectives.

Authors:  Md Solayman; Thomas Litfin; Jaswinder Singh; Kuldip Paliwal; Yaoqi Zhou; Jian Zhan
Journal:  Brief Bioinform       Date:  2022-05-13       Impact factor: 13.994

3.  CSSR: assignment of secondary structure to coarse-grained RNA tertiary structures.

Authors:  Chengxin Zhang; Anna Marie Pyle
Journal:  Acta Crystallogr D Struct Biol       Date:  2022-03-11       Impact factor: 7.652

4.  Predicting RNA distance-based contact maps by integrated deep learning on physics-inferred secondary structure and evolutionary-derived mutational coupling.

Authors:  Jaswinder Singh; Kuldip Paliwal; Thomas Litfin; Jaspreet Singh; Yaoqi Zhou
Journal:  Bioinformatics       Date:  2022-06-25       Impact factor: 6.931

5.  Increasing the Accuracy of Single Sequence Prediction Methods Using a Deep Semi-Supervised Learning Framework.

Authors:  Lewis Moffat; David T Jones
Journal:  Bioinformatics       Date:  2021-07-02       Impact factor: 6.937

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.