Literature DB >> 29722865

Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks.

Xiaoyong Pan1, Hong-Bin Shen2.   

Abstract

Motivation: RNA-binding proteins (RBPs) take over 5-10% of the eukaryotic proteome and play key roles in many biological processes, e.g. gene regulation. Experimental detection of RBP binding sites is still time-intensive and high-costly. Instead, computational prediction of the RBP binding sites using patterns learned from existing annotation knowledge is a fast approach. From the biological point of view, the local structure context derived from local sequences will be recognized by specific RBPs. However, in computational modeling using deep learning, to our best knowledge, only global representations of entire RNA sequences are employed. So far, the local sequence information is ignored in the deep model construction process.
Results: In this study, we present a computational method iDeepE to predict RNA-protein binding sites from RNA sequences by combining global and local convolutional neural networks (CNNs). For the global CNN, we pad the RNA sequences into the same length. For the local CNN, we split a RNA sequence into multiple overlapping fixed-length subsequences, where each subsequence is a signal channel of the whole sequence. Next, we train deep CNNs for multiple subsequences and the padded sequences to learn high-level features, respectively. Finally, the outputs from local and global CNNs are combined to improve the prediction. iDeepE demonstrates a better performance over state-of-the-art methods on two large-scale datasets derived from CLIP-seq. We also find that the local CNN runs 1.8 times faster than the global CNN with comparable performance when using GPUs. Our results show that iDeepE has captured experimentally verified binding motifs. Availability and implementation: https://github.com/xypan1232/iDeepE. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29722865     DOI: 10.1093/bioinformatics/bty364

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  46 in total

1.  Accurate prediction of boundaries of high resolution topologically associated domains (TADs) in fruit flies using deep learning.

Authors:  John Henderson; Vi Ly; Shawn Olichwier; Pranik Chainani; Yu Liu; Benjamin Soibam
Journal:  Nucleic Acids Res       Date:  2019-07-26       Impact factor: 16.971

2.  Protein-RNA interaction prediction with deep learning: structure matters.

Authors:  Junkang Wei; Siyuan Chen; Licheng Zong; Xin Gao; Yu Li
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

3.  DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites.

Authors:  Jidong Zhang; Bo Liu; Zhihan Wang; Klaus Lehnert; Mark Gahegan
Journal:  BMC Bioinformatics       Date:  2022-06-29       Impact factor: 3.307

Review 4.  Decoding disease: from genomes to networks to phenotypes.

Authors:  Aaron K Wong; Rachel S G Sealfon; Chandra L Theesfeld; Olga G Troyanskaya
Journal:  Nat Rev Genet       Date:  2021-08-02       Impact factor: 53.242

5.  DeepImmuno: deep learning-empowered prediction and generation of immunogenic peptides for T-cell immunity.

Authors:  Guangyuan Li; Balaji Iyer; V B Surya Prasath; Yizhao Ni; Nathan Salomonis
Journal:  Brief Bioinform       Date:  2021-05-03       Impact factor: 11.622

6.  Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data.

Authors:  Shuangquan Zhang; Anjun Ma; Jing Zhao; Dong Xu; Qin Ma; Yan Wang
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 13.994

7.  Recognizing Ion Ligand-Binding Residues by Random Forest Algorithm Based on Optimized Dihedral Angle.

Authors:  Liu Liu; Xiuzhen Hu; Zhenxing Feng; Shan Wang; Kai Sun; Shuang Xu
Journal:  Front Bioeng Biotechnol       Date:  2020-06-12

8.  A Hybrid Prediction Method for Plant lncRNA-Protein Interaction.

Authors:  Jael Sanyanda Wekesa; Yushi Luan; Ming Chen; Jun Meng
Journal:  Cells       Date:  2019-05-30       Impact factor: 6.600

9.  Thermodynamic modeling reveals widespread multivalent binding by RNA-binding proteins.

Authors:  Salma Sohrabi-Jahromi; Johannes Söding
Journal:  Bioinformatics       Date:  2021-07-12       Impact factor: 6.937

10.  Graph neural representational learning of RNA secondary structures for predicting RNA-protein interactions.

Authors:  Zichao Yan; William L Hamilton; Mathieu Blanchette
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.