Literature DB >> 32426818

PASSION: an ensemble neural network approach for identifying the binding sites of RBPs on circRNAs.

Cangzhi Jia1, Yue Bi1, Jinxiang Chen2,3, André Leier4,5, Fuyi Li2,3, Jiangning Song2,3,6.   

Abstract

MOTIVATION: Different from traditional linear RNAs (containing 5' and 3' ends), circular RNAs (circRNAs) are a special type of RNAs that have a closed ring structure. Accumulating evidence has indicated that circRNAs can directly bind proteins and participate in a myriad of different biological processes.
RESULTS: For identifying the interaction of circRNAs with 37 different types of circRNA-binding proteins (RBPs), we develop an ensemble neural network, termed PASSION, which is based on the concatenated artificial neural network (ANN) and hybrid deep neural network frameworks. Specifically, the input of the ANN is the optimal feature subset for each RBP, which has been selected from six types of feature encoding schemes through incremental feature selection and application of the XGBoost algorithm. In turn, the input of the hybrid deep neural network is a stacked codon-based scheme. Benchmarking experiments indicate that the ensemble neural network reaches the average best area under the curve (AUC) of 0.883 across the 37 circRNA datasets when compared with XGBoost, k-nearest neighbor, support vector machine, random forest, logistic regression and Naive Bayes. Moreover, each of the 37 RBP models is extensively tested by performing independent tests, with the varying sequence similarity thresholds of 0.8, 0.7, 0.6 and 0.5, respectively. The corresponding average AUC obtained are 0.883, 0.876, 0.868 and 0.883, respectively, highlighting the effectiveness and robustness of PASSION. Extensive benchmarking experiments demonstrate that PASSION achieves a competitive performance for identifying binding sites between circRNA and RBPs, when compared with several state-of-the-art methods.
AVAILABILITY AND IMPLEMENTATION: A user-friendly web server of PASSION is publicly accessible at http://flagship.erc.monash.edu/PASSION/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2020        PMID: 32426818     DOI: 10.1093/bioinformatics/btaa522

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  Computational prediction of species-specific yeast DNA replication origin via iterative feature representation.

Authors:  Balachandran Manavalan; Shaherin Basith; Tae Hwan Shin; Gwang Lee
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

2.  iLearnPlus: a comprehensive and automated machine-learning platform for nucleic acid and protein sequence analysis, prediction and visualization.

Authors:  Zhen Chen; Pei Zhao; Chen Li; Fuyi Li; Dongxu Xiang; Yong-Zi Chen; Tatsuya Akutsu; Roger J Daly; Geoffrey I Webb; Quanzhi Zhao; Lukasz Kurgan; Jiangning Song
Journal:  Nucleic Acids Res       Date:  2021-06-04       Impact factor: 16.971

Review 3.  Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.

Authors:  Xiao Liang; Fuyi Li; Jinxiang Chen; Junlong Li; Hao Wu; Shuqin Li; Jiangning Song; Quanzhong Liu
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

4.  DeCban: Prediction of circRNA-RBP Interaction Sites by Using Double Embeddings and Cross-Branch Attention Networks.

Authors:  Liangliang Yuan; Yang Yang
Journal:  Front Genet       Date:  2021-01-22       Impact factor: 4.599

Review 5.  A Review of Approaches for Predicting Drug-Drug Interactions Based on Machine Learning.

Authors:  Ke Han; Peigang Cao; Yu Wang; Fang Xie; Jiaqi Ma; Mengyao Yu; Jianchun Wang; Yaoqun Xu; Yu Zhang; Jie Wan
Journal:  Front Pharmacol       Date:  2022-01-28       Impact factor: 5.810

6.  Construction and Simulation of Music Style Prediction Model under Improved Sparse Neural Network.

Authors:  Junfang Wu; Junbiao Lu
Journal:  Comput Intell Neurosci       Date:  2022-04-08

7.  Accurate identification of RNA D modification using multiple features.

Authors:  Lijun Dou; Wenyang Zhou; Lichao Zhang; Lei Xu; Ke Han
Journal:  RNA Biol       Date:  2021-03-17       Impact factor: 4.652

8.  Discrimination of Thermophilic Proteins and Non-thermophilic Proteins Using Feature Dimension Reduction.

Authors:  Zifan Guo; Pingping Wang; Zhendong Liu; Yuming Zhao
Journal:  Front Bioeng Biotechnol       Date:  2020-10-22

9.  NonClasGP-Pred: robust and efficient prediction of non-classically secreted proteins by integrating subset-specific optimal models of imbalanced data.

Authors:  Chao Wang; Jin Wu; Lei Xu; Quan Zou
Journal:  Microb Genom       Date:  2020-11-27

10.  CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

Authors:  Mengting Niu; Quan Zou; Chen Lin
Journal:  PLoS Comput Biol       Date:  2022-01-20       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.