Literature DB >> 32920048

iDRBP_MMC: Identifying DNA-Binding Proteins and RNA-Binding Proteins Based on Multi-Label Learning Model and Motif-Based Convolutional Neural Network.

Jun Zhang1, Qingcai Chen2, Bin Liu3.   

Abstract

DNA-binding protein (DBP) and RNA-binding protein (RBP) are playing crucial roles in gene expression. Accurate identification of them is of great significance, and accurately computational predictors are highly required. In previous studies, DBP recognition and RBP recognition were treated as two separate tasks. Because the functional and structural similarities between DBPs and RBPs are high, the DBP predictors tend to predict RBPs as DBPs, while the RBP predictors tend to predict the DBPs as the RBPs, leading to high cross-prediction rate and low prediction precision. Here we introduced a multi-label learning model based on the motif-based convolutional neural network, and a sequence-based computational method called iDRBP_MMC was proposed to solve the cross-prediction problem so as to improve the predictive performance of DBPs and RBPs. The results on four test datasets showed that it outperformed other state-of-the-art DBP predictors and RBP predictors. When applied to analyze the tomato genome, the results reveal the ability of iDRBP_MMC for large-scale data analysis. Moreover, iDRBP_MMC can identify the proteins binding to both DNA and RNA, which is beyond the scope of existing DBP predictors or RBP predictors. The web-server of iDRBP_MMC is freely available at http://bliulab.net/iDRBP_MMC.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  cross-prediction problem; motif-based convolutional neural network; multi-label learning; nucleic acid binding protein prediction

Mesh:

Substances:

Year:  2020        PMID: 32920048     DOI: 10.1016/j.jmb.2020.09.008

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  7 in total

1.  RBP-TSTL is a two-stage transfer learning framework for genome-scale prediction of RNA-binding proteins.

Authors:  Xinxin Peng; Xiaoyu Wang; Yuming Guo; Zongyuan Ge; Fuyi Li; Xin Gao; Jiangning Song
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

2.  Comparative Analysis on Alignment-Based and Pretrained Feature Representations for the Identification of DNA-Binding Proteins.

Authors:  Die Chen; Hua Zhang; Zeqi Chen; Bo Xie; Ye Wang
Journal:  Comput Math Methods Med       Date:  2022-06-28       Impact factor: 2.809

3.  BioSeq-BLM: a platform for analyzing DNA, RNA and protein sequences based on biological language models.

Authors:  Hong-Liang Li; Yi-He Pang; Bin Liu
Journal:  Nucleic Acids Res       Date:  2021-12-16       Impact factor: 16.971

4.  EDLMFC: an ensemble deep learning framework with multi-scale features combination for ncRNA-protein interaction prediction.

Authors:  Jingjing Wang; Yanpeng Zhao; Weikang Gong; Yang Liu; Mei Wang; Xiaoqian Huang; Jianjun Tan
Journal:  BMC Bioinformatics       Date:  2021-03-19       Impact factor: 3.169

5.  The Characterization of Structure and Prediction for Aquaporin in Tumour Progression by Machine Learning.

Authors:  Zheng Chen; Shihu Jiao; Da Zhao; Quan Zou; Lei Xu; Lijun Zhang; Xi Su
Journal:  Front Cell Dev Biol       Date:  2022-02-01

6.  DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins.

Authors:  Feifei Cui; Shuang Li; Zilong Zhang; Miaomiao Sui; Chen Cao; Abd El-Latif Hesham; Quan Zou
Journal:  Comput Struct Biotechnol J       Date:  2022-04-26       Impact factor: 6.155

7.  DNA- and RNA-Binding Proteins Linked Transcriptional Control and Alternative Splicing Together in a Two-Layer Regulatory Network System of Chronic Myeloid Leukemia.

Authors:  Chuhui Wang; Xueqing Zong; Fanjie Wu; Ricky Wai Tak Leung; Yaohua Hu; Jing Qin
Journal:  Front Mol Biosci       Date:  2022-08-16
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.