Literature DB >> 33451286

Identify RNA-associated subcellular localizations based on multi-label learning using Chou's 5-steps rule.

Hao Wang1, Yijie Ding2, Jijun Tang1,3, Quan Zou4, Fei Guo5.   

Abstract

BACKGROUND: Biological functions of biomolecules rely on the cellular compartments where they are located in cells. Importantly, RNAs are assigned in specific locations of a cell, enabling the cell to implement diverse biochemical processes in the way of concurrency. However, lots of existing RNA subcellular localization classifiers only solve the problem of single-label classification. It is of great practical significance to expand RNA subcellular localization into multi-label classification problem. <br> RESULTS: In this study, we extract multi-label classification datasets about RNA-associated subcellular localizations on various types of RNAs, and then construct subcellular localization datasets on four RNA categories. In order to study Homo sapiens, we further establish human RNA subcellular localization datasets. Furthermore, we utilize different nucleotide property composition models to extract effective features to adequately represent the important information of nucleotide sequences. In the most critical part, we achieve a major challenge that is to fuse the multivariate information through multiple kernel learning based on Hilbert-Schmidt independence criterion. The optimal combined kernel can be put into an integration support vector machine model for identifying multi-label RNA subcellular localizations. Our method obtained excellent results of 0.703, 0.757, 0.787, and 0.800, respectively on four RNA data sets on average precision. <br> CONCLUSION: To be specific, our novel method performs outstanding rather than other prediction tools on novel benchmark datasets. Moreover, we establish user-friendly web server with the implementation of our method.

Entities:  

Keywords:  Hilbert-Schmidt independence criterion; Multi-label classification; Multiple kernel learning; RNA subcellular localization; Web server

Mesh:

Substances:

Year:  2021        PMID: 33451286      PMCID: PMC7811227          DOI: 10.1186/s12864-020-07347-7

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  48 in total

1.  Nuc-PLoc: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Protein Eng Des Sel       Date:  2007-11-10       Impact factor: 1.650

2.  iLearn: an integrated platform and meta-learner for feature engineering, machine-learning analysis and modeling of DNA, RNA and protein sequence data.

Authors:  Zhen Chen; Pei Zhao; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Jerico Revote; Yan Zhu; David R Powell; Tatsuya Akutsu; Geoffrey I Webb; Kuo-Chen Chou; A Ian Smith; Roger J Daly; Jian Li; Jiangning Song
Journal:  Brief Bioinform       Date:  2020-05-21       Impact factor: 11.622

3.  iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.

Authors:  Bin Liu; Fan Yang; De-Shuang Huang; Kuo-Chen Chou
Journal:  Bioinformatics       Date:  2018-01-01       Impact factor: 6.937

4.  The lncLocator: a subcellular localization predictor for long non-coding RNAs based on a stacked ensemble classifier.

Authors:  Zhen Cao; Xiaoyong Pan; Yang Yang; Yan Huang; Hong-Bin Shen
Journal:  Bioinformatics       Date:  2018-07-01       Impact factor: 6.937

5.  Design powerful predictor for mRNA subcellular location prediction in Homo sapiens.

Authors:  Zhao-Yue Zhang; Yu-He Yang; Hui Ding; Dong Wang; Wei Chen; Hao Lin
Journal:  Brief Bioinform       Date:  2021-01-18       Impact factor: 11.622

6.  PseAAC-Builder: a cross-platform stand-alone program for generating various special Chou's pseudo-amino acid compositions.

Authors:  Pufeng Du; Xin Wang; Chao Xu; Yang Gao
Journal:  Anal Biochem       Date:  2012-03-27       Impact factor: 3.365

Review 7.  Impacts of bioinformatics to medicinal chemistry.

Authors:  Kuo-Chen Chou
Journal:  Med Chem       Date:  2015       Impact factor: 2.745

8.  PseKNC: a flexible web server for generating pseudo K-tuple nucleotide composition.

Authors:  Wei Chen; Tian-Yu Lei; Dian-Chuan Jin; Hao Lin; Kuo-Chen Chou
Journal:  Anal Biochem       Date:  2014-04-13       Impact factor: 3.365

9.  Hum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites.

Authors:  Hong-Bin Shen; Kuo-Chen Chou
Journal:  Biochem Biophys Res Commun       Date:  2007-02-23       Impact factor: 3.575

10.  CD-HIT Suite: a web server for clustering and comparing biological sequences.

Authors:  Ying Huang; Beifang Niu; Ying Gao; Limin Fu; Weizhong Li
Journal:  Bioinformatics       Date:  2010-01-06       Impact factor: 6.937

View more
  9 in total

1.  iAIPs: Identifying Anti-Inflammatory Peptides Using Random Forest.

Authors:  Dongxu Zhao; Zhixia Teng; Yanjuan Li; Dong Chen
Journal:  Front Genet       Date:  2021-11-30       Impact factor: 4.599

2.  KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest.

Authors:  Yuran Jia; Shan Huang; Tianjiao Zhang
Journal:  Front Genet       Date:  2021-11-29       Impact factor: 4.599

Review 3.  Identify DNA-Binding Proteins Through the Extreme Gradient Boosting Algorithm.

Authors:  Ziye Zhao; Wen Yang; Yixiao Zhai; Yingjian Liang; Yuming Zhao
Journal:  Front Genet       Date:  2022-01-28       Impact factor: 4.599

4.  Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction.

Authors:  Muhammad Nabeel Asim; Muhammad Ali Ibrahim; Muhammad Imran Malik; Andreas Dengel; Sheraz Ahmed
Journal:  Int J Mol Sci       Date:  2022-07-26       Impact factor: 6.208

5.  EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction.

Authors:  Muhammad Nabeel Asim; Muhammad Ali Ibrahim; Muhammad Imran Malik; Christoph Zehe; Olivier Cloarec; Johan Trygg; Andreas Dengel; Sheraz Ahmed
Journal:  Comput Struct Biotechnol J       Date:  2022-07-26       Impact factor: 6.155

6.  Identification of Vesicle Transport Proteins via Hypergraph Regularized K-Local Hyperplane Distance Nearest Neighbour Model.

Authors:  Rui Fan; Bing Suo; Yijie Ding
Journal:  Front Genet       Date:  2022-07-13       Impact factor: 4.772

7.  4mCPred-MTL: Accurate Identification of DNA 4mC Sites in Multiple Species Using Multi-Task Deep Learning Based on Multi-Head Attention Mechanism.

Authors:  Rao Zeng; Song Cheng; Minghong Liao
Journal:  Front Cell Dev Biol       Date:  2021-05-10

8.  Assessing Dry Weight of Hemodialysis Patients via Sparse Laplacian Regularized RVFL Neural Network with L2,1-Norm.

Authors:  Xiaoyi Guo; Wei Zhou; Qun Lu; Aiyan Du; Yinghua Cai; Yijie Ding
Journal:  Biomed Res Int       Date:  2021-02-04       Impact factor: 3.411

9.  CRBPDL: Identification of circRNA-RBP interaction sites using an ensemble neural network approach.

Authors:  Mengting Niu; Quan Zou; Chen Lin
Journal:  PLoS Comput Biol       Date:  2022-01-20       Impact factor: 4.475

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.