Literature DB >> 33388743

SubLocEP: a novel ensemble predictor of subcellular localization of eukaryotic mRNA based on machine learning.

Jing Li1, Lichao Zhang2, Shida He1, Fei Guo1, Quan Zou1.   

Abstract

MOTIVATION: mRNA location corresponds to the location of protein translation and contributes to precise spatial and temporal management of the protein function. However, current assignment of subcellular localization of eukaryotic mRNA reveals important limitations: (1) turning multiple classifications into multiple dichotomies makes the training process tedious; (2) the majority of the models trained by classical algorithm are based on the extraction of single sequence information; (3) the existing state-of-the-art models have not reached an ideal level in terms of prediction and generalization ability. To achieve better assignment of subcellular localization of eukaryotic mRNA, a better and more comprehensive model must be developed.
RESULTS: In this paper, SubLocEP is proposed as a two-layer integrated prediction model for accurate prediction of the location of sequence samples. Unlike the existing models based on limited features, SubLocEP comprehensively considers additional feature attributes and is combined with LightGBM to generated single feature classifiers. The initial integration model (single-layer model) is generated according to the categories of a feature. Subsequently, two single-layer integration models are weighted (sequence-based: physicochemical properties = 3:2) to produce the final two-layer model. The performance of SubLocEP on independent datasets is sufficient to indicate that SubLocEP is an accurate and stable prediction model with strong generalization ability. Additionally, an online tool has been developed that contains experimental data and can maximize the user convenience for estimation of subcellular localization of eukaryotic mRNA.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Keywords:  LightGBM; ensemble model; feature extraction; subcellular localization of eukaryotic mRNA

Year:  2021        PMID: 33388743     DOI: 10.1093/bib/bbaa401

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  2 in total

1.  LGFC-CNN: Prediction of lncRNA-Protein Interactions by Using Multiple Types of Features through Deep Learning.

Authors:  Lan Huang; Shaoqing Jiao; Sen Yang; Shuangquan Zhang; Xiaopeng Zhu; Rui Guo; Yan Wang
Journal:  Genes (Basel)       Date:  2021-10-24       Impact factor: 4.096

2.  EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction.

Authors:  Muhammad Nabeel Asim; Muhammad Ali Ibrahim; Muhammad Imran Malik; Christoph Zehe; Olivier Cloarec; Johan Trygg; Andreas Dengel; Sheraz Ahmed
Journal:  Comput Struct Biotechnol J       Date:  2022-07-26       Impact factor: 6.155

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.