Literature DB >> 33080214

KD-KLNMF: Identification of lncRNAs subcellular localization with multiple features and nonnegative matrix factorization.

Shengli Zhang1, Huijuan Qiao2.   

Abstract

Long non-coding RNAs (lncRNAs) refer to functional RNA molecules with a length more than 200 nucleotides and have minimal or no function to encode proteins. In recent years, more studies show that lncRNAs subcellular localization has valuable clues for their biological functions. So it is count for much to identify lncRNAs subcellular localization. In this paper, a novel statistical model named KD-KLNMF is constructed to predict lncRNAs subcellular localization. Firstly, k-mer and dinucleotide-based spatial autocorrelation are incorporated as the feature vector. Then, Synthetic Minority Over-sampling Technique is used to deal with the imbalance dataset. Next, Kullback-Leibler divergence-based nonnegative matrix factorization is applied to select optimal features. And then we utilize support vector machine as the classifier after comparing with other classifiers. Finally, the jackknife test is performed to evaluate the model. The overall accuracies reach 97.24% and 92.86% on training dataset and independent dataset, respectively. The results are better than the previous methods, which indicate that our model will be a useful and feasible tool to identify lncRNAs subcellular localization. The datasets and source code are freely available at https://github.com/HuijuanQiao/KD-KLNMF.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  K-mer; KLNMF; LncRNAs subcellular localization; SMOTE; Spatial autocorrelation

Mesh:

Substances:

Year:  2020        PMID: 33080214     DOI: 10.1016/j.ab.2020.113995

Source DB:  PubMed          Journal:  Anal Biochem        ISSN: 0003-2697            Impact factor:   3.365


  2 in total

1.  Accurate Prediction of Anti-hypertensive Peptides Based on Convolutional Neural Network and Gated Recurrent unit.

Authors:  Hongyan Shi; Shengli Zhang
Journal:  Interdiscip Sci       Date:  2022-04-27       Impact factor: 3.492

2.  EL-RMLocNet: An explainable LSTM network for RNA-associated multi-compartment localization prediction.

Authors:  Muhammad Nabeel Asim; Muhammad Ali Ibrahim; Muhammad Imran Malik; Christoph Zehe; Olivier Cloarec; Johan Trygg; Andreas Dengel; Sheraz Ahmed
Journal:  Comput Struct Biotechnol J       Date:  2022-07-26       Impact factor: 6.155

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.