Literature DB >> 35006529

EnANNDeep: An Ensemble-based lncRNA-protein Interaction Prediction Framework with Adaptive k-Nearest Neighbor Classifier and Deep Models.

Lihong Peng1,2, Jingwei Tan3, Xiongfei Tian3, Liqian Zhou4.   

Abstract

lncRNA-protein interactions (LPIs) prediction can deepen the understanding of many important biological processes. Artificial intelligence methods have reported many possible LPIs. However, most computational techniques were evaluated mainly on one dataset, which may produce prediction bias. More importantly, they were validated only under cross validation on lncRNA-protein pairs, and did not consider the performance under cross validations on lncRNAs and proteins, thus fail to search related proteins/lncRNAs for a new lncRNA/protein. Under an ensemble learning framework (EnANNDeep) composed of adaptive k-nearest neighbor classifier and Deep models, this study focuses on systematically finding underlying linkages between lncRNAs and proteins. First, five LPI-related datasets are arranged. Second, multiple source features are integrated to depict an lncRNA-protein pair. Third, adaptive k-nearest neighbor classifier, deep neural network, and deep forest are designed to score unknown lncRNA-protein pairs, respectively. Finally, interaction probabilities from the three predictors are integrated based on a soft voting technique. In comparing to five classical LPI identification models (SFPEL, PMDKN, CatBoost, PLIPCOM, and LPI-SKF) under fivefold cross validations on lncRNAs, proteins, and LPIs, EnANNDeep computes the best average AUCs of 0.8660, 0.8775, and 0.9166, respectively, and the best average AUPRs of 0.8545, 0.8595, and 0.9054, respectively, indicating its superior LPI prediction ability. Case study analyses indicate that SNHG10 may have dense linkage with Q15717. In the ensemble framework, adaptive k-nearest neighbor classifier can separately pick the most appropriate k for each query lncRNA-protein pair. More importantly, deep models including deep neural network and deep forest can effectively learn the representative features of lncRNAs and proteins.
© 2022. International Association of Scientists in the Interdisciplinary Areas.

Entities:  

Keywords:  Adaptive k-nearest neighbor; Deep forest; Deep neural network; Ensemble learning; lncRNA–protein interaction

Mesh:

Substances:

Year:  2022        PMID: 35006529     DOI: 10.1007/s12539-021-00483-y

Source DB:  PubMed          Journal:  Interdiscip Sci        ISSN: 1867-1462            Impact factor:   2.233


  27 in total

1.  The activatory long non-coding RNA DBE-T reveals the epigenetic etiology of facioscapulohumeral muscular dystrophy.

Authors:  Miguel Vizoso; Manel Esteller
Journal:  Cell Res       Date:  2012-06-19       Impact factor: 25.617

2.  LPGNMF: Predicting Long Non-Coding RNA and Protein Interaction Using Graph Regularized Nonnegative Matrix Factorization.

Authors:  Tianyi Zhang; Minghui Wang; Jianing Xi; Ao Li
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2018-07-30       Impact factor: 3.710

3.  Compared analysis of LncRNA expression profiling in pdk1 gene knockout mice at two time points.

Authors:  Hailang Liu; Guixian Song; Lijuan Zhou; Xiaoshan Hu; Ming Liu; Junwei Nie; Shuangshuang Lu; Xiangqi Wu; Yunshan Cao; Lichan Tao; Ling Chen; Lingmei Qian
Journal:  Cell Physiol Biochem       Date:  2013-11-29

4.  HLPI-Ensemble: Prediction of human lncRNA-protein interactions based on ensemble strategy.

Authors:  Huan Hu; Li Zhang; Haixin Ai; Hui Zhang; Yetian Fan; Qi Zhao; Hongsheng Liu
Journal:  RNA Biol       Date:  2018-06-06       Impact factor: 4.652

5.  A path-based computational model for long non-coding RNA-protein interaction prediction.

Authors:  Hui Zhang; Zhong Ming; Chunlong Fan; Qi Zhao; Hongsheng Liu
Journal:  Genomics       Date:  2019-10-19       Impact factor: 5.736

Review 6.  LncRNA HOXA-AS2 and its molecular mechanisms in human cancer.

Authors:  Jicai Wang; Zhilei Su; Shounan Lu; Wen Fu; Zhifa Liu; Xingming Jiang; Sheng Tai
Journal:  Clin Chim Acta       Date:  2018-07-04       Impact factor: 3.786

7.  LncDisease: a sequence based bioinformatics tool for predicting lncRNA-disease associations.

Authors:  Junyi Wang; Ruixia Ma; Wei Ma; Ji Chen; Jichun Yang; Yaguang Xi; Qinghua Cui
Journal:  Nucleic Acids Res       Date:  2016-02-16       Impact factor: 16.971

Review 8.  Long non-coding RNAs and complex diseases: from experimental results to computational models.

Authors:  Xing Chen; Chenggang Clarence Yan; Xu Zhang; Zhu-Hong You
Journal:  Brief Bioinform       Date:  2017-07-01       Impact factor: 11.622

Review 9.  SNHG12: An LncRNA as a Potential Therapeutic Target and Biomarker for Human Cancer.

Authors:  Suraksha Tamang; Varnali Acharya; Deepronil Roy; Rinka Sharma; Apeksha Aryaa; Uttam Sharma; Akanksha Khandelwal; Hridayesh Prakash; Karen M Vasquez; Aklank Jain
Journal:  Front Oncol       Date:  2019-09-18       Impact factor: 6.244

10.  Projection-Based Neighborhood Non-Negative Matrix Factorization for lncRNA-Protein Interaction Prediction.

Authors:  Yingjun Ma; Tingting He; Xingpeng Jiang
Journal:  Front Genet       Date:  2019-11-20       Impact factor: 4.599

View more
  3 in total

1.  Editorial: Machine Learning-Based Methods for RNA Data Analysis.

Authors:  Lihong Peng; Jialiang Yang; Minxian Wang; Liqian Zhou
Journal:  Front Genet       Date:  2022-05-25       Impact factor: 4.772

2.  Finding Lung-Cancer-Related lncRNAs Based on Laplacian Regularized Least Squares With Unbalanced Bi-Random Walk.

Authors:  Zhifeng Guo; Yan Hui; Fanlong Kong; Xiaoxi Lin
Journal:  Front Genet       Date:  2022-07-22       Impact factor: 4.772

3.  Prioritizing potential circRNA biomarkers for bladder cancer and bladder urothelial cancer based on an ensemble model.

Authors:  Qiongli Su; Qiuhong Tan; Xin Liu; Ling Wu
Journal:  Front Genet       Date:  2022-09-15       Impact factor: 4.772

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.