Literature DB >> 32058946

Predicting potential miRNA-disease associations by combining gradient boosting decision tree with logistic regression.

Su Zhou1, Shulin Wang2, Qi Wu3, Riasat Azim1, Wen Li1.   

Abstract

MicroRNAs (miRNAs) have been proved to play an indispensable role in many fundamental biological processes, and the dysregulation of miRNAs is closely correlated with human complex diseases. Many studies have focused on the prediction of potential miRNA-disease associations. Considering the insufficient number of known miRNA-disease associations and the poor performance of many existing prediction methods, a novel model combining gradient boosting decision tree with logistic regression (GBDT-LR) is proposed to prioritize miRNA candidates for diseases. To balance positive and negative samples, GBDT-LR firstly adopted k-means clustering to screen negative samples from unknown miRNA-disease associations. Then, the gradient boosting decision tree (GBDT) model, which has an intrinsic advantage in finding many distinguishing features and feature combinations is applied to extract features. Finally, the new features extracted by the GBDT model are input into a logistic regression (LR) model for predicting the final miRNA-disease association score. The experimental results show that the average AUC of GBDT-LR in 5-fold cross-validation (CV) can achieve 0.9274. Besides, in the case studies, 90 %, 94 % and 88 % of the top 50 miRNAs potentially associated with colon cancer, gastric cancer, and pancreatic cancer were confirmed by databases, respectively. Compared with the other three state-of-the-art methods, GBDT-LR can achieve the best prediction performance. The source code and dataset of GBDT-LR are freely available at https://github.com/Pualalala/GBDT-LR.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Diseases; Gradient boosting decision tree; Logistic regression; miRNA-disease association; miRNAs

Mesh:

Substances:

Year:  2020        PMID: 32058946     DOI: 10.1016/j.compbiolchem.2020.107200

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  8 in total

1.  PMDFI: Predicting miRNA-Disease Associations Based on High-Order Feature Interaction.

Authors:  Mingyan Tang; Chenzhe Liu; Dayun Liu; Junyi Liu; Jiaqi Liu; Lei Deng
Journal:  Front Genet       Date:  2021-04-09       Impact factor: 4.599

2.  Employment of Artificial Intelligence Based on Routine Laboratory Results for the Early Diagnosis of Multiple Myeloma.

Authors:  Wei Yan; Hua Shi; Tao He; Jian Chen; Chen Wang; Aijun Liao; Wei Yang; Huihan Wang
Journal:  Front Oncol       Date:  2021-03-29       Impact factor: 6.244

3.  Radiomics-Based Machine Learning Models for Predicting P504s/P63 Immunohistochemical Expression: A Noninvasive Diagnostic Tool for Prostate Cancer.

Authors:  Yun-Fan Liu; Xin Shu; Xiao-Feng Qiao; Guang-Yong Ai; Li Liu; Jun Liao; Shuang Qian; Xiao-Jing He
Journal:  Front Oncol       Date:  2022-06-20       Impact factor: 5.738

4.  Application of Bidirectional Generative Adversarial Networks to Predict Potential miRNAs Associated With Diseases.

Authors:  Long Xu; Xiaokun Li; Qiang Yang; Long Tan; Qingyuan Liu; Yong Liu
Journal:  Front Genet       Date:  2022-07-12       Impact factor: 4.772

5.  A clustering-based sampling method for miRNA-disease association prediction.

Authors:  Zheng Wei; Dengju Yao; Xiaojuan Zhan; Shuli Zhang
Journal:  Front Genet       Date:  2022-09-13       Impact factor: 4.772

6.  Predicting potential miRNA-disease associations based on more reliable negative sample selection.

Authors:  Ruiyu Guo; Hailin Chen; Wengang Wang; Guangsheng Wu; Fangliang Lv
Journal:  BMC Bioinformatics       Date:  2022-10-17       Impact factor: 3.307

7.  Seq-SymRF: a random forest model predicts potential miRNA-disease associations based on information of sequences and clinical symptoms.

Authors:  Jinlong Li; Xingyu Chen; Qixing Huang; Yang Wang; Yun Xie; Zong Dai; Xiaoyong Zou; Zhanchao Li
Journal:  Sci Rep       Date:  2020-10-21       Impact factor: 4.379

8.  ANMDA: anti-noise based computational model for predicting potential miRNA-disease associations.

Authors:  Xue-Jun Chen; Xin-Yun Hua; Zhen-Ran Jiang
Journal:  BMC Bioinformatics       Date:  2021-07-02       Impact factor: 3.169

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.