Literature DB >> 32339094

Diagnostic classification of cancers using extreme gradient boosting algorithm and multi-omics data.

Baoshan Ma1, Fanyu Meng2, Ge Yan2, Haowen Yan2, Bingjie Chai2, Fengju Song3.   

Abstract

Accurate diagnostic classification of cancers can greatly help physicians to choose surveillance and treatment strategies for patients. Following the explosive growth of huge amounts of biological data, the shift from traditional biostatistical methods to computer-aided means has made machine-learning methods as an integral part of today's cancer prognosis prediction. In this work, we proposed a classification model by leveraging the power of extreme gradient boosting (XGBoost) and using increasingly complex multi-omics data with the aim to separate early stage and late stage cancers. We applied XGBoost model to four kinds of cancer data downloaded from TCGA and compared its performance with other popular machine-learning methods. The experimental results showed that our method obtained statistically significantly better or comparable predictive performance. The results of this study also revealed that DNA methylation outperforms other molecular data (mRNA expression and miRNA expression) in terms of accuracy and stability for discriminating between early stage and late stage groups. Furthermore, integration of multi-omics data by autoencoder can enhance the classification accuracy of cancer stage. Finally, we conducted bioinformatics analyses to assess the medical utility of the significant genes ranked by their importance using XGBoost algorithm. Extensively comparative experiments demonstrated that the XGBoost method has a remarkable performance in predicting the stage of cancer patients with multi-omics data. Moreover, identification of novel candidate genes associated with cancer stages would contribute to further elucidate disease pathogenesis and develop novel therapeutics.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Cancer; Diagnostic classification; Extreme gradient boosting; Machine learning; Multi-omics data

Mesh:

Substances:

Year:  2020        PMID: 32339094     DOI: 10.1016/j.compbiomed.2020.103761

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  9 in total

1.  A Systematic Evaluation of Supervised Machine Learning Algorithms for Cell Phenotype Classification Using Single-Cell RNA Sequencing Data.

Authors:  Xiaowen Cao; Li Xing; Elham Majd; Hua He; Junhua Gu; Xuekui Zhang
Journal:  Front Genet       Date:  2022-02-23       Impact factor: 4.599

2.  Diagnostic classification of cancers using DNA methylation of paracancerous tissues.

Authors:  Baoshan Ma; Bingjie Chai; Heng Dong; Jishuang Qi; Pengcheng Wang; Tong Xiong; Yi Gong; Di Li; Shuxin Liu; Fengju Song
Journal:  Sci Rep       Date:  2022-06-23       Impact factor: 4.996

3.  VirVACPRED: A Web Server for Prediction of Protective Viral Antigens.

Authors:  Jesús Herrera-Bravo; Jorge G Farías; Fernanda Parraguez Contreras; Lisandra Herrera-Belén; Juan-Alejandro Norambuena; Jorge F Beltrán
Journal:  Int J Pept Res Ther       Date:  2021-12-17       Impact factor: 1.931

4.  Predictions of cervical cancer identification by photonic method combined with machine learning.

Authors:  Michał Kruczkowski; Anna Drabik-Kruczkowska; Anna Marciniak; Martyna Tarczewska; Monika Kosowska; Małgorzata Szczerska
Journal:  Sci Rep       Date:  2022-03-08       Impact factor: 4.379

5.  A novel liver cancer diagnosis method based on patient similarity network and DenseGCN.

Authors:  Ge Zhang; Zhen Peng; Chaokun Yan; Jianlin Wang; Junwei Luo; Huimin Luo
Journal:  Sci Rep       Date:  2022-04-26       Impact factor: 4.996

6.  Application of Machine Learning in Hospitalized Patients with Severe COVID-19 Treated with Tocilizumab.

Authors:  Antonio Ramón; Marta Zaragozá; Ana María Torres; Joaquín Cascón; Pilar Blasco; Javier Milara; Jorge Mateo
Journal:  J Clin Med       Date:  2022-08-12       Impact factor: 4.964

7.  DBP-iDWT: Improving DNA-Binding Proteins Prediction Using Multi-Perspective Evolutionary Profile and Discrete Wavelet Transform.

Authors:  Farman Ali; Omar Barukab; Ajay B Gadicha; Shruti Patil; Omar Alghushairy; Akram Y Sarhan
Journal:  Comput Intell Neurosci       Date:  2022-09-28

8.  Classifying Breast Cancer Subtypes Using Deep Neural Networks Based on Multi-Omics Data.

Authors:  Yuqi Lin; Wen Zhang; Huanshen Cao; Gaoyang Li; Wei Du
Journal:  Genes (Basel)       Date:  2020-08-04       Impact factor: 4.096

9.  Cluster Analysis of Cell Nuclei in H&E-Stained Histological Sections of Prostate Cancer and Classification Based on Traditional and Modern Artificial Intelligence Techniques.

Authors:  Subrata Bhattacharjee; Kobiljon Ikromjanov; Kouayep Sonia Carole; Nuwan Madusanka; Nam-Hoon Cho; Yeong-Byn Hwang; Rashadul Islam Sumon; Hee-Cheol Kim; Heung-Kook Choi
Journal:  Diagnostics (Basel)       Date:  2021-12-22
  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.