Literature DB >> 35028863

Hybrid gene selection approach using XGBoost and multi-objective genetic algorithm for cancer classification.

Xiongshi Deng1,2, Min Li3,4, Shaobo Deng1,2, Lei Wang1,2.   

Abstract

Microarray gene expression data are often accompanied by a large number of genes and a small number of samples. However, only a few of these genes are relevant to cancer, resulting in significant gene selection challenges. Hence, we propose a two-stage gene selection approach by combining extreme gradient boosting (XGBoost) and a multi-objective optimization genetic algorithm (XGBoost-MOGA) for cancer classification in microarray datasets. In the first stage, the genes are ranked using an ensemble-based feature selection using XGBoost. This stage can effectively remove irrelevant genes and yield a group comprising the most relevant genes related to the class. In the second stage, XGBoost-MOGA searches for an optimal gene subset based on the most relevant genes' group using a multi-objective optimization genetic algorithm. We performed comprehensive experiments to compare XGBoost-MOGA with other state-of-the-art feature selection methods using two well-known learning classifiers on 14 publicly available microarray expression datasets. The experimental results show that XGBoost-MOGA yields significantly better results than previous state-of-the-art algorithms in terms of various evaluation criteria, such as accuracy, F-score, precision, and recall.
© 2022. International Federation for Medical and Biological Engineering.

Entities:  

Keywords:  Classification; Feature selection; Microarray gene expression; Multi-objective genetic algorithm; XGBoost

Mesh:

Year:  2022        PMID: 35028863     DOI: 10.1007/s11517-021-02476-x

Source DB:  PubMed          Journal:  Med Biol Eng Comput        ISSN: 0140-0118            Impact factor:   2.602


  14 in total

1.  A comparative study on feature selection methods for drug discovery.

Authors:  Ying Liu
Journal:  J Chem Inf Comput Sci       Date:  2004 Sep-Oct

2.  Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification.

Authors:  V Elyasigomari; D A Lee; H R C Screen; M H Shaheed
Journal:  J Biomed Inform       Date:  2017-02-03       Impact factor: 6.317

3.  Genetic algorithm based cancerous gene identification from microarray data using ensemble of filter methods.

Authors:  Manosij Ghosh; Sukdev Adhikary; Kushal Kanti Ghosh; Aritra Sardar; Shemim Begum; Ram Sarkar
Journal:  Med Biol Eng Comput       Date:  2018-08-01       Impact factor: 2.602

4.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.

Authors:  U Alon; N Barkai; D A Notterman; K Gish; S Ybarra; D Mack; A J Levine
Journal:  Proc Natl Acad Sci U S A       Date:  1999-06-08       Impact factor: 11.205

5.  Radiation Dose Exposure for Lumbar Transforaminal Epidural Steroid Injections and Facet Joint Blocks Under CT vs. Fluoroscopic Guidance.

Authors:  Paolo Maino; Stefano Presilla; Paola A Colli Franzone; Sander M J van Kuijk; Roberto S G M Perez; Eva Koetsier
Journal:  Pain Pract       Date:  2018-02-05       Impact factor: 3.183

6.  Combined FV and FVIII deficiency (F5F8D) in a Chinese family with a novel missense mutation in MCFD2 gene.

Authors:  A Wang; X Liu; J Wu; X Cai; W Zhu; Z Sun
Journal:  Haemophilia       Date:  2014-11       Impact factor: 4.287

7.  DWFS: a wrapper feature selection tool based on a parallel genetic algorithm.

Authors:  Othman Soufan; Dimitrios Kleftogiannis; Panos Kalnis; Vladimir B Bajic
Journal:  PLoS One       Date:  2015-02-26       Impact factor: 3.240

8.  CD59 is a potential biomarker of esophageal squamous cell carcinoma radioresistance by affecting DNA repair.

Authors:  Yuzhen Zhou; Li Chu; Qi Wang; Weixing Dai; Xiaozhou Zhang; Jianfeng Chen; Ling Li; Peipei Ding; Long Zhang; Hongyu Gu; Luying Li; Xinyue Lv; Wei Zhang; Danlei Zhou; Pingzhao Zhang; Guoxiang Cai; Kuaile Zhao; Weiguo Hu
Journal:  Cell Death Dis       Date:  2018-08-30       Impact factor: 8.469

9.  Gene Expression Value Prediction Based on XGBoost Algorithm.

Authors:  Wei Li; Yanbin Yin; Xiongwen Quan; Han Zhang
Journal:  Front Genet       Date:  2019-11-12       Impact factor: 4.599

10.  A Novel XGBoost Method to Infer the Primary Lesion of 20 Solid Tumor Types From Gene Expression Data.

Authors:  Sijie Chen; Wenjing Zhou; Jinghui Tu; Jian Li; Bo Wang; Xiaofei Mo; Geng Tian; Kebo Lv; Zhijian Huang
Journal:  Front Genet       Date:  2021-02-03       Impact factor: 4.599

View more
  4 in total

1.  Deep learning-based microarray cancer classification and ensemble gene selection approach.

Authors:  Khosro Rezaee; Gwanggil Jeon; Mohammad R Khosravi; Hani H Attar; Alireza Sabzevari
Journal:  IET Syst Biol       Date:  2022-07-04       Impact factor: 1.468

2.  Cancer Detection and Prediction Using Genetic Algorithms.

Authors:  Aradhita Bhandari; B K Tripathy; Khurram Jawad; Surbhi Bhatia; Mohammad Khalid Imam Rahmani; Arwa Mashat
Journal:  Comput Intell Neurosci       Date:  2022-05-16

3.  Performance Analysis of Ovarian Cancer Detection and Classification for Microarray Gene Data.

Authors:  M Kalaiyarasi; Harikumar Rajaguru
Journal:  Biomed Res Int       Date:  2022-07-15       Impact factor: 3.246

4.  Comparative Study of Classification Algorithms for Various DNA Microarray Data.

Authors:  Jingeun Kim; Yourim Yoon; Hye-Jin Park; Yong-Hyuk Kim
Journal:  Genes (Basel)       Date:  2022-03-11       Impact factor: 4.096

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.