Literature DB >> 33338982

Application of decision tree-based ensemble learning in the classification of breast cancer.

Mohammad M Ghiasi1, Sohrab Zendehboudi2.   

Abstract

As a common screening and diagnostic tool, Fine Needle Aspiration Biopsy (FNAB) of the suspicious breast lumps can be used to distinguish between malignant and benign breast cytology. In this study, we first review published works on the classification of breast cancer where the machine learning and data mining algorithms have been applied by using the Wisconsin Breast Cancer Database (WBCD). This work then introduces useful new tools, based on Random Forest (RF) and Extremely Randomized Trees or Extra Trees (ET) algorithms to classify breast cancer. The RF and ET strategies use the decision trees as proper classifiers to attain the ultimate classification. The RF and ET approaches include four main stages: input identification, determination of the optimal number of trees, voting analysis, and final decision. The models implemented in this research consider important factors such as uniformity of cell size, bland chromatin, mitoses, and clump thickness as the input parameters. According to the statistical analysis, the proposed methods are able to classify the type of breast cancer accurately. The error analysis results reveal that the designed RF and ET models offer easy-to-use outcomes and the highest diagnostic performance, compared to previous tools/models in the literature for the WBCD classification. The highest and lowest magnitudes of relative importance are attributed to the uniformity of cell size and mitoses among the factors. It is expected that the RF and ET algorithms play an important role in medicine and health systems for screening and diagnosis in the near future.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Breast cancer; Classification; Ensemble learning; Random forest/extra trees; Wisconsin breast cancer database

Year:  2020        PMID: 33338982     DOI: 10.1016/j.compbiomed.2020.104089

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  4 in total

1.  Cancer Classification with a Cost-Sensitive Naive Bayes Stacking Ensemble.

Authors:  Yueling Xiong; Mingquan Ye; Changrong Wu
Journal:  Comput Math Methods Med       Date:  2021-04-24       Impact factor: 2.238

2.  Developing practical clinical tools for predicting neonatal mortality at a neonatal intensive care unit in Tanzania.

Authors:  Dory Kovacs; Delfina R Msanga; Stephen E Mshana; Muhammad Bilal; Katarina Oravcova; Louise Matthews
Journal:  BMC Pediatr       Date:  2021-12-01       Impact factor: 2.125

3.  Application of machine learning models based on decision trees in classifying the factors affecting mortality of COVID-19 patients in Hamadan, Iran.

Authors:  Samad Moslehi; Niloofar Rabiei; Ali Reza Soltanian; Mojgan Mamani
Journal:  BMC Med Inform Decis Mak       Date:  2022-07-24       Impact factor: 3.298

4.  Machine learning of microvolt-level 12-lead electrocardiogram can help distinguish takotsubo syndrome and acute anterior myocardial infarction.

Authors:  Masato Shimizu; Makoto Suzuki; Hiroyuki Fujii; Shigeki Kimura; Mitsuhiro Nishizaki; Tetsuo Sasano
Journal:  Cardiovasc Digit Health J       Date:  2022-07-16
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.