Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Breast cancer data analysis for survivability studies and prediction.

Literature DB >> 29512500

Breast cancer data analysis for survivability studies and prediction.

Nagesh Shukla¹, Markus Hagenbuchner², Khin Than Win², Jack Yang³.

Abstract

BACKGROUND: Breast cancer is the most common cancer affecting females worldwide. Breast cancer survivability prediction is challenging and a complex research task. Existing approaches engage statistical methods or supervised machine learning to assess/predict the survival prospects of patients.
OBJECTIVE: The main objectives of this paper is to develop a robust data analytical model which can assist in (i) a better understanding of breast cancer survivability in presence of missing data, (ii) providing better insights into factors associated with patient survivability, and (iii) establishing cohorts of patients that share similar properties.
METHODS: Unsupervised data mining methods viz. the self-organising map (SOM) and density-based spatial clustering of applications with noise (DBSCAN) is used to create patient cohort clusters. These clusters, with associated patterns, were used to train multilayer perceptron (MLP) model for improved patient survivability analysis. A large dataset available from SEER program is used in this study to identify patterns associated with the survivability of breast cancer patients. Information gain was computed for the purpose of variable selection. All of these methods are data-driven and require little (if any) input from users or experts.
RESULTS: SOM consolidated patients into cohorts of patients with similar properties. From this, DBSCAN identified and extracted nine cohorts (clusters). It is found that patients in each of the nine clusters have different survivability time. The separation of patients into clusters improved the overall survival prediction accuracy based on MLP and revealed intricate conditions that affect the accuracy of a prediction.
CONCLUSIONS: A new, entirely data driven approach based on unsupervised learning methods improves understanding and helps identify patterns associated with the survivability of patient. The results of the analysis can be used to segment the historical patient data into clusters or subsets, which share common variable values and survivability. The survivability prediction accuracy of a MLP is improved by using identified patient cohorts as opposed to using raw historical data. Analysis of variable values in each cohort provide better insights into survivability of a particular subgroup of breast cancer patients.

Entities: Disease Species

Keywords: Breast cancer survivability study; Machine learning; SEER data

Mesh：

Year: 2017 PMID： 29512500 DOI： 10.1016/j.cmpb.2017.12.011

Source DB: PubMed Journal: Comput Methods Programs Biomed ISSN： 0169-2607 Impact factor: 5.428

Keyword Cloud
Cited

9 in total

Review 1. Artificial Intelligence in Cardiovascular Imaging: JACC State-of-the-Art Review.

Authors: Damini Dey; Piotr J Slomka; Paul Leeson; Dorin Comaniciu; Sirish Shrestha; Partho P Sengupta; Thomas H Marwick
Journal: J Am Coll Cardiol Date: 2019-03-26 Impact factor: 24.094

2. Dynamic Risk Prediction via a Joint Frailty-Copula Model and IPD Meta-Analysis: Building Web Applications.

Authors: Takeshi Emura; Hirofumi Michimae; Shigeyuki Matsui
Journal: Entropy (Basel) Date: 2022-04-22 Impact factor: 2.738

3. Simultaneous Integrated Boost in Once-weekly Hypofractionated Radiotherapy for Breast Cancer in the Elderly: Preliminary Evidence.

Authors: Marina Guenzi; Renzo Corvò; Elisabetta Bonzano; Liliana Belgioia; Giorgia Polizzi; Guido Siffredi; Piero Fregatti; Daniele Friedman; Stefania Garelli; Marco Gusinu; Elena Maria Luisa Vaccara
Journal: In Vivo Date: 2019 Nov-Dec Impact factor: 2.155

4. Subtyping CKD Patients by Consensus Clustering: The Chronic Renal Insufficiency Cohort (CRIC) Study.

Authors: Zihe Zheng; Sushrut S Waikar; Insa M Schmidt; J Richard Landis; Chi-Yuan Hsu; Tariq Shafi; Harold I Feldman; Amanda H Anderson; Francis P Wilson; Jing Chen; Hernan Rincon-Choles; Ana C Ricardo; Georges Saab; Tamara Isakova; Radhakrishna Kallem; Jeffrey C Fink; Panduranga S Rao; Dawei Xie; Wei Yang
Journal: J Am Soc Nephrol Date: 2021-01-18 Impact factor: 14.978

5. Individual-patient prediction of meningioma malignancy and survival using the Surveillance, Epidemiology, and End Results database.

Authors: Jeremy T Moreau; Todd C Hankinson; Sylvain Baillet; Roy W R Dudley
Journal: NPJ Digit Med Date: 2020-01-30

6. Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study.

Authors: Wenjie Qu; Qingqing Liu; Xinlin Jiao; Teng Zhang; Bingyu Wang; Ningfeng Li; Taotao Dong; Baoxia Cui
Journal: Front Oncol Date: 2021-02-18 Impact factor: 6.244

7. Machine Learning With K-Means Dimensional Reduction for Predicting Survival Outcomes in Patients With Breast Cancer.

Authors: Melissa Zhao; Yushi Tang; Hyunkyung Kim; Kohei Hasegawa
Journal: Cancer Inform Date: 2018-11-09

8. Cohort profile: the MCC-Spain follow-up on colorectal, breast and prostate cancers: study design and initial results.

Authors: Jessica Alonso-Molero; Antonio J Molina; Jose Juan Jiménez-Moleón; Beatriz Pérez-Gómez; Vicente Martin; Victor Moreno; Pilar Amiano; Eva Ardanaz; Silvia de Sanjose; Inmaculada Salcedo; Guillermo Fernandez-Tardon; Juan Alguacil; Dolores Salas; Rafael Marcos-Gragera; Maria Dolores Chirlaque; Nuria Aragonés; Gemma Castaño-Vinyals; Marina Pollán; Manolis Kogevinas; Javier Llorca
Journal: BMJ Open Date: 2019-11-21 Impact factor: 2.692

9. Application of data mining for predicting hemodynamics instability during pheochromocytoma surgery.

Authors: Yueyang Zhao; Li Fang; Lei Cui; Song Bai
Journal: BMC Med Inform Decis Mak Date: 2020-07-20 Impact factor: 2.796

9 in total