Literature DB >> 29512500

Breast cancer data analysis for survivability studies and prediction.

Nagesh Shukla1, Markus Hagenbuchner2, Khin Than Win2, Jack Yang3.   

Abstract

BACKGROUND: Breast cancer is the most common cancer affecting females worldwide. Breast cancer survivability prediction is challenging and a complex research task. Existing approaches engage statistical methods or supervised machine learning to assess/predict the survival prospects of patients.
OBJECTIVE: The main objectives of this paper is to develop a robust data analytical model which can assist in (i) a better understanding of breast cancer survivability in presence of missing data, (ii) providing better insights into factors associated with patient survivability, and (iii) establishing cohorts of patients that share similar properties.
METHODS: Unsupervised data mining methods viz. the self-organising map (SOM) and density-based spatial clustering of applications with noise (DBSCAN) is used to create patient cohort clusters. These clusters, with associated patterns, were used to train multilayer perceptron (MLP) model for improved patient survivability analysis. A large dataset available from SEER program is used in this study to identify patterns associated with the survivability of breast cancer patients. Information gain was computed for the purpose of variable selection. All of these methods are data-driven and require little (if any) input from users or experts.
RESULTS: SOM consolidated patients into cohorts of patients with similar properties. From this, DBSCAN identified and extracted nine cohorts (clusters). It is found that patients in each of the nine clusters have different survivability time. The separation of patients into clusters improved the overall survival prediction accuracy based on MLP and revealed intricate conditions that affect the accuracy of a prediction.
CONCLUSIONS: A new, entirely data driven approach based on unsupervised learning methods improves understanding and helps identify patterns associated with the survivability of patient. The results of the analysis can be used to segment the historical patient data into clusters or subsets, which share common variable values and survivability. The survivability prediction accuracy of a MLP is improved by using identified patient cohorts as opposed to using raw historical data. Analysis of variable values in each cohort provide better insights into survivability of a particular subgroup of breast cancer patients.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Breast cancer survivability study; Machine learning; SEER data

Mesh:

Year:  2017        PMID: 29512500     DOI: 10.1016/j.cmpb.2017.12.011

Source DB:  PubMed          Journal:  Comput Methods Programs Biomed        ISSN: 0169-2607            Impact factor:   5.428


  9 in total

Review 1.  Artificial Intelligence in Cardiovascular Imaging: JACC State-of-the-Art Review.

Authors:  Damini Dey; Piotr J Slomka; Paul Leeson; Dorin Comaniciu; Sirish Shrestha; Partho P Sengupta; Thomas H Marwick
Journal:  J Am Coll Cardiol       Date:  2019-03-26       Impact factor: 24.094

2.  Dynamic Risk Prediction via a Joint Frailty-Copula Model and IPD Meta-Analysis: Building Web Applications.

Authors:  Takeshi Emura; Hirofumi Michimae; Shigeyuki Matsui
Journal:  Entropy (Basel)       Date:  2022-04-22       Impact factor: 2.738

3.  Simultaneous Integrated Boost in Once-weekly Hypofractionated Radiotherapy for Breast Cancer in the Elderly: Preliminary Evidence.

Authors:  Marina Guenzi; Renzo Corvò; Elisabetta Bonzano; Liliana Belgioia; Giorgia Polizzi; Guido Siffredi; Piero Fregatti; Daniele Friedman; Stefania Garelli; Marco Gusinu; Elena Maria Luisa Vaccara
Journal:  In Vivo       Date:  2019 Nov-Dec       Impact factor: 2.155

4.  Subtyping CKD Patients by Consensus Clustering: The Chronic Renal Insufficiency Cohort (CRIC) Study.

Authors:  Zihe Zheng; Sushrut S Waikar; Insa M Schmidt; J Richard Landis; Chi-Yuan Hsu; Tariq Shafi; Harold I Feldman; Amanda H Anderson; Francis P Wilson; Jing Chen; Hernan Rincon-Choles; Ana C Ricardo; Georges Saab; Tamara Isakova; Radhakrishna Kallem; Jeffrey C Fink; Panduranga S Rao; Dawei Xie; Wei Yang
Journal:  J Am Soc Nephrol       Date:  2021-01-18       Impact factor: 14.978

5.  Individual-patient prediction of meningioma malignancy and survival using the Surveillance, Epidemiology, and End Results database.

Authors:  Jeremy T Moreau; Todd C Hankinson; Sylvain Baillet; Roy W R Dudley
Journal:  NPJ Digit Med       Date:  2020-01-30

6.  Development and Validation of a Personalized Survival Prediction Model for Uterine Adenosarcoma: A Population-Based Deep Learning Study.

Authors:  Wenjie Qu; Qingqing Liu; Xinlin Jiao; Teng Zhang; Bingyu Wang; Ningfeng Li; Taotao Dong; Baoxia Cui
Journal:  Front Oncol       Date:  2021-02-18       Impact factor: 6.244

7.  Machine Learning With K-Means Dimensional Reduction for Predicting Survival Outcomes in Patients With Breast Cancer.

Authors:  Melissa Zhao; Yushi Tang; Hyunkyung Kim; Kohei Hasegawa
Journal:  Cancer Inform       Date:  2018-11-09

8.  Cohort profile: the MCC-Spain follow-up on colorectal, breast and prostate cancers: study design and initial results.

Authors:  Jessica Alonso-Molero; Antonio J Molina; Jose Juan Jiménez-Moleón; Beatriz Pérez-Gómez; Vicente Martin; Victor Moreno; Pilar Amiano; Eva Ardanaz; Silvia de Sanjose; Inmaculada Salcedo; Guillermo Fernandez-Tardon; Juan Alguacil; Dolores Salas; Rafael Marcos-Gragera; Maria Dolores Chirlaque; Nuria Aragonés; Gemma Castaño-Vinyals; Marina Pollán; Manolis Kogevinas; Javier Llorca
Journal:  BMJ Open       Date:  2019-11-21       Impact factor: 2.692

9.  Application of data mining for predicting hemodynamics instability during pheochromocytoma surgery.

Authors:  Yueyang Zhao; Li Fang; Lei Cui; Song Bai
Journal:  BMC Med Inform Decis Mak       Date:  2020-07-20       Impact factor: 2.796

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.