Literature DB >> 15894176

Predicting breast cancer survivability: a comparison of three data mining methods.

Dursun Delen1, Glenn Walker, Amit Kadam.   

Abstract

OBJECTIVE: The prediction of breast cancer survivability has been a challenging research problem for many researchers. Since the early dates of the related research, much advancement has been recorded in several related fields. For instance, thanks to innovative biomedical technologies, better explanatory prognostic factors are being measured and recorded; thanks to low cost computer hardware and software technologies, high volume better quality data is being collected and stored automatically; and finally thanks to better analytical methods, those voluminous data is being processed effectively and efficiently. Therefore, the main objective of this manuscript is to report on a research project where we took advantage of those available technological advancements to develop prediction models for breast cancer survivability. METHODS AND MATERIAL: We used two popular data mining algorithms (artificial neural networks and decision trees) along with a most commonly used statistical method (logistic regression) to develop the prediction models using a large dataset (more than 200,000 cases). We also used 10-fold cross-validation methods to measure the unbiased estimate of the three prediction models for performance comparison purposes.
RESULTS: The results indicated that the decision tree (C5) is the best predictor with 93.6% accuracy on the holdout sample (this prediction accuracy is better than any reported in the literature), artificial neural networks came out to be the second with 91.2% accuracy and the logistic regression models came out to be the worst of the three with 89.2% accuracy.
CONCLUSION: The comparative study of multiple prediction models for breast cancer survivability using a large dataset along with a 10-fold cross-validation provided us with an insight into the relative prediction ability of different data mining methods. Using sensitivity analysis on neural network models provided us with the prioritized importance of the prognostic factors used in the study.

Entities:  

Mesh:

Year:  2005        PMID: 15894176     DOI: 10.1016/j.artmed.2004.07.002

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  107 in total

1.  Computational modeling and multilevel cancer control interventions.

Authors:  Joseph P Morrissey; Kristen Hassmiller Lich; Rebecca Anhang Price; Jeanne Mandelblatt
Journal:  J Natl Cancer Inst Monogr       Date:  2012-05

2.  Diagnosing breast masses in digital mammography using feature selection and ensemble methods.

Authors:  Shu-Ting Luo; Bor-Wen Cheng
Journal:  J Med Syst       Date:  2010-05-14       Impact factor: 4.460

3.  A study on hepatitis disease diagnosis using multilayer neural network with levenberg marquardt training algorithm.

Authors:  M Serdar Bascil; Feyzullah Temurtas
Journal:  J Med Syst       Date:  2009-10-16       Impact factor: 4.460

4.  Tuberculosis disease diagnosis using artificial neural networks.

Authors:  Orhan Er; Feyzullah Temurtas; A Cetin Tanrikulu
Journal:  J Med Syst       Date:  2010-06       Impact factor: 4.460

5.  Diagnosis of breast cancer in light microscopic and mammographic images textures using relative entropy via kernel estimation.

Authors:  Sevcan Aytac Korkmaz; Mehmet Fatih Korkmaz; Mustafa Poyraz
Journal:  Med Biol Eng Comput       Date:  2015-09-07       Impact factor: 2.602

6.  Breast alert: an on-line tool for predicting the lifetime risk of women breast cancer.

Authors:  Joel J P C Rodrigues; Nuno Reis; José A F Moutinho; Isabel de la Torre
Journal:  J Med Syst       Date:  2010-10-02       Impact factor: 4.460

7.  Data mining and medical world: breast cancers' diagnosis, treatment, prognosis and challenges.

Authors:  Rozita Jamili Oskouei; Nasroallah Moradi Kor; Saeid Abbasi Maleki
Journal:  Am J Cancer Res       Date:  2017-03-01       Impact factor: 6.166

8.  Using methods from the data-mining and machine-learning literature for disease classification and prediction: a case study examining classification of heart failure subtypes.

Authors:  Peter C Austin; Jack V Tu; Jennifer E Ho; Daniel Levy; Douglas S Lee
Journal:  J Clin Epidemiol       Date:  2013-02-04       Impact factor: 6.437

9.  Impact of Machine Learning With Multiparametric Magnetic Resonance Imaging of the Breast for Early Prediction of Response to Neoadjuvant Chemotherapy and Survival Outcomes in Breast Cancer Patients.

Authors:  Amirhessam Tahmassebi; Georg J Wengert; Thomas H Helbich; Zsuzsanna Bago-Horvath; Sousan Alaei; Rupert Bartsch; Peter Dubsky; Pascal Baltzer; Paola Clauser; Panagiotis Kapetas; Elizabeth A Morris; Anke Meyer-Baese; Katja Pinker
Journal:  Invest Radiol       Date:  2019-02       Impact factor: 6.016

10.  A comparative study on chronic obstructive pulmonary and pneumonia diseases diagnosis using neural networks and artificial immune system.

Authors:  Orhan Er; Cengiz Sertkaya; Feyzullah Temurtas; A Cetin Tanrikulu
Journal:  J Med Syst       Date:  2009-12       Impact factor: 4.460

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.