Literature DB >> 25712814

Applying under-sampling techniques and cost-sensitive learning methods on risk assessment of breast cancer.

Jia-Lien Hsu1, Ping-Cheng Hung, Hung-Yen Lin, Chung-Ho Hsieh.   

Abstract

Breast cancer is one of the most common cause of cancer mortality. Early detection through mammography screening could significantly reduce mortality from breast cancer. However, most of screening methods may consume large amount of resources. We propose a computational model, which is solely based on personal health information, for breast cancer risk assessment. Our model can be served as a pre-screening program in the low-cost setting. In our study, the data set, consisting of 3976 records, is collected from Taipei City Hospital starting from 2008.1.1 to 2008.12.31. Based on the dataset, we first apply the sampling techniques and dimension reduction method to preprocess the testing data. Then, we construct various kinds of classifiers (including basic classifiers, ensemble methods, and cost-sensitive methods) to predict the risk. The cost-sensitive method with random forest classifier is able to achieve recall (or sensitivity) as 100 %. At the recall of 100 %, the precision (positive predictive value, PPV), and specificity of cost-sensitive method with random forest classifier was 2.9 % and 14.87 %, respectively. In our study, we build a breast cancer risk assessment model by using the data mining techniques. Our model has the potential to be served as an assisting tool in the breast cancer screening.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25712814     DOI: 10.1007/s10916-015-0210-x

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  7 in total

Review 1.  Multi-state Markov models in cancer screening evaluation: a brief review and case study.

Authors:  Z Uhry; G Hédelin; M Colonna; B Asselain; P Arveux; A Rogel; C Exbrayat; C Guldenfels; I Courtial; P Soler-Michel; F Molinié; D Eilstein; S W Duffy
Journal:  Stat Methods Med Res       Date:  2010-03-15       Impact factor: 3.021

2.  Latent semantic analysis.

Authors:  Nicholas E Evangelopoulos
Journal:  Wiley Interdiscip Rev Cogn Sci       Date:  2013-08-28

3.  Cancer statistics, 2013.

Authors:  Rebecca Siegel; Deepa Naishadham; Ahmedin Jemal
Journal:  CA Cancer J Clin       Date:  2013-01-17       Impact factor: 508.702

4.  Combined screening with ultrasound and mammography vs mammography alone in women at elevated risk of breast cancer.

Authors:  Wendie A Berg; Jeffrey D Blume; Jean B Cormack; Ellen B Mendelson; Daniel Lehrer; Marcela Böhm-Vélez; Etta D Pisano; Roberta A Jong; W Phil Evans; Marilyn J Morton; Mary C Mahoney; Linda Hovanessian Larsen; Richard G Barr; Dione M Farria; Helga S Marques; Karan Boparai
Journal:  JAMA       Date:  2008-05-14       Impact factor: 56.272

Review 5.  A systematic review of the effectiveness of magnetic resonance imaging (MRI) as an addition to mammography and ultrasound in screening young women at high risk of breast cancer.

Authors:  S J Lord; W Lei; P Craft; J N Cawson; I Morris; S Walleser; A Griffiths; S Parker; N Houssami
Journal:  Eur J Cancer       Date:  2007-08-02       Impact factor: 9.162

6.  Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data.

Authors:  Juhyeon Kim; Hyunjung Shin
Journal:  J Am Med Inform Assoc       Date:  2013-03-06       Impact factor: 4.497

7.  Effect of three decades of screening mammography on breast-cancer incidence.

Authors:  Archie Bleyer; H Gilbert Welch
Journal:  N Engl J Med       Date:  2012-11-22       Impact factor: 91.245

  7 in total
  3 in total

1.  Applying Data Mining Techniques to Improve Breast Cancer Diagnosis.

Authors:  Joana Diz; Goreti Marreiros; Alberto Freitas
Journal:  J Med Syst       Date:  2016-08-06       Impact factor: 4.460

2.  Preprocessing Breast Cancer Data to Improve the Data Quality, Diagnosis Procedure, and Medical Care Services.

Authors:  Zeinab Sajjadnia; Raof Khayami; Mohammad Reza Moosavi
Journal:  Cancer Inform       Date:  2020-05-27

3.  Comparison of Machine Learning Techniques for Mortality Prediction in a Prospective Cohort of Older Adults.

Authors:  Salvatore Tedesco; Martina Andrulli; Markus Åkerlund Larsson; Daniel Kelly; Antti Alamäki; Suzanne Timmons; John Barton; Joan Condell; Brendan O'Flynn; Anna Nordström
Journal:  Int J Environ Res Public Health       Date:  2021-12-04       Impact factor: 3.390

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.