| Literature DB >> 27200218 |
Woojae Kim1, Ku Sang Kim2, Rae Woong Park3.
Abstract
OBJECTIVES: Breast cancer has a high rate of recurrence, resulting in the need for aggressive treatment and close follow-up. However, previously established classification guidelines, based on expert panels or regression models, are controversial. Prediction models based on machine learning show excellent performance, but they are not widely used because they cannot explain their decisions and cannot be presented on paper in the way that knowledge is customarily represented in the clinical world. The principal objective of this study was to develop a nomogram based on a naïve Bayesian model for the prediction of breast cancer recurrence within 5 years after breast cancer surgery.Entities:
Keywords: Breast Neoplasms; Data Mining; Decision Support Techniques; Neural Networks; Support Vector Machine; Survival Analysis
Year: 2016 PMID: 27200218 PMCID: PMC4871850 DOI: 10.4258/hir.2016.22.2.89
Source DB: PubMed Journal: Healthc Inform Res ISSN: 2093-3681
Summary of patient data
MLN: metastatic lymph nodes.
aCategorical value 1 represents negative and 2 represents positive.
Figure 1Process of selecting prognostic factors in the model using both previously established clinical knowledge and statistical analysis.
Figure 2Proposed nomogram for the prediction of breast cancer recurrence within 5 years after breast cancer surgery. By using a measure, each score of the variables can be transferred into the total score, which is linked to the responding probability.
Comparison of clinicopathologic characteristics between training & testing datasets
Values are presented as number (%) or mean ± standard deviation.
aPearson chi-square test, bStudent t-test.
Figure 3Receiver operating characteristics (ROC) curve and calibration plot for the naïve Bayesian classifier at 5 years after breast cancer surgery. (A) The area under the ROC curve (AUC) was 0.81 for naïve Bayesian classifier. (B) The x-axis represents the predicted probability of recurrence; the y-axis represents observed probability. TP: true positive, FP: false positive.
Classification result of the naïve Bayesian classifier at 5 years after breast cancer surgery
AUC: area under the receiver operating characteristics curve.