| Literature DB >> 36033513 |
Lichen Ji1,2,3,4, Wei Zhang3,4,5, Xugang Zhong3,4,5, Tingxiao Zhao1,3,4, Xixi Sun6, Senbo Zhu1,2,3,4, Yu Tong1,3,4, Junchao Luo1,2,3,4, Youjia Xu7, Di Yang1,3,4, Yao Kang1,3,4, Jin Wang8,9,10, Qing Bi1,3,4.
Abstract
The risk of osteoporosis in breast cancer patients is higher than that in healthy populations. The fracture and death rates increase after patients are diagnosed with osteoporosis. We aimed to develop machine learning-based models to predict the risk of osteoporosis as well as the relative fracture occurrence and prognosis. We selected 749 breast cancer patients from two independent Chinese centers and applied six different methods of machine learning to develop osteoporosis, fracture and survival risk assessment models. The performance of the models was compared with that of current models, such as FRAX, OSTA and TNM, by applying ROC, DCA curve analysis, and the calculation of accuracy and sensitivity in both internal and independent external cohorts. Three models were developed. The XGB model demonstrated the best discriminatory performance among the models. Internal and external validation revealed that the AUCs of the osteoporosis model were 0.86 and 0.87, compared with the FRAX model (0.84 and 0.72)/OSTA model (0.77 and 0.66), respectively. The fracture model had high AUCs in the internal and external cohorts of 0.93 and 0.92, which were higher than those of the FRAX model (0.89 and 0.86). The survival model was also assessed and showed high reliability via internal and external validation (AUC of 0.96 and 0.95), which was better than that of the TNM model (AUCs of 0.87 and 0.87). Our models offer a solid approach to help improve decision making.Entities:
Keywords: breast cancer; estrogen; fracture; machine learning; osteoporosis; prognosis
Year: 2022 PMID: 36033513 PMCID: PMC9417646 DOI: 10.3389/fonc.2022.973307
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Figure 1Flow diagram of the study population selected from Zhejiang Provincial People’s Hospital and the Second Affiliated Hospital of Soochou University. According to the inclusion and exclusion criteria, a total of 599 patient were included in this study,and they were randomly cut into the training and internal test sets in a 7:3 ratio. Data from the Second Affiliated Hospital of Soochou University as an external test set.
Figure 2(A) Five-fold cross-validation results of different machine models in training set. Abbreviations: DT: Decision tree; LR: Logistic regression; MLP: Multilayer Pecepreon; NBC: Naive Bayes classification; RF: Random Forest; XGB: eXtreme gradient boosting. (B) The ROC curve of different machine learning models,FRAX score and OSTA score in training test set. (C) The ROC curve of different machine learning models, FRAX score and OSTA score in internal test set. (D) Prediction performance of different models, FRAX score and OSTA score in internal test set. (E) The DCA curve of different machine learning models, FRAX score and OSTA score in internal test set. (F) The ROC curve of different machine learning models, FRAX score and OSTA score in external test set. (G) Prediction performance of different models, FRAX score and OSTA score in external test set. (H) The DCA curve of different machine learning models, FRAX score and OSTA score in external test set. (I) Summary plots for SHAP values. For each feature, one point corresponds to a single patient. A point’s position along the x axis (i.e., the actual SHAP value) represents the impact that feature had on the model’s output for that specific patient. (osteoporosis predicting model).
Figure 3(A) Five-fold cross-validation results of different machine models in training set. (B) The ROC curve of different machine learning models, FRAX score in training test set. (C) The ROC curve of different machine learning models, FRAX score in internal test set. (D) Prediction performance of different models, FRAX score in internal test set. (E) The DCA curve of different machine learning models, FRAX score in internal test set. (F) The ROC curve of different machine learning models, FRAX score in external test set. (G) Prediction performance of different models, FRAX score in external test set. (H) The DCA curve of different machine learning models, FRAX score in external test set. (I) Feature importance plot for the XGB osteoporosis prediction model. All the features are shown in this figure. The blue and red points in each row represent nodules having low to high values of the specific feature, while the x-axis shows the SHAP value, indicating the impact on the model. (fracture predicting model).
Figure 4(A) Five-fold cross-validation results of different machine models in training set. (B) The ROC curve of different machine learning models, TNM stage model in training test set. (C) The ROC curve of different machine learning models and TNM stage model in internal test set. (D) Prediction performance of different models and TNM stage model in internal test set. (E) The DCA curve of different machine learning models and TNM stage model in internal test set. (F) The ROC curve of different machine learning models and TNM stage model in external test set. (G) Prediction performance of different models and TNM stage model in external test set. (H) The DCA curve of different machine learning models and TNM stage model in external test set. (I) Feature importance plot for the XGB osteoporosis prediction model. All the features are shown in this figure. The blue and red points in each row represent nodules having low to high values of the specific feature, while the x-axis shows the SHAP value, indicating the impact on the model. (survival predicting model for 8 years).
Figure 5Screenshot of the web-based model. Screenshot of the XGB osteoporosis predicting model, which is available at https://share.streamlit.io/lry4000/osteoporosis/main.