| Literature DB >> 32161842 |
Sung Hye Kong1, Daehwan Ahn2, Buomsoo Raymond Kim3, Karthik Srinivasan3, Sudha Ram3, Hana Kim1, A Ram Hong4, Jung Hee Kim1, Nam H Cho5, Chan Soo Shin1.
Abstract
The prediction of fracture risk in osteoporotic patients has been a topic of interest for decades, and models have been developed for the accurate prediction of fracture, including the fracture risk assessment tool (FRAX). As machine-learning methodologies have recently emerged as a potential model for medical prediction tools, we aimed to develop a novel fracture prediction model using machine-learning methods in a prospective community-based cohort. In this study, 2227 participants (1257 females) with a baseline bone mineral density (BMD) and trabecular bone score were enrolled from the Ansung cohort. The primary endpoint was the fragility fractures reported by patients or confirmed by X-rays. We used 3 different models: CatBoost, support vector machine (SVM), and logistic regression. During a mean 7.5-year follow-up (range, 2.5 to 10 years), fragility fractures occurred in 537 (25.6%) of participants. In predicting total fragility fractures, the area under the curve (AUC) values of the CatBoost, SVM, and logistic regression models were 0.688, 0.500, and 0.614, respectively. The AUC value of CatBoost was significantly better than that of FRAX (0.663; p < 0.001), whereas the the SVM and logistic regression models were not. Compared with the conventional models such as SVM and logistic regression, the CatBoost model had the best performance in predicting total fragility fractures (p < 0.001). According to feature importance in the CatBoost model, the top predicting factors (listed in order) were total hip, lumbar spine, and femur neck BMD, subjective arthralgia score, serum creatinine, and homocysteine. The latter three factors were listed higher than conventional predictors such as age or previous fracture history. In summary, we hereby report the development of a prediction model for fragility fractures using a machine-learning method, CatBoost, which outperforms the FRAX model as well as two conventional machine-learning models. The model was also able to propose novel high-ranking predictors.Entities:
Keywords: FRACTURE; MACHINE LEARNING; PREDICTION MODEL; PROSPECTIVE COHORT
Year: 2020 PMID: 32161842 PMCID: PMC7059838 DOI: 10.1002/jbm4.10337
Source DB: PubMed Journal: JBMR Plus ISSN: 2473-4039
Performance in AUC of Machine‐Learning Models
| Total fracture | Vertebral fracture | Hip fracture | |
|---|---|---|---|
| CatBoost model with all variables | 0.688 | 0.684 | 0.656 |
| CatBoost model with top‐20 variables | 0.688 | 0.656 | 0.653 |
| Logistic regression model with all variables | 0.614 (0.612–0.616) | 0.663 (0.661–0.664) | 0.606 (0.598–0.614) |
| Logistic regression model with top‐20 variables | 0.565 | 0.628 | 0.622 |
| SVM model with all variables | 0.500 (0.500–0.501) | 0.502 (0.501–0.502) | 0.502 (0.502–0.502) |
| SVM model with top‐20 variables | 0.542 | 0.563 | 0.503 (0.497–0.510) |
Evaluation of the performance of the prediction models were done in area under the curve (AUC) score with randomly selected threefold cross‐validation for 1000 times.
SVM = support vector machine.
Refers to p < 0.001 of model with top‐20 variables compared with model with all variables.
Refers to p < 0.001 of CatBoost model compared with the logistic regression model.
Refers to p < 0.001 of CatBoost model compared with the SVM model.
Refers to p < 0.001 of CatBoost model with top‐20 variables compared with the logistic regression model with top‐20 variables.
Refers to p < 0.001 of CatBoost model with top‐20 variables compared with the SVM model with the top‐20 variables.
Figure 1Study participants and used models. SVM = support vector machine.
Clinical Characteristics of Participants
| Total ( | Without fracture ( | With fracture ( |
| |
|---|---|---|---|---|
| Age, years | 61.2 ± 8.7 | 60.4 ± 8.7 | 63.7 ± 8.2 | <0.001 |
| Female | 1257 (56.4) | 927 (54.9%) | 330 (61.5%) | 0.008 |
| BMI, kg/m2 | 24.4 ± 3.3 | 24.4 ± 3.2 | 24.3 ± 3.3 | 0.551 |
| Menarche, years | 16.1 ± 1.9 | 16.0 ± 1.8 | 16.4 ± 1.9 | <0.001 |
| Menopause, years | 46.5 ± 10.7 | 46.0 ± 10.9 | 47.8 ± 10.0 | 0.203 |
| Ever smoker | 746 (33.6%) | 580 (34.4%) | 166 (31.0%) | 0.166 |
| Ever drinker | 349 (16.9%) | 266 (17.1%) | 83 (16.0%) | 0.603 |
| History of previous fracture | 206 (9.3%) | 120 (7.1%) | 86 (16.0%) | <0.001 |
| Diabetes | 284 (12.8%) | 220 (13.0%) | 64 (12.0%) | 0.564 |
| Hypertension | 934 (1.8%) | 593 (35.2%) | 210 (39.3%) | 0.188 |
| Osteoporosis | 514 (23.1%) | 338 (20.0%) | 176 (32.9%) | <0.001 |
| Arthritis | 866 (39.8%) | 968 (58.4%) | 341 (66.0%) | 0.003 |
| Arthralgia, score | 1.6 ± 3.1 | 1.4 ± 2.1 | 2.1 ± 5.0 | 0.001 |
| K‐MMSE, score | 23.2 ± 6.2 | 23.6 ± 6.0 | 22.2 ± 6.5 | 0.001 |
| K‐GDS, score | 4.3 ± 4.0 | 4.0 ± 3.9 | 5.1 ± 4.2 | <0.001 |
| Hba1c, % | 5.9 ± 1.0 | 5.9 ± 1.0 | 5.8 ± 1.0 | 0.774 |
| Creatinine, mg/dL | 0.9 ± 0.2 | 0.9 ± 0.2 | 0.9 ± 0.2 | 0.007 |
| ALT, mg/dL | 24.9 ± 16.6 | 25.0 ± 16.7 | 24.7 ± 16.3 | 0.652 |
| AST, mg/dL | 27.2 ± 13.3 | 27.1 ± 12.6 | 27.7 ± 15.1 | 0.348 |
| CRP, mg/dL | 1.8 ± 5.2 | 1.7 ± 5.0 | 1.9 ± 5.7 | 0.510 |
| Homocysteine, μmol/L | 12.1 ± 5.0 | 12.1 ± 5.1 | 12.2 ± 4.6 | 0.579 |
| TSH, μIU/mL | 1.7 ± 1.7 | 1.7 ± 1.8 | 1.6 ± 1.4 | 0.146 |
| HOMA‐β cell | 106.0 ± 82.6 | 105.4 ± 87.8 | 107.8 ± 63.6 | 0.496 |
| Lumbar BMD, g/cm2 | 1.007 ± 0.194 | 1.030 ± 0.184 | 0.956 ± 0.192 | <0.001 |
| Femur neck BMD, g/cm2 | 0.834 ± 0.146 | 0.858 ± 0.142 | 0.793 ± 0.139 | <0.001 |
| Total hip BMD, g/cm2 | 0.899 ± 0.154 | 0.924 ± 0.151 | 0.850 ± 0.148 | <0.001 |
| Lumbar TBS, score | 1.406 ± 0.112 | 1.392 ± 0.094 | 1.357 ± 0.097 | <0.001 |
| Follow‐up duration, years | 7.5 ± 1.6 | 7.7 ± 1.3 | 6.9 ± 2.3 | <0.001 |
| Mortality | 128 (5.7%) | 105 (6.2%) | 23 (4.3%) | 0.117 |
| FRAX (major, without BMD), % | 5.2 ± 3.1 | 4.9 ± 2.8 | 6.1 ± 3.6 | <0.001 |
| FRAX (hip, without BMD), % | 1.6 ± 1.6 | 1.4 ± 1.5 | 2.0 ± 1.8 | <0.001 |
| FRAX (major, with BMD), % | 4.5 ± 2.9 | 4.2 ± 2.7 | 5.5 ± 3.4 | <0.001 |
| FRAX (hip, with BMD), % | 1.1 ± 1.7 | 0.9 ± 1.5 | 1.5 ± 2.0 | <0.001 |
| FRAX (major, with TBS), % | 4.4 ± 2.9 | 4.0 ± 2.6 | 5.5 ± 3.6 | <0.001 |
| FRAX (hip, with TBS), % | 0.9 ± 1.5 | 0.8 ± 1.2 | 1.5 ± 1.9 | <0.001 |
Continuous variables are expressed as mean ± SD, or median [interquartile range], and categorical variables as numbers (percentages). Comparisons between groups were analyzed by performing Student's t test, whereas a χ2 test was used for categorical variables.
ALT = alanine aminotransferase; AST = aspartate aminotransferase; CRP = C‐reactive protein; FRAX = fracture risk assessment tool; HOMA‐β = homeostasis model assessment of β‐cell function; K‐GDS = Korean geriatric depression score tool; K‐MMSE = Korean mini‐mental status examination; TBS = trabecular bone score; TSH = thyroid‐stimulating hormone (thyrotropin).
Top‐20 Features Derived From the CatBoost Model
| Ranking | Risk factor | Feature importance |
|---|---|---|
| 1 | Total hip BMD | 0.222 |
| 2 | Lumbar spine BMD | 0.112 |
| 3 | Femur neck BMD | 0.101 |
| 4 | Arthralgia score | 0.100 |
| 5 | Creatinine | 0.090 |
| 6 | Homocysteine | 0.086 |
| 7 | AST | 0.076 |
| 8 | Lumbar spine TBS | 0.072 |
| 9 | Fasting glucose | 0.068 |
| 10 | Age | 0.062 |
| 11 | Triglyceride | 0.062 |
| 12 | K‐MMSE | 0.061 |
| 13 | CRP | 0.060 |
| 14 | BMI | 0.058 |
| 15 | Menarche | 0.055 |
| 16 | Platelet | 0.049 |
| 17 | Income status | 0.043 |
| 18 | Previous fracture history | 0.041 |
| 19 | TSH | 0.040 |
| 20 | K‐GDS | 0.038 |
AST = Aspartate aminotransferase; CRP = C‐reactive protein; K‐GDS = Korean geriatric depression score; K‐MMSE = Korean mini‐mental status examination; TBS = trabecular bone score; TSH = thyroid‐stimulating hormone (thyrotropin).
Figure 2Impact of features on prediction model output. Red and blue colors represent high and low levels of each predictor. The x‐axis represents the SHAP value. A positive SHAP value means likely to have a fracture; a negative value means unlikely to have a fracture. AST = aspartate aminotransferase; TSH = thyroid‐stimulating hormone (thyrotropin); TBS = trabecular bone score; KMMSE = Korean mini‐mental status examination; CRP = C‐reactive protein; K‐GDS = Korean geriatric depression score; SHAP = Shapley additive explanations.
Figure 3Impact on prediction model output of (A) total hip BMD, (B) lumbar spine BMD, (C) subjective arthralgia score, and (D) homocysteine level. Red and blue colors represent old and young age. The y‐axis represents the SHAP value. A positive SHAP value means likely to have a fracture; a negative value means unlikely to have a fracture. SHAP = Shapley additive explanations.
Performance in AUC of Machine Learning and FRAX Score
| Total fracture |
| Hip fracture |
| |
|---|---|---|---|---|
| Machine‐learning model (CatBoost) | 0.688 (0.687–0.688) | 0.656 (0.655–0.656) | ||
| FRAX (major, without BMD), % | 0.638 | <0.001 | ‐ | ‐ |
| FRAX (major, with BMD), % | 0.660 | <0.001 | ‐ | ‐ |
| FRAX (major, with TBS), % | 0.663 | <0.001 | ‐ | ‐ |
| FRAX (hip, without BMD), % | ‐ | ‐ | 0.528 | <0.001 |
| FRAX (hip, with BMD), % | ‐ | ‐ | 0.545 | <0.001 |
| FRAX (hip, with TBS), % | ‐ | ‐ | 0.549 | <0.001 |
AUC = area under the curve; FRAX = fracture risk assessment tool; TBS = trabecular bone score.
FRAX scores compared with the machine‐learning model.