| Literature DB >> 36229500 |
Chingching Foocharoen1, Wilaiphorn Thinkhamrop2, Nathaphop Chaichaya2, Ajanee Mahakkanukrauh1, Siraphop Suwannaroj1, Bandit Thinkhamrop3.
Abstract
Clinical predictors of mortality in systemic sclerosis (SSc) are diversely reported due to different healthcare conditions and populations. A simple predictive model for early mortality among patients with SSc is needed as a precise referral tool for general practitioners. We aimed to develop and validate a simple predictive model for predicting mortality among patients with SSc. Prognostic research with a historical cohort study design was conducted between January 1, 2013, and December 31, 2020, in adult SSc patients attending the Scleroderma Clinic at a university hospital in Thailand. The data were extracted from the Scleroderma Registry Database. Early mortality was defined as dying within 5 years after the onset of SSc. Deep learning algorithms with Adam optimizer and different machine learning algorithms (including Logistic Regression, Decision tree, AdaBoost, Random Forest, Gradient Boosting, XGBoost, and Autoencoder neural network) were used to classify SSc mortality. In addition, the model's performance was evaluated using the area under the receiver operating characteristic curve (auROC) and its 95% confidence interval (CI) and values in the confusion matrix. The predictive model development included 528 SSc patients, 343 (65.0%) were females and 374 (70.8%) had dcSSc. Ninety-five died within 5 years after disease onset. The final 2 models with the highest predictive performance comprise the modified Rodnan skin score (mRSS) and the WHO-FC ≥ II for Model 1 and mRSS and WHO-FC ≥ III for Model 2. Model 1 provided the highest predictive performance, followed by Model 2. After internal validation, the accuracy and auROC were good. The specificity was high in Models 1 and 2 (84.8%, 89.8%, and 98.8% in model 1 vs. 84.8%, 85.6%, and 98.8% in model 2). This simplified machine learning model for predicting early mortality among patients with SSc could guide early referrals to specialists and help rheumatologists with close monitoring and management planning. External validation across multi-SSc clinics should be considered for further study.Entities:
Mesh:
Year: 2022 PMID: 36229500 PMCID: PMC9563044 DOI: 10.1038/s41598-022-22161-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flow of patients.
Figure 2Learning curves of Autoencoder model (mRSS and WHO-FC ≥ II). Title = Autoencoder model loss. X-axis = Loss. Y-axis = number of epochs.
Figure 3Learning curves of Autoencoder model (mRSS and WHO-FC ≥ III). Title = Autoencoder model loss. X-axis = Loss. Y-axis = number of epochs.
Overall clinical characteristics.
| Clinical characteristic | N = 528 |
|---|---|
| Male sex (%) | 185 (35.0) |
| Age at onset (years); mean ± SD | 49.7 ± 11.9 |
| Age (years); mean ± SD | 59.4 ± 10.6 |
| Diffuse cutaneous SSc subset (%) | 374 (70.8) |
| mRSS (points); median (IQR) | 2 (0–8) |
| WHO-FC ≥ II (%) | 311 of 524 (59.4) |
| WHO-FC ≥ III (%) | 78 of 524 (14.9) |
| Raynaud’s phenomenon (%) | 253 (47.9) |
| Digital ulcer (%) | 109 (20.6) |
| Salt and pepper skin appearance (%) | 198 (37.5) |
| Calcinosis cutis (%) | 27 (5.1) |
| Tendon friction rub (%) | 67 (12.9) |
| Hand deformity (%) | 203 (38.5) |
| Synovitis (%) | 21 (4.0) |
| Pulmonary fibrosis (%) | 241 of 525 (45.9) |
| Pulmonary arterial hypertension (%) | 77 of 525 (14.7) |
| Cardiac involvement (%) | 6 of 55 (10.9) |
| Renal crisis (%) | 11 (2.1) |
| Esophageal involvement (%) | 220 (41.7) |
| Intestinal involvement (%) | 81 of 521 (15.6) |
| Weight loss (%) | 76 (14.4) |
| Anemia (%) | 294 (55.7) |
SD standard deviation, SSc systemic sclerosis, mRSS modified Rodnan skin score, IQR interquartile range, WHO World Health Organization.
Clinical characteristics of survived and non-survivors.
| Clinical characteristics | Survive | Non-survive | Difference (95% CI) | p-value |
|---|---|---|---|---|
| Overall (%) | 82.0 | 18.0 | ||
| Female sex; n (%) | 300 69.3) | 43 (45.3) | 24.0 (13.1 to 34.9) | < 0.001 |
| Age at onset (years); mean ± SD | 48.1 (11.4) | 57.2 (11.5) | − 9.1 (− 11.7 to − 6.1) | < 0.001 |
| Age (years); mean ± SD | 59.3 (10.5) | 59.7 (11.3) | − 0.4 (− 2.8 to 1.9) | 0.709 |
| Diffuse cutaneous SSc subset; n (%) | 298 (68.8) | 76 (80.0) | − 11.2 (− 20.3 to − 2.0) | 0.030 |
| mRSS (points); median (IQR) | 2 (0–4) | 15 (4–29) | NA | < 0.001 |
| WHO-FC ≥ II; n (%) | 233 (54.2) | 78 (83.0) | − 28.8 (− 37.7 to − 19.9) | < 0.001 |
| WHO-FC ≥ III; n (%) | 54 (12.6) | 24 (25.5) | − 13.0 (− 22.3 to − 3.6) | 0.001 |
| Tendon friction rub; n (%) | 47 (10.9) | 20 (22.0) | − 11.0 (− 20.1 to − 2.0) | 0.004 |
| Pulmonary fibrosis; n (%) | 202 (46.9) | 39 (41.5) | 5.4 (− 5.6 to 16.4) | 0.343 |
| Pulmonary arterial hypertension; n (%) | 69 (16.0) | 8 (8.5) | 7.5 (1 to 14.1) | 0.063 |
| Cardiac involvement; n (%) | 4 (8.7) | 2 (22.2) | − 0.14 (− 0.42 to 0.15) | 0.23 |
| Renal crisis; n (%) | 4 (0.9) | 7 (7.37) | − 0.06 (− 0.12 to − 0.01) | < 0.001 |
| Esophageal involvement; n (%) | 160 (37.0) | 60 (63.1) | − 0.26 (− 0.37 to − 0.15) | < 0.001 |
| Intestinal involvement; n (%) | 65 (15.2) | 16 (17.6) | − 0.02 (− 0.11 to 0.06) | 0.56 |
| Anemia; n (%) | 229 (52.8) | 65 (68.4) | − 0.16 (− 0.26 to − 0.05) | 0.01 |
SD standard deviation, SSc systemic sclerosis, mRSS modified Rodnan skin score, IQR interquartile range, WHO-FC World Health Organization functional class.
Performance of the different algorithms.
| Predictive model | Model performance | |||||
|---|---|---|---|---|---|---|
| Accuracy (%) | AUC (%) | Precision (%) | Spec (%) | Recall (%) | F1-score (%) | |
| Model 1 (mRSS and WHO FC ≥ II) | 84.8 | 89.8 | 88.9 | 98.8 | 34.8 | 70.5 |
| Model 2 (mRSS and WHO FC ≥ III) | 84.8 | 85.6 | 88.9 | 98.8 | 34.8 | 70.5 |
| Decision Tree | 81.9 | 77.0 | 45.5 | 93.1 | 27.8 | 34.5 |
| AdaBoost | 81.9 | 78.1 | 45.5 | 93.1 | 27.8 | 34.5 |
| Random Forest | 81.0 | 80.7 | 41.7 | 92.0 | 27.8 | 33.3 |
| Gradient Boosting | 81.9 | 78.7 | 45.5 | 93.1 | 27.8 | 34.5 |
| XGBoost | 81.9 | 81.0 | 45.5 | 93.1 | 27.8 | 34.5 |
| Logistic regression | 85.7 | 80.0 | 80.0 | 98.9 | 22.2 | 34.8 |
| Decision Tree | 83.8 | 66.0 | 53.8 | 93.1 | 38.9 | 45.2 |
| AdaBoost | 83.8 | 61.5 | 54.5 | 94.3 | 33.3 | 41.4 |
| Random Forest | 82.9 | 71.6 | 50.0 | 92.0 | 38.9 | 43.8 |
| Gradient Boosting | 83.8 | 66.3 | 53.8 | 93.1 | 38.8 | 45.2 |
| XGBoost | 82.9 | 74.5 | 50.0 | 93.1 | 33.3 | 40.0 |
| Logistic regression | 84.8 | 75.4 | 66.7 | 96.5 | 31.6 | 42.9 |
| Model 1 (mRSS and WHO FC ≥ II) | 82.3 | 73.0 | 62.5 | 92.7 | 44.1 | 70.4 |
| Model 2 (mRSS and WHO FC ≥ III) | 79.1 | 81.0 | 51.3 | 84.7 | 58.8 | 70.6 |
| Decision Tree | 81.6 | 79.2 | 60.0 | 93.6 | 36.4 | 45.3 |
| AdaBoost | 81.6 | 79.4 | 60.0 | 93.6 | 36.4 | 45.3 |
| Random Forest | 82.3 | 80.3 | 61.9 | 93.6 | 39.4 | 48.1 |
| Gradient Boosting | 81.6 | 78.4 | 60.0 | 93.6 | 36.4 | 45.3 |
| XGBoost | 81.6 | 80.0 | 58.3 | 92.0 | 42.4 | 49.1 |
| Logistic regression | 82.9 | 80.8 | 80.0 | 98.4 | 24.2 | 37.2 |
| Models for mRSS and WHO FC ≥ III | Results were not shown due to the modest variations between the models for mRSS and WHO FC II | |||||
AUC Area under the receiver operating characteristics (ROC), mRSS modified Rodnan skin score, WHO FC World Health Organization functional class, Spec. Specificity, precision—positive predictive value, recall—sensitivity, CV cross-validation.
Generalizability of selected model(s) presented as accuracy, area under ROC, positive predictive value, specificity, and sensitivity.
| Selected model | Accuracy | AUC (95%) | PPV (95%) | Specificity (95%) | Sensitivity (95%) |
|---|---|---|---|---|---|
| Model 1 mRSS and WHO FC ≥ II | 84.8 | 89.8 (82.7–96.8) | 88.9 (51.8–99.7) | 98.8 (93.4–100.0) | 34.8 (16.4–57.3) |
| Model 2 mRSS and WHO FC ≥ III | 84.8 | 85.6 (74.9–96.2) | 88.9 (51.8–99.7) | 98.8 (93.4–100.0) | 34.8 (16.4–57.3) |
95% CI 95% confidence interval, AUC Area under the receiver operating characteristics (ROC), mRSS modified Rodnan skin score, WHO FC World Health Organization functional class.
Figure 4Learning curves of Model 1 (mRSS and WHO-FC ≥ II), (a) Training and validation loss, (b) the model accuracy in training and validation. Title = (a) Model loss (b) Model accuracy. X-axis = (a) Loss (b) Accuracy. Y-axis = (a) number of epochs (b) number of epochs.
Figure 5Learning curves of Model 2 (mRSS and WHO-FC ≥ III), (a) Training and validation loss, (b) the model accuracy in training and validation. Title = (a) Model loss (b) Model accuracy. X-axis = (a) Loss (b) Accuracy. Y-axis = (a) number of epochs (b) number of epochs.
Figure 6Confusion Matrix for both deep learning models without autoencoder. Title = Deep learning Confusion Matrix. X-axis (Left) = True Labels (upper row = Survivors, lower row = Non-survivors). X-axis (Right) = Number of cases. Y-axis = Predicted Labels (right column = Survivors, left column = Non-survivors).