| Literature DB >> 35508670 |
Hidemasa Matsuo1, Mayumi Kamada2, Akari Imamura3, Madoka Shimizu3, Maiko Inagaki3, Yuko Tsuji3, Motomu Hashimoto4,5, Masao Tanaka4, Hiromu Ito4,6,7, Yasutomo Fujii3.
Abstract
Recent effective therapies enable most rheumatoid arthritis (RA) patients to achieve remission; however, some patients experience relapse. We aimed to predict relapse in RA patients through machine learning (ML) using data on ultrasound (US) examination and blood test. Overall, 210 patients with RA in remission at baseline were dichotomized into remission (n = 150) and relapse (n = 60) based on the disease activity at 2-year follow-up. Three ML classifiers [Logistic Regression, Random Forest, and extreme gradient boosting (XGBoost)] and data on 73 features (14 US examination data, 54 blood test data, and five data on patient information) at baseline were used for predicting relapse. The best performance was obtained using the XGBoost classifier (area under the receiver operator characteristic curve (AUC) = 0.747), compared with Random Forest and Logistic Regression (AUC = 0.719 and 0.701, respectively). In the XGBoost classifier prediction, ten important features, including wrist/metatarsophalangeal superb microvascular imaging scores, were selected using the recursive feature elimination method. The performance was superior to that predicted by researcher-selected features, which are conventional prognostic markers. These results suggest that ML can provide an accurate prediction of relapse in RA patients, and the use of predictive algorithms may facilitate personalized treatment options.Entities:
Mesh:
Year: 2022 PMID: 35508670 PMCID: PMC9068780 DOI: 10.1038/s41598-022-11361-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flow chart depicting patient selection. From 563 RA patients enrolled in the KURAMA cohort in 2015, 390 patients with available follow-up data in 2017 were selected. Next, 323 patients whose US data were available in 2015 were selected. Of the 323 patients, DAS28-CRP data in 2015 and 2017 were available in 293 patients and 81 patients with non-remission (DAS28-CRP ≥ 2.3) in 2015 were excluded. Two of the 212 patients in remission (DAS28-CRP < 2.3) in 2015 lacking the most blood test data (> 80%) were excluded. Finally, the remaining 210 patients were divided into Group 1 (patients with remission in 2017, n = 150) and Group 2 (patients with relapse in 2017, n = 60). KURAMA Kyoto University Rheumatoid Arthritis Management Alliance, US ultrasound, DAS28 disease activity score on 28 joints, CRP C-reactive protein.
Patient characteristics (n = 210).
| Median (range) or n (%) | |||||
|---|---|---|---|---|---|
| Remission (n = 150) | Relapse (n = 60) | ||||
| Age, years | 63.8 | (28.1–81.4) | 66.8 | (20.4–91.4) | 0.18# |
| Female, n (%) | 123 | (82.0) | 49 | (81.7) | 1.00§ |
| Disease duration, years | 7.1 | (0.9–71.6) | 9.9 | (1.2–41.7) | 0.06# |
| RF positive, n (%) | 111 | (74.0) | 50 | (83.3) | 0.20§ |
| Anti-CCP positive, n (%) | 109 | (72.7) | 46 | (76.7) | 0.61§ |
| CRP, mg/dl | 0.1 | (0.0–3.6) | 0.1 | (0.0–3.6) | 0.51# |
| DAS28-CRP | 1.3 | (1.0–2.3) | 1.6 | (1.0–2.3) | 0.007# |
| SDAI | 2.5 | (0.0–8.8) | 3.5 | (0.2–10.4) | 0.0009# |
| CDAI | 2.2 | (0.0–8.8) | 3.2 | (0.1–9.8) | 0.002# |
| HAQ | 0.3 | (0.0–1.9) | 0.5 | (0.0–2.5) | 0.007# |
| Pt-VAS (mm) | 10.0 | (0.0–70.0) | 13.0 | (0.0–80.0) | 0.03# |
| Use of glucocorticoid, n (%) | 27 | (18.0) | 15 | (25.0) | 0.26§ |
| Use of methotrexate, n (%) | 116 | (77.3) | 34 | (56.7) | 0.004§ |
| Use of biologics, n (%) | 74 | (49.3) | 26 | (43.3) | 0.45§ |
| TNF inhibitors, n (%) | 45 | (30.0) | 15 | (25.0) | 0.50§ |
| Tocilizumab, n (%) | 18 | (12.0) | 6 | (10.0) | 0.81§ |
| Abatacept, n (%) | 10 | (6.7) | 4 | (6.7) | 1.00§ |
| Tofacitinib, n (%) | 1 | (0.7) | 1 | (1.7) | 0.49§ |
The groups are defined as follows: Remission: Patients with remission in both 2015 and 2017. Relapse: Patients with remission in 2015 and relapse in 2017.
RF rheumatoid factor, CCP cyclic citrullinated peptide antibody, CRP C-reactive protein, DAS28 disease activity score on 28 joints, SDAI simplified disease activity index, CDAI clinical disease activity index, HAQ Health Assessment Questionnaire, Pt-VAS patient global assessment with visual analog scale. #Mann–Whitney U test; §Fisher’s exact test.
AUCs for predicting relapse in RA patients using US examination data, blood test data, or all data calculated by each model.
| Model | AUC | ||
|---|---|---|---|
| US | Blood | All | |
| Logistic Regression | 0.659 | 0.571 | 0.645 |
| Random Forest | 0.621 | 0.615 | 0.677 |
| XGBoost | 0.650 | 0.577 | 0.664 |
RA rheumatoid arthritis, US ultrasound, AUC area under the curve.
Prediction results of relapse in RA patients using researcher/RFE-selected features calculated by each model.
| Model | AUC | Accuracy | Precision | Recall | F1-Score | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Researcher | RFE | Researcher | RFE | Researcher | RFE | Researcher | RFE | Researcher | RFE | |
| Logistic Regression | 0.643 | 0.701 | 0.629 | 0.667 | 0.576 | 0.626 | 0.590 | 0.647 | 0.571 | 0.625 |
| Random Forest | 0.658 | 0.719 | 0.695 | 0.729 | 0.619 | 0.691 | 0.562 | 0.595 | 0.552 | 0.596 |
| XGBoost | 0.590 | 0.747 | 0.610 | 0.776 | 0.528 | 0.735 | 0.532 | 0.703 | 0.524 | 0.706 |
RA rheumatoid arthritis, RFE recursive feature elimination, AUC area under the curve.
Figure 2ROC curves for predicting relapse in RA patients. ROC curves of Logistic Regression, Random Forest, and XGBoost for predicting relapse in RA patients are shown. ROC receiver operating characteristics, RA rheumatoid arthritis.
Comparison of the researcher/RFE-selected features.
| Selected by | Features |
|---|---|
| Researcher | Gender, Disease duration, Age, Wrist SMI score, MTP SMI score, ESR (1 h), CRP, RF, anti-CCP, MMP-3 |
| RFE | Height, Wrist SMI score, MTP SMI score, Lisfranc GS score, Cuneonavicular GS score, LYMPH, ESR (1 h), PLT, ALT, CRE |
In RFE-selected features, those selected in XGBoost were shown.
SMI superb microvascular imaging, ESR erythrocyte sedimentation rate, CRP C-reactive protein, RF rheumatoid factor, CCP cyclic citrullinated peptide antibody, MMP-3 matrix metalloproteinase-3, RFE recursive feature elimination, MTP metatarsophalangeal, LYMPH lymphocyte count, PLT platelet count, ALT alanine aminotransferase, GS gray scale, CRE creatinine.
Figure 3Feature importance for predicting relapse and comparison of each feature between RA patients with remission and relapse. (A) Importance of features for predicting relapse calculated by XGBoost model. (B) Comparison of each feature between RA patients with remission and relapse. (C) Visualization of the characteristics of the selected features in XGBoost model using tSNE. *P < 0.05, **P < 0.01, ***P < 0.001. SMI superb microvascular imaging, MTP metatarsophalangeal, LYMPH lymphocyte count, ESR erythrocyte sedimentation rate, PLT platelet count, ALT alanine aminotransferase, GPT glutamic pyruvic transaminase, GS gray scale, CRE creatinine, US ultrasound.