| Literature DB >> 35778714 |
Stephanie Q Duong1, Cynthia S Crowson1,2, Arjun Athreya3, Elizabeth J Atkinson1, John M Davis2, Kenneth J Warrington2, Eric L Matteson2, Richard Weinshilboum3, Liewei Wang3, Elena Myasoedova4,5.
Abstract
BACKGROUND: Methotrexate is the preferred initial disease-modifying antirheumatic drug (DMARD) for rheumatoid arthritis (RA). However, clinically useful tools for individualized prediction of response to methotrexate treatment in patients with RA are lacking. We aimed to identify clinical predictors of response to methotrexate in patients with rheumatoid arthritis (RA) using machine learning methods.Entities:
Keywords: Machine learning; Methotrexate; Rheumatoid arthritis; Treatment
Mesh:
Substances:
Year: 2022 PMID: 35778714 PMCID: PMC9248180 DOI: 10.1186/s13075-022-02851-5
Source DB: PubMed Journal: Arthritis Res Ther ISSN: 1478-6354 Impact factor: 5.606
Baseline comparisons between responder groups and training/test sets. Baseline demographic and clinical characteristics were compared (a) between the good and poor responders defined using latent class mixed modeling and (b) between the unimputed training and test sets
| 50 (40, 58) ( | 50 (41, 57) ( | 0.66 | 50 (40, 57) | 50 (40, 59) ( | 0.40 | 50 (40, 58) ( | |
| 402 (78.8) | 215 (81.1) | 0.45 | 286 (78.4) | 331 (80.7) | 0.41 | 617 (79.6) | |
| 269 (52.7) | 157 (59.2) | 0.08 | 239 (65.5) | 187 (45.6) | < 0.001 | 426 (55) | |
| 6.5 (5.9, 7.2) | 7.6 (6.9, 8.1) | < 0.001 | 7.1 (6.4, 7.8) | 6.7 (6.0, 7.4) | < 0.001 | 6.9 (6.2, 7.6) | |
| 14 (9, 20) | 21 (16, 26) | < 0.001 | 18 (12, 24) | 15 (10, 22) | < 0.001 | 16 (11, 23) | |
| 10.5 (7, 15) | 15 (11, 22) | < 0.001 | 13 (9, 19) | 11 (8, 16) | < 0.001 | 12 (8, 17) | |
| 42.5 (31, 64) | 63 (42, 88) | < 0.001 | 57 (38, 81) | 42.5 (32, 64) | < 0.001 | 49 (34, 74.5) | |
| 65 (50, 79) | 76 (61, 88) | < 0.001 | 71 (55, 83) | 67 (50, 82) | < 0.001 | 69 (52, 83) | |
| 2.3 (0.9, 5.0) ( | 3.1 (1.4, 7.2) ( | < 0.001 | 2.4 (1.3, 4.4) ( | 2.7 (0.8, 8.9) | 0.25 | 2.5 (1.0, 6.2) ( | |
| 66 (52, 76) ( | 72 (62, 83) | < 0.001 | 70 (56, 80) ( | 68 (53.2, 77) | 0.034 | 68.5 (55, 78) ( | |
| 463 (91.1) ( | 241 (90.9) | 0.93 | 329 (90.1) | 375 (91.9) ( | 0.39 | 704 (91.1) ( | |
| 447 (88.3) ( | 230 (87.8) ( | 0.82 | 325 (90.3) ( | 352 (86.3) ( | 0.09 | 677 (88.2) ( | |
| 124 (46.4) ( | 56 (36.4) ( | 0.044 | 87 (48.9) ( | 93 (38.3) ( | 0.030 | 180 (42.8) ( | |
| 1.6 (1.1, 2.0) ( | 2.0 (1.6, 2.4) ( | < 0.001 | 2.0 (1.6, 2.2) ( | 1.6 (1.1, 2.0) ( | < 0.001 | 1.8 (1.2, 2.1) ( | |
| 3.7 (1.5, 7.9) | 5.3 (2.2, 13.4) | < 0.001 | 6.9 (3.6, 17.2) | 2.2 (1.0, 5.7) | < 0.001 | 4.1 (1.7, 9.2) | |
| 199 (54.5) | 311 (75.9) | < 0.001 | |||||
Median (interquartile range) are reported unless noted otherwise
Abbreviations: ACPA, anti-citrullinated protein antibody, CRP, C-reactive protein, DAS28-ESR, disease activity score including 28-joint counts and ESR, erythrocyte-sedimentation rate, HAQ, Health Assessment Questionnaire, PtGA, Patient’s Global Assessment of Disease Activity, PhGA, Physician’s Global Assessment of Disease Activity, RA, rheumatoid arthritis, RF, rheumatoid factor, SJC28, swollen joint count, TJC28, 28-tender joint count
*p-values are reported from the Kruskal–Wallis rank sum test for continuous data and Pearson’s chi-squared test for categorical data
Fig. 1Two patient class trajectories identified with latent class modeling of DAS28-ESR (N = 775)
Model performances on the training (N = 365) and test (N = 410) sets. Two models were investigated. The “DAS28-ESR” model consisted of baseline DAS28-ESR, age, sex, race, RA duration, RF status, ACPA status, glucocorticoid use, and HAQ score. The “components of DAS28-ESR” model consisted of TJC28, SJC28, ESR, PtGA, CRP, PhGA, age, sex, race, RA duration, RF status, ACPA status, glucocorticoids use, and HAQ score. High sensitivity values observed in both LASSO models using the test set implied both models performed well in identifying those who were good responders to methotrexate. For calculating sensitivity and specificity, the cutoff was set to 0.5, with predictions greater than or equal to 0.5 classified as the “good” responder group
| Algorithm | Model | Training set ( | Test set ( | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LASSO | DAS28-ESR | 0.76 (0.71, 0.81) | 0.73 | 0.66 | 0.70 | 0.72 | 0.67 | 0.79 (0.74, 0.84) | 0.83 | 0.61 | 0.78 | 0.87 | 0.53 |
| Components of DAS28-ESR | 0.77 (0.72, 0.81) | 0.75 | 0.65 | 0.70 | 0.72 | 0.68 | 0.79 (0.74, 0.84) | 0.86 | 0.58 | 0.79 | 0.86 | 0.56 | |
| Random forests | DAS28-ESR | 0.96 (0.94, 0.98) | 0.97 | 0.96 | 0.96 | 0.97 | 0.96 | 0.68 (0.62, 0.73) | 0.81 | 0.55 | 0.75 | 0.85 | 0.48 |
| Components of DAS28-ESR | 1 | 1 | 1 | 1 | 1 | 1 | 0.68 (0.63, 0.74) | 0.77 | 0.60 | 0.73 | 0.86 | 0.45 | |
Abbreviations: AUC, area under the curve; DAS28-ESR, Disease Activity Score including 28-joint counts and erythrocyte sedimentation rate, LASSO, least absolute shrinkage and selection operator, NPV, negative predictive value, PPV, positive predictive value, RCTs, randomized clinical trials
Fig. 2Receiver-operating characteristic (ROC) curves of algorithms validated on the test set for the “DAS28-ESR” model (N = 410). The area under the curve (AUC) for the least absolute shrinkage and selection operator (LASSO; dashed) and random forests (RF; solid) methods validated on the test set were 0.79 and 0.68, respectively. LASSO, least absolute shrinkage and selection operator; RF, random forests
Fig. 3Feature importance plots of characteristics for A LASSO and B random forests. Feature importance plots for A the DAS28-ESR model with LASSO algorithm, B the components of DAS28-ESR model with LASSO algorithm, C the DAS28-ESR model with random forests methods, and D the components of DAS28-ESR model with random forests methods are provided below. Feature importance was determined based on standardized LASSO coefficients and the Gini score for random forests. The most important feature was set to 100, and the rest is relative to that feature. DAS28ESR, Disease Activity Score with 28-joint count with erythrocyte sedimentation rate; RA, rheumatoid arthritis; SJC66, 66 Swollen Joint Count; ESR, erythrocyte sedimentation rate; ACPA, anti-citrullinated protein antibodies; TJC68, 68 Tender Joint Count; CRP, C-reactive protein; HAQ, Health Assessment Questionnaire Score; PhGA, Physician’s Global Assessment of Disease Activity; PtGA, Patient’s Global Assessment of Disease Activity; MI, missing indicator
Matrix prediction model. Probability of achieving a good response to methotrexate at 24 weeks
| ≤ 2 | > 2 | ||||
|---|---|---|---|---|---|
| ≤ 7.4 | 80.1 (76.4, 83.8) | 77.3 (70.6, 84) | Positive | ||
| 77.1 (68.6, 85.6) | 74.1 (63.3, 84.9) | Negative | |||
| > 7.4 | 40.3 (32.1, 48.5) | 36.5 (29.3, 43.6) | Positive | ||
| 36.2 (23.3, 49.1) | 32.5 (20.9, 44.1) | Negative | |||
The number in each cell represents the percentage and 95% CI of achieving the outcome, based on the combination of predictors at baseline. DAS28-ESR, Disease Activity Score with 28-joint count with erythrocyte sedimentation rate, HAQ, Health Assessment Questionnaire; ACPA, anti-citrullinated protein antibody