| Literature DB >> 35999028 |
Vincent Bouget1, Julien Duquesne1, Signe Hassler2,3, Paul-Henry Cournède4, Bruno Fautrel5,6, Francis Guillemin7, Marc Pallardy8,9, Philippe Broët2,3, Xavier Mariette10, Samuel Bitoun11.
Abstract
OBJECTIVES: Around 30% of patients with rheumatoid arthritis (RA) do not respond to tumour necrosis factor inhibitors (TNFi). We aimed to predict patient response to TNFi using machine learning on simple clinical and biological data.Entities:
Keywords: Outcome Assessment, Health Care; Rheumatoid Arthritis; Tumor Necrosis Factor Inhibitors
Mesh:
Substances:
Year: 2022 PMID: 35999028 PMCID: PMC9403109 DOI: 10.1136/rmdopen-2022-002442
Source DB: PubMed Journal: RMD Open ISSN: 2056-5933
Characteristics of the training set (ESPOIR) and validation set (ABIRISK) at last visit before treatment initiation
| Feature’s name | ESPOIR | ABIRISK |
| n=161 | n=118 | |
| Age, years | 49 (13) | 52 (13) |
| Female, n (%) | 115 (71) | 89 (76) |
| Weight, kg | 69 (15) | 75 (19) |
| Height, cm | 166 (9) | 167 (9) |
| Body mass index | 24.9 (4.6) | 25.6 (5.5) |
| Autoimmunity family history, n (%) | 48 (30) | 37 (32) |
| Ever smokers, n (%) | 72 (45) | 72 (61) |
| Current smokers, n (%) | 28 (17) | 33 (28) |
| Smoking cumulative dose, pack-year | 8 (13) | 15 (14) |
| Past pregnancy (among sex=female), n (%) | 92 (74) | 70 (79) |
| DAS28 | 4.6 (1.6) | 4.4 (1.1) |
| CRP, mg/L | 17 (27) | 12 (16) |
| Erythrocyte sedimentation rate, mm | 26 (23) | 23 (19) |
| Creatinine, µmol/L | 74 (17) | 66 (14) |
| AST, UI/L | 22 (8) | 25 (13) |
| ALT, UI/L | 22 (13) | 27 (20) |
| White blood, cells/109 | 7.9 (2.6) | 8.5 (3.2) |
| Neutrophils, cells/109 | 5.4 (2.4) | 6.1 (3.2) |
| Lymphocytes, cells/109 | 1.7 (0.68) | 2.1 (2.9) |
| Presence of anti-citrullinated protein antibody, n (%) | 113 (70) | 81 (70) |
| Presence of rheumatoid factor IgM | 119 (74%) | 78 (68%) |
| TNFi treatment sequences, N | N=208 | N=118 |
| Etanercept sequences, N (%) | 100 (48) | 68 (58) |
| Monoclonal anti-TNF antibodies sequences, N (%) | 108 (52) | 50 (42) |
| Adalimumab sequences, N (%) | 80 (74) | 39 (78) |
| Infliximab sequences, N (%) | 17 (16) | 11 (22) |
| Certolizumab sequences, N (%) | 8 (7) | 0 (0) |
| Golimumab sequences, N (%) | 3 (3) | 0 (0) |
| First TNFi line, N (%) | 153 (74) | 107 (91) |
| Non-responder imputation, N (%) | 10 (4.8) | 21 (18) |
| Responder to sequences, N (%) | 122 (59) | 72 (61) |
| Etanercept, N (%) | 64 (64) | 42 (62) |
| Monoclonal anti-TNF antibodies, N (%) | 58 (54) | 30 (60) |
| Co-treated with corticosteroids, N (%) | 94 (45) | 51 (43) |
| Co-treated with MTX, N (%) | 152 (73) | 64 (54) |
Results are presented as follows: mean (SD) for continuous variables and amount (percentage) for binary variables.
ALT, aspartate aminotransferase; AST, alanine transaminase; CRP, C reactive protein; DAS28, disease activity score; MTX, methotrexate; TNFi, tumour necrosis factor inhibitors.
Result of the variable selection process for the prediction of the EULAR response
| All TNFi | Etanercept | Monoclonal antibodies TNFi |
| DAS28 | DAS28 | DAS28 |
| Age | Sex | Sex |
| Ever smoked | Ever smoked | Ever smoked |
| Weight | BMI | Weight |
| Lymphocytes | ESR | |
| Neutrophils | ||
| ALT |
The feature selection was run on the training set (ESPOIR).
ALT, alanine transaminase; BMI, body mass index; DAS28, disease activity score; ESR, erythrocyte sedimentation rate; TNFi, tumour necrosis factor inhibitors.
Figure 1Performances of the models predicting the EULAR response calculated on the training set. Cross-validated AUROC of our models for each drug class on the training set with the 95% CI. The higher the AUROC, the better. Stars legend the p value ns: 5.00e-02
Performances of the best models for the EULAR response prediction
| Drug | Best model | Best missing value imputer | AUROC (train) | AUROC (validation) |
| Overall TNFi | CatBoost | MICE | 0.72 (0.68 to 0.73) | Not evaluated since worse than drug-class-specific models on the training set |
| Etanercept | Random forest | Median | 0.74 (0.68 to 0.75) | 0.70 (0.57 to 0.82) |
| Monoclonal anti-TNF antibodies | CatBoost | MICE | 0.74 (0.69 to 0.77) | 0.71 (0.55 to 0.86) |
The best model and best missing value imputer were selected on the training set (ESPOIR) using AUROC. The replication of the results was assessed on the validation set (ABIRISK). Numbers in brackets refer to 95% CIs.
AUROC, area under the receiver operating characteristic curve; MICE, Multiple Imputation by Chained Equations; TNFi, tumour necrosis factor inhibitors.
Figure 2Performances of the best model predicting the EULAR response on the training and validation sets. ROC curves of the best models for the prediction of response to overall TNFi (A), etanercept (B) and monoclonal anti-TNF antibodies (C). ROC, receiver operating characteristic; TNFi, tumour necrosis factor inhibitors.
Metrics of interest regarding the prediction of the EULAR response
| Drug | Strategy 1 (high confidence in response) | Strategy 2 (high confidence in non-response) | ||||||
| Sensitivity | Specificity | PPV | NPV | Sensitivity | Specificity | PPV | NPV | |
| Etanercept | 60% (44% to 74%) | 73% (55% to 89%) | 78% (63% to 92%) | 53% (36% to 69%) | 95% (88% to 100%) | 15% (4% to 30%) | 64% (52% to 76%) | 67% (20% to 100%) |
| Monoclonal anti-TNF antibodies | 37% (20% to 55%) | 95% (83% to 100%) | 92% (73% to 100%) | 50% (35% to 66%) | 90% (78% to 100%) | 40% (19% to 62%) | 69% (54% to 84%) | 73% (44% to 100%) |
For the two strategies presented in methods, we display the metrics on the validation set (ABIRISK). Numbers in brackets refer to 95% CIs.
NPV, negative predictive value; PPV, positive predictive value; TNF, tumour necrosis factor.
Figure 3SHAP values of the best models for the prediction of the EULAR response to overall TNFi (A), etanercept (B) and monoclonal anti-TNF antibodies (C). The SHAP values are computed on a concatenation of the training and validation sets. Only the four most potent variables are displayed. Each dot represents a patient’s data at treatment initiation and is placed on the y-axis according to its SHAP value and on the x-axis depending on the variable value. Positive (resp. negative) SHAP values influence the outcome towards a response (resp. inadequate response). This influence is proportional to the SHAP value. Female sex is encoded by 0. Jitter was added to binary variable to facilitate the reading. BMI, body mass index; DAS28, Disease Activity Score; ESR, erythrocyte sedimentation rate; ETN, etanercept; mAB, monoclonal antibodies; TNFi, tumour necrosis factor inhibitors.