| Literature DB >> 35450069 |
Vincenzo Venerito1, Giacomo Emmi2, Luca Cantarini3, Pietro Leccese4, Marco Fornaro1, Claudia Fabiani5, Nancy Lascaro4, Laura Coladonato1, Irene Mattioli2, Giulia Righetti1, Danilo Malandrino2, Sabina Tangaro6,7, Adalgisa Palermo2, Maria Letizia Urban2, Edoardo Conticini3, Bruno Frediani3, Florenzo Iannone1, Giuseppe Lopalco1.
Abstract
Background: Inferential statistical methods failed in identifying reliable biomarkers and risk factors for relapsing giant cell arteritis (GCA) after glucocorticoids (GCs) tapering. A ML approach allows to handle complex non-linear relationships between patient attributes that are hard to model with traditional statistical methods, merging them to output a forecast or a probability for a given outcome. Objective: The objective of the study was to assess whether ML algorithms can predict GCA relapse after GCs tapering.Entities:
Keywords: algorithm; giant cell (temporal) arteritis; glucocorticoids; machine learning; precision medicine
Mesh:
Substances:
Year: 2022 PMID: 35450069 PMCID: PMC9017227 DOI: 10.3389/fimmu.2022.860877
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 8.786
Cohort characteristics.
| Cohort Characteristics | BASELINE | RELAPSE | ||
|---|---|---|---|---|
| Av. Obs. | Av. Obs | |||
| Female, n (%) | 107 | 73 (68.2) | ||
| Age, mean (SD) | 107 | 74.1 (8.5) | ||
| Disease duration, weeks, mean (SD) | 107 | 17.7 (37.5) | ||
| BMI, mean (SD) | 107 | 26.1 (4.2) | ||
|
| 107 | 21 (19.6) | ||
| Cardiovascular diseases, n (%) | 104 | 68 (65.4) | ||
| Temporal artery abnormalities, n (%) | 105 | 33 (31.4) | ||
| Positive temporal artery biopsy, n (%) | 27 | 15 (55.6) | ||
| Halo sign, n (%) | 78 | 67 (15.9) | ||
|
| 101 | 56.3 (27.3) | 40 | 10.5 (4.1) |
| CRP, mg/l, mean (SD) | 102 | 50.5 (80.8) | 38 | 46.4 (22.1) |
| Fever (>38°C), n (%) | 107 | 41 (38.3) | 40 | 2 (5) |
| Headache, n (%) | 107 | 82 (76.6) | 40 | 10 (25) |
| Ophthalmologic symptoms, n (%) | 107 | 33 (30.8) | 40 | 3 (7.5) |
| Weight loss (>2 kg), n (%) | 103 | 32 (31.07) | ||
| Jaw claudication, n (%) | 107 | 36 (33.6) | ||
| Aortic aneurysm, n (%) | 106 | 3 (2.83) | ||
|
| 107 | 58 (54.2) | 40 | 27 (67.5) |
| Glucocorticoid dose, mg PDN, mean (SD) | 107 | 27.7 (17.6) | 40 | 10.5 (4.1) |
| Time to remission, weeks, mean (SD) | 57 | 3.8 (2.4) | ||
| Relapse at three months for steroid tapering, n (%) | 107 | 407 (37.4) | ||
CRP, C reactive protein; DM, Diabetes mellitus; ESR, Erythrocyte Sedimentation Rate; GCs, Glucocorticoid.
The wrapper algorithm automatically selected among all the gathered attributes at baseline the best number of features based on their importance for predictions of GCA flare. The attributes selected as the core set to train algorithms are all in bold.
Figure 1Attribute core set used for training and validation of the algorithms ranked for feature importance score. DM, Diabetes mellitus; ESR, Erythrocyte Sedimentation Rate; PMR, Polymyalgia Rheumatica.
Figure 2Receiver operating characteristic curves of the assessed algorithms. AUROC, Area under the receiver operating characteristic curve; DT, Decision tree; LR, Logistic Regression; RF, Random Forest.
Performance of the ML algorithms.
| Cut-off | Accuracy (%) | SD | Recall (%) | SD | Precision (%) | SD | AUROC | SD | |
|---|---|---|---|---|---|---|---|---|---|
| LR | 0.52 | 70.4 | 0.11 | 62.5* | 0.25* | 62.6 | 0.08 | 0.73 | 0.08 |
| RF | 0.46* | 71.4* | 0.06* | 60 | 0.14 | 72.1* | 0.05* | 0.76* | 0.05* |
| DT | 1 | 62.9 | 0.07 | 47.5 | 0.14 | 50.8 | 0.11 | 0.65 | 0.11 |
*p < 0.001.
AUROC, area under the receiver operating characteristic curve; DT, Decision tree; LR, Logistic Regression; RF, Random Forest.
Figure 3A sample decision tree among those included into RF. At each node data are split according to ESR, presence/absence DM or presence/absence of PMR. DM, Diabetes mellitus; ESR, Erythrocyte Sedimentation Rate; PMR, Polymyalgia Rheumatica.
Figure 4Calibration curve before (upper panel) and after (lower panel) isotonic calibration. After calibration, GCA flare roughly happened with an observed relative frequency (dotted line) consistent with the forecast value (solid orange line), showing an acceptable calibration.