| Literature DB >> 35935756 |
Johannes Knitza1,2,3, Lena Janousek4, Felix Kluge4, Cay Benedikt von der Decken5,6,7, Stefan Kleinert7,8,9, Wolfgang Vorbrüggen7,10, Arnd Kleyer1,2, David Simon1,2, Axel J Hueber1,11, Felix Muehlensiepen3,12, Nicolas Vuillerme3,13,14, Georg Schett1,2, Bjoern M Eskofier4, Martin Welcker7,15, Peter Bartz-Bazzanella6,7.
Abstract
Introduction: Rheport is an online rheumatology referral system allowing automatic appointment triaging of new rheumatology patient referrals according to the respective probability of an inflammatory rheumatic disease (IRD). Previous research reported that Rheport was well accepted among IRD patients. Its accuracy was, however, limited, currently being based on an expert-based weighted sum score. This study aimed to evaluate whether machine learning (ML) models could improve this limited accuracy. Materials and methods: Data from a national rheumatology registry (RHADAR) was used to train and test nine different ML models to correctly classify IRD patients. Diagnostic performance was compared of ML models and the current algorithm was compared using the area under the receiver operating curve (AUROC). Feature importance was investigated using shapley additive explanation (SHAP).Entities:
Keywords: artificial intelligence; decision support system (DSS); digital health; machine learning; rheumatology; symptom checker; triage
Year: 2022 PMID: 35935756 PMCID: PMC9354580 DOI: 10.3389/fmed.2022.954056
Source DB: PubMed Journal: Front Med (Lausanne) ISSN: 2296-858X
Example of a questionnaire summary report including the respective score calculation.
| Question | Answers | Factor | Weight | Sub-score |
| Gender | Female | 0 | 0% | 0.00 |
| Age | <60 years | 0 | 2% | 0.00 |
| Weight loss | No | 0 | 3% | 0.00 |
| Duration of complaints | 6 months, <12 months | 3 | 5% | 0.15 |
| Preceding injury | No | 5 | 1% | 0.05 |
| Preceding infection | No | 0 | 1% | 0.00 |
| Preceding tick sting | No | 0 | 0% | 0.00 |
| Referring physician | General practitioner | 0 | 1% | 0.00 |
| Lab results | No lab results available | 0 | 10% | 0.00 |
| Family history | No | 0 | 0% | 0.00 |
| Joint pain | With movement | 1 | 10% | 0.10 |
| Joint swelling | Big toe | 10 | 10% | 1.00 |
| Finger swelling | Whole finger | 10 | 4.5% | 0.45 |
| Duration of joint swelling | >6 weeks, <half a year | 10 | 2.5% | 0.25 |
| Joint stiffness | All day long | 1 | 7.5% | 0.075 |
| Headache | Neck and back of head | 1 | 7.5% | 0.075 |
| Lower back pain | No | 0 | 7.5% | 0.00 |
| Other pain | No | 0 | 7.5% | 0.00 |
| Pain related limitation of movement | No | 0 | 2.5% | 0.00 |
| Muscle weakness | No | 0 | 10% | 0.00 |
| General symptoms | Disrupted sleep, often tired | 0 | 2.5% | 0.00 |
| Other symptoms | Fever >38°C | 4 | 2.5% | 0.10 |
| Comorbidities | Psoriasis | 4 | 2.5% | 0.10 |
| Total score | 2.35 | |||
Rheport’s triage levels, respective total score thresholds and appointment time frame.
| Triage levels | Total score thresholds | Appointment time frame | |
| 1 | Very urgent | >4.0 | Within 1 week |
| 2 | Urgent | 2.4–4.0 | Within 2 weeks |
| 3 | Intermediate | 1.0–2.4 | Within 1 month |
| 4 | IRD unlikely | <1 | Transfer (back) to GP |
Diagnostic categories.
| Diagnostic categories | |
| Inflammatory rheumatic disease | 690 (30.5) |
| Rheumatoid arthritis | 339 (15.0) |
| Psoriatic arthritis | 103 (4.5) |
| Polymyalgia rheumatica | 84 (3.7) |
| Axial spondyloarthritis | 56 (2.5) |
| Undifferentiated arthritis | 44 (1.9) |
| Reactive arthritis | 21 (0.9) |
| Systemic lupus erythematosus | 18 (0.8) |
| Crystal arthropathies | 12 (0.5) |
| Systemic sclerosis | 5 (0.2) |
| Inflammatory idiopathic myositis | 3 (0.1) |
| Behcet’s disease | 2 (0.1) |
| ANCA-associated vasculitis | 2 (0.1) |
| Giant cell arteritis | 1 (0.04) |
| Non-inflammatory rheumatic disease | 1,575 (69.5) |
FIGURE 1Flowchart of patient case selection.
FIGURE 2Comparison of machine learning model performance (A) and comparison of best performing machine learning model and current Rheport algorithm (B).
Specificity, PPV, NPV of current Rheport algorithm and best performing machine learning model according to targeted sensitivity.
| Current Rheport algorithm | Logistic regression | |||||
|
|
| |||||
| Targeted sensitivity (%) | Specificity (%) | PPV (%) | NPV (%) | Specificity (%) | PPV (%) | NPV (%) |
| 95 | 10 | 33 | 81 | 20 | 34 | 89 |
| 90 | 17 | 34 | 78 | 33 | 37 | 88 |
| 70 | 36 | 34 | 72 | 67 | 48 | 84 |
PPV, positive predictive value; NPV, negative predictive value.
FIGURE 3Feature importance of logistic regression model using SHAP values. High SHAP values (x-axis) represent a high impact on model output. negative values imply impact toward non-IRD classification and positive values direct toward classification as IRD. The y-axis describes the value levels; low representing 0 and negative answers.