| Literature DB >> 36087130 |
Markus Gräf1,2, Johannes Knitza3,4,5, Jan Leipe6, Martin Krusche7, Martin Welcker8, Sebastian Kuhn9, Johanna Mucke10, Axel J Hueber1,11, Johannes Hornig12, Philipp Klemm13, Stefan Kleinert14, Peer Aries15, Nicolas Vuillerme16,17,18, David Simon1,2, Arnd Kleyer1,2, Georg Schett1,2, Johanna Callhoff19,20.
Abstract
Symptom checkers are increasingly used to assess new symptoms and navigate the health care system. The aim of this study was to compare the accuracy of an artificial intelligence (AI)-based symptom checker (Ada) and physicians regarding the presence/absence of an inflammatory rheumatic disease (IRD). In this survey study, German-speaking physicians with prior rheumatology working experience were asked to determine IRD presence/absence and suggest diagnoses for 20 different real-world patient vignettes, which included only basic health and symptom-related medical history. IRD detection rate and suggested diagnoses of participants and Ada were compared to the gold standard, the final rheumatologists' diagnosis, reported on the discharge summary report. A total of 132 vignettes were completed by 33 physicians (mean rheumatology working experience 8.8 (SD 7.1) years). Ada's diagnostic accuracy (IRD) was significantly higher compared to physicians (70 vs 54%, p = 0.002) according to top diagnosis. Ada listed the correct diagnosis more often compared to physicians (54 vs 32%, p < 0.001) as top diagnosis as well as among the top 3 diagnoses (59 vs 42%, p < 0.001). Work experience was not related to suggesting the correct diagnosis or IRD status. Confined to basic health and symptom-related medical history, the diagnostic accuracy of physicians was lower compared to an AI-based symptom checker. These results highlight the potential of using symptom checkers early during the patient journey and importance of access to complete and sufficient patient information to establish a correct diagnosis.Entities:
Keywords: Artificial intelligence; Diagnosis; Diagnostic decision support system; Rheumatology; Symptom checker; Telemedicine
Mesh:
Year: 2022 PMID: 36087130 PMCID: PMC9548469 DOI: 10.1007/s00296-022-05202-4
Source DB: PubMed Journal: Rheumatol Int ISSN: 0172-8172 Impact factor: 3.580
Fig. 1Example of the Ada symptom assessment report excerpt presented to physicians (adapted from original report and translated to English)
Participant demographics
| Participant demographics | Value |
|---|---|
| Age (years), mean (SD) | 39 (8.2) |
| Females, n (%) | 15 (46) |
| Board-certified specialist, | 22 (67) |
| Professional experience in years: mean (SD) | 11.6 (7.4) |
| Professional experience in rheumatology in years: mean (SD) | 8.8 (7.1) |
| Working environment | |
| University hospital, | 16 (49) |
| Other hospital, | 1 (3) |
| Rheumatology practice, | 16 (49) |
Accuracy, sensitivity, specificity, positive and negative predictive value of Ada and physicians for correct classification of inflammatory rheumatic diseases
| Origin of diagnosis | Diagnoses considered | Accuracy | Sensitivity | Specificity | Positive likelihood ratio | Negative likelihood ratio |
|---|---|---|---|---|---|---|
| Physicians | Top 1 | 53% | 64% | 47% | 1.2 | 0.77 |
| Top 2 | 50% | 77% | 35% | 1.2 | 0.66 | |
| Top 3 | 50% | 81% | 33% | 1.2 | 0.58 | |
| Ada | Top 1 | 70% | 71% | 69% | 2.3 | 0.42 |
| Top 2 | 55% | 71% | 46% | 1.3 | 0.63 | |
| Top 3 | 60% | 86% | 46% | 1.6 | 0.30 | |
| Top 4 | 60% | 86% | 46% | 1.6 | 0.30 | |
| Top 5 | 60% | 86% | 46% | 1.6 | 0.30 |
Fig. 2Percentage of correctly classified IRD status by diagnosis rank, vignette difficulty and IRD status
Fig. 3Percentage of correct exact diagnoses by diagnosis rank, vignette difficulty and IRD status
Fig. 4Probabilities of diagnosis. The bars show the interquartile range. Correct and incorrect refers to the top diagnosis compared to the actual diagnosis