| Literature DB >> 34082719 |
Abstract
BACKGROUND: Artificial Intelligence has the potential to revolutionize healthcare, and it is increasingly being deployed to support and assist medical diagnosis. One potential application of AI is as the first point of contact for patients, replacing initial diagnoses prior to sending a patient to a specialist, allowing health care professionals to focus on more challenging and critical aspects of treatment. But for AI systems to succeed in this role, it will not be enough for them to merely provide accurate diagnoses and predictions. In addition, it will need to provide explanations (both to physicians and patients) about why the diagnoses are made. Without this, accurate and correct diagnoses and treatments might otherwise be ignored or rejected.Entities:
Mesh:
Year: 2021 PMID: 34082719 PMCID: PMC8176739 DOI: 10.1186/s12911-021-01542-6
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Week 4 (top panel) and 5 (bottom panel) Probability Chart and Explanation. Initial diagnosis of IBS changes as information emerges, and the explanations constitute the relative certainty of each disease in these bar charts and text description
Post-hoc Analysis for final understanding
| Control- local Ex | Control-global Ex | Local–global | |
|---|---|---|---|
| 1. I do not understand what MediBot is doing | |||
| 2. I understand MediBot is following a systematic elimination method | |||
| 3. I think MediBot is behaving erratically | |||
| 4. I understand why MediBot changed its mind between week 4 and week 5 |
Each pair-wise comparison (n = 80) was performed with a pairwise Tukey HSD test
Fig. 2Results for explanation satisfaction scales
Results from Type-III factorial ANOVA for explanation satisfaction scales (n = 80)
| Time (week) | Explanation | Explanation: time | |
|---|---|---|---|
| Satisfaction | ηp2 = 0.04 | ηp2 = 0.01 | ηp2 = 0.02 |
| Sufficiency | ηp2 = 0.03 | ηp2 = 0.03 | ηp2 = 0.03 |
| Completeness | ηp2 = 0.03 | ηp2 = 0.04 | ηp2 = 0.02 |
| Usefulness | ηp2 = 0.06 | ηp2 = 0.01 | ηp2 = 0.03 |
| Accuracy | ηp2 = 0.08 | ηp2 = 0.06 | ηp2 = 0.02 |
| Trust | ηp2 = 0.07 | ηp2 = 0.04 | ηp2 = 0.01 |
Time and explanation refer to the main effects and Explanation: Time refers to the interaction between these independent variables
Fig. 3Results from statement ratings
Fig. 4Sample explanations used in Experiment 2. Top panel shows visualizing feature weights and rationale; bottom panel shows example-based explanations
Fig. 5Rating for explanation satisfaction scales
Results from Type- III factorial ANOVA for explanation satisfaction scales (n = 113)
| Time (week) | Explanation | Explanation: time | |
|---|---|---|---|
| Satisfaction | ηp2 = 0.25 | ηp2 = 0.04 | ηp2 = 0.06 |
| Sufficiency | ηp2 = 0.25 | ηp2 = 0.06 | ηp2 = 0.07 |
| Completeness | ηp2 = 0.24 | ηp2 = 0.08 | ηp2 = 0.06 |
| Usefulness | ηp2 = 0.25 | ηp2 = 0.02 | ηp2 = 0.05 |
| Accuracy | ηp2 = 0.20 | ηp2 = 0.16 | ηp2 = 0.07 |
| Trust | ηp2 = 0.16 | ηp2 = 0.08 | ηp2 = 0.04 |
Time and explanation refer to the main effects and Explanation: Time refers to the interaction between these independent variables
Significant differences between conditions (n = 113) at each Set according to the Tukey test, any pairing not mentioned was not significantly different for that Set
| Time 1 | Time 2 | Time 3 | |
|---|---|---|---|
| Satisfaction | None | Visual; examples > rationale; control | None |
| Sufficiency | None | Visual; examples > rationale; control | None |
| Completeness | None | Visual; examples > rationale; control | None |
| Usefulness | None | Visuals were better than control | None |
| Accuracy | Example > control | Visual; examples > rationale; control | None |
| Trust | None | Visuals; examples > control | None |
Fig. 6Mean rating for Overall Satisfaction