| Literature DB >> 33032582 |
Amie J Barda1, Christopher M Horvat2,3,4,5, Harry Hochheiser6,7.
Abstract
BACKGROUND: There is an increasing interest in clinical prediction tools that can achieve high prediction accuracy and provide explanations of the factors leading to increased risk of adverse outcomes. However, approaches to explaining complex machine learning (ML) models are rarely informed by end-user needs and user evaluations of model interpretability are lacking in the healthcare domain. We used extended revisions of previously-published theoretical frameworks to propose a framework for the design of user-centered displays of explanations. This new framework served as the basis for qualitative inquiries and design review sessions with critical care nurses and physicians that informed the design of a user-centered explanation display for an ML-based prediction tool.Entities:
Keywords: Clinical decision support systems; Explainable artificial intelligence; In-hospital mortality; Machine learning; Pediatric intensive care units; User-computer interface
Mesh:
Year: 2020 PMID: 33032582 PMCID: PMC7545557 DOI: 10.1186/s12911-020-01276-x
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Proposed framework for designing user-centered displays of explanation
Fig. 2Summary of an initial context of use and a possible space of explanation designs. Note that any supporting information needs or dimensionality preferences would be determined based on input from users
Design options and rationales for main factors to consider for explanation presentation
| Factor | Design Options | Rationale |
|---|---|---|
| Unit of explanation | Individual features | Lower information granularity can reduce cognitive load and processing time. Evidence supports the use of lower information granularity for non-AI/ML experts via feature groupings or extractions [ |
| Feature groups | ||
| Explanation unit organization | None | Explanations including causes that are abnormal or controllable (i.e., modifiable) might be preferred [ |
| Influence groups | ||
| Assessment groups | ||
| Dimensionality (size & granularity) | Static | Dimensionality can be reduced through information removal (e.g., reducing explanation size) or aggregation (e.g., reducing explanation granularity). The desired dimensionality of an explanation may vary by individual and prediction, [ |
| Interactive | ||
| Risk representation | Probability | Critical care providers should be comfortable with the risk representation format. Risk information in feature influence explanations has been previously reported in terms of odds and probability, [ |
| Odds | ||
| Explanation display format | Force plot | Visual representations of risk information may facilitate comprehension of risk [ |
| Tornado plot |
Fig. 3Prototypes of explanation displays utilizing different design options. Design options used in each prototype are listed as follows: a) unit of explanation, b) organization of explanation units, c) dimensionality, d) risk representation, and e) explanation display format. Prototype 1 options: a) individual features, b) none, c) interactive explanation size, static explanation unit granularity, d) odds, e) tornado plot. Prototype 2 options: a) feature groups, b) influence groups, c) interactive explanation size, interactive explanation unit granularity, d) probability, e) tornado plot. Prototype 3 options: a) feature groups, b) influence groups, c) static explanation size, static explanation unit granularity, d) probability, e) force plot. Prototype 4 options: a) individual features, b) influence groups, c) interactive explanation size, static explanation unit granularity, d) probability, e) tornado plot. Prototype 5 options: a) feature groups, b) influence groups and assessment groups, c) interactive explanation size, interactive explanation unit granularity, d) probability, e) tornado plot
Fig. 4Example of complete prototype explanation display with supporting information. In addition to the explanation plot and model prediction (top left), each prototype included demographic information (bottom left), a list of current diagnoses (bottom right), a table of raw values of the features used in the model (middle right), and an interactive plot where the raw values of time series data from laboratory tests and vital signs could be viewed (top right). Please note that length of stay has been redacted to protect patient privacy
Perceptions of focus group participants on context of use and perceived influences on model perceptions
| User goal (why) | User characteristic (who) | Desired information | Positive (+) and negative (−) influences on perceptions | |
|---|---|---|---|---|
| Verification | Predictive modeling knowledge | Detailed | Predictive performance Alignment with domain knowledge Comparison with existing models Modeling processes | Credibility + high predictive performance + predictions that aligned with clinical knowledge - influential outliers or data errors - counterintuitive risk factors - model limitations |
| Basic | Predictive performance Alignment with domain knowledge | |||
| Learning | Clinical role | Physician | Obtain patient insights: Prioritization Assessment of status Highlight patients/info of concern | Utility + training on use/interpretation - clinically irrelevant information |
| Nurse | Actionable information Alerts to changes Information to intervene or justify consult | Usability + appropriate alerts - high cognitive effort or attention - large time investments | ||
Explanation design preferences
| Desired content (what) | Benefits | Preferred design |
|---|---|---|
| Explanations | Help assess model credibility and utility | Risk expressed as percent probability High-level information with details available on demand Interactive options to support different displays/organizations for various users |
| Table of raw feature values | Interpret discretized features Examine trend-based features | Directionality for trend-based features Simpler terminology |
| Time-series data plot | Investigate suspicious values Assess trends and baselines | Multiple plots Highlight points related to features Auto-population of data |
| Contextual information | Clinically meaningful interpretation Context for risk prediction | Providing clinical context information Prominent display of baseline risk Inclusion of risk trends |
Fig. 5Provider preferences on prototype design options by clinical role
Fig. 6Final user-centered explanation display. The predicted risk and baseline risk are displayed in percent probability at the top of the figure. The explanation plot (top left) uses feature groups as the unit of explanation, but has hover-box capability to view individual features within each feature group. In the hover-box, trend-based features are summarized by trend direction (e.g., “Cr has increased since minimum value”). The plot includes interactive controls to view additional feature groups (e.g., scrolling down the explanation plot) and view different sets of feature groups (e.g., view laboratory test feature groups). The table of raw feature values (bottom left) includes the description, value, and contribution to the predicted risk for each individual feature. This table also includes the trend direction for trend-based features. The time-series plots to display raw values of laboratory test and vital sign data (right) highlight the points used to compute features and include interactive controls to zoom in on and select regions of data. These plots also have hover functionality that can be used to show the value and time of a specific point. To facilitate data exploration, interactivity is linked across plots and tables (e.g., selecting a predictor on the explanation plot will highlight it in the raw feature table and load the appropriate laboratory test/vital sign in the time-series plot)