| Literature DB >> 19561711 |
Paul Walsh1, Donal Doyle, Kemedy K McQuillen, Joshua Bigler, Caleb Thompson, Ed Lin, Padraig Cunningham.
Abstract
BACKGROUND: Decision-support tools (DST) are typically developed by computer engineers for use by clinicians. Prototype testing DSTs may be performed relatively easily by one or two clinical experts. The costly alternative is to test each prototype on a larger number of diverse clinicians, based on the untested assumption that these evaluations would more accurately reflect those of actual end users. HYPOTHESIS: We hypothesized substantial or better agreement (as defined by a kappa statistic greater than 0.6) between the evaluations of a case based reasoning (CBR) DST predicting ED admission for bronchiolitis performed by the clinically diverse end users, to those of two clinical experts who evaluated the same DST output.Entities:
Year: 2008 PMID: 19561711 PMCID: PMC2672244
Source DB: PubMed Journal: West J Emerg Med ISSN: 1936-900X
Figure 1Case flow through the study.
Figure 2Age and severity of illness of patients.
Figure 3Evaluation (raw scores) of the DST by the end users and expert reviewers.
Agreement between evaluators on the predicted disposition. The values in parentheses are the results obtained when the five categories are collapsed to three.
| CBR DST predicted disposition: Do you agree with the suggested course of action?
| |||||
|---|---|---|---|---|---|
| Evaluator 5 point scale (3 point scale) | Observed Agreement | Agreement expected by chance alone | κ | 95% C.I. | Interpretation |
| End users & Expert 1 | 93.5% (89.9%) | 87.2% (79.6%) | 0.49 (0.51) | 0.25 – 0.69 (0.25 – 0.71) | Moderate (Moderate) |
| End users & Expert 2 | 93.6% (91.6%) | 86.4% (79.9%) | 0.53 (0.58) | 0.33 – 0.68 (0.36 – 0.76) | Moderate (Moderate) |
| Expert 1 & Expert 2 | 94.5% (91.6%) | 87.3% (80.9%) | 0.56 (0.56) | 0.38– 0.70 (0.33 – 0.74) | Moderate (Moderate) |
Agreement between evaluators on the value of the explanatory dialog. The values in parentheses are the results obtained when the five categories are collapsed to three.
| CBR DST explanatory dialog: Did you find the supporting dialog useful?
| |||||
|---|---|---|---|---|---|
| Evaluator 5 point scale (3 point scale) | Observed Agreement | Agreement expected by chance alone | κ | 95% C.I. | Interpretation |
| End users & Expert 1 | 87.2% (66.0%) | 83.6% (56.7%) | 0.21 (0.21) | 0.03 – 0.40 (0.05 – 0.38) | Fair (Fair) |
| End users & Expert 2 | 84.1% (62.0%) | 83.4% (58.7%) | 0.04 (0.08) | (0.13 – 0.22) (−0.10 – 0.26) | Poor (Poor) |
| Expert 1 & Expert 2 | 78.3% (60.3%) | 79.2% (56.9%) | −0.04 (0.08) | −0.20 – 0.12 (−0.09 – 0.25) | None (Poor) |
| Features | Patient | Explanation Case |
|---|---|---|
| Age | 1.2 | 1.8 |
| Birth | Vaginal | Vaginal |
| Smoking Mother | No | No |
| Hydration before treatment | Normal | Normal |
| O2 saturation before treatment | 99.0 | 98.0 |
| Retraction severity before treatment | None | Mod |
| Heart rate after treatment | 129 | 129 |
| Overall increase in work of breathing | None | None |
| Oxygen saturation under 92 after treatment | No (100.0) | No (99.0) |
| Respiratory rate over 60 after treatment | No (42) | No (38) |
| Temperature over 100.4 after treatment | No (98.0) | No (99.9) |
| Work of breathing after treatment | Same | Improved |
| Disposition | Admit |
| Definitely Not | No | Maybe | Yes | Absolutely | |
|---|---|---|---|---|---|
| Q1. Do you agree with the suggested course of action? | □ | □ | □ | □ | □ |
| Q2. Did you find the explanation case useful? | □ | □ | □ | □ | □ |
| Q3. Did you find the supporting dialog useful? | □ | □ | □ | □ | □ |
Agreement between evaluators on the value of the explanatory case. The values in parentheses are the results obtained when the five categories are collapsed to three.
| CBR DST explanatory example: Did you find the explanation case useful?
| |||||
|---|---|---|---|---|---|
| Evaluator5 point scale(3 point scale) | Observed Agreement | Agreement expected by chance alone | κ | 95% C.I. | Interpretation |
| End users & Expert 1 | 89.2% (70.8%) | 83.0% (58.4%) | 0.36 (0.30) | 0.19 – 0.53 (0.14 – 0.46) | Fair (Fair) |
| End users & Expert 2 | 83.96% (46.0%) | 84.3% (43.2%) | −0.02 (0.05) | −0.10 – 0.04 (0.04 – 0.14) | None* (Poor) |
| Expert 1 & Expert 2 | 87.03% (59.2%) | 87.3% (52.6%) | −0.01 (0.14) | −0.08 – 0.07 (0.03 – 0.26) | None* (Poor) |