| Literature DB >> 22618855 |
Rebecca S Crowley1, Elizabeth Legowski, Olga Medvedeva, Kayse Reitmeyer, Eugene Tseytlin, Melissa Castine, Drazen Jukic, Claudia Mello-Thoms.
Abstract
The purpose of this study is threefold: (1) to develop an automated, computer-based method to detect heuristics and biases as pathologists examine virtual slide cases, (2) to measure the frequency and distribution of heuristics and errors across three levels of training, and (3) to examine relationships of heuristics to biases, and biases to diagnostic errors. The authors conducted the study using a computer-based system to view and diagnose virtual slide cases. The software recorded participant responses throughout the diagnostic process, and automatically classified participant actions based on definitions of eight common heuristics and/or biases. The authors measured frequency of heuristic use and bias across three levels of training. Biases studied were detected at varying frequencies, with availability and search satisficing observed most frequently. There were few significant differences by level of training. For representativeness and anchoring, the heuristic was used appropriately as often or more often than it was used in biased judgment. Approximately half of the diagnostic errors were associated with one or more biases. We conclude that heuristic use and biases were observed among physicians at all levels of training using the virtual slide system, although their frequencies varied. The system can be employed to detect heuristic use and to test methods for decreasing diagnostic errors resulting from cognitive biases.Entities:
Mesh:
Year: 2012 PMID: 22618855 PMCID: PMC3728442 DOI: 10.1007/s10459-012-9374-z
Source DB: PubMed Journal: Adv Health Sci Educ Theory Pract ISSN: 1382-4996 Impact factor: 3.853
Fig. 1Data collection system showing a virtual slide and diagnostic reasoning interface and b confidence measurement interface and case summary
Summary description of heuristics and biases
| Heuristic | Definition | Unit of analysis | Heuristic and/or bias | Detection algorithm |
|---|---|---|---|---|
| Anchoring | Locking on to salient evidence early in the diagnostic process leading to an initial diagnosis | Single case | Heuristic | Participant adds a hypothesis, then adds findings, and then adds a diagnosis supported by the new findings. The diagnosis may be (1) the same as the hypothesis ( |
| Bias | Participant adds a hypothesis, then adds findings, and then adds a diagnosis that is not supported by the new findings. Participant does not subsequently add hypothesis or diagnosis consistent with all new findings ( | |||
| Availability | The disposition to judge things as either more likely or as frequently occurring if they come to mind readily | Case sequence | Bias | In a sequence of three cases where the third case has a different diagnosis than the first two cases, the participant makes an incorrect diagnosis in the third case. The incorrect diagnosis is identical to the correct diagnosis in the two immediately preceding cases |
| Confirmation bias | The tendency to look for confirming evidence to support a diagnosis rather than looking for disconfirming evidence to refute it | Single case | Bias | Participants adds an incorrect diagnosis, and then adds findings that support this incorrect diagnosis |
| Gambler’s Fallacy | The belief that when deviations from expected results occur in repeated independent events, it increases the likelihood of deviations in the opposing direction, leading the clinician to reject a diagnosis because the entity has been observed more frequently than expected in recent cases | Case sequence | Bias | In a sequence of three cases where all three cases have the same diagnosis, the participant makes a diagnosis in the first case, the same diagnosis in the second case, but a different and incorrect diagnosis in the third case. In the first two cases, the participant’s diagnosis may be either correct or incorrect |
| Representativeness (Type 2) | The tendency to judge an entity against a mental model based on similarity to a prototype, leading the clinican to rigidly associate a feature with a single disease, based on a learned model | Case sequence | Heuristic | Participant adds a finding and then immediately (in the next action) adds a correct diagnosis. This sequence of actions must have been observed more than once during the session. The relationship between the finding and the diagnosis must have been included in the study period |
| Bias | Participant adds a finding and then immediately (in the next action) adds an incorrect diagnosis. This sequence of actions must have been observed more than once during the session. The relationship between the finding and the diagnosis must have been included in the study period | |||
| Search Satisficing | The tendency to call off a search once something has been found, leading to premature diagnostic closure | Case sequence | Bias | Participant adds a hypothesis and then adds a diagnosis without intervening findings. The diagnosis must be incorrect or the differential diagnosis must be incomplete. The case must be immediately closed. If any findings are added during the case, they must precede the addition of the hypotheses |
| Overconfidence and underconfidence | The belief in one’s own performance, with extremes representing opposite ends of a spectrum of feeling-of-knowing | Case sequence | Bias | Based on the comparison of participant self assessment to performance for all items. Bias is computed as result of the subtraction of total correct from total sure, divided by total items, and range from +1 (completely overconfident) to −1 (completely underconfident), with an optimum value of zero indicating perfect matching between their confidence and performance |
Fig. 2Sample user data showing examples of several heuristics
Task metrics (overall and by level of training)
| All | Level of training | |||||||
|---|---|---|---|---|---|---|---|---|
| Level 1 Y1 and Y2 residents (N = 22) | Level 2 Y3 and Y4 residents (N = 26) | Level 3 | ||||||
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | |
| Total number of cases completed | 31.4 | 6.8 | 27.8 | 6.3 | 32.5 | 7.1 | 33.7 | 5.7 |
| Subepidermal vesicular (only) | 15.7 | 4.4 | 14.4 | 4.5 | 15.9 | 4.2 | 16.8 | 4.4 |
| Nodular and diffuse (only) | 15.7 | 4.7 | 13.4 | 5.0 | 16.5 | 4.5 | 16.9 | 4.2 |
Diagnostic and finding accuracy (overall and by level of training)
| All (N = 71) | Level of training | ANOVA | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Level 1 | Level 2 | Level 3 |
|
| ||||||
| Mean (%) | SD (%) | Mean (%) | SD (%) | Mean (%) | SD (%) | Mean (%) | SD (%) | |||
| Overall diagnostic accuracy | 37.0 | 10.4 | 38.6 | 9.9 | 34.2 | 10.0 | 38.7 | 10.9 | 1.56 | .22 |
| Subepidermal vesicular (only) | 36.0 | 14.3 | 36.8 | 13.0 | 35.6 | 14.0 | 35.8 | 16.4 | 0.04 | .96 |
| Nodular and diffuse (only) | 39.1 | 12.6 | 41.9 | 10.3 | 33.2 | 12.4 | 43.0 | 12.9 | 4.92 | .01 |
| Overall finding accuracy | 65.9 | 7.9 | 66.3 | 8.4 | 64.1 | 8.9 | 67.5 | 5.9 | 1.18 | .31 |
| Subepidermal vesicular (only) | 64.1 | 9.3 | 62.7 | 10.2 | 63.9 | 9.9 | 65.8 | 7.6 | 0.62 | .54 |
| Nodular and diffuse (only) | 68.0 | 11.7 | 70.6 | 10.5 | 64.4 | 14.3 | 69.5 | 8.8 | 1.90 | .16 |
Frequency of occurence of Availability, Confirmation, Gambler’s Fallacy and Satisficing
| All | Level of training | |||||||
|---|---|---|---|---|---|---|---|---|
| Level 1 Y1 and Y2 residents | Level 2 Y3 and Y4 residents | Level 3 | ||||||
| Median | SD | Median | SD | Median | SD | Median | SD | |
| Availability bias | ||||||||
| Number of cases | 1.0 | 1.00 | 1.0 | 0.85 | 1.0 | 1.26 | 1.0 | 0.83 |
| % of possible cases | 20.00 | 21.01 | 20.00 | 22.02 | 20.00 | 22.77 | 20.00 | 18.79 |
| Confirmation bias | ||||||||
| Number of cases | 0.0 | 0.43 | 0.0 | 0.22 | 0.0 | 0.35 | 0.0 | 0.64 |
| % of possible cases | 0.0 | 1.34 | 0.0 | 0.87 | 0.0 | 1.31 | 0.0 | 1.75 |
| Gambler’s Fallacy bias | ||||||||
| Number of cases | 0.0 | 0.94 | 0.0 | 0.79 | 1.0 | 0.85 | 1.0 | 0.80 |
| % of possible cases | 0.0 | 11.39 | 0.0 | 11.38 | 10.00 | 9.22 | 10.00 | 8.21 |
| Satisficing bias | ||||||||
| Number of cases | 6.0 | 8.04 | 3.50 | 5.09 | 9.50 | 9.63 | 10.0 | 6.32 |
| % of possible cases | 22.50 | 20.82 | 13.81 | 16.55 | 26.67 | 24.51 | 30.00 | 18.68 |
Cases classified as bias per participant (overall and by level of training)
Legend: For each user, the denominator used for percent of possible cases is the number of cases that the user completed in which the heuristic could occur. Therefore, the denominator differs across users
Frequency of occurence of Anchoring and Representativeness
| All | Level of training | |||||||
|---|---|---|---|---|---|---|---|---|
| Level 1 | Level 2 | Level 3 | ||||||
| Median | SD | Median | SD | Median | SD | Median | SD | |
| Anchoring heuristic | ||||||||
| Number of cases | 2.0 | 2.29 | 2.0 | 2.72 | 2.0 | 2.02 | 2.0 | 2.23 |
| % of possible cases | 5.26 | 9.64 | 5.71 | 12.15 | 5.06 | 9.35 | 5.13 | 7.27 |
| Anchoring bias | ||||||||
| Number of cases | 2.0 | 3.25 | 2.50 | 3.46 | 1.0 | 3.87 | 3.0 | 2.50 |
| % of possible cases | 5.88 | 12.24 | 8.84 | 15.19 | 4.00 | 13.71 | 7.50 | 7.51 |
| Representativeness heuristic | ||||||||
| Number of cases | 1.0 | 1.87 | 1.0 | 2.05 | 1.0 | 1.62 | 1.0 | 2.04 |
| % of possible cases | 3.45 | 6.32 | 4.65 | 7.08 | 3.33 | 5.12 | 3.45 | 6.75 |
| Representativeness bias | ||||||||
| Number of cases | 0.0 | 0.94 | 0.0 | 1.07 | 0.0 | 1.00 | 0.0 | 0.77 |
| % of possible cases | 0.0 | 3.29 | 0.00 | 3.85 | 0.00 | 3.48 | 0.00 | 2.57 |
Cases classified as heuristic versus bias per participant (overall and by level of training)
Legend: For each user, the denominator used for percent of possible cases is the number of cases that user completed in which the heuristic could occur. Therefore, the denominator differs across users
Heuristic and bias use by cases with correct and incorrect diagnoses
| All | Level of training | |||
|---|---|---|---|---|
| Level 1 | Level 2 | Level 3 | ||
| Percentage of cases with a final | 67.6 % (21.2) | 68.4 % (19.0) | 69.7 % (22.6) | 64.6 % (21.8) |
| Percentage of cases with a final | 51.2 % (10.9) | 43.5 % (8.3) | 57.1 % (12.9) | 50.5 % (11.0) |
| Percentage of cases with a final | 32.4 % (10.2) | 31.6 % (8.8) | 30.3 % (9.9) | 35.3 % (11.9) |
| Percentage of cases with a final | 21.0 % (2.1) | 24.4 % (2.1) | 19.5 % (1.9) | 20.1 % (2.4) |
| Frequency of bias occurrence when the diagnosis was incorrect (normalized for opportunity) | ||||
| Percentage of cases with diagnostic error where | 10.7 % (2.3) | 12.7 % (2.4) | 10.4 % (2.4) | 9.4 % (2.0) |
| Percentage of cases with diagnostic error where | 32.7 % (1.0) | 30.8 % (0.8) | 34.7 % (1.2) | 31.6 % (1.0) |
| Percentage of cases with diagnostic error where | 0.7 % (0.2) | 0.3 % (0.1) | 0.5 % (0.1) | 1.4 % (0.3) |
| Percentage of cases with diagnostic error where | 14.0 % (0.6) | 11.0 % (0.4) | 16.6 % (0.7) | 13.8 % (0.7) |
| Percentage of cases with diagnostic error where | 2.4 % (0.5) | 3.1 % (0.6) | 2.3 % (0.5) | 1.9 % (0.4) |
| Percentage of cases with diagnostic error where | 33.5 % (7.1) | 23.2 % (4.4) | 41.0 % (9.3) | 33.3 % (7.3) |
| Frequency of heuristic occurrence when the diagnosis was correct (normalized for opportunity) | ||||
| Percentage of correctly diagnosed cases where | 9.9 % (1.0) | 13.5 % (1.2) | 7.8 % (0.8) | 9.5 % (1.1) |
| Percentage of correctly diagnosed cases where | 12.1 % (1.1) | 11.6 % (1.0) | 12.7 % (1.2) | 11.8 % (1.3) |
Fig. 3Distribution of bias scores. Number of participants (by level of training) with average bias scores at 0.1 intervals. Bias scores on x axis reflect center of interval. The optimum point at zero is marked with a vertical line