| Literature DB >> 27293430 |
Richard R McNeer1, Roman Dudaryk2, Nicholas B Nedeff2, Christopher L Bennett3.
Abstract
Introduction. Medical simulators are used for assessing clinical skills and increasingly for testing hypotheses. We developed and tested an approach for assessing performance in anesthesia residents using screen-based simulation that ensures expert raters remain blinded to subject identity and experimental condition. Methods. Twenty anesthesia residents managed emergencies in an operating room simulator by logging actions through a custom graphical user interface. Two expert raters rated performance based on these entries using custom Global Rating Scale (GRS) and Crisis Management Checklist (CMC) instruments. Interrater reliability was measured by calculating intraclass correlation coefficients (ICC), and internal consistency of the instruments was assessed with Cronbach's alpha. Agreement between GRS and CMC was measured using Spearman rank correlation (SRC). Results. Interrater agreement (GRS: ICC = 0.825, CMC: ICC = 0.878) and internal consistency (GRS: alpha = 0.838, CMC: alpha = 0.886) were good for both instruments. Subscale analysis indicated that several instrument items can be discarded. GRS and CMC scores were highly correlated (SRC = 0.948). Conclusions. In this pilot study, we demonstrated that screen-based simulation can allow blinded assessment of performance. GRS and CMC instruments demonstrated good rater agreement and internal consistency. We plan to further test construct validity of our instruments by measuring performance in our simulator as a function of training level.Entities:
Year: 2016 PMID: 27293430 PMCID: PMC4879220 DOI: 10.1155/2016/9348478
Source DB: PubMed Journal: Anesthesiol Res Pract ISSN: 1687-6962
Figure 1Graphical user interface (GUI) used in simulation experiments. Parameters were updated at one-second intervals based on values read from an XLS file. The GUI featured a responsive pulse oximeter auditory display and IEC alarms that would annunciate when parameter alarm thresholds were transcended. Subjects entered answers to distractor questions and responses to state changes in the text entry box.
Figure 2Screenshot of a portion of the “symptomatic bradycardia” XLS script. Note cell AA752 which shows the first time the heart rate drops below 60 bpm and surpasses an alarm threshold. Cell AE752 programmatically changes to a value of 1 which instructs the GUI to annunciate the appropriate IEC alarm, in this case the medium priority cardiac alarm “cardmed.”
Figure 3Plot showing the changes to relevant parameters in the “hypovolemia” scenario. Near the beginning, heart rate gradually increases over 5 minutes but does not surpass the alarm threshold. Later in the scenario, a low blood pressure is measured and the appropriate alarm sound enunciated. All parameters normalize and revert back to baseline levels before the end of the scenario.
Figure 4Stem plot showing the responses entered by a subject into the GUI during the “hypovolemia” scenario. Note that the y-axis scale does not have any informative value. The diamond and square red markers represent the times when state changes and alarm annunciation occur in the script. The filled circles are color coded based on the legend and show the relative times and text responses entered by subjects. The red triangles represent the average times subjects took to detect, diagnose, and treat scripted problems.
Internal consistency of Global Rating Scale considering all emergency scenarios.
| Item | Corrected item-total correlation† | Cronbach's |
|---|---|---|
| Overall performance | 0.909 | 0.726 |
| State change detection | 0.663 | 0.800 |
| Situational awareness | 0.828 | 0.747 |
| Therapy/resource utilization | 0.794 | 0.760 |
| Subject perception of crisis resolution | 0.117 | 0.930 |
#Cronbach's α = 0.838.
†Analysis performed on average of rater responses.
Internal consistency of Crisis Management Checklist considering all emergency scenarios.
| Item | Corrected item-total correlation† | Cronbach's |
|---|---|---|
| State change detection | ||
| Timely/prompt detection | 0.457 | 0.885 |
| Complete detection | 0.545 | 0.879 |
| Missed detection | 0.533 | 0.881 |
| Situational awareness | ||
| Complete/correct differential | 0.800 | 0.862 |
| Prioritized differential list | 0.774 | 0.864 |
| Reassesses situation | 0.759 | 0.867 |
| One or more incorrect diagnoses | 0.183 | 0.894 |
| Therapy/resource utilization | ||
| Timely therapy | 0.696 | 0.870 |
| Prioritized actions | 0.771 | 0.864 |
| Appropriate therapy/action | 0.772 | 0.864 |
| One or more inappropriate actions | 0.177 | 0.893 |
#Cronbach's α = 0.886.
†Analysis performed on average of rater responses.
Interrater agreement for the Global Rating Scale.
| Item | Circuit disconnect | Bradycardia | Endobronchial intubation | Hypovolemia | Pulmonary embolism | Light anesthesia | All scenarios |
|---|---|---|---|---|---|---|---|
| Overall performance | 0.756 | 0.900 | 0.772 | 0.789 | 0.908 | 0.844 | 0.804 |
| State change detection | 0.796 | 0.907 | 0.883 | 0.888 | 0.828 | 0.537 | 0.819 |
| Situational awareness | 0.889 | 0.848 | 0.872 | 0.673 | 0.964 | 0.835 | 0.866 |
| Therapy/resource utilization | 0.756 | 0.798 | 0.739 | 0.825 | 0.805 | 0.867 | 0.787 |
| Subject perception of crisis resolution | 0.683 | 0.633 | 0.740 | 0.529 | 0.414 | 0.538 | 0.624 |
| Total# | 0.798/0.852 | 0.899/0.906 | 0.812/0.860 | 0.849/0.838 | 0.892/0.929 | 0.760/0.826 | 0.825/0.856 |
Interrater agreement assessed by calculating the intraclass correlation coefficient (ICC) using a two-way mixed effects for consistency between two expert rater responses.
Not significant.
#The second ICC does not include the “subject perception of crisis resolution” item.
Interrater agreement for the Crisis Management Checklist.
| Item | Circuit disconnect | Bradycardia | Endobronchial intubation | Hypovolemia | Pulmonary embolism | Light anesthesia | All scenarios |
|---|---|---|---|---|---|---|---|
| State change detection∧ | 0.699/0.732 | 0.521 | 0.807/0.839 | 0.856/0.817 | 0.495 | 0.121 | 0.639/0.674 |
| Timely/prompt detection | 0.730 | 0.463 | 0.791 | 0.733 | 0.333 | 0.301 | 0.593 |
| Complete detection | 0.640 | 0.838 | 0.729 | 0.618 | 0.506 | 0.649 | 0.655 |
| Missed detection | 0.248 | † | 0.518 | 0.000 | 0.487 | (−)0.366 | 0.088 |
| Situational awareness% | 0.835/0.838 | 0.946/0.952 | 0.834/0.856 | 0.773/0.773 | 0.907/0.937 | 0.798/0.803 | 0.844/0.852 |
| Complete/correct differential | 0.753 | 0.885 | 0.820 | 0.724 | 0.889 | 0.790 | 0.821 |
| Prioritized differential list | 0.710 | 0.913 | 0.790 | 0.825 | 0.857 | 0.794 | 0.807 |
| Reassesses situation | 0.733 | 0.387 | 0.739 | 0.533 | 0.647 | 0.708 | 0.620 |
| One or more incorrect diagnoses | 0.910 | 0.654 | 0.158 | † | 0.627 | 0.557 | 0.565 |
| Therapy/resource utilization& | 0.917/0.917 | 0.886/0.886 | 0.705/0.711 | 0.888/0.888 | 0.763/0.784 | 0.934/0.934 | 0.842/0.852 |
| Timely therapy | 0.945 | 0.647 | 0.594 | 0.795 | 0.798 | 0.871 | 0.793 |
| Prioritized actions | 0.857 | 0.681 | 0.570 | 0.914 | 0.733 | 0.840 | 0.780 |
| Appropriate therapy/action | 0.770 | 0.432 | 0.609 | 0.681 | 0.515 | 0.851 | 0.658 |
| One or more inappropriate actions | † | † | 0.000 | † | 0.487 | † | 0.314 |
| Total# | 0.903/0.898 | 0.954/0.965 | 0.871/0.860 | 0.917/0.931 | 0.870/0.908 | 0.850/0.881 | 0.878/0.890 |
Interrater agreement assessed by calculating the intraclass correlation coefficient (ICC) using a two-way mixed effects for consistency between two expert rater responses.
Not significant.
†Item responses were all zero.
∧The second ICC does not include the “missed detection” item.
%The second ICC does not include the “one or more incorrect diagnoses” item.
&The second ICC does not include the “one or more inappropriate actions” item.
#The second ICC does not include the previously excluded items.
Figure 5Overall subject performance assessment scores from the Global Rating Scale (a) and Crisis Management Checklist (b). Individual rater and average ratings are shown. The bars depict standard deviations.
Figure 6Subject performance assessment scores from the Global Rating Scale (a) and Crisis Management Checklist (b) based on emergency scenario. Individual rater and average ratings are shown. The bars depict standard deviations.
Estimation of effect sizes.
| Mean 1 | Mean 2 | Difference | % difference∧ | Cohen's | |
|---|---|---|---|---|---|
| GRS | |||||
| Median# | 15.1 (2.8) | 20.2 (1.9) | 5.1 | 25.2 | 1.5 |
| Quartile | 17.2 | 18.6 | 1.4 | 7.4 | 0.4 |
| Scenario& | 15.8 | 19.8 | 4.0 | 25.2 | 0.6 |
| CMC | |||||
| Median# | 10.0 (2.2) | 13.0 (0.9) | 3.0 | 23.3 | 1.3 |
| Quartile | 11.6 | 12.3 | 0.6 | 5.0 | 0.3 |
| Scenario& | 10.2 | 12.5 | 2.4 | 23.1 | 0.6 |
#Mean 1 and Mean 2 are average of scores from lower and upper 50th percentiles, respectively.
Mean 1 and Mean 2 are average of scores from 2nd and 3rd quartiles, respectively.
&Mean 1 and Mean 2 are average of scores from all subjects for “endobronchial intubation” and “unstable bradycardia” emergency scenarios, respectively.
∧Calculated as difference divided by average of means multiplied by 100.
!Calculated as difference divided by pooled standard deviation from all 20 subject scores.