INTRODUCTION: Defining valid, reliable, defensible, and generalizable standards for the evaluation of learner performance is a key issue in assessing both baseline competence and mastery in medical education. However, before setting these standards of performance, the reliability of the scores yielding from a grading tool must be assessed. Accordingly, the purpose of this study was to assess the reliability of scores generated from a set of grading checklists used by nonexpert raters during simulations of American Heart Association (AHA) Megacodes. METHODS: The reliability of scores generated from a detailed set of checklists, when used by 4 nonexpert raters, was tested by grading team leader performance in 8 Megacode scenarios. Videos of the scenarios were reviewed and rated by trained faculty facilitators and a group of nonexpert raters. The videos were reviewed "continuously" and "with pauses." The grading made by 2 content experts served as the reference standard, and 4 nonexpert raters were used to test the reliability of the checklists. RESULTS: Our results demonstrate that nonexpert raters are able to produce reliable grades when using the checklists under consideration, demonstrating excellent intrarater reliability and agreement with a reference standard. The results also demonstrate that nonexpert raters can be trained in the proper use of the checklist in a short amount of time, with no discernible learning curve thereafter. Finally, our results show that a single trained rater can achieve reliable scores of team leader performance during AHA Megacodes when using our checklist in a continuous mode because measures of agreement in total scoring were very strong [Lin's (Biometrics 1989;45:255-268) concordance correlation coefficient, 0.96; intraclass correlation coefficient, 0.97]. CONCLUSIONS: We have shown that our checklists can yield reliable scores, are appropriate for use by nonexpert raters, and are able to be used during continuous assessment of team leader performance during the review of a simulated Megacode. This checklist may be more appropriate for use by advanced cardiac life support instructors during Megacode assessments than the current tools provided by the AHA.
INTRODUCTION: Defining valid, reliable, defensible, and generalizable standards for the evaluation of learner performance is a key issue in assessing both baseline competence and mastery in medical education. However, before setting these standards of performance, the reliability of the scores yielding from a grading tool must be assessed. Accordingly, the purpose of this study was to assess the reliability of scores generated from a set of grading checklists used by nonexpert raters during simulations of American Heart Association (AHA) Megacodes. METHODS: The reliability of scores generated from a detailed set of checklists, when used by 4 nonexpert raters, was tested by grading team leader performance in 8 Megacode scenarios. Videos of the scenarios were reviewed and rated by trained faculty facilitators and a group of nonexpert raters. The videos were reviewed "continuously" and "with pauses." The grading made by 2 content experts served as the reference standard, and 4 nonexpert raters were used to test the reliability of the checklists. RESULTS: Our results demonstrate that nonexpert raters are able to produce reliable grades when using the checklists under consideration, demonstrating excellent intrarater reliability and agreement with a reference standard. The results also demonstrate that nonexpert raters can be trained in the proper use of the checklist in a short amount of time, with no discernible learning curve thereafter. Finally, our results show that a single trained rater can achieve reliable scores of team leader performance during AHA Megacodes when using our checklist in a continuous mode because measures of agreement in total scoring were very strong [Lin's (Biometrics 1989;45:255-268) concordance correlation coefficient, 0.96; intraclass correlation coefficient, 0.97]. CONCLUSIONS: We have shown that our checklists can yield reliable scores, are appropriate for use by nonexpert raters, and are able to be used during continuous assessment of team leader performance during the review of a simulated Megacode. This checklist may be more appropriate for use by advanced cardiac life support instructors during Megacode assessments than the current tools provided by the AHA.
Authors: Nick Sevdalis; Melinda Lyons; Andrew N Healey; Shabnam Undre; Ara Darzi; Charles A Vincent Journal: Ann Surg Date: 2009-06 Impact factor: 12.969
Authors: Brenda S Bray; Megan N Willson; Jennifer D Robinson; Gregory T Matsuura; Catrina R Schwartz; Douglas L Weeks Journal: Am J Pharm Educ Date: 2013-09-12 Impact factor: 2.047
Authors: Matthew D McEvoy; William R Hand; W David Stoll; Cory M Furse; Paul J Nietert Journal: Reg Anesth Pain Med Date: 2014 Jul-Aug Impact factor: 6.288
Authors: Larry C Field; Matthew D McEvoy; Jeremy C Smalley; Carlee A Clark; Michael B McEvoy; Horst Rieke; Paul J Nietert; Cory M Furse Journal: Resuscitation Date: 2013-09-19 Impact factor: 5.262
Authors: Matthew D McEvoy; William R Hand; Cory M Furse; Larry C Field; Carlee A Clark; Vivek K Moitra; Paul J Nietert; Michael F O'Connor; Mark E Nunnally Journal: Simul Healthc Date: 2014-10 Impact factor: 1.929
Authors: Scott C Watkins; Paul J Nietert; Elisabeth Hughes; Eric T Stickles; Tracy E Wester; Matthew D McEvoy Journal: Am J Med Sci Date: 2016-10-03 Impact factor: 2.378
Authors: Matthew D McEvoy; Larry C Field; Haley E Moore; Jeremy C Smalley; Paul J Nietert; Sheila H Scarbrough Journal: Resuscitation Date: 2013-10-05 Impact factor: 5.262
Authors: Ithan D Peltan; David Guidry; Katie Brown; Naresh Kumar; William Beninati; Samuel M Brown Journal: Chest Date: 2022-01-19 Impact factor: 10.262
Authors: Matthew D McEvoy; Deborah J Dewaay; Allison Vanderbilt; Louise A Alexander; Marna C Stilley; Maura C Hege; Donna H Kern Journal: Acad Med Date: 2014-04 Impact factor: 6.893