Alison Walzak1, Maria Bacchus, Jeffrey P Schaefer, Kelly Zarnke, Jennifer Glow, Charlene Brass, Kevin McLaughlin, Irene W Y Ma. 1. A. Walzak is clinical instructor, Department of Medicine, University of British Columbia, Victoria, British Columbia, Canada. M. Bacchus is associate professor, Department of Medicine, University of Calgary, Calgary, Alberta, Canada. J.P. Schaefer is clinical professor, Department of Medicine, University of Calgary, Calgary, Alberta, Canada. K. Zarnke is associate professor, Department of Medicine, University of Calgary, Calgary, Alberta, Canada. J. Glow is internal medicine residency program administrator, University of Calgary, Calgary, Alberta, Canada. C. Brass is internal medicine residency program assistant, University of Calgary, Calgary, Alberta, Canada. K. McLaughlin is associate professor, Department of Medicine, University of Calgary, Calgary, Alberta, Canada. I.W.Y. Ma is associate professor, Department of Medicine, University of Calgary, Calgary, Alberta, Canada.
Abstract
PURPOSE: To compare procedure-specific checklists and a global rating scale in assessing technical competence. METHOD: Two trained raters used procedure-specific checklists and a global rating scale to independently evaluate 218 video-recorded performances of six bedside procedures of varying complexity for technical competence. The procedures were completed by 47 residents participating in a formative simulation-based objective structured clinical examination at the University of Calgary in 2011. Pass/fail (competent/not competent) decisions were based on an overall global assessment item on the global rating scale. Raters provided written comments on performances they deemed not competent. Checklist minimum passing levels were set using traditional standard-setting methods. RESULTS: For each procedure, the global rating scale demonstrated higher internal reliability and lower interrater reliability than the checklist. However, interrater reliability was almost perfect for decisions on competence using the overall global assessment (Kappa range: 0.84-1.00). Clinically significant procedural errors were most often cited as reasons for ratings of not competent. Using checklist scores to diagnose competence demonstrated acceptable discrimination: The area under the curve ranged from 0.84 (95% CI 0.72-0.97) to 0.93 (95% CI 0.82-1.00). Checklist minimum passing levels demonstrated high sensitivity but low specificity for diagnosing competence. CONCLUSIONS: Assessment using a global rating scale may be superior to assessment using a checklist for evaluation of technical competence. Traditional standard-setting methods may establish checklist cut scores with too-low specificity: High checklist scores did not rule out incompetence. The role of clinically significant errors in determining procedural competence should be further evaluated.
PURPOSE: To compare procedure-specific checklists and a global rating scale in assessing technical competence. METHOD: Two trained raters used procedure-specific checklists and a global rating scale to independently evaluate 218 video-recorded performances of six bedside procedures of varying complexity for technical competence. The procedures were completed by 47 residents participating in a formative simulation-based objective structured clinical examination at the University of Calgary in 2011. Pass/fail (competent/not competent) decisions were based on an overall global assessment item on the global rating scale. Raters provided written comments on performances they deemed not competent. Checklist minimum passing levels were set using traditional standard-setting methods. RESULTS: For each procedure, the global rating scale demonstrated higher internal reliability and lower interrater reliability than the checklist. However, interrater reliability was almost perfect for decisions on competence using the overall global assessment (Kappa range: 0.84-1.00). Clinically significant procedural errors were most often cited as reasons for ratings of not competent. Using checklist scores to diagnose competence demonstrated acceptable discrimination: The area under the curve ranged from 0.84 (95% CI 0.72-0.97) to 0.93 (95% CI 0.82-1.00). Checklist minimum passing levels demonstrated high sensitivity but low specificity for diagnosing competence. CONCLUSIONS: Assessment using a global rating scale may be superior to assessment using a checklist for evaluation of technical competence. Traditional standard-setting methods may establish checklist cut scores with too-low specificity: High checklist scores did not rule out incompetence. The role of clinically significant errors in determining procedural competence should be further evaluated.
Authors: Hong-En Chen; Mary A Yovanoff; David F Pepley; Rohan S Prabhu; Cheyenne C Sonntag; David C Han; Jason Z Moore; Scarlett R Miller Journal: J Surg Res Date: 2018-09-08 Impact factor: 2.192
Authors: Sachin Wani; Rajesh Keswani; Matt Hall; Samuel Han; Meer Akbar Ali; Brian Brauer; Linda Carlin; Amitabh Chak; Dan Collins; Gregory A Cote; David L Diehl; Christopher J DiMaio; Andrew Dries; Ihab El-Hajj; Swan Ellert; Kimberley Fairley; Ashley Faulx; Larissa Fujii-Lau; Srinivas Gaddam; Seng-Ian Gan; Jonathan P Gaspar; Chitiki Gautamy; Stuart Gordon; Cynthia Harris; Sarah Hyder; Ross Jones; Stephen Kim; Srinadh Komanduri; Ryan Law; Linda Lee; Rawad Mounzer; Daniel Mullady; V Raman Muthusamy; Mojtaba Olyaee; Patrick Pfau; Shreyas Saligram; Cyrus Piraka; Amit Rastogi; Laura Rosenkranz; Fadi Rzouq; Aditi Saxena; Raj J Shah; Violette C Simon; Aaron Small; Jayaprakash Sreenarasimhaiah; Andrew Walker; Andrew Y Wang; Rabindra R Watson; Robert H Wilson; Patrick Yachimski; Dennis Yang; Steven Edmundowicz; Dayna S Early Journal: Clin Gastroenterol Hepatol Date: 2017-06-16 Impact factor: 11.382
Authors: Yasser A Noureldin; David M Hoenig; Philip Zhao; Sammy E Elsamra; Joshua Stern; Geoffrey Gaunay; Piruz Motamedinia; Zeph Okeke; Ardeshir R Rastinehad; Robert M Sweet Journal: World J Urol Date: 2018-02-17 Impact factor: 4.226