Cornelis J Hopmans1, Pieter T den Hoed2, Lijckle van der Laan3, Erwin van der Harst4, Maarten van der Elst5, Guido H H Mannaerts6, Imro Dawson7, Reinier Timman8, Bas P L Wijnhoven8, Jan N M IJzermans8. 1. Department of Surgery, Erasmus MC University Medical Center, Rotterdam, The Netherlands. Electronic address: c.hopmans@erasmusmc.nl. 2. Department of Surgery, Ikazia Hospital, Rotterdam, The Netherlands. 3. Department of Surgery, Amphia Hospital, Breda, The Netherlands. 4. Department of Surgery, Maasstad Hospital, Rotterdam, The Netherlands. 5. Department of Surgery, Reinier de Graaf Groep, Delft, The Netherlands. 6. Department of Surgery, Sint Franciscus Gasthuis, Rotterdam, The Netherlands. 7. Department of Surgery, IJsselland Hospital, Capelle aan den IJssel, The Netherlands. 8. Department of Surgery, Erasmus MC University Medical Center, Rotterdam, The Netherlands.
Abstract
BACKGROUND: With the implementation of competency-based curricula, Objective Structured Assessment of Technical Skills (OSATS) increasingly is being used for the assessment of operative skills. Although evidence for its usefulness has been demonstrated in experimental study designs, data supporting OSATS application in the operating room are limited. This study evaluates the validity and reliability of the OSATS instrument to assess the operative skills of surgery residents in the operating theater. METHODS: Twenty-four residents were recruited from seven hospitals within a general surgical training region and classified equally into three groups according to postgraduate training year (PGY). Each resident had to perform five different types of operations. Surgical performance was measured using a modified OSATS consisting of three scales: Global Rating Scale, Overall Performance Scale, and Alphabetic Summary Scale. Validity and reliability metrics included construct validity (Kruskal-Wallis test) and internal consistency reliability (Cronbach's α coefficient). Spearman's correlation coefficients were calculated to determine correlations between the different scales. RESULTS: Eighteen residents (PGY 1-2 [n = 7]; PGY 3-4 [n = 8]; PGY 5-6 [n = 3]) performed 249 operations. Comparisons of the performance scores revealed that evidence for construct validity depended on the difficulty level of the selected procedures. For individual operations, internal consistency reliability of the Global Rating Scale ranged from 0.93 to 0.95. Scores on the different scales correlated strongly (r = 0.62-0.76, P < .001). CONCLUSION: Assessment of operative skills in the operating theater using this modified OSATS instrument has the potential to establish learning curves, allowing adequate monitoring of residents' progress in achieving operative competence. The Alphabetic Summary Scale seems to be of additional value. Use of the Overall Performance Scale should be reconsidered.
BACKGROUND: With the implementation of competency-based curricula, Objective Structured Assessment of Technical Skills (OSATS) increasingly is being used for the assessment of operative skills. Although evidence for its usefulness has been demonstrated in experimental study designs, data supporting OSATS application in the operating room are limited. This study evaluates the validity and reliability of the OSATS instrument to assess the operative skills of surgery residents in the operating theater. METHODS: Twenty-four residents were recruited from seven hospitals within a general surgical training region and classified equally into three groups according to postgraduate training year (PGY). Each resident had to perform five different types of operations. Surgical performance was measured using a modified OSATS consisting of three scales: Global Rating Scale, Overall Performance Scale, and Alphabetic Summary Scale. Validity and reliability metrics included construct validity (Kruskal-Wallis test) and internal consistency reliability (Cronbach's α coefficient). Spearman's correlation coefficients were calculated to determine correlations between the different scales. RESULTS: Eighteen residents (PGY 1-2 [n = 7]; PGY 3-4 [n = 8]; PGY 5-6 [n = 3]) performed 249 operations. Comparisons of the performance scores revealed that evidence for construct validity depended on the difficulty level of the selected procedures. For individual operations, internal consistency reliability of the Global Rating Scale ranged from 0.93 to 0.95. Scores on the different scales correlated strongly (r = 0.62-0.76, P < .001). CONCLUSION: Assessment of operative skills in the operating theater using this modified OSATS instrument has the potential to establish learning curves, allowing adequate monitoring of residents' progress in achieving operative competence. The Alphabetic Summary Scale seems to be of additional value. Use of the Overall Performance Scale should be reconsidered.
Authors: Leah K Taylor; Geb W Thomas; Matthew D Karam; Clarence D Kreiter; Donald D Anderson Journal: J Surg Educ Date: 2016-05-12 Impact factor: 2.891