OBJECTIVE: To collect evidence for the validity and reliability of an assessment tool for simulated subcuticular suturing. STUDY DESIGN: Three subjects were videotaped while closing a simulated incision in a plastic model. The 3 trials were viewed independently by 7 faculty examiners masked to subject identity. Global rating and task-specific scales were used to assess subject competence. The mean scores were compared among the 3 subjects and 7 evaluators using analysis of variance. RESULTS: Significant differences were found among the mean global rating scores for the 3 subjects but not among the evaluators. Similarly, significant differences were found between mean task-specific scale scores for the 3 subjects but not among the evaluators. Cronbach's alpha for global rating (0.89) and task-specific (0.93) scores suggested high internal consistency for each scale. CONCLUSIONS: These findings provide evidence for the discriminant validity, internal consistency, and inter-rater reliability of both the global rating and task-specific scales of our assessment tool.
OBJECTIVE: To collect evidence for the validity and reliability of an assessment tool for simulated subcuticular suturing. STUDY DESIGN: Three subjects were videotaped while closing a simulated incision in a plastic model. The 3 trials were viewed independently by 7 faculty examiners masked to subject identity. Global rating and task-specific scales were used to assess subject competence. The mean scores were compared among the 3 subjects and 7 evaluators using analysis of variance. RESULTS: Significant differences were found among the mean global rating scores for the 3 subjects but not among the evaluators. Similarly, significant differences were found between mean task-specific scale scores for the 3 subjects but not among the evaluators. Cronbach's alpha for global rating (0.89) and task-specific (0.93) scores suggested high internal consistency for each scale. CONCLUSIONS: These findings provide evidence for the discriminant validity, internal consistency, and inter-rater reliability of both the global rating and task-specific scales of our assessment tool.