Anthony D Roth1, Pamela Myles-Hooton2, Amanda Branson2. 1. Research Department of Clinical, Educational and Health Psychology, University College London, London, UK. 2. Charlie Waller Institute, University of Reading, Reading, UK.
Abstract
BACKGROUND: One method for appraising the competence with which psychological therapy is delivered is to use a structured assessment tool that rates audio or video recordings of therapist performance against a standard set of criteria. AIMS: The present study examines the inter-rater reliability of a well-established instrument (the Cognitive Therapy Scale - Revised) and a newly developed scale for assessing competence in CBT. METHOD: Six experienced raters working independently and blind to each other's ratings rated 25 video recordings of therapy being undertaken by CBT therapists in training. RESULTS: Inter-rater reliability was found to be low on both instruments. CONCLUSIONS: It is argued that the results represent a realistic appraisal of the accuracy of rating scales, and that the figures often cited for inter-rater reliability are unlikely to be generalizable outside the specific context in which they were achieved. The findings raise concerns about the use of these scales for making summative judgements of clinical competence in both educational and research contexts.
BACKGROUND: One method for appraising the competence with which psychological therapy is delivered is to use a structured assessment tool that rates audio or video recordings of therapist performance against a standard set of criteria. AIMS: The present study examines the inter-rater reliability of a well-established instrument (the Cognitive Therapy Scale - Revised) and a newly developed scale for assessing competence in CBT. METHOD: Six experienced raters working independently and blind to each other's ratings rated 25 video recordings of therapy being undertaken by CBT therapists in training. RESULTS: Inter-rater reliability was found to be low on both instruments. CONCLUSIONS: It is argued that the results represent a realistic appraisal of the accuracy of rating scales, and that the figures often cited for inter-rater reliability are unlikely to be generalizable outside the specific context in which they were achieved. The findings raise concerns about the use of these scales for making summative judgements of clinical competence in both educational and research contexts.
Entities:
Keywords:
assessment; clinical supervision; therapist competence; training