| Literature DB >> 27707897 |
Alan Boyd1, Rachael Addicott2, Ruth Robertson3, Shilpa Ross4, Kieran Walshe5.
Abstract
The credibility of a regulator could be threatened if stakeholders perceive that assessments of performance made by its inspectors are unreliable. Yet there is little published research on the reliability of inspectors' assessments of health care organizations' services. Objectives We investigated the inter-rater reliability of assessments made by inspectors inspecting acute hospitals in England during the piloting of a new regulatory model implemented by the Care Quality Commission (CQC) during 2013 and 2014. Multi-professional teams of inspectors rated service provision on a four-point scale for each of five domains: safety; effectiveness; caring; responsiveness; and leadership. Methods In an online survey, we asked individual inspectors to assign a domain and a rating to each of 10 vignettes of service information extracted from CQC inspection reports. We used these data to simulate the ratings that might be produced by teams of inspectors. We also observed inspection teams in action, and interviewed inspectors and staff from hospitals that had been inspected. Results Levels of agreement varied substantially from vignette to vignette. Characteristics such as professional background explained only a very small part of the variation. Overall, agreement was higher on ratings than on domains, and for groups of inspectors compared with individual inspectors. A number of potential causes of disagreement were identified, such as differences regarding the weight that should be given to contextual factors and general uncertainty about interpreting the rating and domain categories. Conclusion Groups of inspectors produced more reliable assessments than individual inspectors, and there is evidence to support the utility of appropriate discussions between inspectors in improving reliability. The reliability of domain allocations was lower than for ratings. It is important to define categories and rating levels clearly, and to train inspectors in their use. Further research is needed to replicate these results now that the model has been fully implemented, and to understand better the impact that inspector uncertainty and disagreement may have on published CQC ratings.Entities:
Keywords: Care Quality Commission; hospitals; inspections; regulation; reliability
Mesh:
Year: 2016 PMID: 27707897 PMCID: PMC5207299 DOI: 10.1177/1355819616669736
Source DB: PubMed Journal: J Health Serv Res Policy ISSN: 1355-8196
The new regulatory model for assessing NHS acute hospitals in England.
| Domains | Rating categories | Service areas | Organization of inspections |
|---|---|---|---|
| • Safety • Effectiveness • Caring • Responsiveness • Leadership | • Inadequate • Requires improvement • Good • Outstanding | • Children and young people • Maternity and gynaecology • Urgent and emergency services • Outpatients and diagnostic imaging • Surgery • Medical care, including older people’s care • Critical care • End of life care | • Large team • Sub-teams of 3–5 inspectors rate performance for each service area with regard to each domain. Sub-team membership: ^ Led by an experienced inspector employed by CQC ^ A doctor, a nurse, and a manager with experience of the area ^ Patient advocate, trainee doctor or nurse in some sub-teams • Typically 1–2 days inspection per hospital site. Announced in advance • Investigate pertinent issues (‘key lines of enquiry’), drawing on a generic list and statistics provided by CQC • Twice daily ‘corroboration’ discussion of likely ratings – within the sub-team and across the whole team • Optional unannounced follow-up visit to gather further data |
NHS: National Health Service.
Individual Inspector agreement on domain allocation and rating for the vignettes – Krippendorff’s alpha (Ka) and percentage agreement (PA).
| Vignette | Description | Domain | Rating | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| S | E | C | R | W | PA | I | RI | G | O | PA |
| |||
| V1 | Interpreting services are easily accessible | 5.1% | 15.4% | 8.2% | 70.9% | 0.3% | 53% | 0.41 | 1.0% | 3.7% | 92.9% | 2.4% | 87% | – |
| V2 | Complementary therapies are available to patients nearing the end of life to aid relaxation and symptom control | 0.7% | 11.6% | 73.3% | 14.0% | 0.3% | 57% | 0.45 | 0.7% | 2.4% | 59.4% | 37.5% | 49% | – |
| V3 | Staff left ampules of medicines in labour rooms instead of locking them away | 97.6% | 1.4% | 0.7% | 0.3% | 0.0% | 95% | 0.94 | 63.7% | 35.3% | 1.0% | 0.0% | 53% | – |
| V4 | Managers are developing a plan to address bullying following concerns reported in the national annual staff survey | 2.4% | 1.4% | 0.3% | 18.2% | 77.7% | 64% | 0.54 | 1.0% | 50.7% | 47.6% | 0.7% | 48% | – |
| V5 | The children's community nursing team cannot access local authority systems to check for safeguarding issues on discharge | 69.5% | 19.2% | 0.3% | 8.9% | 3.1% | 53% | 0.40 | 53.6% | 45.1% | 1.0% | 0.3% | 49% | – |
| V6 | Nurses undertake hourly rounds | 26.4% | 25.6% | 26.0% | 8.9% | 3.1% | 27% | 0.07 | 0.0% | 3.4% | 87.4% | 9.2% | 77% | – |
| V7 | New medication was researched so that a patient with a very complex condition could return home to die as they preferred | 1.0% | 16.1% | 30.5% | 51.7% | 0.7% | 38% | 0.22 | 0.3% | 1.4% | 37.3% | 61.0% | 51% | – |
| V8 | 40% of staff are not up to date with their mandatory training | 42.5% | 17.5% | 0.0% | 2.4% | 37.7% | 35% | 0.18 | 45.2% | 54.4% | 0.3% | 0.0% | 50% | – |
| V9 | Systems ensure that medical patients remain under the care of the medical team when moved to another ward | 26.7% | 49.3% | 2.4% | 14.0% | 7.5% | 34% | 0.16 | 0.0% | 5.5% | 91.4% | 3.1% | 84% | – |
| V10 | Frail elderly patients with complex needs are given additional guidance and rehabilitation to prepare for surgery | 3.1% | 34.2% | 26.7% | 34.9% | 1.0% | 31% | 0.12 | 0.3% | 1.7% | 75.5% | 22.4% | 63% | – |
| Overall agree ment | 49% | 0.35 | 61% | 0.79 | ||||||||||
| LL95%CI | 38% | 0.17 | 53% | 0.41 | ||||||||||
| UL95%CI | 62% | 0.53 | 71% | 0.90 | ||||||||||
Key for column headings: domains: S, safe; E, effective; C, caring; R, responsive; W, well-led.
Ratings: I, Inadequate; RI, Requires Improvement; G, Good; O, Outstanding.
Factors affecting levels of agreement on rating and domain allocations: statistically significant variables in the logistic regressions.
| Vignette | Potential sources/indicators of agreement/disagreement | |
|---|---|---|
| Rating level | Domain allocation | |
| V1 | • Little variation/High agreement; no significant variables | • Little variation/High agreement; no significant variables |
| V2 | • Ratings given to other vignettes • Experience of inspections • Seniority: rated Outstanding by 72% of junior-level inspectors, compared with 36% of more senior colleagues • Content knowledge: rated Outstanding by 54% of End of life care sub-team members, compared with 35% of members of other sub-teams | • Profession: Fewer CQC staff (65%) and patients/experts by experience (65%) allocated Caring, compared with doctors (89%) and nurses/allied health professionals (73%) |
| V3 | • Ratings given to other vignettes • Experience of inspections • Profession: rated Inadequate by 97% of Experts by Experience, compared with 60% of other inspectors | • Little variation/High agreement; no significant variables |
| V4 | • Ratings given to other vignettes | • No significant variables |
| V5 | • Experience of inspections • Seniority: rated Inadequate by 73% of junior level inspectors and Experts by Experience, compared with 50% of other inspectors • Profession: rated Requires Improvement by 60% of CQC inspectors, compared with 39% of other inspectors | • No significant variables |
| V6 | • Profession: rated Outstanding by 30% of doctors, compared with 3% of other inspectors | • Perceived accuracy of ratings • Ratings given across all vignettes |
| V7 | • Ratings given to other vignettes • Profession: rated Outstanding by 74% of nurses, compared with 57% of other inspectors | • No significant variables |
| V8 | • No significant variables | • Profession: More CQC staff (28%) allocated Effectiveness, compared with nurses/allied health professionals (9%), doctors (11%) and patients/experts by experience (15%) |
| V9 | • Little variation/High agreement; no significant variables | • Profession: Safety was the most popular domain allocation among doctors (48%); Effectiveness the most popular among patients/experts by experience (65%), CQC staff (58%) and nurses/allied health professionals (49%) |
| V10 | • Ratings given to other vignettes | • Experience of inspections: Allocated Responsiveness by 28% of inexperienced inspectors, rising to 63% for the most experienced inspectors |