| Literature DB >> 27550650 |
Mirelle Hanskamp-Sebregts1, Marieke Zegers2, Charles Vincent3, Petra J van Gurp1, Henrica C W de Vet4, Hub Wollersheim2.
Abstract
OBJECTIVES: Record review is the most used method to quantify patient safety. We systematically reviewed the reliability and validity of adverse event detection with record review.Entities:
Mesh:
Year: 2016 PMID: 27550650 PMCID: PMC5013509 DOI: 10.1136/bmjopen-2016-011078
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Definitions of reliability and validity in the context of record review
| Terms | Definition (expressed by) | Comments relevant to record review |
|---|---|---|
| Inter-rater reliability | Measures consensus in the scores when different raters using the same measurement instrument in the same group of patients. Mostly expressed as a reliability measure (κ), or % agreement | Two independent reviewers assess patient records without discussion between the reviewers during the review process |
| Face validity | The degree to which the content of an instrument is an adequate reflection of the construct to be measured (descriptive, expert opinion) | |
| Concurrent validity | The extent to which scores on a new measure are related to scores from a criterion measure administered at the same time (Se, Sp, PPV and NPV) | Clinical data registries, autopsy or direct observations of patient care have the potential to be a criterion measure for record review |
NPV, negative predictive value; PPV, positive predictive value; Se, sensitivity; Sp, specificity;
Description of the Global Trigger Tool and Harvard Medical Practice Study
| Instrument | Description | Safety outcomes | Conducted by | Scale |
|---|---|---|---|---|
| Global Trigger Tool | Two-stage retrospective record review | |||
| Stage 1: Screening records for the presence of triggers and determining the adverse event that caused harm to patients | Triggers (mostly narrow) | Stage 1: Trained nurses or hospital pharmacists (primary reviewers, mostly two reviewers per records) | Dichotomous: yes/no trigger | |
| Stage 2: Confirming or dismissing the occurrence and category of the adverse event | Adverse events | Stage 2: Trained physicians (second reviewers, mostly one reviewer) | Dichotomous: yes/no AE | |
| Medical record review based on HMPS | Two-stage or three-stage retrospective record review | |||
| Stage 1: Screening records using criteria | (Broad) Screening criteria (triggers) | Stage 1: Trained nurses* | Dichotomous: yes/no trigger | |
| Stage 2: Detailed review to confirm the presence of adverse events and their preventability | (Preventable) Adverse events | Stage 2: Trained physicians (one or two reviewers per record) | AE determination is based on three criteria:
Unintended injury to the patient (dichotomous: yes/no) Resulted in prolongation of hospital stay, temporary or permanent disability or death (dichotomous: yes/no) Caused by healthcare management (six-point scale) | |
| Stage 3: Discussion or independently supervising review (consensus stage)† | Stage 3: Supervising physician | |||
*With the exception of the study of Brennan et al,32 in which medical records were reviewed by medical-record-room administrators.
†In some studies, a third stage was used.3 32 39–42
AEs, adverse events; HMPS, Harvard Medical Practice Study.
Differences in pooled κ values (n=20) among subgroups according to number of reviewers, reviewer experience and reviewer training
| n | Pooled κ* (SD) | 95% CI | p Value† | |
|---|---|---|---|---|
| Group of reviewers | ||||
| Max 5 | 7 | 0.80 (0.07) | 0.66 to 0.94 | 0.006 |
| >5–20 | 7 | 0.52 (0.06) | 0.40 to 0.64 | |
| >20 | 6 | 0.54 (0.02) | 0.50 to 0.59 | |
| Total | 20 | |||
| Reviewer experience (records/reviewer) | ||||
| <100 | 7 | 0.71 (0.06) | 0.58 to 0.84 | 0.062 |
| 100–300 | 6 | 0.51 (0.04) | 0.43 to 0.58 | |
| >300 | 7 | 0.53 (0.04) | 0.45 to 0.62 | |
| Total | 20 | |||
| Training | ||||
| <1 day | 4 | 0.53 (0.07) | 0.37 to 0.68 | 0.809 |
| 1 day | 4 | 0.56 (0.14) | 0.25 to 0.87 | |
| >1 day | 5 | 0.57 (0.05) | 0.45 to 0.67 | |
| Total | 13 | |||
*Pooled κ weighted for the number of records on which the κ value is calculated.
†p Values are obtained with the prevalence rate as covariate.
The reviewer experience, reviewer training and the prevalence of AEs in the three groups of reviewers
| Max 5 reviewers | >5–20 reviewers | >20 reviewers | |||||
|---|---|---|---|---|---|---|---|
| Median* (IQR) | Min–Max | Median* (IQR) | Min–Max | Median* (IQR) | Min–Max | p Value† | |
| Reviewer experience (records/reviewer) | 213 (60–1138) | 50–4043 | 95 (39–317) | 38–591 | 129 (109–616) | 78–675 | 0.351 |
| Training hours | 6 (0–6) | 0–12 | 16 (5–20) | 2–24 | 8 (3–10) | 2–16 | 0.317 |
| Prevalence AEs (%) | 17.1 (7.8) | 7.2–27 | 13.5 (4.0) | 7.5–21 | 12.7 (8.5) | 2.9–25.1 | 0.480 |
*Unweighted statistics for reviewer experience, training and prevalence rate.
†p Values are obtained by the non-parametric Kruskal-Wallis test.
‡p Value is obtained with an ANOVA.
AEs, adverse events; Min, minimum; Max, maximum.