| Literature DB >> 24275499 |
Jim McCambridge1, John Witton2, Diana R Elbourne3.
Abstract
OBJECTIVES: This study aims to (1) elucidate whether the Hawthorne effect exists, (2) explore under what conditions, and (3) estimate the size of any such effect. STUDY DESIGN ANDEntities:
Keywords: Assessment; Hawthorne effect; Observation; Reactivity; Research methods; Research participation
Mesh:
Year: 2013 PMID: 24275499 PMCID: PMC3969247 DOI: 10.1016/j.jclinepi.2013.08.015
Source DB: PubMed Journal: J Clin Epidemiol ISSN: 0895-4356 Impact factor: 6.437
Fig. 1PRISMA flowchart.
Fig. 2Binary outcome data. RR, relative risk; CI, confidence interval.
Extent and effects of heterogeneity in 14 studies of the Hawthorne effect with binary outcomesa
| df | ||||
|---|---|---|---|---|
| All 14 studies | 194.47 | 13 | <0.001 | 93.3 |
| 5 Randomized controlled trials | 15.98 | 4 | 0.003 | 75.0 |
| 5 Quasiexperimental studies | 2.35 | 4 | 0.67 | 0 |
| 4 Observational studies | 32.63 | 3 | <0.001 | 90.8 |
| 8 Studies of being observed or studied | 38.25 | 7 | <0.001 | 81.7 |
| 6 Studies of answering questions | 23.23 | 5 | <0.001 | 78.5 |
| 5 Studies of self-completing health questionnaires | 8.9 | 4 | 0.064 | 55.0 |
Rows 2-4 are mutually exclusive, as are rows 5 and 6. The final row 7 is a subset of data in row 6.
Study characteristics and findings of randomized controlled trials evaluating the Hawthorne effect
| Characteristic | Granberg and Holmberg | Van Rooyen et al. | Feil et al. | O'Sullivan et al. | Kypri et al. | Kypri et al. | McCambridge and Day | Evans et al. |
|---|---|---|---|---|---|---|---|---|
| Population | Swedish general population | Academic peer reviewers | Adolescent dental patients | Colorectal cancer screening population | University students | University students | University students | Male prostate cancer-testing population |
| Setting | Community | Correspondence | Dental clinic | Community | Student health service | Student health service | Student union | General practice initiated internet |
| Operationalization of HE | Being interviewed before an election | Awareness of study participation | Participation in experimental arm of clinical trial | Being given a questionnaire with screening kit | Completing a questionnaire | Completing a questionnaire | Completing a questionnaire on alcohol | Completing a questionnaire |
| Comparison group | Being interviewed after an election | No awareness of study participation | Usual care | No questionnaire with screening kit | No questionnaire | No questionnaire | No questionnaire on alcohol | No questionnaire |
| Participant blinding | Not clear if aware of outcome assessment | Yes. Control group blinded to study conduct | Yes. HE group blinded to study purpose. Control group blinded to all aspects of study participation. | Not clear if aware of outcome assessment | Yes. Both groups blinded to conduct of trial and study purpose | Yes. Both groups blinded to conduct of trial and study purpose. | Yes. Both groups blinded to study purpose and focus on drinking, HE group capable of inferring the latter | Not clear if aware of outcome assessment |
| Outcome measure | Objectively ascertained voting records | Quality of reviews produced. | Objectively ascertained plaque scores | Uptake of screening ascertained in records | 4 Self-reported health behaviors | Self-reported drinking and related problems | Self-reported drinking and related problems | Uptake of prostate cancer test in medical records |
| Follow-up intervals | Not reported | Not reported | 3 and 6 mo | 6 wk and 6 mo | 6 wk | 6 and 12 mo | 2-3 mo | 6 mo |
| Sample size | Preelection interview: 4,720, postelection interview: 4,999 | 316 unaware, 149 aware | 40, 20 per group | 1,944 sent a questionnaire, 10,413 not sent one | 74 completed a questionnaire, 72 did not | 147 completed a questionnaire, 146 did not | 217 completed a questionnaire, 204 did not | 150 per group |
| Attrition | None | None | Two lost to follow-up (one in each group) | None | 86%, Not differential by group | 84% and 86% not differential by group | 77%, Not differential by group | 83%, Not differential by group |
| Summary of reported findings | People interviewed before the election were more likely to vote (95% vs. 93%), and this effect was stronger for those with low political interest (93% vs. 90%) | No evidence of any difference | Large between-group differences in plaque score at both 3 (54 ± 13.79 vs. 78 ± 12.18) and 6 mo (52 ± 13.04 vs. 79 ± 10.76) | Small statistically significant differences in uptake at 6 wk (54.4% vs. 51.9%), no longer significant at 6 mo (64.7% vs. 62.9%) | No differences detected | No differences detected at 6 mo, 3 of 7 statistically significant differences in outcomes at 12 mo | Small statistically significant differences in 4 of 9 outcomes including primary outcome (0.23 standard deviations) | Those completing questionnaire more likely to undergo test (11 of 123 vs. 2 of 126) |
| Reviewer comments including on principal risks of bias | Completed interviews analyzed, not ITT. Various other sample refinements. Not a formal research report, details of study design and methods unclear. | Validated outcome measure completed by author. Potential for information bias unclear. | Family of three randomized ad hoc as cluster. Small study. | Sequence generation for "every sixth person" not described. Effects of reminder letters not reported. | Self-report. | Self-report. Reasons for lack of effect at 6 mo unclear. | Self-report. Attrition. | Two of four arms in online trial evaluating a decision support aid described. Limited information in report. Small numbers. |
Abbreviation: HE, Hawthorne effect.
Study characteristics and findings of quasiexperimental evaluations of the Hawthorne effect
| Characteristic | Murray et al. | Malotte and Morisky | McCusker et al. | Ertem et al. | Fernald et al. |
|---|---|---|---|---|---|
| Population | Secondary school children | Nonactive tuberculosis patients | General practitioners | Breast-feeding mothers of new born children | General practitioners |
| Setting | School | Tuberculosis clinic | General practice | Hospital | General practice |
| Operationalization of HE | Completing five annual questionnaires in cohort study | Participation in a usual care control arm of clinical trial | Practitioner completing a questionnaire on older patients' mental health | Participation in a cohort study | Completion of case reviews with researchers |
| Comparison group | Completing questionnaire only in the final year | Usual care group not participating in trial | No questionnaire | Eligible nonparticipants | No case reviews |
| Allocation method | Schools randomly selected at different times | Comparison group formed of all patients after monthly trial recruitment quota is reached | Alternate patient numbers, clinician-level data not reported | Alternate recruitment days | Random sample of 25% invited to participate. |
| Participant blinding | Yes to HE study purpose, not to focus on smoking behavior | HE group aware of trial participation, comparison group unaware of study | No | HE group aware of the cohort study of behavior. Comparison group unaware of the study | Not clear |
| Outcome measure | Self-reported smoking | Treatment retention | Recording | Data in routine records | Prescribing of antibiotics in abscess and cellulitis cases |
| Follow-up intervals | Cumulative exposure to 4 yr of surveys | 6 and 12 mo | 3 mo | 2 wk, 2 and 4 mo | No follow-up. 6- to 7-mo study period |
| Sample size | 5,615 annual questionnaires, 1,934 final year only, genders presented separately | 46 in trial, 85 in nontrial group | 41 with questionnaire, 53 without | 64 in cohort study, 61 in nonparticipating group | 91 clinicians, 14 participating in case reviews, and 77 comparisons |
| Attrition/response rates | 75% in 48 HE schools, 84% in 12 of 20 control schools, differential between groups | No attrition, medical records used | No attrition, medical records used | No attrition, medical records used | No attrition, medical records used |
| Summary of reported findings | Two statistically significant differences in girls, one among boys, of four outcomes assessed: 23% vs. 29% nonsmokers among girls, 25% vs. 27% among boys. Outcomes aggregated for | At 6 mo, 30% vs. 21% comparing trial with nontrial groups, 12 mo 20% vs. 19%. Median time in treatment greater for trial group than nontrial group (13 vs. 5 wk). First follow-up data used in | No differences in recording of mental health problems: 7 (17%) of 41 questionnaire; 10 (19%) of 53 no questionnaire. | No differences in discontinuation of breast-feeding: 2 wk 34% vs. 38%; 2 mo 73% vs. 70%; 4 mo 84% vs. 89% (nonparticipants vs. cohort study). First follow-up data used in | No differences in antibiotic prescribing in reviewed abscess cases (9 of 21 vs. 60 of 127) or in reviewed cellulitis cases (105 of 250 vs. 465 of 1,108). First outcome data used in |
| Reviewer comments including on principal risks of bias | Response rates differential at final survey. Between-group equivalence not demonstrated, vulnerable to selection bias. | Nonequivalent groups. Those not consenting to trial were excluded, no consent procedure for control group. Small sample size. | No data provided on clinicians. Unclear why imbalance in numbers allocated. May be weak manipulation of intended sense of being studied. Absence of blinding. Small study. | No consent procedure for control group, no information on refusals to consent to cohort study. Small sample size. | Outcome data reported comparing approximately 15% who participated in case reviews (rather than those randomized) with approximately 85% who did not. Brief report. |
Abbreviation: HE, Hawthorne effect.
Study characteristics and findings of observational study evaluations of the Hawthorne effect
| Characteristic | Campbell et al. | Mangione-Smith | Eckmanns et al. | Leonard and Masatu | Maury et al. | Fox et al. |
|---|---|---|---|---|---|---|
| Population | Paramedics | Pediatricians | Clinicians | Clinicians | Clinicians | Obstetricians |
| Setting | Emergency services | Community practices | Hospital intensive care units | Outpatient clinics | Hospital intensive care unit | Hospital birth unit |
| Operationalization of HE | Announcement of study in a memo | Impact of audio taping consultations and completing questionnaires on inappropriate antibiotic prescribing | Announcement of 10-day direct observation study of hand hygiene | Direct observation of consultations by researchers | Announcement of observational study of hand hygiene in two time periods by two clinicians | Impact of awareness of being studied on diagnostic accuracy (EFW) |
| Comparisons | Prior awareness-raising memo | Unobserved consultations (neither audio/questionnaire) during same time period (and also later) | Covert observational period 10 mo earlier (same research nurse observer) | Nonobserved consultations before research team arrival | Two covert observed periods (by same two clinicians) when clinicians were unaware of being observed | Accuracy of estimates found in consecutive equivalent records in an earlier period |
| Blinding | No | Blinded to prescribing focus, consented to communication study | Only during covert observation | No | Only during covert observation | Were aware of study of accuracy, not of HE |
| Outcome measure | Documentation rates of medication, allergy, and medical history | Antibiotic prescribing in viral cases by direct observation or in medical records | Observed use of AHR | Patient-reported quality of care in postconsultation questionnaire | Observed hand hygiene compliance | Differences in accuracy (proportions of EFWs at birth weight ±10%) |
| Sample size | 145 Practitioners and 30,828 reports | 10 Pediatricians. 91 Nonobserved consultations, 149 observed | 2,808 Indications for AHR use, 937 in period 1 and 1,871 in period 2 | Not clear | 4,142 Opportunities for hand hygiene compliance in four periods (1,064, 1,045, 1,038, and 995 each) | 187 in each group |
| Summary of reported findings | Study led to increases in 2 of 3 recording outcomes (medication and allergy, not medical history). Regression coefficients and | Inappropriate antibiotic prescribing in viral cases was 29% lower when observed (46% vs. 17%) | AHR compliance increased from 29% to 45% during the overt observation period (OR, 2.33; 95% CI: 1.95, 2.78; higher in multivariate model) | Patient-reported quality of care increased by 13% with direct observation, and returned to preobservation levels between 10 and 15 consultations | Both observed periods saw similar increases in compliance (47-55% and 48-56%). | No differences in main analysis (67.9% vs. 68.5% control/study). Possible difference in expected direction in small subgroup of heavy babies (37% vs. 53% control/study) |
| Reviewer comments including on principal risks of bias | No evaluation of confounding. Differences seen after awareness-raising memo, effect may be contingent. | Information bias judged likely. Reporting complex. | Big difference in indications for use in two periods reflects different observation procedures. Observer bias also possible. | Both the analyses and the reporting of outcome data are complex. | Not a formal report, so scant detail on data collection and other study procedures. | Equivalence problematic, clinicians different in two periods, although baby characteristics similar. |
Abbreviations: HE, Hawthorne effect; EFW, estimate of fetal weight; AHR, antiseptic hand rub; OR, odds ratio; CI, confidence interval.