| Literature DB >> 32293514 |
Jennifer F Bobb1,2, Hongxiang Qiu3, Abigail G Matthews4, Jennifer McCormack4, Katharine A Bradley5,6,7.
Abstract
BACKGROUND: Pragmatic trials provide the opportunity to study the effectiveness of health interventions to improve care in real-world settings. However, use of open-cohort designs with patients becoming eligible after randomization and reliance on electronic health records (EHRs) to identify participants may lead to a form of selection bias referred to as identification bias. This bias can occur when individuals identified as a result of the treatment group assignment are included in analyses.Entities:
Keywords: Cluster-randomized trials; Electronic health records; Identification bias; Implementation trial; Open-cohort trial; Opioid use disorder; Recruitment bias
Mesh:
Year: 2020 PMID: 32293514 PMCID: PMC7092580 DOI: 10.1186/s13063-020-4148-z
Source DB: PubMed Journal: Trials ISSN: 1745-6215 Impact factor: 2.279
Fig. 1Analytic samples available for inclusion in analyses of PROUD intervention effects before and after randomization. Boxes not drawn to scale. * Increase in documentation of an OUD diagnosis may be due to increased skill in diagnosing and treating OUD or increased patient disclosure due to reduced stigma. ** Includes patients who are attracted to the intervention site because they are seeking OUD treatment (e.g. due to limited access to treatment elsewhere or lower barriers to receiving care in the PROUD intervention site)
Hypothetical example illustrating identification biasa when individuals identified using post-randomization data are included in analyses
| Research Question: Does the intervention decrease the number of days of acute care utilization among patients with OUD (patient-level outcome)? | |
Assumption: Assume the intervention has no effect on reducing acute care utilization Analytic sample: An open cohort of individuals with an OUD diagnosis documented in the EHR (pre- and/or post-randomization) Patients identified using pre-randomization data: • Suppose the number of patients with documented OUD pre-randomization is 100 in each trial arm (control and intervention) • Assume an average of 9 days of acute care utilization per year at baseline among these patients with OUD in each arm Patients identified using post-randomization data: • Control: 25 patients receive a new documented OUD diagnosis post randomization. These patients have an average of 9 days of acute care per year at baseline • Intervention: 50 patients receive a new documented OUD diagnosis. Of these, 25 are diagnosed as part of the intervention program and the other 25 are diagnosed through other mechanisms as in the control sites • Suppose patients diagnosed via the intervention program are sicker as compared to those diagnosed through other mechanisms, with an average of 12 days of acute care per year (versus 9) at baseline Estimated intervention effect: • Control: among 125 patients with a diagnosis, there is an average of 9 days of acute care per year of follow-upb • Intervention: among 150 patients with a diagnosis, there is an average of 9.5 days of acute care per year of follow-upb [= 9*125/150 + 12*25/150] Summary: We would estimate that the intervention results in greater acute care utilization relative to control, even if there is truly no effect. The bias could go in the other direction if patients diagnosed as part of the intervention program are healthier (rather than sicker) than patients diagnosed through other mechanisms. |
EHR electronic health record, OUD opioid use disorder
a Identification bias is a form of selection bias that can occur in open-cohort cluster-randomized trials when the randomized intervention group assignment affects who is identified as eligible for a particular analysis
b For simplicity, here we assume no time trend (i.e., that average number of days of acute care per year of follow up is the same as the average per year at baseline)
Considerations in using different sets of eligibility criteria for defining the analytic sample
| Analytic sample | General considerations | Implications for effectiveness outcome |
|---|---|---|
| Analytic samples not affected by identification biasa | ||
| Sample 1. Patients with documented OUD before randomization | • Misses potential improvements in outcomes attributable to the intervention among new patients with OUD who were attracted to receive care after randomization due to the PROUD intervention. These patients could comprise a substantial proportion of patients with OUD treated due to the PROUD intervention (70%–90%) • Patients with a prior documented OUD may not reflect the general population of patients with true OUDb • Given that OUD is underdiagnosed, restricting to patients with documented OUD before randomization reduces the sample size and therefore power for patient-level outcomes, relative to an open cohort design that includes those diagnosed after randomization (Sample 4 below) | Estimates of intervention effects within this select population of patients with documented OUD before randomization may not generalize to the broader population of individuals with true OUD who may be treated as part of the intervention (and therefore may not detect the true benefit). |
| Sample 2. All patients with primary care visits before randomization | • Same as bullet #1 for Sample 1 above • Sample includes patients with undocumented OUDs before randomization who might benefit from the intervention • Sample also includes many individuals without true OUD who would not be impacted by the intervention | • Since most individuals in the site population do not have OUD, the effect of the intervention on acute care utilization would be diluted, resulting in attenuation of the treatment effect toward the null • Relative to Sample 1, power could either be increased due to the higher sample size of patients with OUD or decreased due to including patients without true OUD in the analysis |
| Sample 3. Patients with primary care visits before randomization who have documented OUD or are at “increased risk” of OUDc | • Same as bullet #1 for Sample 1 above • Need to develop a definition of “increased risk” of OUD that seeks to include as many patients who truly have OUD (maximizing sensitivity) while limiting the number of patients included who do not truly have OUD (maximizing specificity). For example, this definition could be selected to target a high specificity (Sample 3a), a high sensitivity (Sample 3b), or a balanced sensitivity and specificity option (Sample 3c) | • Relative to Sample 1, results in a larger sample size of patients with true OUD (higher sensitivity). This could increase power • At the same time could lead to attenuated intervention effect estimates relative to Sample 1, since more of the identified individuals would not have OUD (lower specificity) |
| Analytic samples potentially affected by identification biasa | ||
| Sample 4. Patients with documented OUD (before and/or after randomization) | • Patients diagnosed after randomization in the intervention arm may not be comparable to patients diagnosed after randomization in the control arm • Diagnosis of newly recognized OUD is expected to continue over time. Consequently, including individuals diagnosed after randomization could increase the sample size (and therefore power) as compared to Sample 1 | Individuals diagnosed with OUD after randomization in the intervention arm are likely to be different (either sicker or healthier) with respect to their propensity for acute care utilization than individuals diagnosed with OUD after randomization in the control arm. This could lead to bias (see Table |
| Sample 5. All patients with primary care visits (before and/or after randomization) | • Patients new to intervention sites after randomization may not be comparable to patients new to the control sites after randomization • As in Sample 2, sample includes many individuals without true OUD who would not be impacted by the intervention • Captures outcomes of all patients with OUD who could be treated: patients seen previously in the clinic (including those with and without documented OUD before randomization) and those attracted to receive care as part of the PROUD intervention | As in Sample 2, the effect of the intervention would be diluted in the entire site population relative to Samples 1 or 4 and power could either be increased or decreased. |
| Sample 6. Patients with primary care visits who have documented OUD or are at “increased risk” of OUDc (before and/or after randomization) | • Patients identified as at “increased risk” of OUD after randomization in the intervention arm may not be comparable to patients identified as at “increased risk” of OUD after randomization in the control arm • As in Sample 3, need to develop a definition of “increased risk” of OUD that seeks to include as many patients who truly have OUD while limiting the proportion of patients who do not truly have OUD who are included | As in Sample 3, results in a larger sample size of patients with true OUD, but also could include many patients without OUD; thus, power could either be increased or decreased relative to Sample 4. |
EHR electronic health record, OUD opioid use disorder
a Identification bias is a form of selection bias that can occur in open-cohort cluster-randomized trials when the randomized intervention group assignment affects who is identified as eligible for a particular analysis. Identifying eligibility for inclusion in trial analyses before randomization (or using data collected pre-randomization) avoids this source of bias
b Documented OUD refers to patients with an OUD diagnosis documented in the EHR; True OUD refers to patients with OUD regardless of its recognition by clinicians and/or documentation in the EHR
c Planned definition of “increased risk” of OUD included individuals with any documented OUD diagnosis at baseline or anyone with both chronic opioid therapy (outside of end of life, palliative care, or active cancer treatment) and at least one of the following risk factors associated with increased risk of OUD: high morphine equivalent dose, alcohol or other substance use disorders, mental health disorders, concurrent sedative use, or pain in two or more body regions (e.g. headache and back pain)
Assumed values of sensitivity and specificity for each analytic sample using pre-randomization data considered in the power evaluation
| True OUD prevalence (π) | ||||||
|---|---|---|---|---|---|---|
| 0.01 | 0.02 | 0.04 | ||||
| Samplea | Assumptions / Notes | Specificity | Sensitivityb | |||
| 1 | Documented OUD | • Assumes all individuals with documented OUD do in fact have true OUD (specificity = 1) • Sensitivity selected to be consistent with the observed proportion of patients with documented OUD based on Phase 1 data (0.5%) and the specific choice of the true prevalence (π) | 1 | 0.5 | 0.25 | 0.125 |
| 2 | All patients | By definition, sensitivity = 1 and specificity = 0 | 0 | 1 | 1 | 1 |
| 3a | High specificity | Selected to have slightly higher sensitivity than Sample 1 (1.2 times the value), at the cost of slightly reduced specificity | 0.95 | 0.6 | 0.3 | 0.15 |
| 3b | High sensitivity | • Sensitivity was selected based on a previously developed algorithmc to identify individuals with opioid abuse and addiction, among patients on long-term opioid therapy • We considered a lower specificity (0.5 vs 0.64c) given that our initial sample is the entire site population, not restricted to long-term opioid users | 0.5 | 0.85 | 0.85 | 0.85 |
| 3c | Equal sens./spec. | Selected to have lower sensitivity and higher specificity than Sample 3b | 0.6 | 0.6 | 0.6 | 0.6 |
OUD opioid use disorder
a All options identify the study population using baseline (pre-randomization) data; see Table 2
b For some of the options, sensitivity was allowed to vary across the assumed prevalence of true OUD (π)
c Different cut-points for the developed risk score [39] achieved different values of sensitivity and specificity. We considered the “high sensitivity” scenario that achieved a sensitivity of 0.85 and a specificity of 0.64 in the validation sample (D. Carrell, personal communication, 5 July 2017)
Fig. 2Comparison of statistical power across different options for the analytic sample for the effectiveness analysis. The x-axis shows the intervention effect size, parameterized as the percentage decrease in the expected number of days of acute care utilization comparing patients with OUD (recognized or unrecognized) in the intervention versus usual care arm. All options for the analytic sample (described in Table 2) use pre-randomization data. Each panel represents a different true prevalence of OUD (1%, 2%, or 4%). Options 3a, 3b, and 3c correspond to different assumptions of the properties of an algorithm for defining “increased risk” of OUD (see Table 3). Higher sensitivity includes more patients with true OUD whereas higher specificity excludes more patients without true OUD. Power calculations were based on closed form sample size formula based on Poisson regression (details are in the Additional File)