| Literature DB >> 34779122 |
Michael W Beets1, Lauren von Klinggraeff1, Sarah Burkart1, Alexis Jones1, John P A Ioannidis2, R Glenn Weaver1, Anthony D Okely3, David Lubans4, Esther van Sluijs5, Russell Jago6, Gabrielle Turner-McGrievy7, James Thrasher7, Xiaoming Li7.
Abstract
Biases introduced in early-stage studies can lead to inflated early discoveries. The risk of generalizability biases (RGBs) identifies key features of feasibility studies that, when present, lead to reduced impact in a larger trial. This meta-study examined the influence of RGBs in adult obesity interventions. Behavioral interventions with a published feasibility study and a larger scale trial of the same intervention (e.g., pairs) were identified. Each pair was coded for the presence of RGBs. Quantitative outcomes were extracted. Multilevel meta-regression models were used to examine the impact of RGBs on the difference in the effect size (ES, standardized mean difference) from pilot to larger scale trial. A total of 114 pairs, representing 230 studies, were identified. Overall, 75% of the pairs had at least one RGB present. The four most prevalent RGBs were duration (33%), delivery agent (30%), implementation support (23%), and target audience (22%) bias. The largest reductions in the ES were observed in pairs where an RGB was present in the pilot and removed in the larger scale trial (average reduction ES -0.41, range -1.06 to 0.01), compared with pairs without an RGB (average reduction ES -0.15, range -0.18 to -0.14). Eliminating RGBs during early-stage testing may result in improved evidence.Entities:
Keywords: intervention; pilot; scaling; translation
Mesh:
Year: 2021 PMID: 34779122 PMCID: PMC8755584 DOI: 10.1111/obr.13369
Source DB: PubMed Journal: Obes Rev ISSN: 1467-7881 Impact factor: 10.867
FIGURE 1Diagram of search procedures of locating pilot studies and larger scale trial pairs
Operational definition of risk of generalizability biases
| Risk of generalizability bias | Example of bias | |
|---|---|---|
| What is the potential for difference(s) between … | ||
| Intervention intensity bias | … the number and length of contacts in the pilot study compared with the number and length of contacts in the larger scale trial of the intervention? | 7 sessions in 7 weeks in pilot/feasibility study vs. 4 sessions over 12 weeks in larger scale trial. 24 contacts per week for 12 weeks in pilot/feasibility vs. 2 contacts per week for 12 weeks in larger scale trial |
| Implementation support bias | … the amount of support provided to implement the intervention in the pilot study compared with the amount of support provided in the larger scale trial? |
Any adherence issues noted were immediately addressed in ongoing supervision. At the end of each session, the researcher debriefed with the interventionist to discuss reasons for the variation in approach and to maintain standardization, integrity of implementation, and reliability among interventionists. |
| Intervention delivery agent bias | … the level of expertise of the individual(s) who delivered the intervention in the pilot study compared with who delivered the intervention in the larger scale trial |
All intervention sessions were led by the first author. The interventionists were highly trained doctoral students or a postdoc. |
| Target audience bias | … the demographics of those who received the intervention in the pilot study compared with those who received the intervention in the larger scale trial |
Affluent and educated background, mostly White non‐Hispanic. Participants were predominately healthy and well‐educated. |
| Intervention duration bias | … the length of the intervention provided in the pilot study compared with the length of the intervention in larger scale trial? | 8‐week intervention in pilot/feasibility study to 12‐month intervention in larger scale trial. |
| Setting bias | … the type of setting where the intervention was delivered in the pilot study compared with the setting in the larger scale trial |
A convenience sample of physicians, in one primary care office practice agreed to participate. They were approached because of a personal relationship with one of the investigators who also was responsible for providing physician training in the counseling intervention. The study was conducted at a university health science center. |
| Measurement bias | … the measures employed in the pilot study compared with the measures used in the larger scale trial of the intervention for primary/secondary outcomes? | Use of objective measures in pilot to self‐report measures in larger scale trial. |
| Directional conclusions | Are the intervention effect(s) in the hypothesized direction for the pilot study compared with those in the larger scale trial? | Outcomes in the opposite direction (e.g., control group improved more so than treatment group). |
Note: Based on definitions originally appearing in Beets et al.
FIGURE 2PRISMA diagram of systematic literature search and final studies included in analyses
FIGURE 3Classification of the presence (red circle) and absence (green circle) risk of generalizability biases across pilot and larger scale trial pairs
FIGURE 4Forest plot of the change in the standardized difference in means (SDM) of the presence, absence, or carry forward of risk of generalizability biases from a pilot/feasibility study to a larger scale trial. No pairs contained directional conclusion bias in both the pilot and larger scale trial. Intervention duration, intervention intensity, and measurement describe differences between smaller and larger scale studies, so they cannot be present in both studies
Odds for detecting a nominally statistically significant p value (p < 0.05 and p < 0.005) in a larger scale trial from the presence of a risk of generalizability bias
| Risk of generalizability bias | Odds ratio for | Odds ratio for | ||||||
|---|---|---|---|---|---|---|---|---|
| OR | (95% CI) | % Effects | OR | (95% CI) | % Effects | |||
| Delivery agent | Not present | Reference | 34% | Reference | 23% | |||
| Pilot only |
|
| 24% |
|
| 15% | ||
| Both | 1.39 | (0.97, 2.00) | 42% | 0.96 | (0.63, 1.47) | 22% | ||
| Implementation support | Not present | Reference | 34% | Reference | 22% | |||
| Pilot only | 0.81 | (0.42, 1.56) | 30% | 0.54 | (0.22, 1.30) | 13% | ||
| Both | 0.74 | (0.51, 1.08) | 30% | 1.03 | (0.67, 1.57) | 22% | ||
| Target audience | Not present | Reference | 32% | Reference | 20% | |||
| Pilot only | 0.72 | (0.47, 1.11) | 26% | 1.11 | (0.70, 1.75) | 21% | ||
| Both |
|
| 49% |
|
| 34% | ||
| Setting | Not present | Reference | 33% | Reference | 21% | |||
| Pilot only |
|
| 8% |
|
| 6% | ||
| Both |
|
| 48% |
|
| 35% | ||
| Intervention duration | Not present | Reference | 34% | Reference | 23% | |||
| Pilot only | 0.83 | (0.61, 1.12) | 32% |
|
|
| 17% | |
| Intervention intensity | Not present | Reference | 34% | Reference | 22% | |||
| Pilot only |
|
| 3% |
|
| 3% | ||
| Measurement | Not present | Reference | 35% | Reference | 22% | |||
| Pilot only | 0.86 | (0.52, 1.41) | 15% | 0.93 | (0.52, 1.65) | 20% | ||
| Directional conclusions | Not present | Reference | 34% | Reference | 23% | |||
| Pilot only |
|
| 32% |
|
| 8% | ||
Note: Bolded values 95% confidence intervals (CIs) do not cross 1.00. Reference group is pilot/feasibility study and larger scale trial pairs without a risk of generalizability bias present.
FIGURE 5Z value distribution of outcomes in larger scale trials by the absence (green line) of risk of generalizability bias (RGB), RGB present in pilot and absent in larger scale trial (red distribution), and RGB present in both pilot and larger scale trial (blue distribution). Positive z values indicate that the intervention was better than the control group, and negative z values indicate that the control group was better than the intervention group. Solid vertical lines represent z ± 1.96 (p = 0.05); dashed vertical lines represent z ± 2.58 (p = 0.005)