| Literature DB >> 24073634 |
Abstract
BACKGROUND: Interventions targeting individuals classified as "high-risk" have become common-place in health care. High-risk may represent outlier values on utilization, cost, or clinical measures. Typically, such individuals are invited to participate in an intervention intended to reduce their level of risk, and after a period of time, a follow-up measurement is taken. However, individuals initially identified by their outlier values will likely have lower values on re-measurement in the absence of an intervention. This statistical phenomenon is known as "regression to the mean" (RTM) and often leads to an inaccurate conclusion that the intervention caused the effect. Concerns about RTM are rarely raised in connection with most health care interventions, and it is uncommon to find evaluators who estimate its effect. This may be due to lack of awareness, cognitive biases that may cause people to systematically misinterpret RTM effects by creating (erroneous) explanations to account for it, or by design.Entities:
Mesh:
Year: 2013 PMID: 24073634 PMCID: PMC3849564 DOI: 10.1186/1471-2288-13-119
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Actual data illustrating the regression to the mean phenomenon in Coronary Artery Disease (CAD), Congestive Heart Failure (CHF), and Chronic Obstructive Pulmonary Disease (COPD). Quintile I is the lowest cost group and V the highest. All individuals were continuously enrolled during the 2-year period. The diagonal line represents perfect correlation between the first and second year costs, which can only be achieved in the complete absence of variability between measurements and no measurement error.
Figure 2Physical Component Summary (PCS) scores on the Short Form-12 (SF-12v2), from a control group (= 118) participating in a health coaching study (Butterworth et al. 2006). All participants were surveyed twice, once at program commencement and then again at three months. Squares/circles represent mean scores and capped lines represent 95% bootstrapped confidence intervals (1000 resamples).
Results of the Monte Carlo simulation (= 10,000)
| | | | | |
| RTM (H) actual | 1417.67 | 0.33 | 1417.03 | 1418.32 |
| RTM (H) calculated | 1417.55 | 0.20 | 1417.16 | 1417.94 |
| RTM (L) actual | 354.16 | 0.18 | 353.80 | 354.51 |
| RTM (L) calculated | 354.27 | 0.06 | 354.15 | 354.39 |
| | | | | |
| RTM (H) actual | 945.21 | 0.28 | 944.66 | 945.76 |
| RTM (H) calculated | 945.10 | 0.14 | 944.82 | 945.38 |
| RTM (L) actual | 236.09 | 0.15 | 235.80 | 236.38 |
| RTM (L) calculated | 236.20 | 0.04 | 236.12 | 236.28 |
| | | | | |
| RTM (H) actual | 472.71 | 0.21 | 472.30 | 473.11 |
| RTM (H) calculated | 472.59 | 0.08 | 472.44 | 472.74 |
| RTM (L) actual | 118.03 | 0.11 | 117.83 | 118.24 |
| RTM (L) calculated | 118.11 | 0.02 | 118.07 | 118.15 |
Notes: RTM (H) is the regression to the mean effect for the high-risk group, and RTM (L) is the regression to the mean effect for the low-risk group. “Actual” represents the RTM effect derived directly from the data, and “calculated” is derived using Equation 1. ρ is the pretest-posttest correlation for the entire sample.
Regression to the mean effects for Physical Component Summary (PCS) scores on the Short Form-12 (SF-12v2), from the high-risk (PCS values ≤ 44.25) subgroup of controls (= 34) participating in a health coaching study (Butterworth et al. 2006)
| | | | | |
| Pre-test | 36.65 | 2.21 | 32.32 | 40.97 |
| Post-test | 44.93 | 3.27 | 38.51 | 51.34 |
| RTM | 8.28 | 2.01 | 4.35 | 12.21 |
| | | | | |
| Pre-test | 40.02 | 0.78 | 38.49 | 41.56 |
| Post-test | 43.41 | 1.52 | 40.44 | 46.38 |
| RTM | 3.38 | 0.94 | 1.54 | 5.22 |
| Difference (Actual – Calculated) | | | ||
| Pre-test | −3.38 | 0.83 | −5.01 | −1.74 |
| Post-test | 1.52 | 1.82 | −2.04 | 5.09 |
| RTM | 4.90 | 1.93 | 1.12 | 8.68 |
Notes: “Actual” indicates that the pre-test, post-test and RTM effects were estimated directly from the data. “Calculated” indicates that Equations 1, 5 and 6 were applied directly to the existing data. Standard errors and 95% confidence intervals were derived by bootstrap resampling of the data 1000 times.