| Literature DB >> 25519494 |
Kristin Gustavson1, Ingrid Borren.
Abstract
BACKGROUND: Medical researchers often use longitudinal observational studies to examine how risk factors predict change in health over time. Selective attrition and inappropriate modeling of regression toward the mean (RTM) are two potential sources of bias in such studies.Entities:
Mesh:
Year: 2014 PMID: 25519494 PMCID: PMC4298063 DOI: 10.1186/1471-2288-14-133
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Figure 1Diagram of the procedures for defining populations, generating data, and performing analyses.
Figure 2Population model of a situation where RTM occurs. Population model where the baseline association between health and predictor is entirely due to causes with transient effects on health. When the transient effect in the current figure comes from factors with variance > 0, it contributes to variance in change and thus to rank-order instability and RTM in health. These transient causes also affect the predictor, and RTM in health will therefore be correlated with the predictor. Thus, persons with different scores on the baseline predictor tend to regress toward the same mean on the health variable. To avoid attributing RTM in health to the predictor, the statistical method used should include an assumption of RTM. A more technical explanation of why MR is appropriate in this situation is provided in Additional file 1. The circle symbolizes a latent factor that is not observed by the researcher. Squares are observed variables used in the analyses. To simplify the figure, residual variances of observed variables are not drawn. This model corresponds to the model in Judd & Kenny [24] (p.111) where the allocation variable to control versus intervention group has time-limited effects on test scores.
Figure 3Population model of a situation where RTM does not occur. Population model where the baseline association between health and predictor is due to causes with enduring effects on health. The sum of the indirect and direct effects from these causes on follow-up health is equal to their direct effect on baseline health. The effect from the enduring causes on health implies that this latent variable does not contribute to rank-order instability, and hence not to RTM in health. The statistical method used should therefore not include an assumption of RTM. A more technical explanation of why change score analysis is appropriate in this situation is provided in Additional file 1. The circle symbolizes a latent factor not observed by the researcher. Squares are observed variables used in the analyses of the generated data. To simplify the figure, residual variances of observed variables are not drawn. This population model partly corresponds to the model in Judd & Kenny [24] (p. 118), where the allocation variable to control versus intervention group has stable effects on test scores. The two models differ because the current model assumes that baseline health affects follow-up health. Nevertheless, the current population model implies that the effect from these causes on follow-up health is equal to their effect on baseline health.
Estimated associations between baseline predictor and change in health from appropriate use of MR analyses (i.e. in situations where RTM was assumed to occur), as shown in Figure 2
| 50% attrition rate | 70% attrition rate | |||||
|---|---|---|---|---|---|---|
| Dependency (b drop) | b pred (SE) | Coverage of 95% C.I. | b pred (SE) | Coverage of 95% C.I. | ||
| Pred. | V1 | V2 | ||||
| 0 | 0 | 0 | .10 (.04) | 95% | .10 (.05) | 95% |
| .3 | .3 | 0 | .10 (.04) | 95% | .10 (.05) | 93% |
| .3 | .3 | .1 | .09 (.04) | 91% | .08 (.05) | 93% |
| .3 | .3 | .3 | .04 (.04) | 67% | .03 (.05) | 73% |
Dependency is the magnitude (bdrop) of the regression of liability of dropping out on each of the three study variables. Pred = baseline predictor. V1 = the main variable at baseline (baseline health), V2 = the main variable at follow-up (follow-up health). SE = standard error. bpred = regression coefficient from predictor to change in health. Coverage of 95% C.I. = the percentage of the 500 samples with an estimated bpred with a 95% confidence interval containing the true population value. bpred and SE are average results over the 500 generated samples. N in the original samples was 1000. The first line shows results when attrition was completely random.
The true population value was bpred = .10.
Estimated associations between baseline predictor and change in health from appropriate use of change score analyses (i.e. in situations where RTM was assumed not to occur), as shown in Figure 3
| 50% attrition rate | 70% attrition rate | |||||||
|---|---|---|---|---|---|---|---|---|
| Dependency (b drop) | b pred (SE) | Coverage of 95% C.I. | Diff | b pred (SE) | Coverage of 95% C.I. | Diff | ||
| Pred. | V1 | V2 | ||||||
| 0 | 0 | 0 | .10 (.04) | 96% | 0.00 | .10 (.05) | 96% | 0.00 |
| .3 | .3 | 0 | .13 (.04) | 86% | 0.08 | .13 (.05) | 91% | 0.11 |
| .3 | .3 | .1 | .12 (.04) | 90% | 0.04 | .13 (.05) | 92% | 0.06 |
| .3 | .3 | .3 | .10 (.04) | 94% | -0.02 | .10 (.06) | 93% | -0.03 |
Dependency is the magnitude (bdrop) of the regression of liability of dropping out on each of the three study variables. Pred = baseline predictor. V1 = the main variable at baseline (baseline health), V2 = the main variable at follow-up (follow-up health). SE = standard error. bpred = regression coefficient from predictor to change in health. Coverage of 95% C.I. = the percentage of the 500 samples with an estimated bpred with a 95% confidence interval containing the true population value. Diff = estimated change score (follow-up health minus baseline health). bpred, SE, and Diff are average results over the 500 generated samples. N in the original samples was 1000. The first line shows results when attrition was completely random.
The true population value was bpred = .10.
Estimated associations between baseline predictor and change in health from inappropriate use of MR analyses (i.e. in situations where RTM was assumed not to occur), as shown in Figure 3
| Dependency (b drop) | b pred (SE) | Coverage of 95% C.I. | ||
|---|---|---|---|---|
| Pred. | V1 | V2 | ||
| 0 | 0 | 0 | .17 (.04) | 52 |
| .3 | .3 | 0 | .17 (.04) | 52 |
| .3 | .3 | .1 | .15 (.04) | 70 |
| .3 | .3 | .3 | .11 (.04) | 94 |
Dependency is the magnitude (bdrop) of the regression of liability of dropping out on each of the three study variables. Pred = baseline predictor. V1 = the main variable at baseline (baseline health), V2 = the main variable at follow-up (follow-up health). SE = standard error. bpred = regression coefficient from predictor to change in health. Coverage of 95% C.I. = the percentage of the 500 samples with an estimated bpred with a 95% confidence interval containing the true population value. bpred and SE are average results over the 500 generated samples. N in the original samples was 1000. Attrition rate was 50%. The first line shows results when attrition was completely random.
The true population value was bpred = .10.
Estimated associations between baseline predictor and change score from inappropriate use of change score analyses (i.e. in situations where RTM was assumed to occur), as shown in Figure 2
| Dependency (b drop) | b pred (SE) | Coverage of 95% C.I. | Diff | ||
|---|---|---|---|---|---|
| Pred. | V1 | V2 | |||
| 0 | 0 | 0 | . 02 (.04) | 57 | 0.00 |
| .3 | .3 | 0 | .06 (.05) | 86 | 0.12 |
| .3 | .3 | .1 | .05 (.05) | 79 | 0.08 |
| .3 | .3 | .3 | .02 (.05) | 61 | 0.00 |
Dependency is the magnitude (bdrop) of the regression of liability of dropping out on each of the three study variables. Pred = baseline predictor. V1 = the main variable at baseline (baseline health), V2 = the main variable at follow-up (follow-up health). SE = standard error. bpred = regression coefficient from predictor to change in health. Coverage of 95% C.I. = the percentage of the 500 samples with an estimated bpred with a 95% confidence interval containing the true population value. Diff = estimated change score (follow-up health minus baseline health). bpred, SE, and Diff are average results over the 500 generated samples. N in the original samples was 1000. Attrition rate was 50%. The first line shows results when attrition was completely random.
The true population value was bpred = .10.
Estimated associations between baseline predictor and change in health from inappropriate use of MR analyses (i.e. in situations where RTM was assumed not to occur)
| Dependency (b drop) | b pred (SE) | Coverage of 95% C.I. | ||
|---|---|---|---|---|
| Pred. | V1 | V2 | ||
| 0 | 0 | 0 | -.17 (.04) | 53 |
| .3 | .3 | 0 | -.17 (.04) | 54 |
| .3 | .3 | .1 | -.15 (.04) | 69 |
| .3 | .3 | .3 | -.11 (.04) | .94 |
Dependency is the magnitude (bdrop) of the regression of liability of dropping out on each of the three study variables. Pred = baseline predictor. V1 = the main variable at baseline (baseline health), V2 = the main variable at follow-up (follow-up health). SE = standard error. bpred = regression coefficient from predictor to change in health. Coverage of 95% C.I. = the percentage of the 500 samples with an estimated bpred with a 95% confidence interval containing the true population value. bpred and SE are average results over the 500 generated samples. N in the original samples was 1000. Attrition rate was 50%. The first line shows results when attrition was completely random.
The true population value was bpred = - .10.
Estimated associations between baseline predictor and change in health from inappropriate use of change score analyses (i.e. in situations where RTM was assumed to occur)
| Dependency (b drop) | b pred (SE) | Coverage of 95% C.I. | Diff | ||
|---|---|---|---|---|---|
| Pred. | V1 | V2 | |||
| 0 | 0 | 0 | -.02 (.04) | 57% | 0.00 |
| .3 | .3 | 0 | -.05 (.05) | 84% | 0.12 |
| .3 | .3 | .1 | -.04 (.05) | 77% | 0.08 |
| .3 | .3 | .3 | -.02 (.05) | 62% | 0.00 |
Dependency is the magnitude (bdrop) of the regression of liability of dropping out on each of the three study variables. Pred = baseline predictor. V1 = the main variable at baseline (baseline health), V2 = the main variable at follow-up (follow-up health). SE = standard error. bpred = regression coefficient from predictor to change in health. Coverage of 95% C.I. = the percentage of the 500 samples with an estimated bpred with a 95% confidence interval containing the true population value. Diff = estimated change score (follow-up health minus baseline health). bpred, SE, and Diff are average results over the 500 generated samples. N in the original samples was 1000. Attrition rate was 50%. The first line shows results when attrition was completely random.
The true population value was bpred = - .10.