| Literature DB >> 30157752 |
Catherine A Welch1, Séverine Sabia2,3, Eric Brunner2, Mika Kivimäki2, Martin J Shipley2.
Abstract
BACKGROUND: Informative attrition occurs when the reason participants drop out from a study is associated with the study outcome. Analysing data with informative attrition can bias longitudinal study inferences. Approaches exist to reduce bias when analysing longitudinal data with monotone missingness (once participants drop out they do not return). However, findings may differ when using these approaches to analyse longitudinal data with non-monotone missingness.Entities:
Keywords: Informative attrition; Longitudinal observational data; Multiple imputation; Pattern mixture modelling
Mesh:
Year: 2018 PMID: 30157752 PMCID: PMC6114233 DOI: 10.1186/s12874-018-0548-0
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Characteristics of participants in each simulated dataset
| Phase | 5 | 7 | 9 |
|---|---|---|---|
| Smoking status, n (%) | |||
| Non-smoker | 4940 (49.4) | 4940 (49.4) | 4940 (49.4) |
| Ex-smoker | 4343 (43.4) | 4436 (44.4) | 4557 (45.6) |
| Current smoker | 717 (7.2) | 624 (6.2) | 503 (5.0) |
| Age Category (year), n (%) | |||
| < 50 | 2420 (24.2) | ||
| 50 and < 55 | 2967 (29.7) | ||
| 55 and < 60 | 2010 (20.1) | ||
| 60 and < 65 | 1896 (19.0) | ||
| 65 | 707 (7.1) | ||
| Employment Grade, n (%) | |||
| High | 5812 (58.1) | ||
| Intermediate | 3878 (38.8) | ||
| Low | 310 (3.1) | ||
| Education, n (%) | |||
| None | 555 (5.6) | ||
| School | 4675 (46.8) | ||
| University | 4770 (47.7) | ||
| Standardised cognitive function (SD), mean (SD) | |||
| Global | 0.00 (0.78) | −0.22 (0.78) | − 0.42 (0.79) |
| Memory | 0.05 (0.71) | − 0.10 (0.71) | − 0.26 (0.72) |
| Participation status, n (%) | |||
| Response | 8709 (87.1) | 7850 (78.5) | 7549 (75.5) |
| Died | 0 | 240 (2.4) | 560 (5.6) |
| Non-response | 1291 (12.9) | 1485 (14.9) | 1289 (12.9) |
| Attrition | 0 | 425 (4.3) | 602 (6.0) |
Correlations among variables in full simulated global cognitive function data and differences compared to correlations among variables in available case analyses data, data imputed using multiple imputation and after applying pattern mixture modelling
| Global cognitive function | Smoking status | Age | Grade | Education | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Phase | 5 | 7 | 9 | 5 | 7 | 9 | 5 | 5 | 5 | |
| Full simulated data | Global 5 | 1 | ||||||||
| Global 7 | 0.9686 | 1 | ||||||||
| Global 9 | 0.9629 | 0.9691 | 1 | |||||||
| Smoke 5 | −0.0721 | − 0.0860 | − 0.0978 | 1 | ||||||
| Smoke 7 | −0.0595 | − 0.0721 | − 0.0824 | 0.9561 | 1 | |||||
| Smoke 9 | −0.0467 | − 0.0587 | − 0.0693 | 0.9233 | 0.9537 | 1 | ||||
| Age | −0.2280 | − 0.2780 | − 0.3293 | 0.0356 | 0.0440 | 0.0538 | 1 | |||
| Grade | −0.4746 | − 0.4462 | − 0.4203 | 0.1134 | 0.0985 | 0.0905 | −0.0516 | 1 | ||
| Education | 0.3794 | 0.3721 | 0.3664 | −0.1144 | − 0.1056 | − 0.1053 | − 0.0782 | − 0.3666 | 1 | |
| Differencesa in correlations from those above | ||||||||||
| Available case | Global 5 | 0 | ||||||||
| Global 7 | 0.0055 | 0 | ||||||||
| Global 9 | 0.0025 | 0.0067 | 0 | |||||||
| Smoke 5 | −0.0051 | 0.0035 | 0.0062 | 0 | ||||||
| Smoke 7 | − 0.0058 | 0.0011 | 0.0019 | −0.0047 | 0 | |||||
| Smoke 9 | −0.0106 | − 0.0022 | − 0.0025 | − 0.0051 | −0.0011 | 0 | ||||
| Age | −0.1419 | −0.1809 | − 0.2112 | −0.0332 | − 0.0283 | −0.0265 | 0 | |||
| Grade | −0.0142 | − 0.0036 | − 0.0020 | 0.0107 | 0.0099 | 0.0086 | −0.0040 | 0 | ||
| Education | 0.0503 | 0.0535 | 0.0446 | −0.0162 | − 0.01520 | − 0.0133 | − 0.0351 | −0.0342 | 0 | |
| Multiple imputation | Global 5 | 0 | ||||||||
| Global 7 | 0.0028 | 0 | ||||||||
| Global 9 | 0.0198 | 0.0070 | 0 | |||||||
| Smoke 5 | 0.0463 | 0.0485 | 0.0593 | 0 | ||||||
| Smoke 7 | 0.0481 | 0.0505 | 0.0631 | 0.0073 | 0 | |||||
| Smoke 9 | 0.0389 | 0.0436 | 0.0564 | 0.0130 | 0.0076 | 0 | ||||
| Age | −0.0160 | −0.0205 | 0.0013 | −0.1233 | − 0.1276 | − 0.1280 | 0 | |||
| Grade | 0.0078 | 0.0141 | 0.0063 | −0.0173 | − 0.0173 | − 0.0122 | 0 | 0 | ||
| Education | 0.0009 | −0.0010 | −0.0008 | 0.0280 | 0.0260 | 0.0298 | 0 | 0 | 0 | |
| Pattern mixture modelling | Global 5 | 0 | ||||||||
| Global 7 | 0.0102 | 0 | ||||||||
| Global 9 | 0.0275 | 0.0076 | 0 | |||||||
| Smoke 5 | 0.0463 | 0.0568 | 0.0679 | 0 | ||||||
| Smoke 7 | 0.0481 | 0.0581 | 0.0710 | 0.0073 | 0 | |||||
| Smoke 9 | 0.0389 | 0.0511 | 0.0637 | 0.0130 | 0.0076 | 0 | ||||
| Age | −0.0160 | − 0.0190 | − 0.0021 | − 0.1233 | − 0.1276 | − 0.1280 | 0 | |||
| Grade | 0.0078 | 0.0173 | 0.0112 | −0.0173 | − 0.0173 | − 0.0122 | 0 | 0 | ||
| Education | 0.0009 | −0.0035 | −0.0038 | 0.0280 | 0.0260 | 0.0298 | 0 | 0 | 0 | |
aDifferences in correlations are calculated as correlation in analysis type minus correlation in full simulated data
Slope coefficients and SE from mixed effects substantive model (random intercepts and slopes) with global cognitive score outcome and 5% of participants with informative attrition
| Observed in Whitehall II study | Full simulated data | Estimation method | Estimation method | |||||
|---|---|---|---|---|---|---|---|---|
| Available case | MI | PMM | Available case | MI | PMM | |||
| Impute smokinga | Impute educationa | |||||||
| Coefficient (SE) | ||||||||
| Reference | − 0.3120 | − 0.3120 (0.0095) | − 0.3082 (0.0139) | −0.3026 (0.0141) | − 0.3401 (0.0153) | − 0.3090 0.0133) | − 0.2885 (0.0104) | − 0.3240 (0.0119) |
| Smoking status | ||||||||
| Ex-smoker | −0.3271 | − 0.3272 (0.0096) | − 0.3236 (0.0142) | − 0.3208 (0.0145) | − 0.3710 (0.0159) | − 0.3244 (0.0135) | − 0.3040 (0.0108) | − 0.3377 (0.0123) |
| Current smoker | −0.4228 | − 0.4229 (0.0110) | − 0.4185 (0.0159) | − 0.3970 (0.0162) | − 0.4558 (0.0176) | −0.4190 (0.0156) | − 0.3975 (0.0131) | −0.4447 (0.0152) |
| Age Category (year) | ||||||||
| 50 and < 55 | −0.3619 | −0.3619 (0.0091) | − 0.3577 (0.0135) | −0.3510 (0.0138) | − 0.3925 (0.0151) | −0.3585 (0.0129) | − 0.3377 (0.0100) | −0.3779 (0.0117) |
| 55 and < 60 | − 0.4400 | −0.4398 (0.0093) | −0.4350 (0.0139) | − 0.4262 (0.0142) | − 0.4789 (0.0156) | − 0.4359 (0.0132) | − 0.4133 (0.0105) | − 0.4644 (0.0124) |
| 60 and < 65 | − 0.5029 | −0.5029 (0.0095) | −0.4971 (0.0154) | − 0.4824 (0.0158) | − 0.5517 (0.0170) | − 0.4984 (0.0148) | − 0.4700 (0.0128) | − 0.5390 (0.0145) |
| 65 | −0.5699 | − 0.5703 (0.0108) | −0.5648 (0.0419) | − 0.5382 (0.0417) | −0.5717 (0.0425) | − 0.5637 (0.0433) | −0.5204 (0.0433) | − 0.5588 (0.0438) |
| Employment grade | ||||||||
| Intermediate | −0.2481 | −0.2483 (0.0091) | −0.2434 (0.0135) | − 0.2358 (0.0137) | − 0.2928 (0.0151) | − 0.2443 (0.0126) | − 0.2237 (0.0100) | − 0.2805 (0.0117) |
| Low | − 0.2178 | − 0.2173 (0.0129) | − 0.2093 (0.0198) | − 0.1974 (0.0200) | − 0.2963 (0.0226) | −0.2096 (0.0189) | − 0.1857 (0.0178) | −0.2870 (0.0208) |
| Education | ||||||||
| School | −0.3222 | −0.3223 (0.0052) | − 0.3214 (0.0063) | −0.3198 (0.0064) | − 0.3260 (0.0068) | −0.3213 (0.0059) | − 0.3165 (0.0059) | −0.3274 (0.0063) |
| University | −0.3232 | − 0.3232 (0.0043) | − 0.3229 (0.0051) | − 0.3224 (0.0052) | − 0.3220 (0.0055) | − 0.3229 (0.0052) | − 0.3241 (0.0051) | − 0.3304 (0.0054) |
MI multiple imputation, PMM pattern mixture modelling
aMissing global cognitive function scores also imputed
Fig. 1Slope coefficient bias for smoking status and education categories with global cognitive function outcome when imputing the outcome together with either smoking status or education. a - % bias, b - mean square error ratio, c – coverage. a - % bias closest to zero indicates least bias approach. b - Ratios relative to available case analysis MSE. Ratio less than one indicates less bias compared to the available case analysis. c - Coverage close to 95% indicate proper control of the type I error rate for testing a null hypothesis of no effect
Correlations among variables in full simulated memory cognitive score data and differences compared to correlations among variables in available case analyses data, data imputed using multiple imputation and after applying pattern mixture modelling
| Memory cognitive function | Smoking status | Age | Grade | Education | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Phase | 5 | 7 | 9 | 5 | 7 | 9 | 5 | 5 | 5 | |
| Full simulated data | Mem 5 | 1 | ||||||||
| Mem 7 | 0.4419 | 1 | ||||||||
| Mem 9 | 0.4423 | 0.4458 | 1 | |||||||
| Smoke 5 | −0.0806 | −0.0884 | −0.1041 | 1 | ||||||
| Smoke 7 | − 0.0715 | − 0.0800 | − 0.0948 | 0.9616 | 1 | |||||
| Smoke 9 | −0.0636 | −0.0764 | − 0.0902 | 0.9337 | 0.9592 | 1 | ||||
| Age | −0.2729 | − 0.3127 | − 0.3379 | 0.0281 | 0.0369 | 0.0451 | 1 | |||
| Grade | −0.2404 | − 0.1913 | − 0.1660 | 0.0983 | 0.0881 | 0.0817 | −0.0501 | 1 | ||
| Education | 0.2058 | 0.1919 | 0.1779 | −0.1114 | − 0.1043 | − 0.1005 | − 0.0784 | − 0.3691 | 1 | |
| Differencesa in correlations from those above | ||||||||||
| Available case | Mem 5 | 0 | ||||||||
| Mem 7 | 0.0355 | 0 | ||||||||
| Mem 9 | 0.0544 | 0.0630 | 0 | |||||||
| Smoke 5 | 0.0159 | 0.0029 | 0.0040 | 0 | ||||||
| Smoke 7 | 0.0071 | 0.0032 | 0.0004 | 0.0009 | 0 | |||||
| Smoke 9 | 0.0137 | −0.0005 | 0.0050 | 0.0025 | 0.0087 | 0 | ||||
| Age | −0.1632 | − 0.1679 | − 0.2154 | − 0.0253 | − 0.0237 | − 0.0194 | 0 | |||
| Grade | −0.0120 | − 0.0102 | − 0.0125 | 0.0066 | 0.0043 | 0.0004 | −0.0104 | 0 | ||
| Education | 0.0052 | 0.0032 | 0.0339 | −0.0050 | − 0.0022 | 0.0015 | − 0.0151 | − 0.0031 | 0 | |
| Multiple imputation | Mem 5 | 0 | ||||||||
| Mem 7 | 0.0157 | 0 | ||||||||
| Mem 9 | 0.0970 | 0.0364 | 0 | |||||||
| Smoke 5 | 0.0528 | 0.0579 | 0.0306 | 0 | ||||||
| Smoke 7 | 0.0522 | 0.0550 | 0.0360 | 0.0062 | 0 | |||||
| Smoke 9 | 0.0532 | 0.0506 | 0.0377 | 0.0083 | 0.0080 | 0 | ||||
| Age | −0.0186 | −0.0298 | − 0.0269 | − 0.1200 | − 0.1246 | − 0.1237 | 0 | |||
| Grade | 0.0067 | − 0.0012 | − 0.0070 | − 0.0116 | − 0.0145 | − 0.0079 | 0 | 0 | ||
| Education | − 0.0067 | 0.0169 | 0.0066 | 0.0282 | 0.0272 | 0.0329 | 0 | 0 | 0 | |
| Pattern mixture modelling | Mem 5 | 0 | ||||||||
| Mem 7 | 0.0115 | 0 | ||||||||
| Mem 9 | 0.0877 | 0.0099 | 0 | |||||||
| Smoke 5 | 0.0528 | 0.0675 | 0.0410 | 0 | ||||||
| Smoke 7 | 0.0522 | 0.0655 | 0.0464 | 0.0062 | 0 | |||||
| Smoke 9 | 0.0532 | 0.0612 | 0.0481 | 0.0083 | 0.0080 | 0 | ||||
| Age | −0.0186 | − 0.0204 | − 0.0194 | − 0.1200 | − 0.1246 | − 0.1237 | 0 | |||
| Grade | 0.0067 | 0.0020 | 0.0007 | −0.0116 | − 0.0145 | − 0.0079 | 0 | 0 | ||
| Education | −0.0067 | 0.0123 | −0.0001 | 0.0282 | 0.0272 | 0.0329 | 0 | 0 | 0 | |
aDifferences in correlations are calculated as correlation in analysis type minus correlation in full simulated data
Slope coefficients and SE from mixed effects substantive model (random intercepts and slopes) with memory cognitive score outcome and 5% of participants with informative attrition
| Observed in Whitehall II study | Full simulated data | Estimation method | Estimation method | |||||
|---|---|---|---|---|---|---|---|---|
| Available case | MI | PMM | Available case | MI | PMM | |||
| Imputea smoking | Imputea education | |||||||
| Coefficient (SE) | ||||||||
| Reference | − 0.2492 | − 0.2508 (0.0356) | − 0.2320 (0.0507) | − 0.2290 (0.0487) | −0.2439 (0.0486) | − 0.2331 (0.0491) | − 0.1583 (0.0413) | − 0.1765 (0.0414) |
| Smoking status | ||||||||
| Ex-smoker | −0.2978 | −0.2989 (0.0361) | −0.2792 (0.0513) | − 0.2798 (0.0489) | −0.3104 (0.0489) | − 0.2799 (0.0496) | −0.2055 (0.0414) | − 0.2246 (0.0415) |
| Current smoker | −0.2698 | −0.2718 (0.0418) | −0.2459 (0.0581) | − 0.2202 (0.0543) | −0.2564 (0.0544) | − 0.2444 (0.0574) | −0.1724 (0.0474) | − 0.2008 (0.0473) |
| Age Category (year) | ||||||||
| 50 and < 55 | −0.3059 | −0.3064 (0.0347) | −0.2821 (0.0493) | − 0.2776 (0.0476) | −0.2980 (0.0477) | − 0.2829 (0.0472) | −0.2099 (0.0401) | − 0.2343 (0.0401) |
| 55 and < 60 | −0.3405 | −0.3414 (0.0357) | −0.3076 (0.0511) | − 0.3003 (0.0492) | −0.3316 (0.0492) | − 0.3085 (0.0489) | −0.2338 (0.0411) | − 0.2687 (0.0410) |
| 60 and < 65 | −0.3967 | −0.3978 (0.0360) | −0.3486 (0.0553) | − 0.3302 (0.0523) | −0.3831 (0.0526) | − 0.3508 (0.0527) | −0.2661 (0.0456) | − 0.3218 (0.0457) |
| 65 | −0.3318 | −0.3327 (0.0410) | −0.2803 (0.1437) | − 0.2400 (0.1081) | −0.2561 (0.1078) | − 0.2764 (0.1481) | −0.1795 (0.1132) | − 0.2037 (0.1135) |
| Employment grade | ||||||||
| Intermediate | -0.1616 | − 0.1638 (0.0335) | − 0.1388 (0.0479) | − 0.1342(0.0465) | − 0.1577 (0.0463) | − 0.1390 (0.0462) | − 0.0746 (0.0389) | − 0.1027 (0.0393) |
| Low | −0.2059 | − 0.2080 (0.0484) | − 0.1703 (0.0695) | − 0.1656 (0.0664) | − 0.2031 (0.0659) | − 0.1736 (0.0662) | − 0.1182 (0.0611) | − 0.1602 (0.0617) |
| Education | ||||||||
| School | −0.2413 | − 0.2414 (0.0206) | − 0.2317 (0.0249) | − 0.2298 (0.0238) | − 0.2368 (0.0238) | − 0.2321 (0.0253) | − 0.2116 (0.0233) | − 0.2245 (0.0233) |
| University | − 0.2533 | − 0.2540 (0.0178) | − 0.2480 (0.0212) | − 0.2482 (0.0203) | − 0.2517 (0.0204) | − 0.2480 (0.0211) | − 0.2558 (0.0200) | − 0.2657 (0.0201) |
MI multiple imputation, PMM pattern mixture modelling
aMissing memory cognitive function scores also imputed
Fig. 2Slope coefficient bias for smoking status and education categories with memory cognitive function outcome when imputing the outcome together with either smoking status or education. a - % bias, b - mean square error ratio, c – coverage. a - % bias closest to zero indicates least bias approach. b - Ratios relative to available case analysis MSE. Ratio less than one indicates less bias compared to the available case analysis. c - Coverage close to 95% indicate proper control of the type I error rate for testing a null hypothesis of no effect