| Literature DB >> 27867251 |
Shaun R Seaman1, Daniel Farewell2, Ian R White1.
Abstract
Linear increments (LI) are used to analyse repeated outcome data with missing values. Previously, two LI methods have been proposed, one allowing non-monotone missingness but not independent measurement error and one allowing independent measurement error but only monotone missingness. In both, it was suggested that the expected increment could depend on current outcome. We show that LI can allow non-monotone missingness and either independent measurement error of unknown variance or dependence of expected increment on current outcome but not both. A popular alternative to LI is a multivariate normal model ignoring the missingness pattern. This gives consistent estimation when data are normally distributed and missing at random (MAR). We clarify the relation between MAR and the assumptions of LI and show that for continuous outcomes multivariate normal estimators are also consistent under (non-MAR and non-normal) assumptions not much stronger than those of LI. Moreover, when missingness is non-monotone, they are typically more efficient.Entities:
Keywords: ignorability; imputation; missing not at random; mortal cohort inference; non‐ignorable missing data; partly conditional inference
Year: 2016 PMID: 27867251 PMCID: PMC5111617 DOI: 10.1111/sjos.12225
Source DB: PubMed Journal: Scand Stat Theory Appl ISSN: 0303-6898 Impact factor: 1.396
Means and empirical SEs of estimated μ in Simulation Study 1
| Method |
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| True values | 0.250 | 0.950 | 1.790 | 2.798 | 4.008 | 5.459 |
| Means | ||||||
| Complete data | 0.249 | 0.945 | 1.784 | 2.790 | 3.998 | 5.447 |
| Complete cases | 0.249 | 1.482 | 2.471 | 4.164 | 6.009 | 8.325 |
| Estim. compens. | 0.249 | 0.944 | 1.784 | 2.798 | 4.017 | 5.469 |
| LI–LS impute | 0.249 | 0.944 | 1.786 | 2.796 | 4.007 | 5.455 |
| LI–uMVN impute | 0.249 | 0.946 | 1.786 | 2.790 | 3.997 | 5.454 |
| LI–aMVN impute | 0.249 | 0.946 | 1.787 | 2.788 | 3.999 | 5.450 |
| uMVN impute | 0.249 | 0.947 | 1.784 | 2.791 | 3.998 | 5.455 |
| aMVN impute | 0.249 | 0.947 | 1.786 | 2.789 | 4.000 | 5.450 |
| Empirical SEs | ||||||
| Complete data | 0.045 | 0.084 | 0.120 | 0.160 | 0.204 | 0.255 |
| Complete cases | 0.045 | 0.115 | 0.172 | 0.224 | 0.278 | 0.319 |
| Estim. compens. | 0.045 | 0.113 | 0.227 | 0.343 | 0.477 | 0.615 |
| LI–LS impute | 0.045 | 0.113 | 0.179 | 0.238 | 0.296 | 0.354 |
| LI–uMVN impute | 0.045 | 0.102 | 0.140 | 0.186 | 0.234 | 0.292 |
| LI–aMVN impute | 0.045 | 0.100 | 0.139 | 0.186 | 0.233 | 0.291 |
| uMVN impute | 0.045 | 0.099 | 0.137 | 0.182 | 0.231 | 0.292 |
| aMVN impute | 0.045 | 0.097 | 0.136 | 0.182 | 0.230 | 0.291 |
LI, linear increment; LS, least‐square; MVN, multivariate normal; aMVN, autoregressive MVN.
Means and empirical SEs of estimated μ in Simulation Study 2
| Method |
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| True values | 0.250 | 1.025 | 2.062 | 3.231 | 4.466 | 5.733 | 7.016 |
| Means | |||||||
| Complete data | 0.247 | 1.022 | 2.059 | 3.230 | 4.465 | 5.733 | 7.014 |
| Complete cases | 0.247 | 1.702 | 3.066 | 3.911 | 5.589 | 7.256 | 12.222 |
| Estim. compens. | 0.247 | 1.022 | 2.065 | 3.237 | 4.466 | 5.735 | 7.015 |
| LI–LS impute | 0.247 | 1.022 | 2.065 | 3.233 | 4.462 | 5.731 | 7.010 |
| LI–uMVN impute | 0.247 | 1.022 | 2.061 | 3.229 | 4.463 | 5.729 | 7.014 |
| uMVN impute | 0.247 | 1.023 | 2.061 | 3.227 | 4.463 | 5.729 | 7.014 |
| Empirical SEs | |||||||
| Complete data | 0.046 | 0.095 | 0.148 | 0.188 | 0.226 | 0.262 | 0.293 |
| Complete cases | 0.046 | 0.125 | 0.197 | 0.239 | 0.286 | 0.326 | 0.557 |
| Estim. compens. | 0.046 | 0.126 | 0.258 | 0.480 | 0.672 | 0.809 | 0.925 |
| LI–LS impute | 0.046 | 0.126 | 0.206 | 0.275 | 0.304 | 0.330 | 0.428 |
| LI–uMVN impute | 0.046 | 0.113 | 0.164 | 0.205 | 0.245 | 0.282 | 0.386 |
| uMVN impute | 0.046 | 0.110 | 0.159 | 0.200 | 0.241 | 0.282 | 0.387 |
LI, linear increment; LS, least‐square; MVN, multivariate normal.
Estimated mean earnings (and SEs) in calendar years 1991–1998 (1000's of pounds).
| Year | ||||||||
|---|---|---|---|---|---|---|---|---|
| Method | 1991 | 1992 | 1993 | 1994 | 1995 | 1996 | 1997 | 1998 |
| Estimated means | ||||||||
| Observed data | 14.35 | 14.55 | 15.28 | 15.60 | 16.85 | 17.25 | 17.96 | 18.63 |
| Compensator | 13.87 | 14.45 | 15.10 | 15.84 | 16.72 | 17.46 | 18.57 | 19.24 |
| LI–LS | 13.87 | 14.33 | 14.91 | 15.49 | 16.45 | 17.12 | 17.92 | 18.51 |
| LI–uMVN | 13.73 | 14.18 | 14.75 | 15.31 | 16.26 | 16.93 | 17.76 | 18.40 |
| uMVN | 13.48 | 13.96 | 14.49 | 15.14 | 15.99 | 16.69 | 17.75 | 18.46 |
| Compensator(2) | 13.87 | 14.45 | 15.12 | 15.89 | 16.72 | 17.41 | 18.41 | 19.11 |
| LI–LS(2) | 13.87 | 14.44 | 15.03 | 15.65 | 16.58 | 17.25 | 18.08 | 18.80 |
| LI–uMVN(2) | 13.77 | 14.33 | 14.88 | 15.49 | 16.44 | 17.14 | 18.02 | 18.74 |
| Standard errors | ||||||||
| Observed data | 0.240 | 0.228 | 0.250 | 0.263 | 0.321 | 0.283 | 0.321 | 0.352 |
| Compensator | 0.233 | 0.234 | 0.260 | 0.281 | 0.293 | 0.298 | 0.349 | 0.403 |
| LI–LS | 0.233 | 0.213 | 0.228 | 0.239 | 0.265 | 0.258 | 0.282 | 0.333 |
| LI–uMVN | 0.220 | 0.202 | 0.216 | 0.230 | 0.265 | 0.253 | 0.285 | 0.328 |
| uMVN | 0.209 | 0.192 | 0.208 | 0.228 | 0.248 | 0.252 | 0.280 | 0.331 |
| Compensator(2) | 0.233 | 0.234 | 0.266 | 0.291 | 0.301 | 0.315 | 0.336 | 0.420 |
| LI–LS(2) | 0.233 | 0.220 | 0.240 | 0.254 | 0.276 | 0.273 | 0.287 | 0.362 |
| LI–uMVN(2) | 0.224 | 0.209 | 0.226 | 0.241 | 0.276 | 0.267 | 0.297 | 0.360 |
Methods are means of observed data, estimating the compensator, LI–LS imputation, LI–uMVN imputation and uMVN imputation.
All LI methods use a LI model with autoregression of order one, unless they are marked ‘(2)’, in which case they use a LI model with second‐order autoregression.
LS, least‐square; LI, linear increments; MVN, multivariate normal.
Estimated coefficients (and standard errors) of linear regression of earnings (1000's of pounds) on current age group and year.
| Current age | Year | |||||
|---|---|---|---|---|---|---|
| Method | Intercept | 26–30 | 31–40 | 41–50 | 51–60 | –1991 |
| Estimates | ||||||
| Observed data | 8.322 | 4.018 | 7.386 | 7.948 | 5.472 | 0.475 |
| LI–LS | 8.517 | 3.879 | 7.109 | 7.774 | 5.792 | 0.457 |
| LI–uMVN | 8.557 | 3.614 | 6.855 | 7.519 | 5.496 | 0.468 |
| uMVN | 8.404 | 3.386 | 6.641 | 7.396 | 5.330 | 0.512 |
| LI–LS(2) | 8.504 | 3.783 | 7.236 | 7.907 | 6.127 | 0.473 |
| LI–uMVN(2) | 8.573 | 3.596 | 6.960 | 7.615 | 5.722 | 0.494 |
| Standard errors | ||||||
| Observed data | 0.322 | 0.377 | 0.459 | 0.525 | 0.799 | 0.049 |
| LI–LS | 0.298 | 0.335 | 0.436 | 0.520 | 0.713 | 0.046 |
| LI–uMVN | 0.284 | 0.315 | 0.422 | 0.496 | 0.691 | 0.045 |
| uMVN | 0.298 | 0.323 | 0.433 | 0.486 | 0.686 | 0.042 |
| LI–LS(2) | 0.300 | 0.338 | 0.447 | 0.541 | 0.764 | 0.047 |
| LI.uMVN(2) | 0.289 | 0.319 | 0.435 | 0.516 | 0.729 | 0.047 |
Methods are observed data, LI–LS imputation, LI–uMVN imputation and uMVN imputation. LI methods use a LI model with autoregression of order one, unless they are marked ‘(2)’, in which case they use a LI model with second‐order autoregression.
LI, linear increment; LS, least‐square; MVN, multivariate normal.