| Literature DB >> 25524532 |
Gillian M Hendry1, Rajen N Naidoo, Temesgen Zewotir, Delia North, Graciela Mentz.
Abstract
BACKGROUND: Multiple imputation is a reliable tool to deal with missing data and is becoming increasingly popular in biostatistics. However, building a model with interactions that are not specified a priori, in the presence of missing data, presents a challenge. On the one hand, the interactions are needed to impute the data, while on the other hand, the data is needed to identify the interactions. The objective of this study was to present a way in which this challenge can be addressed.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25524532 PMCID: PMC4289583 DOI: 10.1186/1471-2288-14-136
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Recommended number of imputations needed for varying fractions of missing data (Graham [9])
| Fraction of missing data | 0.1 | 0.3 | 0.5 | 0.7 | 0.9 |
| Number of imputations | 20 | 20 | 40 | 100 | >100 |
Variables, categories and the percentage missing
| Variable | Response category | % missing |
|---|---|---|
| Gender | male/female | 0 |
| Neonatal care | yes/no | 3.7 |
| Birth weight | up to 2.5 kg/>2.5 kg/don’t know | 1.0 |
| Fear in neighbourhood | yes/no | 6.5 |
| Smoked while pregnant | yes/no | 50. |
| Smokers in the home | yes/no | 0.3 |
| Smoke exposure in vehicles | yes/no | 7.6 |
| Exercise | Up to once a week/2-4 times a week/>4 times a week | 6.3 |
| TV watching | Up to an hour a day/1-3 hours a day/>3 hours a day | 6.5 |
| Number people in home | 1-4/5-7/8+ | 9.2 |
| Income (monthly) | up to R1000/R1001-R4500/R4501-R10000/R10001+ | 19.4 |
| Food availability | not always enough/enough | 8.4 |
| Perceived weight | overweight/underweight/correct weight | 6.8 |
| Work and wear | yes/no | 3.7 |
| Pets at home ever | yes/no | 1.0 |
| Area | South Durban/North Durban | 0 |
| Breakfast habits | Not every day/daily | 6.5 |
| Violence experienced | yes/no | 7.3 |
| Attacked with weapons | yes/no | 7.3 |
| Stove type | paraffin/gas/electric/none | 9.9 |
| Age | 0 | |
| Asthma severity | Moderate-severe/mild persistent/mild intermittent/no asthma | 0 |
Estimated coefficients (EST) and standard errors (SE) for the predictors selected in the different analyses
| Predictor | Reference | Category | CC (N = 216) | MVNI (N = 382) | FCS1 (N = 382) | FCS2 (N = 382) | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| Category | EST | SE | EST | SE | EST | SE | EST | SE | ||
| Gender | Female | Male | -0.441 | 0.674 | 0.129 | 0.398 | 0.030 | 0.391 | 0.017 | 0.390 |
| Neonatal care | No | Yes | 2.484* | 0.723 | 1.103* | 0.444 | 1.112* | 0.450 | 1.085* | 0.446 |
| Fear | No | Yes | -1.169 | 0.649 | -0.958* | 0.431 | -1.009* | 0.451 | -1.073* | 0.444 |
| Smoked while pregnant | No | Yes | 4.256* | 1.237 | 1.019 | 0.736 | 0.885 | 0.693 | 0 | |
| Smokers in home | No | Yes | 0.939 | 0.537 | 0.742* | 0.352 | 0.761* | 0.341 | 0.801* | 0.335 |
| Smoke in vehicles | No | Yes | -2.584* | 0.921 | -0.253 | 1.068 | -0.308 | 1.011 | -0.323 | 1.015 |
| Exercise | >4 times a week | Up to once a week | 2.805* | 1.227 | 0.892 | 0.761 | 0.692 | 0.756 | 0.624 | 0.731 |
| 2 – 4 times a week | 3.313* | 1.229 | 1.039 | 0.717 | 0.936 | 0.718 | 0.738 | 0.680 | ||
| TV watching | >3 hours a day | Up to 1 hour a day | -0.566 | 0.854 | 0.399 | 0.684 | 0.327 | 0.669 | 0.346 | 0.657 |
| 1 – 3 hours a day | 0.304 | 0.769 | 0.641 | 0.639 | 0.525 | 0.630 | 0.569 | 0.618 | ||
| Number people in home | 8+ | 1 - 4 | 0 | 1.084 | 0.554 | 1.060* | 0.539 | 1.101* | 0.526 | |
| 5 - 7 | 0 | 0.226 | 0.552 | 0.254 | 0.551 | 0.250 | 0.540 | |||
| Income | R100001+ | up to R1000 | 2.840* | 1.257 | 0.695 | 0.8 | 0.787 | 0.789 | 0.823 | 0.778 |
| R1001 – R4500 | 1.285 | 1.203 | 0.209 | 0.797 | 0.489 | 0.754 | 0.431 | 0.754 | ||
| R4501 – R10000 | 1.933 | 1.17 | 1.428 | 0.783 | 1.401* | 0.692 | 1.356 | 0.692 | ||
| Food availability | Enough | Not always enough | -0.575 | 0.64 | 0.604 | 0.503 | 0.665 | 0.464 | 0.677 | 0.455 |
| Perceived weight | Correct weight | Overweight | -0.230 | 0.743 | 0 | 0 | 0 | |||
| Underweight | 2.369* | 0.97 | 0 | 0 | 0 | |||||
| Work’nWear | No | Yes | 0 | -0.635 | 0.626 | -0.543 | 0.629 | -0.478 | 0.622 | |
| Pets ever | No | Yes | -3.770* | 0.994 | 1.658* | 0.501 | -1.483* | 0.503 | -1.413* | 0.467 |
| Area | North Durban | South Durban | 6.278* | 1.461 | 2.042* | 0.76 | 1.948* | 0.737 | 1.597* | 0.671 |
| Breakfast habits | Daily | Not daily | -4.098 | 3.04 | -0.492 | 1.512 | -0.234 | 1.548 | -0.110 | 1.518 |
| Violence | No | Yes | 0 | -0.817* | 0.382 | -0.741* | 0.377 | -0.715 | 0.373 | |
| Weapons | No | Yes | -1.147* | 0.555 | 0 | 0 | 0 | |||
| Age | -1.068* | 0.438 | -0.79* | 0.254 | -0.833* | 0.268 | -0.834* | 0.265 | ||
|
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
|
|
| ||
| Fear*Breakfast | No/daily | Yes/not daily | 2.635* | 1.219 | 2.047* | 0.866 | 2.123* | 0.916 | 2.185* | 0.911 |
| Gender*SmokeVehicle | Female/No | Male/yes | 5.092* | 1.342 | 2.535* | 1.034 | 2.431* | 0.977 | 2.464* | 0.971 |
| SmokeVehicle*TV | No/>3 hrs | Yes/up to 1 hr | 0 | 0.891 | 1.298 | 0.675 | 1.265 | 0.722 | 1.250 | |
| Yes/1 – 3 hrs | 0 | -2.184* | 1.085 | -1.975 | 1.034 | -2.002 | 1.037 | |||
| Food*Age | enough/ | Not always enough/ | 1.762* | 0.743 | 0.925* | 0.396 | 0.786* | 0.385 | 0.778* | 0.364 |
| Exercise*Area | >4 times/ND | < once a week/SD | -4.573* | 1.533 | -1.41 | 1.031 | -1.255 | 0.954 | -1.125 | 0.923 |
| 2 – 4 times/SD | -6.331* | 1.627 | -1.981* | 0.913 | -1.805* | 0.896 | -1.551 | 0.850 | ||
| Income*Breakfast | > R10000/daily | ≤R1000/not daily | -4.051 | 2.5 | -3.921* | 1.8 | -3.666* | 1.731 | -3.808* | 1.733 |
| R1001-R4500/not daily | 0.414 | 2.408 | -1.218 | 1.636 | -1.439 | 1.530 | -1.513 | 1.516 | ||
| R4501-R10000/not daily | 2.479 | 2.395 | -1.374 | 1.541 | -1.568 | 1.454 | -1.715 | 1.431 | ||
| TV*Breakfast | >3 hrs/daily | ≤1 hr/not daily | 6.310* | 2.213 | 2.573* | 1.259 | 2.051 | 1.192 | 1.976 | 1.186 |
| 1-3 hrs/not daily | 1.974 | 2.154 | 0.192 | 1.109 | 0.270 | 1.112 | 0.192 | 1.103 | ||
| SmokeVehicle*Age | no/ | yes/ | 0 | 0.814* | 0.375 | 0.809* | 0.348 | 0.782* | 0.341 | |
| Smoke preg*Area | no/ND | yes/SD | -5.118* | 2.101 | -1.875 | 1.363 | -1.663 | 1.291 | 0 | |
| Work’nWear*Breakfast | no/not daily | yes/daily | 0 | 2.349* | 1.076 | 2.095 | 1.070 | 2.165* | 1.090 | |
ND – North Durban; SD – South Durban; preg – pregnant.
CC – Complete case.
MVNI – Multiple imputed MVNI strategies 1 and 2.
FCS1 -Multiple imputed FCS strategy 1.
FCS2 -Multiple imputed FCS strategy 2.
*Significant at the 0.05 level.
Figure 1Differences in measured (observed) and imputed data. A comparison of the distributions of the 4 variables with the most missing data for the complete case data (CC), MVNI imputed data, FCS imputed data and measured data.