| Literature DB >> 34797507 |
Inka Rösel1,2, Lina María Serna-Higuita3, Fatima Al Sayah4, Maresa Buchholz5, Ines Buchholz6, Thomas Kohlmann6, Peter Martus1, You-Shan Feng1,6.
Abstract
PURPOSE: Although multiple imputation is the state-of-the-art method for managing missing data, mixed models without multiple imputation may be equally valid for longitudinal data. Additionally, it is not clear whether missing values in multi-item instruments should be imputed at item or score-level. We therefore explored the differences in analyzing the scores of a health-related quality of life questionnaire (EQ-5D-5L) using four approaches in two empirical datasets.Entities:
Keywords: EQ-5D; Health-related quality of life; Imputation; Missing at random; Missing data
Mesh:
Year: 2021 PMID: 34797507 PMCID: PMC9023409 DOI: 10.1007/s11136-021-03037-3
Source DB: PubMed Journal: Qual Life Res ISSN: 0962-9343 Impact factor: 3.440
Fig. 1Study procedure. MCAR missing completely at random, MAR missing at random, MI multiple imputation, MM mixed model, SE standard error, MSE mean squared error, SHR single self-rated health item
Fig. 2Original patterns of missing EQ-5D-5L data in the ABCD dataset and missing data patterns simulated in the GR dataset (target percentages of missing data). MO mobility, SC self-care, UA usual activities, PD pain/discomfort and AD anxiety/depression, 0: baseline evaluation, 1: evaluation at 1 year and 2: evaluation at 2 years
Baseline characteristics of the ABCD (n = 2040) and GR dataset (n = 450)
| ABCD dataset | GR dataset (CC) | |||
|---|---|---|---|---|
| Mean (SD) or | Mean (SD) or | |||
| Age mean (SD) | 1985 | 63.07 (13.41) | 450 | 53.13 (10.43) |
Gender Male Female | 2027 | 1,110 (54.8%) 917 (45.2%) | 450 | 150 (33.3%) 300 (67.7%) |
Marital status Never married Now married or common law Separated or divorced Widowed | 2012 | 1,459 (72.5%) 127 (6.3%) 230 (11.4%) 196 (9.7%) | 450 | 59 (13.1%) 305 (67.8%) 64 (14.2%) 22 (4.9%) |
Academic status No formal schooling Completed grade school High school College/University | 2028 | 11 (0.5%) 265 (13.1%) 813 (40.1%) 939 (46.3%) | 450 | 68 (15.1%) 244 (54.2%) 112 (24.9%) 26 (5.8%) |
Single self-rated health (SRH) Excellent Very good Good Fair Poor | 2004 | 71 (3.5%) 611 (30.5%) 926 (46.2%) 331 (16.5%) 65 (3.2%) | 450 | 0 (0%) 27 (6.0%) 187 (41.6%) 200 (44.4%) 36 (8.0%) |
| EQ-VAS baseline evaluation | No data | 450 | 58.85 (19.15) | |
| EQ-5D-5L Index score | ||||
| EQ-5D-5L index baseline mean (SD) | 2019 | 0.795 (0.169) | 450 | 0.706 (0.241) |
| EQ-5D-5L index second evaluation mean (SD) | 1507 | 0.793 (0.168) | 450 | 0.762 (0.235) |
| EQ-5D-5L index third evaluation mean (SD) | 1374 | 0.788 (0.173) | Not applicable | |
CC complete cases
Fig. 3Predicted mean EQ-5D-5L (ABCD dataset) over time according to the different approaches
Percentage of item misspecifications after multiple imputation by items (GR dataset)
Gray shaded symptoms item, non-shaded items: functions items, AD anxiety/depression, PD pain/discomfort, MO mobility, SC self-care, UA usual activities
Here we display the percentage of misspecifications of item levels after multiple imputation by item. This was done by calculating the difference between true value from the GR CC dataset and the imputed estimate. Percentages were presented over all imputation and simulation sets
Fig. 4Predicted mean EQ-5D-5L index scores (GR dataset) over time according to the different approaches; MAR scenario. MAR missing at random, MM mixed model, MI multiple imputation. The black lines represent the true scores
Fig. 5Mean square error (MSE) of the models and standard error (SE) of predicted values (post-treatment T1) using different percentage of missing data (GR dataset); MAR scenario. MAR missing at random, MM mixed model, MI multiple imputation, MSE mean squared error, SE standard error