| Literature DB >> 31673410 |
Denis Mongin1, Kim Lauper1, Carl Turesson2,3, Merete Lund Hetland4,5, Eirik Klami Kristianslund6, Tore K Kvien6, Maria Jose Santos7, Karel Pavelka8, Florenzo Iannone9, Axel Finckh1, Delphine Sophie Courvoisier1.
Abstract
Objective: To compare several methods of missing data imputation for function (Health Assessment Questionnaire) and for disease activity (Disease Activity Score-28 and Clinical Disease Activity Index) in rheumatoid arthritis (RA) patients.Entities:
Keywords: DAS28; disease activity; epidemiology; outcomes research; rheumatoid arthritis
Mesh:
Year: 2019 PMID: 31673410 PMCID: PMC6802981 DOI: 10.1136/rmdopen-2019-000994
Source DB: PubMed Journal: RMD Open ISSN: 2056-5933
Figure 1Mean disease activity score based on 28 joints (left panel) and corresponding remission rate (right panel) of the data panel where the 60% of data missing completely at random at follow-up were handled with complete case analysis (CCA), last observation carried forward (LOCF), linear forward extrapolation (LFE), nearest available observation (NAO), linear extrapolation (LE), polynomial extrapolation (PE), linear mixed effect cubic regression (LME3) and multiple imputation by chained equation (MICE). Error bars are the standard errors calculated over the 1000 simulation samples. LFE and LOCF estimation are missing for t=0 since these methods use previous values, which do not exist at baseline. Black horizontal lines represent the true values for each follow-up.
Figure 2Distribution of disease activity score based on 28 joints values imputed with last observation carried forward (LOCF), linear forward extrapolation (LFE), nearest available observation (NAO), linear extrapolation (LE), polynomial extrapolation (PE), linear mixed effect cubic regression (LME3) and multiple imputation by chained equation (MICE), for two different follow-up (t=0 and t=12 months), 1000 imputations of 60% of data missing completely at random at follow-up. Black plain area is the true distribution, and colour lines are interpolations of the imputed dataset’s distribution. The vertical line corresponds to the threshold of remission rate.
Figure 3Mean disease activity score based on 28 joints (left panel) and corresponding remission rate (right panel) of the data panel where the 60% of data missing at random at follow-up were handled with complete case analysis (CCA), last observation carried forward (LOCF), linear forward extrapolation (LFE), nearest available observation (NAO), linear extrapolation (LE), polynomial extrapolation (PE), linear mixed effect cubic regression (LME3) and multiple imputation by chained equation (MICE). Error bars are the standard errors calculated over the 1000 simulation samples. LFE and LOCF estimation are missing for t=0 since these methods use previous values, which do not exist at baseline. Black horizontal lines represent the true values for each follow-up.
Summary of the main findings for each imputation method considered
| Method | Imputed mean disease activity | Imputed remission rate | Effect of random missingness | Sensitivity to quantity of data around follow-up | ||
| Baseline | Follow-up | Baseline | Follow-up | |||
| LOCF | Unusable | Over-estimated | Unusable | Under-estimated | None | Yes |
| LFE | Unusable | Under-estimated | Unusable | Over-estimated | None | Yes |
| NAO | Strong under-estimation | Close to unbiased | Strong over-estimation | Close to unbiased | None | Small |
| LE | Strong under-estimation | Close to unbiased | Strong over-estimation | Slightly biassed | None | Small |
| PE | Strong under-estimation | Slightly biassed | Strong over-estimation | Close to unbiased | None | Yes |
| LME3 | Strong under-estimation | Close to unbiased | Close to unbiased | Strong under-estimation | None | No |
| MICE | Strong under-estimation | Close to unbiased | Close to unbiased | Strong under-estimation | Reduction of bias of remission | No |
LE, linear extrapolation; LFE, linear forward extrapolation; LME3, linear mixed effects cubic regression; LOCF, last observation carried forward; MICE, multiple imputation by chained equation; NAO, nearest available observation; PE, polynomial extrapolation.