Literature DB >> 27862164

Multiple imputation in the presence of non-normal data.

Katherine J Lee1,2, John B Carlin1,2.   

Abstract

Multiple imputation (MI) is becoming increasingly popular for handling missing data. Standard approaches for MI assume normality for continuous variables (conditionally on the other variables in the imputation model). However, it is unclear how to impute non-normally distributed continuous variables. Using simulation and a case study, we compared various transformations applied prior to imputation, including a novel non-parametric transformation, to imputation on the raw scale and using predictive mean matching (PMM) when imputing non-normal data. We generated data from a range of non-normal distributions, and set 50% to missing completely at random or missing at random. We then imputed missing values on the raw scale, following a zero-skewness log, Box-Cox or non-parametric transformation and using PMM with both type 1 and 2 matching. We compared inferences regarding the marginal mean of the incomplete variable and the association with a fully observed outcome. We also compared results from these approaches in the analysis of depression and anxiety symptoms in parents of very preterm compared with term-born infants. The results provide novel empirical evidence that the decision regarding how to impute a non-normal variable should be based on the nature of the relationship between the variables of interest. If the relationship is linear in the untransformed scale, transformation can introduce bias irrespective of the transformation used. However, if the relationship is non-linear, it may be important to transform the variable to accurately capture this relationship. A useful alternative is to impute the variable using PMM with type 1 matching.
Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

Entities:  

Keywords:  missing data; multiple imputation; non-normal data; predictive mean matching; transformation

Mesh:

Year:  2016        PMID: 27862164     DOI: 10.1002/sim.7173

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  14 in total

1.  Multiple Imputation for Bounded Variables.

Authors:  Marco Geraci; Alexander McLain
Journal:  Psychometrika       Date:  2018-04-26       Impact factor: 2.500

2.  ROBUST INFERENCE WHEN COMBINING INVERSE-PROBABILITY WEIGHTING AND MULTIPLE IMPUTATION TO ADDRESS MISSING DATA WITH APPLICATION TO AN ELECTRONIC HEALTH RECORDS-BASED STUDY OF BARIATRIC SURGERY.

Authors:  Tanayott Thaweethai; David E Arterburn; Karen J Coleman; Sebastien Haneuse
Journal:  Ann Appl Stat       Date:  2021-03       Impact factor: 1.959

3.  Multiple imputation for cause-specific Cox models: Assessing methods for estimation and prediction.

Authors:  Edouard F Bonneville; Matthieu Resche-Rigon; Johannes Schetelig; Hein Putter; Liesbeth C de Wreede
Journal:  Stat Methods Med Res       Date:  2022-06-05       Impact factor: 2.494

4.  A comparison of multiple imputation strategies to deal with missing nonnormal data in structural equation modeling.

Authors:  Fan Jia; Wei Wu
Journal:  Behav Res Methods       Date:  2022-08-29

5.  Measurement error and misclassification in electronic medical records: methods to mitigate bias.

Authors:  Jessica C Young; Mitchell M Conover; Michele Jonsson Funk
Journal:  Curr Epidemiol Rep       Date:  2018-09-10

6.  Estimating a panel MSK dataset for comparative analyses of national absorptive capacity systems, economic growth, and development in low and middle income countries.

Authors:  Muhammad Salar Khan
Journal:  PLoS One       Date:  2022-10-20       Impact factor: 3.752

7.  Patterns of Objectively Measured Sedentary Time and Emotional Disorder Symptoms Among Youth.

Authors:  Jennifer Zink; Chih-Hsiang Yang; Kelsey L McAlister; Jimi Huh; Mary Ann Pentz; Kathleen A Page; Britni R Belcher; Genevieve F Dunton
Journal:  J Pediatr Psychol       Date:  2022-07-19

8.  Multiple imputation for discrete data: Evaluation of the joint latent normal model.

Authors:  Matteo Quartagno; James R Carpenter
Journal:  Biom J       Date:  2019-03-14       Impact factor: 2.207

9.  Reference-based multiple imputation for missing data sensitivity analyses in trial-based cost-effectiveness analysis.

Authors:  Baptiste Leurent; Manuel Gomes; Suzie Cro; Nicola Wiles; James R Carpenter
Journal:  Health Econ       Date:  2019-12-17       Impact factor: 3.046

10.  Model checking in multiple imputation: an overview and case study.

Authors:  Cattram D Nguyen; John B Carlin; Katherine J Lee
Journal:  Emerg Themes Epidemiol       Date:  2017-08-23
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.