Literature DB >> 17634973

Multiple imputation using an iterative hot-deck with distance-based donor selection.

Juned Siddique1, Thomas R Belin.   

Abstract

Hot-deck imputation offers advantages in reflecting salient features of data distributions in missing-data problems, but previous implementations have lacked the appeal associated with modern Bayesian statistical-computing techniques. We outline a strategy of iterative hot-deck multiple imputation with distance-based donor selection. With distance defined as a monotonic function of the difference in predictive means between cases, donors are chosen with probability inversely proportional to their distance from the donee. This method retains the implementation ease of ad hoc techniques, while incorporating the desirable features of Bayesian approaches. Special cases of our method include nearest-neighbor imputation and a simple random hot-deck. Iterating the procedure provides an analogy to Markov Chain Monte Carlo methods and is intended to mitigate dependence on starting values. Results from imputing missing values in a longitudinal depression treatment trial as well as a simulation study are presented. We evaluate how different definitions of distance, choices of starting values, the order in which variables are chosen for imputation, and the number of iterations impact inferences. We show that our measure of distance controls the tradeoff between bias and variance of our estimates. We find that inferences from the depression treatment trial are not sensitive to most definitions of distance. In addition, while differences exist between 1 iteration and 10 iterations, there are no meaningful differences between inferences based on 10 iterations and those based on 500 iterations. The choice of starting value did not have an impact on inferences but the order in which the variables were chosen for imputation was significant even after iteration. Copyright (c) 2007 John Wiley & Sons, Ltd.

Entities:  

Mesh:

Year:  2008        PMID: 17634973     DOI: 10.1002/sim.3001

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  15 in total

1.  A Review of Hot Deck Imputation for Survey Non-response.

Authors:  Rebecca R Andridge; Roderick J A Little
Journal:  Int Stat Rev       Date:  2010-04       Impact factor: 2.217

2.  Bayesian longitudinal plateau model of adult grip strength.

Authors:  Ramzi W Nahhas; Audrey C Choh; Miryoung Lee; William M Cameron Chumlea; Dana L Duren; Roger M Siervogel; Richard J Sherwood; Bradford Towne; Stefan A Czerwinski
Journal:  Am J Hum Biol       Date:  2010 Sep-Oct       Impact factor: 1.937

3.  Design of the FRESH study: A randomized controlled trial of a parent-only and parent-child family-based treatment for childhood obesity.

Authors:  Kerri N Boutelle; Abby Braden; Jennifer M Douglas; Kyung E Rhee; David Strong; Cheryl L Rock; Denise E Wilfley; Leonard Epstein; Scott Crow
Journal:  Contemp Clin Trials       Date:  2015-09-08       Impact factor: 2.226

4.  Missing value imputation in longitudinal measures of alcohol consumption.

Authors:  Ulrike Grittner; Gerhard Gmel; Samuli Ripatti; Kim Bloomfield; Matthias Wicki
Journal:  Int J Methods Psychiatr Res       Date:  2011-03       Impact factor: 4.035

5.  A randomized trial of stress management for the prevention of new brain lesions in MS.

Authors:  David C Mohr; Jesus Lovera; Ted Brown; Bruce Cohen; Thomas Neylan; Roland Henry; Juned Siddique; Ling Jin; David Daikh; Daniel Pelletier
Journal:  Neurology       Date:  2012-07-11       Impact factor: 9.910

6.  Using an Approximate Bayesian Bootstrap to Multiply Impute Nonignorable Missing Data.

Authors:  Juned Siddique; Thomas R Belin
Journal:  Comput Stat Data Anal       Date:  2008-12-15       Impact factor: 1.681

7.  Model specification and bootstrapping for multiply imputed data: An application to count models for the frequency of alcohol use.

Authors:  W Scott Comulada
Journal:  Stata J       Date:  2015-06       Impact factor: 2.637

8.  A Bayesian Approach for Generalized Linear Models with Explanatory Biomarker Measurement Variables Subject to Detection Limit - an Application to Acute Lung Injury.

Authors:  Huiyun Wu; Qingxia Chen; Lorraine B Ware; Tatsuki Koyama
Journal:  J Appl Stat       Date:  2012-04-24       Impact factor: 1.404

9.  Addressing Missing Data Mechanism Uncertainty using Multiple-Model Multiple Imputation: Application to a Longitudinal Clinical Trial.

Authors:  Juned Siddique; Ofer Harel; Catherine M Crespi
Journal:  Ann Appl Stat       Date:  2012-12-01       Impact factor: 2.083

10.  Doubly robust multiple imputation using kernel-based techniques.

Authors:  Chiu-Hsieh Hsu; Yulei He; Yisheng Li; Qi Long; Randall Friese
Journal:  Biom J       Date:  2015-12-09       Impact factor: 2.207

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.