Zuber D Mulla1, Byungtae Seo, Ramaswami Kalamegham, Bahij S Nuwayhid. 1. Department of Obstetrics and Gynecology, Paul L. Foster School of Medicine, Texas Tech University Health Sciences Center, 4800 Alberta Ave., El Paso, TX 79905, USA. zuber.mulla@ttuhsc.edu
Abstract
PURPOSE: To present multiple imputation (MI) as an appropriate method to address missing values for a laboratory parameter (serum albumin) in an epidemiologic study. METHODS: A data set of patients who were hospitalized for invasive group A streptococcal infections was accessed. Age was the exposure of interest. The outcome was hospital mortality. Several variables, including serum albumin, were considered to be potential confounders. Of the 201 records, 91 had missing values for serum albumin. The MI procedure in SAS was used to perform 20 imputations of serum albumin by using a Markov chain Monte Carlo approach. Logistic regression was then performed on each of the 20 filled-in data sets, and the results were appropriately combined by using the MIANALYZE procedure. RESULTS: Age (> or = 55 years vs. 0-54 years) was not a risk factor for hospital mortality in the complete-case analysis (n = 110): adjusted odds ratio (OR) = 2.43 (95% confidence interval [CI]: 0.79-7.53). Age was a significant risk factor in the imputed data set (n = 201): adjusted OR = 3.08 (95% CI: 1.22-7.78). CONCLUSIONS: Epidemiologists frequently encounter data sets that contain missing values. Traditional missing data techniques such as the complete-subject analysis may lead to biased results. We have demonstrated the use of a novel technique, MI, to account for missing data.
PURPOSE: To present multiple imputation (MI) as an appropriate method to address missing values for a laboratory parameter (serum albumin) in an epidemiologic study. METHODS: A data set of patients who were hospitalized for invasive group A streptococcal infections was accessed. Age was the exposure of interest. The outcome was hospital mortality. Several variables, including serum albumin, were considered to be potential confounders. Of the 201 records, 91 had missing values for serum albumin. The MI procedure in SAS was used to perform 20 imputations of serum albumin by using a Markov chain Monte Carlo approach. Logistic regression was then performed on each of the 20 filled-in data sets, and the results were appropriately combined by using the MIANALYZE procedure. RESULTS: Age (> or = 55 years vs. 0-54 years) was not a risk factor for hospital mortality in the complete-case analysis (n = 110): adjusted odds ratio (OR) = 2.43 (95% confidence interval [CI]: 0.79-7.53). Age was a significant risk factor in the imputed data set (n = 201): adjusted OR = 3.08 (95% CI: 1.22-7.78). CONCLUSIONS: Epidemiologists frequently encounter data sets that contain missing values. Traditional missing data techniques such as the complete-subject analysis may lead to biased results. We have demonstrated the use of a novel technique, MI, to account for missing data.
Authors: Veronica T Mallett; Anna M Jezari; Thelma Carrillo; Sheralyn Sanchez; Zuber D Mulla Journal: Int Urogynecol J Date: 2017-08-02 Impact factor: 2.894
Authors: Erin Kahle; Mary Campbell; Jairam Lingappa; Deborah Donnell; Connie Celum; Raphael Ondondo; Andrew Mujugira; Kenneth Fife; Nelly Mugo; Saidi Kapiga; James I Mullins; Jared M Baeten Journal: AIDS Date: 2014-01-14 Impact factor: 4.177
Authors: Min Kyun Na; Yu Deok Won; Choong Hyun Kim; Jae Min Kim; Jin Hwan Cheong; Je Il Ryu; Myung-Hoon Han Journal: PLoS One Date: 2017-12-12 Impact factor: 3.240
Authors: Sebastian Schneeweiss; Jeremy A Rassen; Robert J Glynn; Jessica Myers; Gregory W Daniel; Joseph Singer; Daniel H Solomon; Seoyoung Kim; Kenneth J Rothman; Jun Liu; Jerry Avorn Journal: BMC Med Res Methodol Date: 2012-11-26 Impact factor: 4.615
Authors: Qinglin Cheng; Li Xie; Le Wang; Min Lu; Qingchun Li; Yifei Wu; Yinyan Huang; Qingjun Jia; Gang Zhao Journal: Front Public Health Date: 2021-05-28