Ting Hsiang Lin1. 1. Department of Statistics, National Taipei University, Taipei, Taiwan, ROC. tinghlin@mail.ntpu.edu.tw
Abstract
INTRODUCTION: This study investigated the effects of imputing missing data in the WHO Quality of Life Abbreviated Questionnaire (WHOQOL-BREF). The imputation results from both the item and domain levels were compared and the impact of the missing data rate and the number of items included for imputation were examined. METHODS: An empirical analysis and a simulation study were used to examine the effects of missing data rates and the number of items used for imputation on the accuracy for imputation. In the empirical analysis, both item-level and domain-level imputations were performed, and the missing values were imputed using different amounts of data. In the simulation study, sets of 2%, 5% and 10% of the data were drawn randomly and replaced with missing values. Twenty datasets were generated for each situation. The data were imputed and the accuracy of the imputation was reported. RESULTS: In the empirical study, the number of items used for imputation had only a small impact on the accuracy of imputation. Furthermore, in the simulation study, the accuracy rates of imputation did not significantly change as the proportions of missing data increased. However, the number of items used in the computation did contribute to some extent to the missing values imputed. Extreme responses had the worst computations and the lowest accuracy rates. CONCLUSION: It is recommended that as many items as possible be included for imputation within the same domain. However, it is not particularly helpful to use items from different domains for imputation. Researchers should exercise extra caution in interpreting the imputed values of extreme responses.
INTRODUCTION: This study investigated the effects of imputing missing data in the WHO Quality of Life Abbreviated Questionnaire (WHOQOL-BREF). The imputation results from both the item and domain levels were compared and the impact of the missing data rate and the number of items included for imputation were examined. METHODS: An empirical analysis and a simulation study were used to examine the effects of missing data rates and the number of items used for imputation on the accuracy for imputation. In the empirical analysis, both item-level and domain-level imputations were performed, and the missing values were imputed using different amounts of data. In the simulation study, sets of 2%, 5% and 10% of the data were drawn randomly and replaced with missing values. Twenty datasets were generated for each situation. The data were imputed and the accuracy of the imputation was reported. RESULTS: In the empirical study, the number of items used for imputation had only a small impact on the accuracy of imputation. Furthermore, in the simulation study, the accuracy rates of imputation did not significantly change as the proportions of missing data increased. However, the number of items used in the computation did contribute to some extent to the missing values imputed. Extreme responses had the worst computations and the lowest accuracy rates. CONCLUSION: It is recommended that as many items as possible be included for imputation within the same domain. However, it is not particularly helpful to use items from different domains for imputation. Researchers should exercise extra caution in interpreting the imputed values of extreme responses.
Authors: Tolulope T Sajobi; Lisa M Lix; Gurbakhshash Singh; Mark Lowerison; Jordan Engbers; Nancy E Mayo Journal: Qual Life Res Date: 2014-10-26 Impact factor: 4.147
Authors: Jonathan B Bricker; Jingmin Liu; Bryan A Comstock; Arthur V Peterson; Kathleen A Kealey; Patrick M Marek Journal: Psychol Addict Behav Date: 2010-09
Authors: Eva Visser; Brenda Leontine Den Oudsten; Taco Gosens; Paul Lodder; Jolanda De Vries Journal: Qual Life Res Date: 2021-01-14 Impact factor: 4.147
Authors: Sun Sun; Nan Luo; Erik Stenberg; Lars Lindholm; Klas-Göran Sahlén; Karl A Franklin; Yang Cao Journal: Int J Environ Res Public Health Date: 2022-08-30 Impact factor: 4.614