| Literature DB >> 32103723 |
Xueying Xu1, Leizhen Xia1, Qimeng Zhang1, Shaoning Wu1, Mingcheng Wu1, Hongbo Liu2.
Abstract
BACKGROUND: Incomplete data are of particular important influence in mental measurement questionnaires. Most experts, however, mostly focus on clinical trials and cohort studies and generally pay less attention to this deficiency. We aim is to compare the accuracy of four common methods for handling items missing from different psychology questionnaires according to the items non-response rates.Entities:
Keywords: Hot-deck imputation; Imputation methods; Mental measurement questionnaires; Multiple imputation
Mesh:
Year: 2020 PMID: 32103723 PMCID: PMC7045426 DOI: 10.1186/s12874-020-00932-0
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
The absolute deviation for four imputation methods
| Missingness proportion | Imputation methods | Absolute deviation for SAQ | Absolute deviation for ADL | ||||
|---|---|---|---|---|---|---|---|
| Mean (SD) | SD (SD) | Correlation coefficient (SD) | Mean (SD) | SD (SD) | Correlation coefficient (SD) | ||
| 5% | Direct deletion | 0.583 (0.195) | 0.153 (0.123) | 0.028 (0.014) | 0.218 (0.161) | 0.144 (0.115) | 0.022 (0.010) |
| Mode | 0.034 (0.021) | 0.230 (0.026) | 0.004 (0.002) | 0.492 (0.026) | 0.425 (0.039) | 0.002 (0.001) | |
| HD | 0.020 (0.015) | 0.025 (0.019) | 0.004 (0.002) | 0.016 (0.011) | 0.011 (0.008) | 0.001 (0.001) | |
| MI | 0.019 (0.013) | 0.028 (0.020) | 0.003 (0.002) | 0.012 (0.010) | 0.019 (0.014) | 0.001 (0.001) | |
| 10% | Direct deletion | 1.080 (0.324) | 0.226 (0.176) | 0.050 (0.024) | 0.417 (0.299) | 0.294 (0.214) | 0.041 (0.017) |
| Mode | 0.065 (0.034) | 0.463 (0.038) | 0.006 (0.003) | 0.999 (0.036) | 0.856 (0.052) | 0.004 (0.001) | |
| HD | 0.032 (0.024) | 0.050 (0.035) | 0.006 (0.003) | 0.045 (0.021) | 0.019 (0.014) | 0.002 (0.001) | |
| MI | 0.028 (0.022) | 0.070 (0.055) | 0.006 (0.003) | 0.020 (0.013) | 0.030 (0.024) | 0.002 (0.001) | |
| 15% | Direct deletion | 1.453 (0.506) | 0.353 (0.285) | 0.074 (0.034) | 0.721 (0.532) | 0.507 (0.339) | 0.059 (0.026) |
| Mode | 0.101 (0.060) | 0.697 (0.044) | 0.008 (0.004) | 1.511 (0.044) | 1.290 (0.060) | 0.005 (0.002) | |
| HD | 0.041 (0.030) | 0.091 (0.042) | 0.009 (0.004) | 0.106 (0.027) | 0.028 (0.022) | 0.003 (0.001) | |
| MI | 0.036 (0.033) | 0.151 (0.202) | 0.008 (0.003) | 0.023 (0.016) | 0.049 (0.036) | 0.003 (0.002) | |
| 20% | Direct deletion | 1.586 (0.690) | 0.436 (0.325) | 0.097 (0.046) | 0.972 (0.658) | 0.751 (0.525) | 0.080 (0.034) |
| Mode | 0.141 (0.084) | 0.925 (0.047) | 0.009 (0.004) | 2.019 (0.047) | 1.717 (0.059) | 0.007 (0.002) | |
| HD | 0.050 (0.033) | 0.161 (0.057) | 0.012 (0.005) | 0.182 (0.036) | 0.024 (0.019) | 0.004 (0.002) | |
| MI | 0.048 (0.031) | 0.287 (0.253) | 0.010 (0.004) | 0.024 (0.018) | 0.067 (0.044) | 0.005 (0.002) | |
HD Hot-deck imputation, MI nmultiple imputation, SD standard deviation
Fig. 1RMSE values for the 4 imputation methods. a - c. RMSE values for SAQ; d – f. RMSE values for ADL; RMSE, root mean square error; SAQ, self-acceptance scale; ADL, activities of daily living scale. SD, standard deviation
Fig. 2average relative error for the 3 imputation methods. a. average relative error of mean for SAQ. b. average relative error of SD for SAQ. c. average relative error of correlation coefficient for SAQ. d. average relative error of mean for ADL. e. average relative error of SD for ADL. f. average relative error of correlation coefficient for ADL
Fig. 3The average RSES scores of HD and MI
The result of differences test of complete and impute data for RSES
| variables | original | Missingness proportion of 5% | Missingness proportion of 10% | Missingness proportion of 15% | Missingness proportion of 20% | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HD | MI | HD | MI | HD | MI | HD | MI | ||||||||||||
| Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | Mean (SD) | t / F value | ||
| Gender | Male | 28.66 (4.57) | 5.92 | 28.69 (4.55) | 6.78 | 28.66 (5.51) | 5.94 | 28.71 (4.50) | 5.98 | 28.67 (5.48) | 5.94 | 28.69 (4.48) | 6.07 | 28.62 (5.44) | 5.74 | 28.62 (4.34) | 5.44 | 28.65 (5.39) | 5.79 |
| Female | 27.74 (4.64) | 27.73 (4.63) | 27.73 (5.53) | 27.79 (4.63) | 27.74 (5.49) | 27.76 (4.59) | 27.72 (5.43) | 27.80 (4.52) | 27.73 (5.34) | ||||||||||
| Grade | Junior | 28.30 (4.57) | 1.46* | 28.31 (4.56) | 1.45* | 28.29 (5.66) | 1.07* | 28.36 (4.52) | 1.83* | 28.30 (5.71) | 1.06* | 28.33 (4.48) | 1.65* | 28.27 (5.75) | 1.01* | 28.30 (4.42) | 1.34* | 28.28 (5.84) | 1.01* |
| Senior | 28.06 (4.74) | 28.07 (4.73) | 28.04 (8.11) | 28.07 (4.70) | 28.05 (8.07) | 28.06 (4.69) | 28.03 (8.18) | 28.08 (4.65) | 28.04 (8.12) | ||||||||||
| Record | Excellent | 31.09 (4.74) | 63.53 | 31.12 (4.77) | 58.28 | 31.06 (5.89) | 32.24 | 31.07 (4.79) | 61.51 | 31.04 (5.92) | 30.01 | 31.05 (4.76) | 60.70 | 30.97 (5.90) | 28.13 | 30.96 (4.80) | 54.56 | 31.00 (5.93) | 27.18 |
| Good | 29.46 (4.46) | 29.41 (4.46) | 29.43 (5.62) | 29.45 (4.41) | 29.45 (5.58) | 29.42 (4.46) | 29.38 (5.62) | 29.27 (4.35) | 29.35 (5.54) | ||||||||||
| Average | 28.13 (4.38) | 28.14 (4.37) | 28.12 (5.57) | 28.18 (4.34) | 28.12 (5.55) | 28.13 (4.30) | 28.08 (5.51) | 28.15 (4.26) | 28.10 (5.47) | ||||||||||
| Poor | 26.46 (4.38) | 26.53 (4.63) | 26.49 (5.78) | 26.56 (4.59) | 26.53 (5.72) | 26.58 (4.55) | 26.55 (5.69) | 26.66 (4.55) | 26.59 (5.70) | ||||||||||
| Residence | Urban | 28.57 (4.86) | 0.46* | 28.58 (4.98) | 0.29* | 28.59 (6.07) | 0.64* | 28.56 (4.91) | 0.25* | 28.54 (6.01) | 0.41* | 28.75 (4.86) | 0.74* | 28.54 (5.96) | 0.51* | 28.71 (4.77) | 0.67* | 28.65 (5.85) | 0.50* |
| Rural | 28.21 (4.62) | 28.22 (4.60) | 28.20 (5.87) | 28.25 (4.57) | 28.21 (5.83) | 28.22 (4.54) | 28.18 (5.78) | 28.21 (4.49) | 28.19 (5.72) | ||||||||||
| Communication | Usually | 28.87 (4.55) | 32.29 | 28.86 (4.55) | 29.15 | 28.86 (5.71) | 8.63 | 28.90 (4.49) | 31.29 | 28.87 (5.68) | 8.11 | 28.86 (4.47) | 30.47 | 28.83 (5.64) | 8.18 | 28.85 (4.42) | 30.57 | 28.86 (5.57) | 8.15 |
| Sometimes | 27.83 (4.34) | 27.85 (4.33) | 27.82 (5.49) | 27.90 (4.35) | 27.83 (5.50) | 27.87 (4.31) | 27.80 (5.44) | 27.85 (4.26) | 27.80 (5.39) | ||||||||||
| Scarcely | 26.23 (5.23) | 26.33 (5.18) | 26.28 (6.42) | 26.31 (5.04) | 26.26 (6.27) | 26.39 (5.10) | 26.26 (6.30) | 26.38 (4.94) | 26.31 (6.27) | ||||||||||
| Never | 25.68 (4.75) | 25.63 (4.84) | 25.66 (6.20) | 25.81 (4.78) | 25.79 (6.11) | 25.81 (4.63) | 25.78 (5.96) | 25.74 (4.67) | 25.72 (5.99) | ||||||||||
| Unclear | 26.45 (4.94) | 26.45 (4.92) | 26.45 (6.29) | 26.45 (5.02) | 26.44 (6.34) | 26.26 (4.99) | 26.31 (6.30) | 26.55 (4.82) | 26.35 (6.23) | ||||||||||
HD Hot-deck imputation, MI multiple imputation. SD standard deviation
* p > 0.05
Influence on self-esteem in complete and impute data by multiple logistic regression models
| Missingness | Original | HD | MI |
|---|---|---|---|
| Complete data | |||
| Gender | 0.754 (0.648–0.877) | – | – |
| Grade | 1.067 (1.022–1.115) | – | – |
| Record | 0.674 (0.619–0.733) | – | – |
| Residence | 1.054 (0.955–1.164)a | – | – |
| Communication | 0.656 (0.589–0.731) | – | – |
| 5% | |||
| Gender | – | 0.727 (0.625–0.845) | 0.739 (0.634–0.861) |
| Grade | – | 1.060 (1.015–1.106) | 1.067 (1.021–1.115) |
| Record | – | 0.680 (0.626–0.740) | 0.674 (0.617–0.736) |
| Residence | – | 1.030 (0.931–1.140) a | 1.045 (0.943–1.157) a |
| Communication | – | 0.686 (0.617–0.762) | 0.665 (0.594–0.745) |
| 10% | |||
| Gender | – | 0.718 (0.618–0.834) | 0.712 (0.611–0.830) |
| Grade | – | 1.062 (1.018–1.109) | 1.059 (1.014–1.107) |
| Record | – | 0.700 (0.645–0.760) | 0.684 (0.627–0.746) |
| Residence | – | 1.048 (0.950–1.157)a | 1.052 (0.940–1.177)a |
| Communication | – | 0.670 (0.603–0.745) | 0.675 (0.604–0.754) |
| 15% | |||
| Gender | – | 0.743 (0.639–0.864) | 0.746 (0.639–0.871) |
| Grade | – | 1.067 (1.022–1.114) | 1.059 (1.010–1.110) |
| Record | – | 0.691 (0.636–0.751) | 0.696 (0.639–0.759) |
| Residence | – | 1.082 (0.982–1.191)a | 1.038 (0.929–1.159)a |
| Communication | – | 0.708 (0.638–0.786) | 0.683 (0.599–0.779) |
| 20% | |||
| Gender | – | 0.748 (0.642–0.870) | 0.726 (0.619–0.853) |
| Grade | – | 1.074 (1.029–1.122) | 1.053 (1.006–1.102) |
| Record | – | 0.698 (0.642–0.759) | 0.710 (0.645–0.781) |
| Residence | – | 1.075 (0.975–1.185)a | 1.057 (0.914–1.221)a |
| Communication | – | 0.710 (0.639–0.789) | 0.692 (0.618–0.774) |
HD Hot-deck imputation, MI multiple imputation
aOR was not statistically significant