| Literature DB >> 36078543 |
Sun Sun1,2, Nan Luo3, Erik Stenberg4, Lars Lindholm1, Klas-Göran Sahlén1, Karl A Franklin5, Yang Cao6.
Abstract
One of the main challenges for the successful implementation of health-related quality of life (HRQoL) assessments is missing data. The current study examined the feasibility and validity of a sequential multiple imputation (MI) method to deal with missing values in the longitudinal HRQoL data from the Scandinavian Obesity Surgery Registry. All patients in the SOReg who received bariatric surgery between 1 January 2011 and 31 March 2019 (n = 47,653) were included for the descriptive analysis and missingness pattern exploration. The patients who had completed the short-form 36 (SF-36) at baseline (year 0), and one-, two-, and five-year follow-ups were included (n = 3957) for the missingness pattern simulation and the sequential MI analysis. Eleven items of the SF-36 were selected to create the six domains of SF-6D, and the SF-6D utility index of each patient was calculated accordingly. The multiply-imputed variables in previous year were used as input to impute the missing values in later years. The performance of the sequential MI was evaluated by comparing the actual values with the imputed values of the selected SF-36 items and index at all four time points. At the baseline and year 1, where missing proportions were about 20% and 40%, respectively, there were no statistically significant discrepancies between the distributions of the actual and imputed responses (all p-values > 0.05). In year 2, where the missing proportion was about 60%, distributions of the actual and imputed responses were consistent in 9 of the 11 SF-36 items. However, in year 5, where the missing proportion was about 80%, no consistency was found between the actual and imputed responses in any of the SF-36 items. Relatively high missing proportions in HRQoL data are common in clinical registries, which brings a challenge to analyzing the HRQoL of longitudinal cohorts. The experimental sequential multiple imputation method adopted in the current study might be an ideal strategy for handling missing data (even though the follow-up survey had a missing proportion of 60%), avoiding significant information waste in the multivariate analysis. However, the imputations for data with higher missing proportions warrant more research.Entities:
Keywords: SF-36; health utility; health-related quality of life; multiple imputations; real-world data
Mesh:
Year: 2022 PMID: 36078543 PMCID: PMC9518315 DOI: 10.3390/ijerph191710827
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Missing proportions (left) and missingness patterns (right) of the selected 11 SF-36 items at baseline in the real-world data (red cells indicating missing). BP, bodily pain; MH, mental health; PF, physical function; RE, role-emotional; RP, role participation; SF, social function; VT, vitality.
Figure 2Simulated missing proportions (left) and missingness patterns (right) of the selected 11 SF-36 items at baseline in the analytical dataset (red cells indicating missing).
Figure 3Missing proportions (left) and missingness patterns (right) of the SF-6D index at four time points in the real-world data (red cells indicating missing).
Figure 4Simulated missingness patterns of SF-6D index at four time points in the analytical dataset (red cells indicating missing).
Figure 5Flowchart of simulation, sequential multiple imputation, and assessment. MAPE, mean absolute percentage error.
Demographic characteristics of the patients at baseline.
| Variable | All | Excluded | Analytical Dataset | ||
|---|---|---|---|---|---|
| N | 46,753 | 42,796 | 3957 | ||
| Age (mean (SD)) | 41.06 (11.30) | 40.75 (11.28) | 43.82 (11.15) | <0.001 | |
| BMI (mean (SD)) | 41.57 (5.65) | 41.53 (5.66) | 42.01 (5.48) | <0.001 | |
| Sex (%) | Man | 10,933 (23.4) | 10,058 (23.5) | 875 (22.1) | 0.050 |
| Woman | 35,820 (76.6) | 32,738 (76.5) | 3082 (77.9) | ||
| Smoking (%) | No | 28,781 (61.6) | 26,402 (61.7) | 2379 (60.1) | 0.067 |
| Yes | 4765 (10.2) | 4381 (10.2) | 384 (9.7) | ||
| Quit | 8075 (17.3) | 7355 (17.2) | 720 (18.2) | ||
| Missing | 5132 (11.0) | 4658 (10.9) | 474 (12.0) | ||
| Pregnancy (%) | No | 39,862 (85.3) | 35,905 (83.9) | 3957 (100.0) | <0.001 |
| Missing | 6891 (14.7) | 6891 (16.1) | 0 (0.0) | ||
| Comorbidity (%) | No | 17,620 (37.7) | 15,869 (37.1) | 1751 (44.3) | <0.001 |
| Yes | 22,240 (47.6) | 20,034 (46.8) | 2206 (55.7) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Sleep apnea (%) | No | 35,749 (76.5) | 32,245 (75.3) | 3504 (88.6) | <0.001 |
| Yes | 4111 (8.8) | 3658 (8.5) | 453 (11.4) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Hypertension (%) | No | 29,890 (63.9) | 27,161 (63.5) | 2729 (69.0) | <0.001 |
| Yes | 9970 (21.3) | 8742 (20.4) | 1228 (31.0) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Diabetes (%) | No | 34,668 (74.2) | 31,305 (73.1) | 3363 (85.0) | <0.001 |
| Yes | 5192 (11.1) | 4598 (10.7) | 594 (15.0) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Dyslipidemia (%) | No | 36,018 (77.0) | 32,555 (76.1) | 3463 (87.5) | <0.001 |
| Yes | 3842 (8.2) | 3348 (7.8) | 494 (12.5) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Dyspepsia (%) | No | 35,564 (76.1) | 32,037 (74.9) | 3527 (89.1) | <0.001 |
| Yes | 4296 (9.2) | 3866 (9.0) | 430 (10.9) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Diarrhea (%) | No | 39,213 (83.9) | 35,339 (82.6) | 3874 (97.9) | <0.001 |
| Yes | 647 (1.4) | 564 (1.3) | 83 (2.1) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Depression (%) | No | 33,355 (71.3) | 29,904 (69.9) | 3451 (87.2) | <0.001 |
| Yes | 6505 (13.9) | 5999 (14.0) | 506 (12.8) | ||
| Missing | 6893 (14.7) | 6893 (16.1) | 0 (0.0) | ||
| Other illness (%) | No | 35,519 (76.0) | 31,967 (74.7) | 3552 (89.8) | <0.001 |
| Yes | 4343 (9.3) | 3938 (9.2) | 405 (10.2) | ||
| Missing | 6891 (14.7) | 6891 (16.1) | 0 (0.0) | ||
| Obesity problem summary score (mean (SD)) | 65.06 (26.11) | 65.62 (25.98) | 60.05 (26.73) | <0.001 |
* Student’s t-test was used to compare the means and the chi-squared test was used to compare percentages.
Scores for the selected SF-36 items and SF-6D index at baseline.
| SF-6D Item | Level | All | Excluded | Analytical Dataset | |
|---|---|---|---|---|---|
| PF1 (%) | 1 | 26,464 (56.6) | 23,794 (55.6) | 2670 (67.5) | <0.001 |
| 2 | 11,523 (24.6) | 10,388 (24.3) | 1135 (28.7) | ||
| 3 | 1712 (3.7) | 1576 (3.7) | 136 (3.4) | ||
| Missing | 7054 (15.1) | 7038 (16.4) | 16 (0.4) | ||
| PF2 (%) | 1 | 4878 (10.4) | 4445 (10.4) | 433 (10.9) | <0.001 |
| 2 | 22,027 (47.1) | 19,875 (46.4) | 2152 (54.4) | ||
| 3 | 12,767 (27.3) | 11,417 (26.7) | 1350 (34.1) | ||
| Missing | 7081 (15.1) | 7059 (16.5) | 22 (0.6) | ||
| PF10 (%) | 1 | 2853 (6.1) | 2656 (6.2) | 197 (5.0) | <0.001 |
| 2 | 13,910 (29.8) | 12,576 (29.4) | 1334 (33.7) | ||
| 3 | 22,917 (49.0) | 20,507 (47.9) | 2410 (60.9) | ||
| Missing | 7073 (15.1) | 7057 (16.5) | 16 (0.4) | ||
| RP3 (%) | 1 | 17,956 (38.4) | 16,392 (38.3) | 1564 (39.5) | <0.001 |
| 2 | 21,344 (45.7) | 18,999 (44.4) | 2345 (59.3) | ||
| Missing | 7453 (15.9) | 7405 (17.3) | 48 (1.2) | ||
| RE2 (%) | 1 | 14,899 (31.9) | 13,706 (32.0) | 1193 (30.1) | <0.001 |
| 2 | 24,297 (52.0) | 21,595 (50.5) | 2702 (68.3) | ||
| Missing | 7557 (16.2) | 7495 (17.5) | 62 (1.6) | ||
| SF2 (%) | 1 | 1503 (3.2) | 1421 (3.3) | 82 (2.1) | <0.001 |
| 2 | 4393 (9.4) | 4077 (9.5) | 316 (8.0) | ||
| 3 | 8624 (18.4) | 7890 (18.4) | 734 (18.5) | ||
| 4 | 9278 (19.8) | 8382 (19.6) | 896 (22.6) | ||
| 5 | 15,447 (33.0) | 13,574 (31.7) | 1873 (47.3) | ||
| Missing | 7508 (16.1) | 7452 (17.4) | 56 (1.4) | ||
| BP1 (%) | 1 | 5669 (12.1) | 5068 (11.8) | 601 (15.2) | <0.001 |
| 2 | 4777 (10.2) | 4232 (9.9) | 545 (13.8) | ||
| 3 | 6042 (12.9) | 5416 (12.7) | 626 (15.8) | ||
| 4 | 14,001 (29.9) | 12,606 (29.5) | 1395 (35.3) | ||
| 5 | 7103 (15.2) | 6472 (15.1) | 631 (15.9) | ||
| 6 | 1927 (4.1) | 1801 (4.2) | 126 (3.2) | ||
| Missing | 7234 (15.5) | 7201 (16.8) | 33 (0.8) | ||
| BP2 (%) | 1 | 10,196 (21.8) | 9045 (21.1) | 1151 (29.1) | <0.001 |
| 2 | 9781 (20.9) | 8767 (20.5) | 1014 (25.6) | ||
| 3 | 10,232 (21.9) | 9233 (21.6) | 999 (25.2) | ||
| 4 | 6772 (14.5) | 6202 (14.5) | 570 (14.4) | ||
| 5 | 2526 (5.4) | 2343 (5.5) | 183 (4.6) | ||
| Missing | 7246 (15.5) | 7206 (16.8) | 40 (1.0) | ||
| MH1 (%) | 1 | 779 (1.7) | 731 (1.7) | 48 (1.2) | <0.001 |
| 2 | 1788 (3.8) | 1684 (3.9) | 104 (2.6) | ||
| 3 | 3688 (7.9) | 3411 (8.0) | 277 (7.0) | ||
| 4 | 6184 (13.2) | 5644 (13.2) | 540 (13.6) | ||
| 5 | 11,644 (24.9) | 10,466 (24.5) | 1178 (29.8) | ||
| 6 | 15,548 (33.3) | 13,764 (32.2) | 1784 (45.1) | ||
| Missing | 7122 (15.2) | 7096 (16.6) | 26 (0.7) | ||
| MH4 (%) | 1 | 797 (1.7) | 747 (1.7) | 50 (1.3) | <0.001 |
| 2 | 2025 (4.3) | 1903 (4.4) | 122 (3.1) | ||
| 3 | 3554 (7.6) | 3284 (7.7) | 270 (6.8) | ||
| 4 | 6072 (13.0) | 5553 (13.0) | 519 (13.1) | ||
| 5 | 13,164 (28.2) | 11,822 (27.6) | 1342 (33.9) | ||
| 6 | 13,942 (29.8) | 12,323 (28.8) | 1619 (40.9) | ||
| Missing | 7199 (15.4) | 7164 (16.7) | 35 (0.9) | ||
| VT2 (%) | 1 | 844 (1.8) | 738 (1.7) | 106 (2.7) | <0.001 |
| 2 | 3573 (7.6) | 3138 (7.3) | 435 (11.0) | ||
| 3 | 6083 (13.0) | 5408 (12.6) | 675 (17.1) | ||
| 4 | 9448 (20.2) | 8409 (19.6) | 1039 (26.3) | ||
| 5 | 11,734 (25.1) | 10,697 (25.0) | 1037 (26.2) | ||
| 6 | 7903 (16.9) | 7265 (17.0) | 638 (16.1) | ||
| Missing | 7168 (15.3) | 7141 (16.7) | 27 (0.7) | ||
| Index (mean (SD)) | 0.66 (0.13) | 0.66 (0.13) | 0.69 (0.13) | <0.001 |
* Student’s t-test was used to compare the means and the chi-squared test was used to compare percentages. BP, bodily pain; MH, mental health; PF, physical function; RE, role-emotional; RP, role participation; SF, social function; VT, vitality.
Accuracy of multiple imputations for the selected SF-36 item scores.
| Items | Score | Year 0 | Year 1 | Year 2 | Year 5 | ||||
|---|---|---|---|---|---|---|---|---|---|
| Actual | Imputed | Actual | Imputed | Actual | Imputed | Actual | Imputed | ||
| PF1 | 1 | 2584 | 2543 | 460 | 455 | 502 | 543 | 751 | 1341 |
| 2 | 1106 | 1148 | 1573 | 1560 | 1412 | 1458 | 1511 | 1267 | |
| 3 | 132 | 144 | 1778 | 1809 | 1892 | 1833 | 1559 | 1226 | |
| χ2 = 1.610 | χ2 = 0.229 | χ2 = 3.178 | χ2 = 226.622 | ||||||
| ICC = 0.868 | ICC = 0.782 | ICC = 0.671 | ICC = 0.391 | ||||||
| PF2 | 1 | 416 | 390 | 113 | 121 | 109 | 147 | 162 | 940 |
| 2 | 2086 | 2088 | 489 | 490 | 534 | 556 | 709 | 546 | |
| 3 | 1315 | 1356 | 3229 | 3223 | 3175 | 3131 | 2952 | 2348 | |
| χ2 = 1.431 | χ2 = 0.279 | χ2 = 6.358 | χ2 = 639.249 | ||||||
| ICC = 0.868 | ICC = 0.772 | ICC = 0.623 | ICC = 0.101 | ||||||
| PF10 | 1 | 190 | 170 | 53 | 68 | 73 | 142 | 85 | 754 |
| 2 | 1295 | 1278 | 215 | 236 | 293 | 357 | 371 | 421 | |
| 3 | 2334 | 2386 | 3558 | 3530 | 3460 | 3335 | 3368 | 2659 | |
| χ2 = 1.767 | χ2 = 2.940 | χ2 = 30.737 | χ2 = 619.995 | ||||||
| ICC = 0.848 | ICC = 0.745 | ICC = 0.558 | ICC = 0.048 | ||||||
| RP3 | 1 | 1521 | 1516 | 462 | 465 | 547 | 614 | 775 | 1600 |
| 2 | 2279 | 2318 | 3344 | 3369 | 3255 | 3220 | 3022 | 2334 | |
| χ2 = 0.168 | χ2 = 0.000 | χ2 = 3.796 | χ2 = 403.550 | ||||||
| ICC = 0.853 | ICC = 0.730 | ICC = 0.562 | ICC = 0.206 | ||||||
| RE2 | 1 | 1162 | 1157 | 556 | 566 | 709 | 720 | 951 | 1758 |
| 2 | 2629 | 2677 | 3234 | 3268 | 3085 | 3114 | 2843 | 2076 | |
| χ2 = 0.181 | χ2 = 0.007 | χ2 = 0.005 | χ2 = 358.890 | ||||||
| ICC = 0.863 | ICC = 0.724 | ICC = 0.591 | ICC = 0.289 | ||||||
| SF2 | 1 | 80 | 86 | 38 | 46 | 42 | 66 | 74 | 670 |
| 2 | 309 | 308 | 119 | 134 | 161 | 176 | 231 | 416 | |
| 3 | 709 | 720 | 304 | 288 | 402 | 437 | 533 | 575 | |
| 4 | 872 | 854 | 558 | 576 | 600 | 621 | 677 | 586 | |
| 5 | 1824 | 1865 | 2771 | 2790 | 2575 | 2534 | 2255 | 1588 | |
| χ2 = 0.747 | χ2 = 2.180 | χ2 = 7.769 | χ2 = 653.746 | ||||||
| ICC = 0.880 | ICC = 0.771 | ICC = 0.661 | ICC = 0.183 | ||||||
| BP1 | 1 | 587 | 603 | 1655 | 1667 | 1668 | 1642 | 1349 | 1081 |
| 2 | 528 | 556 | 701 | 744 | 527 | 576 | 524 | 404 | |
| 3 | 610 | 602 | 425 | 443 | 427 | 421 | 417 | 416 | |
| 4 | 1353 | 1349 | 677 | 645 | 784 | 790 | 923 | 1022 | |
| 5 | 610 | 606 | 269 | 263 | 307 | 293 | 445 | 660 | |
| 6 | 121 | 118 | 70 | 82 | 92 | 111 | 138 | 251 | |
| χ2 = 0.966 | χ2 = 3.264 | χ2 = 4.449 | χ2 = 124.586 | ||||||
| ICC = 0.868 | ICC = 0.760 | ICC = 0.639 | ICC = 0.354 | ||||||
| BP2 | 1 | 1122 | 1172 | 2327 | 2366 | 2224 | 2252 | 1902 | 1381 |
| 2 | 980 | 955 | 724 | 720 | 701 | 670 | 731 | 614 | |
| 3 | 962 | 986 | 444 | 438 | 525 | 543 | 621 | 569 | |
| 4 | 559 | 547 | 215 | 210 | 252 | 244 | 382 | 779 | |
| 5 | 178 | 174 | 87 | 100 | 98 | 124 | 164 | 491 | |
| χ2 = 1.742 | χ2 = 1.159 | χ2 = 4.211 | χ2 = 393.990 | ||||||
| ICC = 0.876 | ICC = 0.762 | ICC = 0.623 | ICC = 0.270 | ||||||
| MH1 | 1 | 45 | 48 | 40 | 56 | 39 | 36 | 56 | 325 |
| 2 | 100 | 108 | 78 | 88 | 105 | 126 | 128 | 300 | |
| 3 | 272 | 269 | 138 | 150 | 194 | 176 | 236 | 436 | |
| 4 | 521 | 529 | 230 | 236 | 281 | 299 | 361 | 381 | |
| 5 | 1140 | 1118 | 736 | 749 | 770 | 749 | 782 | 786 | |
| 6 | 1739 | 1762 | 2599 | 2555 | 2429 | 2449 | 2251 | 1606 | |
| χ2 = 0.810 | χ2 = 4.314 | χ2 = 3.798 | χ2 = 426.931 | ||||||
| ICC = 0.849 | ICC = 0.775 | ICC = 0.635 | ICC = 0.273 | ||||||
| MH4 | 1 | 48 | 48 | 53 | 72 | 56 | 73 | 85 | 330 |
| 2 | 120 | 126 | 104 | 104 | 142 | 174 | 188 | 408 | |
| 3 | 268 | 268 | 167 | 181 | 223 | 209 | 281 | 690 | |
| 4 | 495 | 487 | 316 | 296 | 400 | 374 | 495 | 438 | |
| 5 | 1306 | 1326 | 999 | 1037 | 1047 | 1090 | 1065 | 813 | |
| 6 | 1575 | 1580 | 2176 | 2145 | 1952 | 1913 | 1704 | 1155 | |
| χ2 = 0.302 | χ2 = 4.984 | χ2 = 8.045 | χ2 = 540.811 | ||||||
| ICC = 0.851 | ICC = 0.780 | ICC = 0.673 | ICC = 0.277 | ||||||
| VT2 | 1 | 100 | 96 | 604 | 615 | 459 | 422 | 322 | 267 |
| 2 | 417 | 432 | 1440 | 1481 | 1245 | 1300 | 949 | 602 | |
| 3 | 660 | 673 | 792 | 769 | 857 | 867 | 804 | 732 | |
| 4 | 1003 | 1008 | 458 | 471 | 505 | 487 | 663 | 637 | |
| 5 | 1014 | 992 | 340 | 302 | 472 | 474 | 617 | 780 | |
| 6 | 622 | 632 | 189 | 196 | 280 | 284 | 460 | 816 | |
| χ2 = 0.769 | χ2 = 3.556 | χ2 = 3.126 | χ2 = 204.960 | ||||||
| ICC = 0.884 | ICC = 0.781 | ICC = 0.671 | ICC = 0.435 | ||||||
Frequencies in the cells were estimated by averaging frequencies for each item across the imputations and then rounded to whole numbers. ICC, intraclass correlation coefficient.
Comparison of actual and imputed SF-6D indices at baseline and follow-ups.
| Time Point | Actual | Imputed | MAPE (%) | ICC (95% CI) | ||
|---|---|---|---|---|---|---|
| Mean | SE | Mean | SE | |||
| Baseline | 0.688 | 0.0021 | 0.698 | 0.0019 | 4.16 | 0.814 (0.811, 0.816) |
| One-year | 0.813 | 0.0022 | 0.812 | 0.0019 | 6.14 | 0.682 (0.678, 0.686) |
| Two-year | 0.796 | 0.0023 | 0.797 | 0.0018 | 8.15 | 0.598 (0.592, 0.693) |
| Five-year | 0.762 | 0.0025 | 0.766 | 0.0018 | 11.62 | 0.516 (0.510, 0.522) |
SE, standard error; MAPE, mean absolute percentage error; ICC, intraclass correlation coefficient; CI, confidence interval.