| Literature DB >> 34611428 |
Abstract
PURPOSE: This research intended to identify significant risk factors of stroke among the elderly population in the United States using the k-means clustering method. PATIENTS AND METHODS: In this cross-sectional study, we analyzed data of 4346 subjects aged ≥60 years using the National Health and Nutrition Examination Survey (NHANES) 2013-2018 datasets. Questionnaire data, dietary data, and laboratory data were accessed to acquire measurements of the potential risk factors. A pre-defined classification method was used based on the Medical Condition Questionnaire to define the stroke group. K-means clustering analysis used all potential risk factors for differentiating both groups. A stepwise logistic regression analysis examined the association between significant risk factors and the odds of stroke.Entities:
Keywords: diabetes; hyperglycemia; k-means clustering method
Year: 2021 PMID: 34611428 PMCID: PMC8487286 DOI: 10.2147/IJGM.S327075
Source DB: PubMed Journal: Int J Gen Med ISSN: 1178-7074
Figure 1Flow chart of selecting eligible participants.
Baseline Characteristics of the Overall Study Participants, NHANES 2013–2018 (n=4346)
| Baseline Characteristics | Total (n=4346) |
|---|---|
| Age, years, M [Q1,Q3] | 68.00 [63.00,75.00] |
| Gender, n (%) | |
| Male | 2178 (45.85) |
| Female | 2168 (54.15) |
| Race, n (%) | |
| Mexican American | 521 (3.98) |
| Non-Hispanic Black | 925 (8.24) |
| Non-Hispanic White | 1985 (78.09) |
| Others | 915 (9.69) |
| BMI, kg/m2, n (%) | |
| <18.5 | 1194 (27.57) |
| 18.5- | 42 (0.97) |
| 25.0- | 1709 (38.32) |
| 30.0- | 1401 (33.14) |
| Education level, n (%) | |
| Less than 12th grade | 1030 (12.69) |
| High school or above | 3316 (87.31) |
| Marital status, n (%) | |
| Divorced/separated | 772 (14.92) |
| Married | 2545 (64.72) |
| Widowed | 788 (16.24) |
| Single | 241 (4.11) |
| PIR, M [Q1,Q3] | 3.18 [1.77,5.00] |
| Smoking, n (%) | |
| No | 3268 (79.25) |
| Yes | 1078 (20.75) |
| Alcohol consumption, n (%) | |
| No | 3532 (77.82) |
| Yes | 814 (22.18) |
| CHF, n (%) | |
| Yes | 288 (5.35) |
| No | 4058 (94.65) |
| CHD, n (%) | |
| Yes | 421 (10.15) |
| No | 3925 (89.85) |
| Angina, n (%) | |
| Yes | 210 (5.09) |
| No | 4136 (94.91) |
| HA, n (%) | |
| Yes | 375 (7.84) |
| No | 3971 (92.16) |
| Diabetes, n (%) | |
| Yes | 1181 (21.75) |
| No | 3165 (78.25) |
| Hypertension, n (%) | |
| Yes | 2635 (57.39) |
| No | 1711 (42.61) |
| HDL, mg/dL, M [Q1,Q3] | 54.00 [44.00,67.00] |
| TG, mg/dL, M [Q1,Q3] | 107.00 [54.00,210.00] |
| LDL, mg/dL, M [Q1,Q3] | 101.00 [80.00,124.00] |
| TC, mg/dL, M [Q1,Q3] | 189.00 [160.00,217.00] |
| GHb, %, M [Q1,Q3] | 5.70 [5.40,6.10] |
| GLU, mg/dL, M [Q1,Q3] | 107.00 [99.00,121.00] |
| Dietary fiber, g, M [Q1,Q3] | 14.90 [10.10,21.30] |
| Vitamin A, mcg, M [Q1,Q3] | 524.00 [304.00,833.00] |
| Vitamin E, mg, M [Q1,Q3] | 7.37 [4.77,10.96] |
| Vitamin C,mg, M [Q1,Q3] | 55.60 [23.50,109.60] |
| Vitamin D, mcg, M [Q1,Q3] | 3.20 [1.40,5.80] |
| PUFA, n (%) | |
| <5 | 315 (5.29) |
| 5- | 854 (17.96) |
| 10- | 968 (21.74) |
| 15- | 785 (19.17) |
| 20- | 544 (13.33) |
| 25- | 880 (22.52) |
Abbreviations: BMI, body mass index; PIR, poverty income ratio; CHF, congestive heart failure; CHD, coronary heart disease; HA, heart attack; HDL, high-density lipoprotein; LDL, low-density lipoproteins; TG, triglycerides; TC, total cholesterol; GHb, glycohemoglobin; GLU, plasma fasting glucose; PUFA, polyunsaturated fatty acids.
Figure 2K-means clustering: centroids of each cluster.
Figure 3K-means clustering: the risk of stroke of each cluster.
Baseline Characteristics According to the Risk of Stroke, k-Means Clustering Method
| Variables | K-Means Clustering | Statistics | ||
|---|---|---|---|---|
| Cluster A* n=1384 | Cluster B† n=1962 | |||
| Age, years, M [Q1,Q3] | 70.00 [65.00,76.00] | 68.00 [63.00,75.00] | Z=667.598 | <0.001 |
| Gender, n (%)‡ | ||||
| Male | 849 (61.03) | 1329 (40.71) | <0.001 | |
| Female | 535 (38.97) | 1633 (59.29) | ||
| Race, n (%) | ||||
| Mexican American | 224 (6.61) | 297 (3.09) | <0.001 | |
| Non-Hispanic Black | 294 (10.08) | 631 (7.61) | ||
| Non-Hispanic White | 571 (71.59) | 1414 (80.30) | ||
| Others | 295 (11.72) | 620 (9.00) | ||
| BMI, kg/m2, n (%) | ||||
| <18.5 | 226 (14.04) | 968 (32.15) | <0.001 | |
| 18.5- | 5 (0.25) | 37 (1.22) | ||
| 25.0- | 506 (34.85) | 1203 (39.50) | ||
| 30.0- | 647 (50.86) | 754 (27.13) | ||
| Education level, n (%) | ||||
| Less than 12th grade | 434 (19.70) | 596 (10.32) | <0.001 | |
| High school or above | 950 (80.30) | 2366 (89.68) | ||
| Marital status, n (%) | ||||
| Divorced/separated | 220 (13.50) | 552 (15.40) | 0.150 | |
| Married | 829 (63.32) | 1716 (65.20) | ||
| Widowed | 264 (18.63) | 524 (15.44) | ||
| Single | 71 (4.55) | 170 (3.96) | ||
| Physical activity, n (%) | ||||
| Sedentary | 2182 (45.98) | 804 (54.97) | χ2=34.774 | <0.001 |
| Insufficient | 681 (16.43) | 208 (16.34) | ||
| Moderate | 545 (14.90) | 154 (13.25) | ||
| High | 938 (22.69) | 222 (15.44) | ||
| PIR, M [Q1,Q3] | 2.53 [1.43,4.48] | 3.43 [1.89,5.00] | Z=−999.692 | <0.001 |
| Smoking, n (%) | ||||
| No | 1095 (81.11) | 2173 (78.61) | 0.213 | |
| Yes | 289 (18.89) | 789 (21.39) | ||
| Alcohol consumption, n (%) | ||||
| No | 1228 (87.41) | 2304 (74.57) | <0.001 | |
| Yes | 156 (12.59) | 658 (25.43) | ||
| CHF, n (%) | ||||
| Yes | 249 (18.24) | 39 (0.98) | <0.001 | |
| No | 1135 (81.76) | 2923 (99.02) | ||
| CHD, n (%) | ||||
| Yes | 366 (32.67) | 55 (2.52) | <0.001 | |
| No | 1018 (67.33) | 2907 (97.48) | ||
| Angina, n (%) | ||||
| Yes | 176 (15.64) | 34 (1.52) | <0.001 | |
| No | 1208 (84.36) | 2928 (98.48) | ||
| HA, n (%) | ||||
| Yes | 318 (25.81) | 57 (1.75) | <0.001 | |
| No | 1066 (74.19) | 2905 (98.25) | ||
| Diabetes, n (%) | ||||
| Yes | 1046 (74.73) | 135 (3.80) | <0.001 | |
| No | 338 (25.27) | 2827 (96.20) | ||
| Hypertension, n (%) | ||||
| Yes | 1086 (77.46) | 1549 (50.59) | <0.001 | |
| No | 298 (22.54) | 1413 (49.41) | ||
| HDL, mg/dL, M [Q1,Q3] | 45.00 [38.00,54.00] | 57.00 [47.00,70.00] | Z=−2490.96 | <0.001 |
| TG, mg/dL, M [Q1,Q3] | 111.00 [43.00,197.00] | 106.00 [56.00,219.00] | Z=−230.891 | <0.001 |
| LDL, mg/dL, M [Q1,Q3] | 84.00 [68.00,104.00] | 108.00 [87.00,129.00] | Z=−2183.04 | <0.001 |
| TC, mg/dL, M [Q1,Q3] | 156.00 [139.00,183.00] | 199.00 [173.00,223.00] | Z=−2921.49 | <0.001 |
| GHb, %, M [Q1,Q3] | 6.60 [6.00,7.40] | 5.60 [5.40,5.80] | Z=3940.41 | <0.001 |
| GLU, mg/dL, M [Q1,Q3] | 128.00 [108.00,163.00] | 104.00 [97.00,113.00] | Z=3081.21 | <0.001 |
| Dietary fiber, g, M [Q1,Q3] | 14.20 [9.50,19.70] | 15.10 [10.30,22.00] | Z=−472.144 | <0.001 |
| Vitamin A, mcg, M [Q1,Q3] | 486.00 [274.00,770.00] | 538.00 [311.00,851.00] | Z=−450.076 | <0.001 |
| Vitamin E, mg, M [Q1,Q3] | 6.78 [4.30,10.01] | 7.49 [4.94,11.30] | Z=−556.366 | <0.001 |
| Vitamin C,mg, M [Q1,Q3] | 45.90 [20.60,95.00] | 58.60 [24.90,115.70] | Z=−570.544 | <0.001 |
| Vitamin D, mcg, M [Q1,Q3] | 3.10 [1.30,5.50] | 3.20 [1.50,5.90] | Z=−202.439 | <0.001 |
| PUFA, n (%) | ||||
| <5 | 110 (6.57) | 205 (4.86) | 0.109 | |
| 5- | 295 (20.29) | 559 (17.17) | ||
| 10- | 311 (21.38) | 657 (21.86) | ||
| 15- | 238 (18.46) | 547 (19.40) | ||
| 20- | 159 (10.97) | 385 (14.13) | ||
| 25- | 271 (22.32) | 609 (22.59) | ||
Notes: *Cluster A, high incidence of stroke, 7.15%; †Cluster B, low incidence of stroke, 2.80%; ‡n%, sample weights were applied to the all the percentages.
Abbreviations: BMI, body mass index; PIR, poverty income ratio; CHF, congestive heart failure; CHD, coronary heart disease; HA, heart attack; HDL, high-density lipoprotein; LDL, low-density lipoproteins; TG, triglycerides; TC, total cholesterol; GHb, glycohemoglobin; GLU, plasma fasting glucose; PUFA, polyunsaturated fatty acids.
Baseline Characteristics According to the Risk of Stroke, Pre-Defined Grouping Method
| Variables | Pre-Defined Grouping | Statistics | ||
|---|---|---|---|---|
| Stroke* n=182 | No Stroke n=4164 | |||
| Age, years, M [Q1,Q3] | 74.00 [69.00,80.00] | 68.00 [63.00,75.00] | Z=958.729 | <0.001 |
| Gender, n (%)† | ||||
| Male | 93 (45.35) | 2085 (45.87) | 0.933 | |
| Female | 89 (54.65) | 2079 (54.13) | ||
| Race, n(%) | ||||
| Mexican American | 12 (2.29) | 509 (4.05) | 0.048 | |
| Non-Hispanic Black | 34 (6.89) | 891 (8.29) | ||
| Non-Hispanic White | 107 (81.55) | 1878 (77.95) | ||
| Others | 29 (9.27) | 886 (9.70) | ||
| BMI, kg/m2, n(%) | ||||
| <18.5 | 55 (25.85) | 1139 (27.64) | 0.063 | |
| 18.5- | 1 (0.19) | 41 (1.00) | ||
| 25.0- | 74 (41.69) | 1635 (38.19) | ||
| 30.0- | 52 (32.26) | 1349 (33.17) | ||
| Education level, n(%) | ||||
| Less than 12th grade | 57 (23.24) | 973 (12.26) | 0.008 | |
| High school or above | 125 (76.76) | 3191 (87.74) | ||
| Marital status, n(%) | ||||
| Divorced/separated | 27 (11.74) | 745 (15.05) | 0.165 | |
| Married | 93 (59.63) | 2452 (64.93) | ||
| Widowed | 53 (25.63) | 735 (15.86) | ||
| Single | 9 (3.00) | 232 (4.16) | ||
| Physical activity | ||||
| Sedentary | 120 (62.52) | 2062 (45.31) | 0.009 | |
| Insufficient | 21 (13.17) | 660 (16.56) | ||
| Moderate | 17 (8.68) | 528 (15.15) | ||
| High | 24 (15.64) | 914 (22.97) | ||
| PIR, M [Q1,Q3] | 2.23 [1.42,4.19] | 3.22 [1.78,5.00] | Z=−473.070 | <0.001 |
| Smoking, n(%) | ||||
| No | 143 (85.19) | 3125 (79.00) | 0.081 | |
| Yes | 39 (14.81) | 1039 (21.00) | ||
| Alcohol consumption, n(%) | ||||
| No | 152 (76.90) | 3380 (77.85) | 0.856 | |
| Yes | 30 (23.10) | 784 (22.15) | ||
| CHF, n(%) | ||||
| Yes | 23 (10.36) | 265 (5.15) | 0.045 | |
| No | 159 (89.64) | 3899 (94.85) | ||
| CHD, n(%) | ||||
| Yes | 34 (17.33) | 387 (9.86) | 0.077 | |
| No | 148 (82.67) | 3777 (90.14) | ||
| Angina, n(%) | ||||
| Yes | 20 (10.77) | 190 (4.86) | 0.121 | |
| No | 162 (89.23) | 3974 (95.14) | ||
| HA, n(%) | ||||
| Yes | 23 (11.45) | 352 (7.69) | 0.188 | |
| No | 159 (88.55) | 3812 (92.31) | ||
| Diabetes, n(%) | ||||
| Yes | 77 (41.24) | 1104 (20.96) | 0.001 | |
| No | 105 (58.76) | 3060 (79.04) | ||
| Hypertension, n(%) | ||||
| Yes | 143 (79.72) | 2492 (56.49) | <0.001 | |
| No | 39 (20.28) | 1672 (43.51) | ||
| HDL, mg/dL, M [Q1,Q3] | 50.00 [41.00,59.00] | 54.00 [44.00,67.00] | Z=−358.113 | <0.001 |
| TG, mg/dL, M [Q1,Q3] | 102.00 [44.00,187.00] | 108.00 [55.00,210.00] | Z=−143.048 | <0.001 |
| LDL, mg/dL, M [Q1,Q3] | 88.00 [74.00,113.00] | 102.00 [81.00,125.00] | Z=−457.960 | <0.001 |
| TC, mg/dL, M [Q1,Q3] | 173.00 [145.00,199.00] | 189.00 [161.00,218.00] | Z=−559.595 | <0.001 |
| GHb, %, M [Q1,Q3] | 5.90 [5.50,6.60] | 5.70 [5.40,6.10] | Z=299.764 | <0.001 |
| GLU, mg/dL, M [Q1,Q3] | 113.00 [102.00,133.00] | 107.00 [99.00,120.00] | Z=370.373 | <0.001 |
| Dietary fiber, g, M [Q1,Q3] | 12.50 [8.80,18.00] | 15.00 [10.20,21.50] | Z=−455.767 | <0.001 |
| Vitamin A, mcg, M [Q1,Q3] | 466.00 [259.00,834.00] | 526.00 [307.00,833.00] | Z=−112.521 | <0.001 |
| Vitamin E, mg, M [Q1,Q3] | 6.45 [4.30,9.74] | 7.39 [4.80,11.00] | Z=−313.950 | <0.001 |
| Vitamin C,mg, M [Q1,Q3] | 40.70 [20.50,121.20] | 56.50 [23.60,108.60] | Z=−138.531 | <0.001 |
| Vitamin D, mcg, M [Q1,Q3] | 3.10 [1.20,5.50] | 3.20 [1.40,5.80] | Z=−82.555 | <0.001 |
| PUFA, n(%) | ||||
| <5 | 18 (10.05) | 297 (5.09) | 0.005 | |
| 5- | 47 (24.58) | 807 (17.69) | ||
| 10- | 44 (23.53) | 924 (21.67) | ||
| 15- | 31 (17.60) | 754 (19.23) | ||
| 20- | 18 (13.07) | 526 (13.34) | ||
| 25- | 24 (11.18) | 856 (22.98) | ||
Notes: *Stroke, the stroke group was defined using the medical condition questionnaire; †n%, sample weights were applied to the all the percentages.
Abbreviations: BMI, body mass index; PIR, poverty income ratio; CHF, congestive heart failure; CHD, coronary heart disease; HA, heart attack; HDL, high-density lipoprotein; LDL, low-density lipoproteins; TG, triglycerides; TC, total cholesterol; GHb, glycohemoglobin; GLU, plasma fasting glucose; PUFA, polyunsaturated fatty acids.
Logistic Regression Analysis of Stroke Risk Factors, Comparing the k-Means Clustering Method and the Pre-Defined Grouping Method
| Variables | K-Means Clustering | Pre-Defined Grouping | ||||||
|---|---|---|---|---|---|---|---|---|
| Unadjusted | Adjusted | Unadjusted | Adjusted | |||||
| OR (95% CI) | OR (95% CI) | OR (95% CI) | OR (95% CI) | |||||
| Age* | 1.031 (1.018–1.045) | <0.001 | 1.053 (1.029–1.077) | <0.001 | 1.097 (1.059–1.137) | <0.001 | 1.093 (1.054–1.132) | <0.001 |
| Diabetes | 74.026 (52.765–103.854) | <0.001 | 28.019 (19.139–41.020) | <0.001 | 2.647 (1.681–4.167) | <0.001 | 2.228 (1.432–3.466) | 0.001 |
| Hypertension | 3.386 (2.649–4.328) | <0.001 | 2.343 (1.602–3.426) | <0.001 | 3.028 (1.773–5.172) | <0.001 | 2.295 (1.338–3.938) | 0.002 |
| Dietary fiber† | 0.981 (0.971–0.991) | <0.001 | 0.980 (0.964–0.995) | 0.011 | 0.961 (0.943–0.980) | <0.001 | 0.966 (0.947–0.985) | 0.001 |
| Education level‡ | 0.473 (0.388–0.577) | <0.001 | 0.541 (0.411–0.713) | <0.001 | ||||
| GHb§ | 11.945 (8.947–15.947) | <0.001 | 2.309 (1.818–2.934) | <0.001 | ||||
| GLU¶ | 1.049 (1.041–1.058) | <0.001 | 1.017 (1.010–1.024) | <0.001 | ||||
Notes: *Age, every 1-year increase in age; †Dietary fiber, every 1-gram increase in dietary fiber consumption; ‡Education level, received education of high school or above; §GHb, every 1% increase in the GHb level; ¶GLU, every 1 mg/dL increase in the GLU level.
Abbreviations: OR, odds ratio; 95% CI, 95% confidence interval; GHb, glycohemoglobin; GLU, plasma fasting glucose.
Figure 4ROC curves evaluating the classification of diabetes.