| Literature DB >> 36199766 |
Lu Liu1,2, Min-Min Zhu1,2, Lin-Lin Cai1,2, Xiao Zhang1,2.
Abstract
Aim: This study used machine learning methods to develop a prediction model for knee pain in middle-aged and elderly individuals.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36199766 PMCID: PMC9529423 DOI: 10.1155/2022/5005195
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.809
The characteristics of the study population.
| Knee pain | No knee pain |
| |
|---|---|---|---|
|
| 1690 | 3696 | |
| Age | 64.0 (54.0, 74.0) | 64.0 (53.0, 74.0) | 0.202 |
| Gender (female), | 962 (56.9%) | 1758 (47.6%) | < 0.001 |
| Race, | 0.383 | ||
| Non-Hispanic white | 977 (57.8%) | 2150 (58.2%) | |
| Non-Hispanic black | 320 (18.9%) | 643 (17.4%) | |
| Mexican American | 296 (17.5%) | 658 (17.8%) | |
| Others | 97 (5.7%) | 245 (6.6%) | |
| Education, | 0.029 | ||
| Below high school | 591 (35.0%) | 1223 (33.1%) | |
| High school | 424 (25.1%) | 855 (23.1%) | |
| Above high school | 675 (39.9%) | 1618 (43.8%) | |
| Hypertension (yes), | 917 (54.3%) | 1648 (44.6%) | < 0.001 |
| Diabetes (yes), | 305 (18.0%) | 549 (14.9%) | 0.003 |
| Pain elsewhere (yes), | 1192 (70.5%) | 1247 (33.7%) | < 0.001 |
| Moderate activity (yes), | 683 (40.4%) | 1663 (45.0%) | 0.002 |
| Vigorous activity (yes), | 255 (15.1%) | 734 (19.9%) | < 0.001 |
| Smoking (yes), | 920 (54.4%) | 2014 (54.5%) | 0.994 |
| Drinking (yes), | 315 (18.6%) | 638 (17.3%) | 0.234 |
| BMI (kg/m2) | 29.1 (25.8, 33.3) | 27.1 (24.1, 30.6) | < 0.001 |
| Waist circumference (cm) | 102.5 (93.5, 111.8) | 98.2 (89.4, 107.3) | < 0.001 |
| Albumin (g/L) | 42.0 (40.0, 44.0) | 42.0 (40.0, 44.0) | 0.002 |
| Phosphorus (mg/dL) | 3.7 (3.4, 4.1) | 3.7 (3.4, 4.0) | 0.075 |
| Total calcium (mg/dL) | 9.5 (9.2, 9.7) | 9.5 (9.2, 9.7) | 0.590 |
| Triglycerides (mg/dL) | 128.0 (89.0, 184.8) | 123.0 (85.0, 179.0) | 0.018 |
| Cholesterol (mg/dL) | 205.0 (179.0, 234.8) | 205.0 (180.0, 233.0) | 0.608 |
| Vitamin D (nmol/L) | 55.7 (39.7, 70.6) | 58.1 (43.4, 72.9) | < 0.001 |
| eGFR (ml/min/1.73m2) | 77.9 (20.6) | 78.4 (20.6) | 0.410 |
BMI: body mass index; eGFR: estimated glomerular filtration rate.
Multivariable logistic regression.
| Odds ratio | 95% CI |
| |
|---|---|---|---|
| Age | 1.01 | 1.00-1.02 | 0.105 |
| Gender (female) | 1.28 | 1.06-1.55 | 0.011 |
| Race | |||
| Non-Hispanic white |
| ||
| Non-Hispanic black | 1.15 | 0.91-1.45 | 0.249 |
| Mexican American | 1.02 | 0.81-1.28 | 0.889 |
| Others | 0.91 | 0.65-1.26 | 0.563 |
| Education | |||
| Below high school |
| ||
| High school | 1.06 | 0.86-1.31 | 0.588 |
| Above high school | 0.95 | 0.78-1.15 | 0.598 |
| Hypertension (yes) | 1.11 | 0.95-1.31 | 0.198 |
| Diabetes (yes) | 0.91 | 0.73-1.12 | 0.376 |
| Pain elsewhere (yes) | 4.64 | 3.98-5.43 | < 0.001 |
| Moderate activity (yes) | 1.04 | 0.89-1.22 | 0.617 |
| Vigorous activity (yes) | 0.90 | 0.72-1.12 | 0.338 |
| Smoking (yes) | 1.12 | 0.96-1.31 | 0.158 |
| Drinking (yes) | 0.86 | 0.70-1.05 | 0.152 |
| BMI (kg/m2) | 1.05 | 1.02-1.08 | < 0.001 |
| Waist circumference | 1.00 | 0.99-1.02 | 0.471 |
| Albumin (g/L) | 1.01 | 0.98-1.04 | 0.584 |
| Phosphorus (mg/dL) | 0.99 | 0.85-1.15 | 0.850 |
| Total calcium (mg/dL) | 1.01 | 0.82-1.24 | 0.927 |
| Triglycerides (mg/dL) | 1.00 | 1.00-1.00 | 0.470 |
| Cholesterol (mg/dL) | 1.00 | 1.00-1.00 | 0.674 |
| Vitamin D (nmol/L) | 1.00 | 0.99-1.00 | 0.438 |
| eGFR (ml/min/1.73m2) | 1.00 | 1.00-1.01 | 0.457 |
BMI: body mass index; eGFR: estimated glomerular filtration rate.
Figure 1The receiver operating characteristic curves of the logistic regression, random forest, and XGBoost based on the test set.
Figure 2Variable importance of the random forest model evaluated by (a) mean decrease accuracy and (b) mean decrease Gini index.
Figure 3The prediction model based on logistic regression visualized by a nomogram.