| Literature DB >> 36099008 |
Zuzanna Nowinka1, M Abdulhadi Alagha1,2, Khadija Mahmoud1, Gareth G Jones1.
Abstract
BACKGROUND: Knee osteoarthritis (OA) is the most common form of OA and a leading cause of disability worldwide. Chronic pain and functional loss secondary to knee OA put patients at risk of developing depression, which can also impair their treatment response. However, no tools exist to assist clinicians in identifying patients at risk. Machine learning (ML) predictive models may offer a solution. We investigated whether ML models could predict the development of depression in patients with knee OA and examined which features are the most predictive.Entities:
Keywords: depression; knee osteoarthritis; machine learning; predictive modeling
Year: 2022 PMID: 36099008 PMCID: PMC9518113 DOI: 10.2196/36130
Source DB: PubMed Journal: JMIR Form Res ISSN: 2561-326X
Summary of all features included in the model training.
| Feature category | Features |
| Patient demographics | Age, sex, BMI, ethnicity, employment status, education status, living alone, marital status, smoking status |
| Past medical history and medication | Heart attack, heart failure, stroke, asthma, chronic obstructive pulmonary disease, peptic ulcer disease, diabetes, kidney disease, osteoporosis medication |
| Knee osteoarthritis history | Knee arthroscopy, knee meniscectomy, ligament repair, other knee surgery, arthritis of other joints, knee injury, steroid knee injections, analgesic medication for knee osteoarthritis, arthritis medication |
| Baseline examination findings | Blood pressure, 20-meter-walk test, five-stands-to-sit test, KLGa,b, CES-Dc baseline |
| Patient-reported outcome measures | WOMACa,d (Total, Pain score, Stiffness score); SF-12e (Physical components, Mental health component); PASEf |
aSeparate feature for the right and left knee.
bKLG: Kellgren-Lawrence Grade.
cCES-D: Center for Epidemiological Studies Depression Scale.
dWOMAC: Western Ontario and McMaster Universities Osteoarthritis Index.
eSF-12: 12-item Short Form Health Survey.
fPASE: Physical Activity Scale for the Elderly.
Figure 1Flowchart summarizing the project timeline and steps of model development. AUC: area under the receiver operating characteristic curve; GBM: gradient boosting machine; LASSO: least absolute shrinkage and selection operator; MOST: Multicenter Osteoarthritis Study; OAI: Osteoarthritis Initiative.
Figure 2Summary of patient flow for both databases. CES-D: Center for Epidemiological Studies Depression Scale.
Key patient demographic and clinical data.
| Characteristic | OAIa (n=3711) | MOSTb (n=2236) | |||
| Age, mean (SD) | 61.0 (9.1) | 62.1 (8.1) | |||
| BMI, mean (SD) | 28.4 (4.8) | 30.4 (5.9) | |||
| Sex (female), n (%) | 2149 (57.91) | 1297 (58.01) | |||
| Ethnicity (white), n (%) | 3082 (83.05) | 1932 (86.40) | |||
| Blood pressure (hypertension stage≥1), n (%) | 1847 (49.77) | 1008 (45.08) | |||
| Other arthritis, n (%) | 1454 (39.18) | 1071 (47.90) | |||
| Analgesic medication for knee OAc (any), n (%) | 845 (22.77) | 1804 (80.68) | |||
|
| |||||
|
| Right knee, grade 1 or higher | 2294 (61.82) | 1180 (52.77) | ||
|
| Left knee, grade 1 or higher | 2206 (59.44) | 1264 (56.53) | ||
|
| |||||
|
| Right knee | 10.7 (10.3) | 18.6 (17.5) | ||
|
| Left knee | 10.7 (10.4) | 18.3 (17.5) | ||
| Baseline CES-Df, mean (SD) | 6.3 (6.0) | 6.7 (6.2) | |||
| Depression at 2-year visit, n (%) | 342 (9.22) | 265 (11.85) | |||
aOAI: Osteoarthritis Initiative.
bMOST: Multicenter Osteoarthritis Study.
cOA: osteoarthritis.
dKLG: Kellgren-Lawrence Grade.
eWOMAC: Western Ontario and McMaster Universities Osteoarthritis Index.
fCES-D: Center for Epidemiological Studies Depression Scale.
Model performance for the internal test set and external validation set.
| Ranka | Model | Test set (OAIb), AUCc (95% CI) | External validation set (MOSTd), AUC (95% CI) |
| 1 | LASSOe | 0.869 (0.824-0.913) | 0.876 (0.853-0.899) |
| 2 | GBMf | 0.858 (0.813-0.903) | 0.872 (0.849-0.895) |
| 3 | Ridge | 0.864 (0.818-0.910) | 0.852 (0.827-0.878) |
| 4 | Random forest | 0.808 (0.741-0.874) | 0.822 (0.790-0.853) |
| 5 | Logistic regression | 0.837 (0.786-0.888) | 0.808 (0.775-0.840) |
| 6 | Decision tree | 0.673 (0.604-0.742) | 0.720 (0.685-0.755) |
aModels are ranked by their performance on the external validation data set.
bOAI: Osteoarthritis Initiative.
cAUC: area under the receiver operating characteristic curve.
dMOST: Multicenter Osteoarthritis Study.
eLASSO: least absolute shrinkage and selection operator.
fGBM: gradient boosting machine.
Figure 3AUC plot of all models tested on the OAI test set (20% of the initial OAI data set). The test set was not used at any stage of model training. AUC: area under the receiver operating characteristic curve; GBM: gradient boosting machine; LASSO: least absolute shrinkage and selection operator; MOST: Multicenter Osteoarthritis Study; OAI: Osteoarthritis Initiative.
Figure 4AUC plot of all models externally validated on the MOST data set. AUC: area under the receiver operating characteristic curve; GBM: gradient boosting machine; LASSO: least absolute shrinkage and selection operator; MOST: Multicenter Osteoarthritis Study; OAI: Osteoarthritis Initiative.
Accuracy, precision, recall, and F1 scores for the test set, ranked by the F1 score.
| Rank | Model | Accuracy | Precision | Recall | F1 |
| 1 | LASSOa | 0.902 | 0.467 | 0.515 | 0.490 |
| 2 | Random forest | 0.923 | 0.628 | 0.397 | 0.486 |
| 3 | Logistic regression | 0.906 | 0.485 | 0.485 | 0.485 |
| 4 | GBMb | 0.901 | 0.466 | 0.500 | 0.482 |
| 5 | Decision tree | 0.895 | 0.429 | 0.441 | 0.435 |
| 6 | Ridge | 0.908 | 0.500 | 0.426 | 0.460 |
aLASSO: least absolute shrinkage and selection operator.
bGBM: gradient boosting machine.
Accuracy, precision, recall, and F1 scores for the validation set, ranked by the F1 score.
| Rank | Model | Accuracy | Precision | Recall | F1 |
| 1 | LASSOa | 0.889 | 0.528 | 0.604 | 0.563 |
| 2 | Decision tree | 0.890 | 0.538 | 0.536 | 0.537 |
| 3 | GBMb | 0.865 | 0.453 | 0.657 | 0.536 |
| 4 | Random forest | 0.894 | 0.556 | 0.506 | 0.530 |
| 5 | Logistic regression | 0.886 | 0.344 | 0.698 | 0.461 |
| 6 | Ridge | 0.895 | 0.593 | 0.370 | 0.456 |
aLASSO: least absolute shrinkage and selection operator.
bGBM: gradient boosting machine.