| Literature DB >> 33789660 |
Arielle Selya1,2,3,4, Drake Anshutz5,6, Emily Griese5,7, Tess L Weber5, Benson Hsu8, Cheryl Ward5,9.
Abstract
BACKGROUND: Diabetes is a medical and economic burden in the United States. In this study, a machine learning predictive model was developed to predict unplanned medical visits among patients with diabetes, and findings were used to design a clinical intervention in the sponsoring healthcare organization. This study presents a case study of how predictive analytics can inform clinical actions, and describes practical factors that must be incorporated in order to translate research into clinical practice.Entities:
Keywords: Diabetes; Machine learning; Predictive model; Unplanned medical visits
Mesh:
Year: 2021 PMID: 33789660 PMCID: PMC8011134 DOI: 10.1186/s12911-021-01474-1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Characteristics of patients with diabetes by unplanned visit status
| Predictor variable | No unplanned visits | ≥ 1 Unplanned visits | |
|---|---|---|---|
| Age | |||
| BMI | 32.3 (28.3–37.0) | 32.2 (28.0–37.3) | = .2454 |
| Systolic BP | 126.0 (118.0–134.0) | 126.0 (116.0–136) | = .0089 |
| Diastolic BP | |||
| LDL cholesterol | 85.0 (67.0–106.0) | 84.0 (65.0–106.0) | = .0053 |
| HDL cholesterol | |||
| A1C | |||
| Ranked smoking status | |||
| Number of diagnoses on problem list | |||
| Number of prescriptions |
Variables are summarized as median (interquartile range). A1C glycohemoglobin. BMI body mass index. BP blood pressure. HDL high-density lipoprotein. LDL low-density lipoprotein. p values are based on t-tests of each variable across groups (any vs. no unplanned visits). Bold: p < .05
Generalization performance of classifiers with optimized parameters, presented as confusion matrices and balanced accuracy ± standard deviation across five-fold cross-validation
| Classifier | Most stable parameters across outer folds | Predicted: no unplanned visits | Predicted: ≥ 1 unplanned visit | ||
|---|---|---|---|---|---|
| Linear discriminant analysis | N/A | Actual: No Unplanned Visits | 50.8% ± 1.4% | 49.2% ± 1.4% | |
| Actual: ≥ 1 Unplanned Visit | 24.8% ± 1.0% | 75.2% ± 1.0% | |||
| Average | 63.0% ± 0.7% | ||||
| Quadratic discriminant analysis | N/A | Actual: No Unplanned Visits | 82.5% ± 0.6% | 17.5% ± 0.6% | |
| Actual: ≥ 1 Unplanned Visit | 56.3% ± 0.8% | 43.7% ± 0.8% | |||
| Average | 63.3% ± 0.4% | ||||
| Linear SVM | Cost = 25 | Actual: No Unplanned Visits | 71.1% ± 0.8% | 28.9% ± 0.8% | |
| Actual: ≥ 1 Unplanned Visit | 39.8% ± 1.0% | 60.2% ± 1.0% | |||
| Average | 65.7% ± 0.3% | ||||
| Radial SVM | Cost = 50; Gamma = 0.1 | Actual: No Unplanned Visits | 57.6% ± 1.4% | 42.5% ± 1.4% | |
| Actual: ≥ 1 Unplanned Visit | 28.4% ± 0.9% | 71.6% ± 0.9% | |||
| Average | 64.6% ± 0.8% | ||||
| Single hidden layer NN | Hidden layer = 20 nodes; Iterations = 200; Decay = 0.0 | Actual: No Unplanned Visits | 50.7% ± 28.7% | 49.3% ± 28.7% | |
| Actual: ≥ 1 Unplanned Visit | 31.6% ± 20.4% | 68.4% ± 20.4% | |||
| Average | 59.5% ± 7.7% | ||||
| Triple hidden layer DNN | Hidden layers = 20 nodes; Learning = 1.0; Momentum = 0.5; Iterations = 20 | Actual: No Unplanned Visits | 65.7% ± 14.5% | 34.4% ± 14.5% | |
| Actual: ≥ 1 Unplanned Visit | 36.7% ± 14.6% | 63.3% ± 14.6% | |||
| Average | 64.5% ± 0.8% | ||||
| XG boost | Max depth = 20; Eta = 0.90; # rounds = 200; Gamma = 10; Min. child weight = 10; Ratio of column per tree = 1.0 | Actual: No Unplanned Visits | 33.9% ± 30.8% | 66.1% ± 30.8% | |
| Actual: ≥ 1 Unplanned Visit | 16.7% ± 15.3% | 83.3% ± 15.3% | |||
| Average | 58.6% ± 7.8% | ||||
| Logistic Regression | N/A | Actual: No Unplanned Visits | 60.4% ± 0.8% | 39.6% ± 0.8% | |
| Actual: ≥ 1 Unplanned Visit | 29.8% ± 0.8% | 70.2% ± 0.8% | |||
| Average | 65.3% ± 0.7% | ||||
Basic cross-validation was run for classifiers without hypermarameters (linear and quadratic discriminant analysis, logistic regression) and nested cross-validation for classifiers with hyperparameters (linear and radial SVM, single- layer NN and triple-layer DNN) to optimize hyperparameters
Cross-validation matrices show the generalization performance with respect to the actual class (rows) against the predicted class (columns), with ± standard deviation across cross-validation runs. DNN deep nets. NN neural nets. SVM support vector machines. XG boost extreme gradient boosting
Optimal hyper-parameters across each of 5 “outer” folds in nested cross-validation
| Parameter | Fold 1 | Fold 2 | Fold 3 | Fold 4 | Fold 5 |
|---|---|---|---|---|---|
| Cost | 0.1 | 25 | 25 | 25 | 25 |
| Cost | 25 | 50 | 50 | 50 | 50 |
| Gamma | 0.1 | 0.1 | 0.1 | 0.1 | 0.1 |
| Size of hidden layer | 15 | 20 | 20 | 20 | 1 |
| Maximum # iterations | 100 | 200 | 200 | 200 | 100 |
| Decay | 0.0 | 0.0 | 0.0 | 0.0 | 0.1 |
| Size of 3 hidden layers | 20, 20, 20 | 20, 20, 20 | 20, 20, 20 | 20, 20, 20 | 20, 20, 20 |
| Learning rate | 1 | 1 | 1 | 1 | 1 |
| Momentum | 0.5 | 0.5 | 0.5 | 0.5 | 0.5 |
| Number of epochs | 20 | 20 | 20 | 20 | 20 |
| Max depth | 20 | 20 | 6 | 20 | 6 |
| Eta | 0.9 | 0.9 | 0.01 | 0.9 | 0.01 |
| Nrounds | 200 | 200 | 50 | 200 | 50 |
| Gamma | 10 | 10 | 0 | 10 | 0 |
| Min. child weight | 10 | 10 | 0 | 10 | 0 |
| Ratio of columns per tree | 1.0 | 1.0 | 0.1 | 1.0 | 0.1 |
NN neural nets. DNN deep nets. SVM support vector machines. XG boost extreme gradient boosting
Training performance of classifiers with optimized parameters, presented as confusion matrices and balanced accuracy ± standard deviation across five-fold cross-validation runs
| Classifier | Predicted: no unplanned visits | Predicted: ≥ 1 unplanned visit | |
|---|---|---|---|
| Actual: No Unplanned Visits | 50.7% ± 1.1% | 49.3% ± 1.1% | |
| Actual: ≥ 1 Unplanned Visit | 24.7% ± 0.7% | 75.3% ± 0.7% | |
| Average | 63.0 ± 0.2% | ||
| Actual: No Unplanned Visits | 83.0% ± 0.2% | 17.1% ± 0.2% | |
| Actual: ≥ 1 Unplanned Visit | 56.2% ± 0.2% | 43.8% ± 0.2% | |
| Average | 63.4% ± 0.1% | ||
| Actual: No Unplanned Visits | 71.3% ± 0.8% | 28.7% ± 0.8% | |
| Actual: ≥ 1 Unplanned Visit | 39.6% ± 0.7% | 60.4% ± 0.7% | |
| Average | 65.8% ± 0.1% | ||
| Actual: No Unplanned Visits | 67.0% ± 1.1% | 33.0% ± 1.1% | |
| Actual: ≥ 1 Unplanned Visit | 21.4% ± 0.4% | 78.6% ± 0.4% | |
| Average | 72.8% ± 0.1% | ||
| Actual: No Unplanned Visits | 50.8% ± 28.7% | 49.2% ± 28.7% | |
| Actual: ≥ 1 Unplanned Visit | 31.5% ± 20.2% | 68.5% ± 20.2% | |
| Average | 59.7% ± 7.9% | ||
| Actual: No Unplanned Visits | 65.4% ± 15.0% | 34.6% ± 15.0% | |
| Actual: ≥ 1 Unplanned Visit | 36.5% ± 14.3% | 63.5% ± 14.3% | |
| Average | 64.4% ± 0.8% | ||
| Actual: No Unplanned Visits | 38.8% ± 35.3% | 61.2% ± 35.3% | |
| Actual: ≥ 1 Unplanned Visit | 12.9% ± 11.2% | 87.2% ± 11.2% | |
| Average | 63.0% ± 11.9% | ||
| Actual: No Unplanned Visits | 60.4% ± 0.2% | 39.6% ± 0.2% | |
| Actual: ≥ 1 Unplanned Visit | 29.8% ± 0.2% | 70.2% ± 0.2% | |
| Average | 65.3% ± 0.2% | ||
Basic cross-validation was run for classifiers without hypermarameters (linear and quadratic discriminant analysis, logistic regression) and nested cross-validation for classifiers with hyperparameters (linear and radial SVM, single-layer NN and triple-layer DNN) to optimize hyperparameters
Cross-validation matrices show the training performance with respect to the actual class (rows) against the predicted class (columns), with ± standard deviation across cross-validation runs. DNN deep nets. NN neural nets. SVM support vector machines. XG boost extreme gradient boosting
Sensitivity analysis showing the disruption of balanced accuracy when adding normally-distributed noise (0.3 × standard deviation) to each variable
| Variable range | New balanced accuracy (%) | Change in balanced accuracy (vs. 65.8% on original sample) (%) |
|---|---|---|
| A1C | 65.7 | − 0.1 |
| BMI | 64.7 | − 1.1 |
| BP | 64.5 | − 1.3 |
| HDL | 64.4 | − 1.4 |
| LDL | 65.8 | − 0.0 |
| Tobacco use | 65.0 | − 0.8 |
Balanced accuracy is the average of the sensitivity and specificity rates (see text), based on test sets across 25 cross-validation tests using repeated-hold-20%-out subsampling. Change in balanced accuracy is relative to the optimized classification results using the original data sample in Table 2 (65.8%)
A1C glycohemoglobin. BMI body mass index. BP blood pressure. HDL high-density lipoprotein. LDL low-density lipoprotein