| Literature DB >> 35922657 |
I Ghanzouri1, S Amal1, V Ho1, L Safarnejad1, J Cabot1, C G Brown-Johnson2, N Leeper1, S Asch2, N H Shah3, E G Ross4,5.
Abstract
Peripheral artery disease (PAD) is a common cardiovascular disorder that is frequently underdiagnosed, which can lead to poorer outcomes due to lower rates of medical optimization. We aimed to develop an automated tool to identify undiagnosed PAD and evaluate physician acceptance of a dashboard representation of risk assessment. Data were derived from electronic health records (EHR). We developed and compared traditional risk score models to novel machine learning models. For usability testing, primary and specialty care physicians were recruited and interviewed until thematic saturation. Data from 3168 patients with PAD and 16,863 controls were utilized. Results showed a deep learning model that utilized time engineered features outperformed random forest and traditional logistic regression models (average AUCs 0.96, 0.91 and 0.81, respectively), P < 0.0001. Of interviewed physicians, 75% were receptive to an EHR-based automated PAD model. Feedback emphasized workflow optimization, including integrating risk assessments directly into the EHR, using dashboard designs that minimize clicks, and providing risk assessments for clinically complex patients. In conclusion, we demonstrate that EHR-based machine learning models can accurately detect risk of PAD and that physicians are receptive to automated risk detection for PAD. Future research aims to prospectively validate model performance and impact on patient outcomes.Entities:
Mesh:
Year: 2022 PMID: 35922657 PMCID: PMC9349186 DOI: 10.1038/s41598-022-17180-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Sequential clinical data and aggregated demographic data are modelled in parallel then combined to make a final classification of PAD versus no PAD in deep learning model.
Descriptive statistics of case and control cohorts.
| Cases (N = 3,168) | Controls (N = 16,863) | ||
|---|---|---|---|
| Age (mean, y ± SD) | 74.8 (± 11) | 67.3 (± 11) | 2e-16 |
| Number (%) of Females | 1386 (40) | 9202 (54.6) | |
| Caucasians | 69.2 | 70.4 | NS |
| Black | 5.8 | 4.1 | 2.5e-05 |
| Asians | 14.1 | 14.1 | NS |
| Hispanic | 9.4 | 9.6 | NS |
| Other | 1.5 | 1.8 | NS |
| CVD (%) | 44.3 | 17.1 | |
| CAD (%) | 71.7 | 36.1 | |
| HF (%) | 46.4 | 20.2 | |
| HTN (%) | 85.6 | 54.4 | |
| Diabetes (%) | 21.7 | 8.4 | |
| HLD (%) | 39.4 | 23.5 | |
| BMI (mean, y ± SD) | 27.7 (± 6) | 27.6 (± 6) | NS |
| Current smokers (%) | 14.1 | 13.6 | 0.003 |
ASCVD—Atherosclerotic cardiovascular disease; CVD—cerebrovascular disease; CAD—coronary artery disease; HF—heart failure; HTN—hypertension; HLD—hyperlipidemia; BMI—body mass index.
Model results comparison.
| Model | Average AUC | Average specificity | Average sensitivity |
|---|---|---|---|
| Logistic regression | 0.81 | 0.73 | 0.75 |
| Nomogram | 0.64 | 0.62 | 0.62 |
| Machine learning | 0.91 | 0.81 | 0.83 |
| Deep learning | 0.96 | 0.88 | 0.95 |
The average area under the curve, sensitivity, and specificity for each model. AUC—area under the curve.
Figure 2Logistic regression (a) receiver operating characteristic curve and (b) calibration curves for five outer validation folds. AUC—area under the curve.
Figure 3Random forest (a) area under the curve and (b) calibration curves for five outer validation folds. AUC—area under the curve.
Figure 4Features most heavily weighted in discriminating between cases and controls in random forest model, based on feature importance averaged across folds.
Figure 5Deep learning model (a) area under the curve and (b) calibration curves for five outer validation folds. AUC—area under the curve.
Percentage increase in true positive cases by increasing model sophistication.
| % increase of true positives | Random forest | Deep learning |
|---|---|---|
| Logistic regression | + 9.6% | + 24.6% |
| Random forest | – | + 13.7% |
Figure 6Dashboards for presentation of patient risk of peripheral artery disease. (a) Tabbed dashboard. Further information on risk factors, patient summary, demographics and guideline recommendations are available only through directly clicking labeled links. (b) Unified Dashboard. All patient information is displayed in one frame of reference with guideline recommendations made available through clicking a link. AI—artificial intelligence; NLP—natural language processing; PAD—peripheral artery disease.
Usability themes and subthemes.
| Themes | Subthemes | Example Quote |
|---|---|---|
| Ease of understanding | Difficulty interpreting prediction output (6 of 12 participants) | “The prediction score … I don't know how to contextualize that. Is 75% to 100% when we consider screening? Some kind of scale would be helpful.” (Participant 7) |
| Ease of use | Desire for electronic health record integration (7 of 12 participants) | “I don't know if I would necessarily click on a link to go to another website. It is just one step removed that may decrease compliance.” (Participant 5) |
| Reducing clicks while navigating (8 of 12 participants) | “It took me three clicks to get here; it would have been better if it was simpler. It does affect my usage and efficiency and maybe satisfaction.” (Participant 2) | |
| Acceptability | Improving peripheral artery disease diagnosis (9 of 12 participants) | “[I]t would help remind me, especially for more complicated … it can be difficult… when you're just kind of inundated with a lot of different problems in a single 30-min visit … to continue to have this on your radar (Participant 11)” |
| “If you have somebody … on the borderline, … it would be nice to see whether you know your overall gestalt matches with the gestalt of the of the computer… If it was like 83% … I would maybe order that test.” (Participant 9) | ||
| Low priority in diagnosing peripheral artery disease amongst primary care physicians (4 of 12 participants) | If I'm trying to do other screenings, I have to put that risk benefit ratio in the context of everything else. If they haven't had their colonoscopy or mammogram, do I send them for that if they have limited bandwidth? Or do I send them for a PAD screen? (Participant 4) | |
| “[Peripheral artery disease] … it's not like coronary disease, where if you miss it, somebody is going to have an acute event and … death could ensue, right? Whereas if you have peripheral vascular disease that you haven't picked up and they're not symptomatic …, is it really going to make a big difference?” (Participant 1) | ||
| Varying perceptions of machine learning (2 of 12 participants) | “I'm for machine learning to help me be a better doctor. It's going to help me not miss diseases, and it's going to help me manage diseases better.” (Participant 4) | |
| “I think it's hard to know what to make of any particular AI prediction… I …would like to have a link to maybe a paper that was peer reviewed saying … this works” (Participant 11) |