| Literature DB >> 35856080 |
Ashish Sarraju1, Jean Coquet2,3, Alban Zammit2,3, Antonia Chan2, Summer Ngo1, Tina Hernandez-Boussard2,3,4, Fatima Rodriguez1.
Abstract
Background: Statins conclusively decrease mortality in atherosclerotic cardiovascular disease (ASCVD), the leading cause of death worldwide, and are strongly recommended by guidelines. However, real-world statin utilization and persistence are low, resulting in excess mortality. Identifying reasons for statin nonuse at scale across health systems is crucial to developing targeted interventions to improve statin use.Entities:
Keywords: Cardiology; Cardiovascular diseases; Computational biology and bioinformatics; Disease prevention; Health services
Year: 2022 PMID: 35856080 PMCID: PMC9287295 DOI: 10.1038/s43856-022-00157-w
Source DB: PubMed Journal: Commun Med (Lond) ISSN: 2730-664X
Fig. 1CONSORT-style diagram for cohort selection.
Abbreviations: ASCVD atherosclerotic cardiovascular disease, SHA Stanford Health Care Alliance (consisting of an academic hospital, a community hospital, and a community practice clinic network), NLP natural language processing. “Statin allergy documented” refers to the documentation of a statin allergy in the structured “allergies” field of the EHR.
Baseline characteristics of the ASCVD study cohort by the presence or absence of a statin prescription.
| Characteristic at index date ( | Statin prescription present ( | Statin prescription absent ( | ||
|---|---|---|---|---|
| Age (years, mean ±SD) | 68.6 ± 11.6 | 65.5 ± 14.7 | <0.001 | |
| Female | 12179 (35.6%) | 10054 (45.1%) | <0.001 | |
| Race | Non-Hispanic White | 19086 (55.7%) | 13031 (58.4%) | <0.001 |
| Non-Hispanic Black | 1687 (4.9%) | 1297 (5.8%%) | ||
| Hispanic | 3167 (9.3%) | 2139 (9.6%) | ||
| Non-Hispanic Asian | 5553 (16.2%) | 3028 (13.6%) | ||
| Other | 3123 (9.1%) | 1656 (7.4%) | ||
| Provider location | SHC | 20442 (59.7%) | 11152 (50.0%) | <0.001 |
| UHA | 12226 (35.7%) | 9779 (43.9%) | ||
| ValleyCare | 1552 (4.5%) | 1191 (5.3%) | ||
| ASCVD type | Coronary artery | 20152 (58.9) | 7577 (34.0%) | <0.001 |
| Cerebrovascular | 7155 (20.9) | 5484 (24.6%) | ||
| Peripheral Arterial | 3117 (9.1) | 3072 (13.8%) | ||
| Polyvascular | 3810 (11.1%) | 6163 (27.6%) | ||
| Current smoking | 2006 (5.9%) | 925 (4.1%) | <0.001 | |
| Hospitalizations in prior 1 year (N) | 3000 (8.8%) | 1831 (8.2%) | 0.022 | |
| Insurance status | Private | 6984 (20.4%) | 5420 (24.3%) | <0.001 |
| Medicare | 20097 (58.7%) | 11696 (52.4%) | ||
| Medicaid | 2144 (6.3%) | 1529 (6.8%) | ||
| PCSK9 inhibitors | 59 (0.2%) | 68 (0.3%) | 0.001 | |
| Ezetimibe | 1697 (5.0%) | 292 (1.3%) | <0.001 | |
| Total cholesterol (mg/dL, mean ± SD) | 167.2 ± 46.2 | 169.3 ± 39.8 | 0.698 | |
| LDL-cholesterol level at index (mg/dl, mean ± SD) | 90.2 ± 38.3 | 107.9 ± 37.2 | <0.001 | |
| Chronic kidney disease | 5286 (15.4%) | 2248 (10.0%) | <0.001 | |
| Heart failure | 6091 (17.8%) | 2937 (13.2%) | <0.001 | |
| Atrial fibrillation | 5565 (16.3%) | 3393 (15.2%) | 0.003 | |
| Liver disease | 1966 (5.7%) | 1483 (6.7%) | <0.001 | |
| Creatine kinase level (mean ± SD) | 252.3 ± 746.6 | 386.5 ± 1022 | 0.001 | |
| Encounter specialties | Cardiology | 10906 (31.8%) | 7985 (35.8%) | <0.001 |
| Internal medicine | 3176 (9.2%) | 1816 (8.1%) | ||
| Family medicine | 2263 (6.6%) | 991 (4.4%) | ||
| Vascular surgery | 1486 (4.3%) | 1093 (4.9%) | ||
| Radiology | 1181 (3.5%) | 993 (4.4%) | ||
| Neurosurgery | 664 (1.9%) | 1017 (4.6%) | ||
| Primary care | 1106 (3.2%) | 459 (2.0%) | ||
| Emergency medicine | 977 (2.8%) | 540 (2.4%) | ||
| Neurology | 802 (2.3%) | 618 (2.8%) | ||
| Anesthesiology | 791 (2.3%) | 510 (2.3%) | ||
| Others | 10882 (31.8%) | 6274 (28.1%) | ||
ASCVD atherosclerotic cardiovascular disease, LDL low-density lipoprotein, SHC Stanford Health Care (academic hospital), UHA University Health Alliance (community practice network), ValleyCare ValleyCare Hospital (community hospital), SD standard deviation.
Fig. 2Training, internal validation, and application of a deep learning model (Clinical BERT) for natural language processing to identify statin nonuse and classify reasons for statin nonuse from unstructured clinical notes of patients with ASCVD.
Abbreviations: BERT Bidirectional Encoder Representations from Transformers, NLP natural language processing.
Performance of deep learning NLP models to characterize statin nonuse from unstructured clinical notes in persons with ASCVD.
| Task | Dataset | Precision* | Recall* | F1 score* | AUC* |
|---|---|---|---|---|---|
| Binary classification of statin use | 10-fold cross-validation (N = 1,393) | 0.88 (0.86–0.90) | 0.82 (0.77-0.87) | 0.85 (0.83–0.87) | 0.94 (0.93–0.95) |
| Test set (N = 349) | 0.87 (0.82–0.91) | 0.82 (0.76–0.88) | 0.84 (0.81–0.88) | 0.94 (0.93–0.96) | |
| Two-step classifier* for statin nonuse reasons | 10-fold cross-validation (N = 800) | 0.63 (0.59–0.65) | 0.62 (0.54–0.72) | 0.62 (0.59–0.64) | 0.84 (0.81–0.85) |
| Test set (N = 200) | 0.68 (0.63–0.75) | 0.69 (0.60–0.79) | 0.68 (0.62–0.75) | 0.88 (0.86–0.91) | |
| Multilabel classification of statin nonuse reasons (simple mutlilabel model) | 10-fold cross-validation (N = 800) | 0.60 (0.58–0.64) | 0.61 (0.56–0.66) | 0.59 (0.56–0.63) | 0.85 (0.83–0.87) |
| Test set (N = 200) | 0.64 (0.61–0.70) | 0.66 (0.60–0.73) | 0.64 (0.58–0.71) | 0.86 (0.82–0.89) |
*The two-step classifier represents the predicted probabilities of multiple classifiers (each reason for statin nonuse versus others) reconciled by a Random Forest.
ASCVD atherosclerotic cardiovascular disease, NLP natural language processing.
NLP-identified reasons for statin nonuse in patients with ASCVD, stratified by type of ASCVD and race/ethnicity.
| Cohort ( | Reason for nonuse | |||||
|---|---|---|---|---|---|---|
| Side effect | ||||||
| Muscle | Other | Nonspecific | Perceived lipid control | Patient preference | ||
| Total number | 385 | 404 | 1011 | 274 | 321 | |
| Stratified by type of ASCVD ( | ||||||
| Coronary artery disease | 224 (58.2) | 233 (57.7) | 472 (46.7) | 123 (44.9) | 175 (54.5) | <0.001 |
| Peripheral artery disease | 35 (9.1) | 38 (9.4) | 129 (12.8) | 29 (10.6) | 47 (14.6) | |
| Cerebrovascular disease | 87 (22.6) | 85 (21.0) | 310 (30.7) | 95 (34.7) | 74 (23.1) | |
| Polyvascular disease | 39 (10.1) | 48 (11.9) | 100 (9.9) | 29 (10.6) | 47 (14.6) | |
| Stratified by race/ethnicity ( | ||||||
| Non-Hispanic White | 244 (63.4) | 273 (67.6) | 554 (54.8) | 132 (48.2) | 214 (66.7) | <0.001 |
| Non-Hispanic Black | 13 (3.4) | 25 (6.2) | 62 (6.1) | 19 (6.9) | 13 (4.0) | |
| Hispanic | 30 (7.8) | 15 (3.7) | 96 (9.5) | 34 (12.4) | 19 (5.9) | |
| Non-Hispanic Asian | 40 (10.4) | 46 (11.4) | 96 (9.5) | 34 (12.4) | 19 (5.9) | |
ASCVD atherosclerotic cardiovascular disease, NLP natural language processing.
Excerpts from clinical notes demonstrating the reasons for statin nonuse identified in this study.
| Category | Note excerpt |
|---|---|
| Muscle-based side-effects | “intolerant of low dose statins (started with high CK)” |
| Other side-effects | “has been intolerant to 3 different statin drugs... they cause diarrhea and nausea” |
| Perceived lipid control | “Will also check lipid panel. If LDL < 100, no indication for statin at this time”; “… LDL is well controlled” |
| Patient Preference | “Declines statins” |
| Nonspecific | “OK for no statin at this time”; “discuss statin next visit” |
ASCVD atherosclerotic cardiovascular disease, CK creatinine kinase, LDL low-density lipoprotein cholesterol, NLP natural language processing.