| Literature DB >> 34127492 |
Carlton R Moore1, Saumya Jain2, Stephanie Haas3, Harish Yadav3, Eric Whitsel2, Wayne Rosamand2, Gerardo Heiss2, Anna M Kucharska-Newton2.
Abstract
OBJECTIVES: Using free-text clinical notes and reports from hospitalised patients, determine the performance of natural language processing (NLP) ascertainment of Framingham heart failure (HF) criteria and phenotype. STUDYEntities:
Keywords: cardiac epidemiology; health informatics; heart failure
Mesh:
Year: 2021 PMID: 34127492 PMCID: PMC8204176 DOI: 10.1136/bmjopen-2020-047356
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Hospital and patient characteristics for the validation dataset (N=406)
| Hospital ID | ||||
| A | B | C | D | |
| EHR vendor | Epic | Epic | Epic | Allscripts |
| Region | South-east | South | North | East |
| Status | Academic | Academic | Academic | Non-academic |
| Hospital bed size | 873 | 700 | 385 | 247 |
| Abstracted records (N) | 122 | 46 | 117 | 121 |
| Age, mean (SD) | 73.2 (10.6) | 70.4 (10.5) | 77.4 (9.9) | 78.6 (10.3) |
| Female, % | 43.2 | 39.1 | 61.5 | 59.0 |
| Identified as white, % | 54.4 | 8.7 | 90.6 | 93.4 |
| No health insurance, % | 0.0 | 0.0 | 0.9 | 9.8 |
| Medicaid insurance, % | 4.0 | 10.9 | 13.7 | 1.6 |
EHR, electronic health record.
Framingham8 HF phenotype criteria variables
| Framingham HF phenotype criteria variables | Criteria |
| 4.5 kg weight change over 5 days during hospitalisation | Major |
| Jugular venous distension | Major |
| Hepatojugular reflux | Major |
| Paroxysmal nocturnal dyspnoea | Major |
| Orthopnea | Major |
| Pulmonary basilar rales | Major |
| S3 gallop | Major |
| Alveolar/pulmonary oedema on chest X-ray | Major |
| Cardiomegaly on chest X-ray | Major |
| Lower extremity oedema | Minor |
| Hepatomegaly | Minor |
| Dyspnoea—exertion | Minor |
| Bilateral pleural effusion | Minor |
HF is diagnosed if the following are present: (1) two major criteria or (2) one major and two minor criteria.
HF, heart failure.
Figure 1Study design. ΨRecall (sensitivity), precision (positive-predictive value); φrecall, precision, ∆ estimated HF prevalence, % agreement; EHR, electronic health record; HF, heart failure; NLP, natural language processing.
Figure 2NLP pipeline. φEmergency department notes, hospital admission notes and discharge summaries; cTAKES, clinical Text Analysis Knowledge Extraction System; EHR, electronic health record; HF, heart failure; Negation, determination of whether an HF criteria is negated (eg, patient has oedema vs patient has no oedema); NLP, natural language processing.
NLP performance for abstracting Framingham HF phenotype criteria from EHRs. Validation dataset (N=406)
| HF criteria variables (n)* | Sensitivity, % (95% CI) | Specificity, % (95% CI) | PPV, % (95% CI) | Note types used |
| Weight loss ≥4.5 kg† (27) | 81.5 (61.9 to 93.7) | 96.0 (93.6 to 97.8) | 59.5 (43.7 to 75.3) | Structured data |
| Jugular venous distension (56) | 60.7 (46.8 to 73.5) | 91.7 (87.6 to 94.8) | 61.8 (49.0 to 74.6) | ED, AN |
| Hepatojugular reflux (0) | N/A | 99.7 (98.2 to 100.0) | 0.00 | ED, AN |
| PND (27) | 55.6 (35.3 to 74.5) | 89.4 (85.2 to 92.7) | 33.3 (19.2 to 46.7) | ED, AN, DC |
| Orthopnea (64) | 59.4 (46.4 to 71.5) | 92.7 (88.7 to 95.6) | 67.9 (55.7 to 80.1) | ED, AN, DC |
| Pulmonary basilar rales (93) | 61.3 (50.6 to 71.2) | 66.4 (59.7 to 72.6) | 43.8 (35.3 to 52.3) | ED, AN, DC |
| S3 gallop (5) | 40.0 (5.3 to 85.3) | 95.1 (92.0 to 97.2) | 11.8 (0.00 to 27.14) | ED, AN, DC |
| Pulmonary oedema (48) | 91.7 (80.0 to 97.7) | 51.0 (44.5 to 57.5) | 27.3 (20.4 to 34.2) | ED, AN, DC, IR |
| Cardiomegaly (162) | 54.3 (46.3 to 62.2) | 96.0 (90.9 to 98.7) | 96.7 (93.0 to 100.0) | ED, AN, DC, IR |
| Lower extremity oedema (163) | 74.8 (67.5 to 81.3) | 75.5 (67.7 to 82.2) | 77.2 (70.7 to 83.7) | ED, AN, DC |
| Hepatomegaly (3) | 33.3 (0.8 to 90.6) | 99.0 (97.2 to 99.8) | 33.3 (0.00 to 86.2) | ED, AN, IR |
| Dyspnoea on exertion (263) | 79.1 (73.7 to 83.8) | 74.5 (59.7 to 86.1) | 94.5 (91.5 to 97.5) | ED, AN, DC |
| Bilateral pleural effusion (79) | 75.9 (65.0 to 84.9) | 73.1 (66.5 to 79.0) | 51.7 (42.6 to 60.8) | ED, AN, DC, IR |
*Instances in total cohort that criteria were identified by manual ARIC abstractors (reference standard).
†Weight loss during hospitalisation based on structured daily patient weight data.
AN, admission note; ARIC, Atherosclerosis Risk in Communities; DC, discharge summary; ED, emergency department; EHRs, electronic health records; HF, heart failure; IR, imaging report; NLP, natural language processing; PND, paroxysmal nocturnal dyspnoea; PPV, positive-predictive value.
Performance of NLP-based ascertainment of Framingham HF phenotype
| Sensitivity, % (95% CI) | Specificity, % (95% CI) | PPV, % (95% CI) | Agreement, % (95% CI) |
| 78.8 (72.8 to 83.9) | 81.7 (75.2 to 87.0) | 84.4 (79.5 to 89.3) | 80.0 (75.8 to 83.8) |
Note types: emergency department notes, hospital admission notes and discharge summaries.
HF, heart failure; NLP, natural language processing; PPV, positive-predictive value.
Figure 3EHR-based performance for Framingham HF phenotype by hospital with 95% CIs. EHR, electronic health record; HF, heart failure.