| Literature DB >> 29667017 |
Mathias Kaspar1,2, Georg Fette1,2, Gülmisal Güder1, Lea Seidlmayer1, Maximilian Ertl2,3, Georg Dietrich2, Helmut Greger3, Frank Puppe2, Stefan Störk4.
Abstract
BACKGROUND: Heart failure is the predominant cause of hospitalization and amongst the leading causes of death in Germany. However, accurate estimates of prevalence and incidence are lacking. Reported figures originating from different information sources are compromised by factors like economic reasons or documentation quality.Entities:
Keywords: Data warehouse; Electronic health records; Heart failure; ICD coding; Information extraction
Mesh:
Year: 2018 PMID: 29667017 PMCID: PMC6105250 DOI: 10.1007/s00392-018-1245-z
Source DB: PubMed Journal: Clin Res Cardiol ISSN: 1861-0684 Impact factor: 5.460
Automated advanced data warehouse interrogation to detect heart failure
| Subquery namea | Search terms (including synonyms and word parts in German language) | HF detection algorithms | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
| Echo-EF ≤ 45 | lvef ≤ 45 | × | × | × | ||
| Echo-EF < 50 | lvef < 50 | |||||
| ICD-Any-HF | I13.2, I13.0, I11, I50 (including more specific diagnosis) | × | × | × | × | |
| Text-Heart-Failure | herzschw* OR herzinsuff* | × | × | × | × | |
| Text-Cardiac-Decompensation | (card| kard| kardiopulmo| cardiopulmo| hydrop| herz| link)* dek* | × | × | × | ||
| Text-Systolic-Failure | (card| kard| cardiopulmo| kardiopulmo| hydrop| herz)* pumpvers* OR vorwärtsversag* OR (kard| card)* schock* | × | × | × | × | |
| Text-Dilated-Cardiomyopathy | dilat* (kardiomy| cardiomy)* OR dcm | × | ||||
| Text-NYHA | nyha | × | × | |||
| Text-Left-HF | (kard|linksherz)* insuff* | × | × | |||
| Text-Right-HF | (rechtsherz| diast)* insuff* | × | ||||
| Text-Reduced-LV-Function | (komp| reduzierte| eingeschränkte| verminderte)* (link| sys)* ventr* funkti* OR (komp| reduzierte| eingeschränkte| verminderte)* lv funkti* OR ventrik* (komp| reduzierte| eingeschränkte |verminderte)* funkti* OR sys* dysfunkti* | × | × | |||
| Text-Reduced-RV-Function | (komp| reduzierte| eingeschränkte| verminderte)* (rechts| dias)* ventr* funkti* OR (komp| reduzierte| eingeschränkte| verminderte)* rv funkti* OR dias* dysfunkti* | × | ||||
| Text-Pulmonary-Edema | lung*ödem* OR lung*stau* OR stau*lung* | × | ||||
| Text-Left-Ventricular-Hypertrophy | lv hypertr* OR link*ventr* hypertr* | × | ||||
| Text-Left-Atrial-Enlargement | (link*vorho*| link*atri*| la) (verg| dilat| hypertr)* | × | ||||
| Text-Diastolic-Dysfunction | (komp| eingeschr| vermind)* (ventr| dias)* funkti* OR dias* (dysfunkti*| relax*stör*) | × | ||||
| Lab-NT-proBNP ≥ 1000 | nt-probnp (pg/mL) ≥ 1000 | |||||
| Lab-NT-proBNP ≥ 3000 | nt-probnp (pg/mL) ≥ 3000 | |||||
MExpert indicates the initially defined query by the clinical expert and a computer scientist. MICD indicates the algorithm using sole ICD codes for comparison. AF1, APrecision, and ASensitivity indicate search algorithms optimized using permutation testing. Queries use Boolean “OR” operators, which means that each single hit justifies presence of heart failure
LV left ventricular, RV right ventricular, EF ejection fraction, HF heart failure, NYHA New York Heart Association, DCM dilated cardiomyopathy, NT-proBNP N-terminal prohormone of brain natriuretic peptide
aSources: Echo echocardiography report, ICD ICD diagnosis, Text unstructured text from discharge letter, Lab laboratory value from routine laboratory testing
Performance of automated heart failure detection algorithms versus reference standard
| Algorithm |
| Ref. HF+ | Ref. HF− | Precision (%) | Sensitivity (%) | F1 score (%) | |
|---|---|---|---|---|---|---|---|
|
| HF+ | 117 | 110 (TP) | 7 (FP) | 94 | 50 | 65 |
| (for comparison) | HF− | 925 | 112 (FN) | 813 (TN) | |||
|
| HF+ | 253 | 193 (TP) | 60 (FP) | 76 | 87 | 81 |
| (expert specified) | HF− | 789 | 29 (FN) | 760 (TN) | |||
|
| HF+ | 140 | 134 (TP) | 6 (FP) | 96 | 60 | 74 |
| (precision optimized) | HF− | 902 | 88 (FN) | 814 (TN) | |||
|
| HF+ | 286 | 204 (TP) | 82 (FP) | 71 | 92 | 80 |
| (sensitivity optimized) | HF− | 756 | 18 (FN) | 738 (TN) | |||
|
| HF+ | 209 | 186 (TP) | 23 (FP) | 89 | 84 | 86 |
| (F1 score optimized) | HF+ | 833 | 36 (FN) | 797 (TN) |
Ref reference standard defined by a heart failure specialist inspecting the documents, HF+ heart failure present, HF− heart failure absent, TP true positive, FP false positive, FN false negative, TN true negative. Precision: TP/(TP + FP), sensitivity: TP/(TP + FN), F1: 2 × (precision × sensitivity)/(precision + sensitivity). For details refer to “Methods”
Fig. 1Count of inpatients within the Department of Medicine I in the years 2000–2015. The solid line indicates all patients; each patient is counted once per year. Intermittent lines represent patients with heart failure identified using different automated heart failure detection algorithms: MExpert originates from the variable set pre-specified by the clinical expert; APrecision optimizes count of false positives; ASensitivity optimizes count of false negatives; AF1 optimizes overall accuracy (for details refer to“Methods”)
Frequencies of all patients with heart failure identified by the AF1 algorithm by age group, gender and the 4-year periods (each with unique patients per time period)
| All patients with heart failure | Period | ||||
|---|---|---|---|---|---|
| 2000–2003 | 2004–2007 | 2008–2011 | 2012–2015 | ||
| Sex, | |||||
| Male | 10,636 (59) | 1772 (57) | 2466 (58) | 2929 (60) | 4945 (60) |
| Female | 7531 (41) | 1328 (43) | 1778 (42) | 1976 (40) | 3252 (40) |
| Age category, | |||||
| ≤ 45 years | 702 (4) | 108 (3) | 173 (4) | 156 (3) | 301 (4) |
| 46–54 years | 1294 (7) | 167 (5) | 287 (7) | 317 (6) | 589 (7) |
| 55–64 years | 2766 (15) | 474 (15) | 571 (13) | 637 (13) | 1268 (15) |
| 65–74 years | 5169 (28) | 956 (31) | 1262 (30) | 1311 (27) | 2062 (25) |
| > 74 years | 9163 (50) | 1431 (46) | 2004 (47) | 2540 (52) | 4052 (50) |
Data are count (percent). Inpatients with heart failure subdivided by sex and age categories. Some patients were admitted more than once; on average, one patient contributed 1.8 patient cases
Fig. 2Detection of heart failure in inpatients using different approaches (percentage of all inpatients). The solid line indicates the prevalence detected when applying the automated algorithm AF1 (for details refer to “Methods”). Intermittent lines indicate detection using ICD codes or other information tags that dominantly contributed to the detection of heart failure. Each patient entered analysis only once per year; if patients attended the hospital multiple times, the first case of each patient per year was used
Fig. 3Detection of heart failure via related ICD codes (dark gray) and the additional detection through other search terms* (light gray), in inpatients with heart failure across the entire sampling period (years 2000–2015). The percentage of patients found via selective ICD code search increased in recent years, which might be explained by the foundation of the Comprehensive Heart Failure Center Würzburg in the year 2010, i.e., a facility devoted to the integration of research and care of patients with heart failure. *Executed via application of the automated algorithm AF1 (for details refer to “Methods”)
Frequency of comorbidities in inpatients with heart failure, detected by ICD codes and additionally detected via data sources provided by the data warehouse
| All patients with heart failurea | Heart failure detected by ICD codes | Heart failure detected on top via other sources | |
|---|---|---|---|
| Essential primary hypertension (I10) | 11,335 (62.4) | 8441 (63.2) | 2894 (60.2) |
| Chronic ischemic heart disease (I20, I25) | 9025 (49.7) | 6801 (50.9) | 2224 (46.3) |
| Heart valve disorders (I34-I39) | 3495 (19.2) | 3045 (22.8) | 450 (9.4) |
| COPD (J44) | 2874 (15.8) | 2392 (17.9) | 482 (10.0) |
| Acute myocardial infarction (I21) | 2837 (15.6) | 1990 (14.9) | 847 (17.6) |
| Anemia (D60-D64) | 1942 (10.7) | 1529 (11.4) | 413 (8.6) |
| Cardiomyopathy (I42) | 1283 (7.1) | 1041 (7.8) | 242 (5.0) |
| Depression (F32, F33) | 1031 (5.7) | 848 (6.3) | 183 (3.8) |
| Cerebral hemorrhage, infarction, stroke (I61, I63, I64) | 387 (2.1) | 313 (2.3) | 74 (1.5) |
| Sleep apnea (G47.3) | 287 (1.6) | 240 (1.8) | 47 (1.0) |
| Kidney failure (N19) | 196 (1.1) | 156 (1.2) | 40 (0.8) |
Comorbidities with ICD codes in descending order by prevalence in the total sample. Numbers are count (%)
HF heart failure, COPD chronic obstructive pulmonary disease
aHeart failure detection was done using the automated algorithm AF1 (for details refer to “Methods”)