| Literature DB >> 26306238 |
Travis Goodwin1, Sanda M Harabagiu1.
Abstract
In this paper, we present a probabilistic reasoning method capable of generating predictions of the progression of clinical findings (CFs) reported in the narrative portion of electronic medical records. This method benefits from a probabilistic knowledge representation made possible by a graphical model. The knowledge encoded in the graphical model considers not only the CFs extracted from the clinical narratives, but also their chronological ordering (CO) made possible by a temporal inference technique described in this paper. Our experiments indicate that the predictions about the progression of CFs achieve high performance given the COs induced from patient records.Entities:
Year: 2015 PMID: 26306238 PMCID: PMC4525214
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Clinical findings related to heart disease, based on risk factors annotated in the i2b2/UTHealth 2014 dataset.
| Clinical Finding | Criteria | Example | |
|---|---|---|---|
| CF1 = DIABETES (DBS) | (1) | diagnosis of type 1 or 2 diabetes | |
| (2) | A1c test over 6.5 | ||
| (3) | two fasting blood glucose measures over 126 | ||
|
| |||
| CF2 = CAD | (1) | diagnosis of coronary artery disease (CAD) | |
| (2) | myocardial infarction (MI, STEMI, NSTEMI) | ||
| (3) | revascularization, cardiac arrest or ischemic cardiomyopathy | ||
| (4) | stress test showing ischemia | ||
| (5) | abnormal cardiac catherization showing coronary stenoses | ||
| (6) | chest pain consistent with angina | ||
|
| |||
| CF3 = HYPERLIPIDEMIA (HLA) | (1) | diagnosis of Hyperlipidemia or Hypercholesterolemia | |
| (2) | total cholesterol measure of over 240 | ||
| (3) | LDL measurement of over 100 mg/dL | ||
|
| |||
| CF4 = HYPERTENSION (HTN) | (1) | diagnosis of Hypertension | |
| (2) | blood pressure measurement of over 140/90 mm/hg | ||
|
| |||
| CF5 = OBESITY (OBY) | (1) | a description of the patient as being obese | |
| (2) | a body mass index (BMI) over 30 | ||
| (3) | a waist circumference > 40 in. for males or 35 in. for females | ||
Temporal signals associated with risk factors in the i2b2/UTHealth 2014 dataset.
| Temporal Signal | Definition | Example |
|---|---|---|
| DURING | finding was present at the time this PR was created | |
| BEFORE | finding was present before the creation of this PR | |
| AFTER | finding is present after the creation of this PR |
Figure 1:Chronological ordering (CO) of clinical findings (CFs) for a patient.
Figure 2:A probabilistic graphical model encoding the likelihood of any possible progression of clinical findings.
Performance results over all CFs for COs of length j, where Acc = Accuracy, PPV = positive predictive value (Precision), FNR = false negative rate, FPR = false positive rate, TNR = true negative rate (Specificity), TPR = true positive rate (Recall), F1 = F1-measure defined as , TP = true positives, FP = false positives, FN = false negatives, TN = true negatives.
| j | Acc | PPV | FNR | FPR | TNR | TPR | F1 | TP | FP | FN | TN |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 84.94 | 94.22 | 22.84 | 05.69 | 94.31 | 77.16 | 84.84 | 375 | 23 | 111 | 381 |
| 2 | 81.91 | 86.63 | 17.80 | 18.51 | 81.49 | 82.20 | 84.35 | 434 | 67 | 94 | 295 |
| 3 | 86.18 | 91.13 | 13.20 | 14.91 | 85.09 | 86.80 | 88.91 | 493 | 48 | 75 | 274 |
| 4 | 86.71 | 85.21 | 06.27 | 23.38 | 76.62 | 93.73 | 89.26 | 478 | 83 | 32 | 272 |
| 5 | 88.92 | 85.29 | 02.52 | 22.60 | 77.40 | 97.48 | 90.98 | 232 | 40 | 6 | 137 |
Figure 3:Experimental results for the prediction of the progression of CFs for chronological orderings of lengths 1 ≤ j ≤ 5, where A denotes the Accuracy, P denotes the Precision, R denotes the Recall, and F1 denotes the F1-measure.