| Literature DB >> 30862607 |
Tao Chen1, Mark Dredze2, Jonathan P Weiner3, Leilani Hernandez4, Joe Kimura4, Hadi Kharrazi3,5.
Abstract
BACKGROUND: Geriatric syndromes in older adults are associated with adverse outcomes. However, despite being reported in clinical notes, these syndromes are often poorly captured by diagnostic codes in the structured fields of electronic health records (EHRs) or administrative records.Entities:
Keywords: clinical notes; conditional random fields; geriatrics; information extraction; natural language processing
Year: 2019 PMID: 30862607 PMCID: PMC6454337 DOI: 10.2196/13039
Source DB: PubMed Journal: JMIR Med Inform
Example sentences from clinical notes that contain a construct: annotated construct phrases are italicized.
| Construct | Example sentence from clinical notes (verbatim) |
| Absence of fecal control | She has also been experiencing urinary incontinence and a few |
| Dementia | Patient |
| Falls | She suffered |
| Weight loss | Sed rate had been mildly elevated except the last one over 70 but in setting of acute illness and |
| Malnutrition | |
| Pressure ulcers | She has 2 |
| Lack of social support | She |
| Severe urinary control issues | She |
| Visual impairment | Has been seen by vision rehab and |
| Walking difficulty |
Statistics related to the 10 constructs.
| Construct | Number of ICD9a codes that indicate a construct (n) | Average number of tokens per construct | Average number of mentions per patient in the test set | Perplexity on test setb |
| Absence of fecal control | 2 | 2.98 | 2.67 | 11.30 |
| Dementia | 58 | 2.76 | 13.00 | 26.28 |
| Falls | 45 | 3.37 | 9.04 | 57.68 |
| Weight loss | 15 | 3.01 | 13.53 | 33.80 |
| Malnutrition | 26 | 2.04 | 13.92 | 100.64 |
| Pressure ulcers | 35 | 3.48 | 5.67 | 66.90 |
| Lack of social support | 14 | 4.03 | 15.23 | 29.96 |
| Severe urinary control issues | 14 | 2.94 | 13.71 | 117.48 |
| Visual impairment | 55 | 3.62 | 9.31 | 57.68 |
| Walking difficulty | 31 | 3.43 | 12.59 | 84.27 |
aICD9: International Classification of Diseases 9.
bPerplexity is computed on the test set based on the construct-specific language model trained on the training set: detailed in the Error Analysis section.
The construct and nonconstruct distribution among three datasets based on manual annotation.
| Construct | Training set (3901 notes) | Validation set (1739 notes) | Test set (2802 notes) | |||
| Tokena (N=1,083,670), n (%) | Patientb (N=85), n (%) | Token (N=435,851), n (%) | Patient (N=50), n (%) | Token (N=638,694), n (%) | Patient (N=50), n (%) | |
| Absence of fecal control | 126 (0.01) | 12 (14) | 126 (0.03) | 4 (8) | 34 (0.01) | 3 (6) |
| Dementia | 631 (0.06) | 15 (18) | 276 (0.06) | 9 (18) | 403 (0.06) | 10 (20) |
| Falls | 1419 (0.13) | 37 (44) | 293 (0.07) | 21 (42) | 748 (0.12) | 23 (46) |
| Weight loss | 365 (0.03) | 21 (25) | 263 (0.06) | 14 (28) | 752 (0.12) | 19 (38) |
| Malnutrition | 115 (0.01) | 8 (9) | 82 (0.02) | 5 (10) | 312 (0.05) | 12 (24) |
| Pressure ulcers | 308 (0.03) | 9 (11) | 18 (0.00) | 4 (8) | 126 (0.02) | 6 (12) |
| Lack of social support | 2026 (0.19) | 53 (62) | 1410 (0.32) | 30 (60) | 1691 (0.26) | 30 (60) |
| Severe urinary control issues | 694 (0.06) | 16 (19) | 81 (0.02) | 4 (8) | 323 (0.05) | 7 (14) |
| Visual impairment | 324 (0.03) | 16 (19) | 141 (0.03) | 6 (12) | 395 (0.06) | 13 (26) |
| Walking difficulty | 2253 (0.21) | 56 (66) | 1315 (0.30) | 26 (52) | 1423 (0.22) | 34 (68) |
| Nonconstruct | 1,075,409 (99.24) | 85 (100) | 431,846 (99.08) | 50 (100) | 632,487 (99.03) | 50 (100) |
aDenotes the number of tokens in the dataset that were labeled as certain constructs.
bDenotes the number of patients in the dataset who were identified containing certain constructs.
Phrase-partial evaluation on the validation set.
| Feature set and features | Macroaverage | Microaverage | |||||||
| Precision | Recall | F1 | Precision | Recall | F1 | ||||
| Basic | N/Ab | 0.828 | 0.450 | 0.583 | 0.930 | 0.597 | 0.727 | ||
| Bc+Is-ICD9d-Code | <.001 | 0.874 | 0.472 | 0.613 | 0.887 | 0.640 | 0.744 | ||
| B+Is-Medical-Unit | <.001 | 0.828 | 0.402 | 0.541 | 0.959 | 0.538 | 0.689 | ||
| B+Entity-Attributes | <.001 | 0.823 | 0.398 | 0.537 | 0.948 | 0.528 | 0.678 | ||
| B+Stem | .03 | 0.856 | 0.572 | 0.686 | 0.864 | 0.678 | 0.760 | ||
| B+Section | <.001 | 0.783 | 0.544 | 0.642 | 0.874 | 0.682 | 0.766 | ||
| B+ICD9-Annotation | <.001 | 0.888 | 0.462 | 0.608 | 0.928 | 0.598 | 0.727 | ||
| B+ICD9-Annotation-Post | <.001 | 0.823 | 0.478 | 0.605 | 0.912 | 0.604 | 0.727 | ||
| B+all Enhanced (Ce+Uf+Eg+Sh)+all Context (Ti+Aj+APk) | <.001 | 0.793 | 0.633 | 0.704 | 0.757 | 0.714 | 0.735 | ||
| B+Enhanced (C+E+S)+all Context (T+A+AP) | <.001 | 0.837 | 0.483 | 0.613 | 0.895 | 0.546 | 0.678 | ||
| B+Enhanced (C+E+S)+Context (A+AP) | <.001 | 0.874 | 0.529 | 0.659 | 0.906 | 0.630 | 0.743 | ||
| B+Enhanced (C+S)+Context (A+AP) | <.001 | 0.799 | 0.509 | 0.622 | 0.896 | 0.616 | 0.730 | ||
| Only uses annotated ICD9 codes as a rule to identify constructs | <.001 | 0.803 | 0.139 | 0.236 | 0.885 | 0.059 | 0.111 | ||
aWe conducted McNemar's test to measure the difference between the results of using basic features and other features.
bN/A: not applicable.
cB: basic.
dICD9: International Classification of Diseases 9.
eC: Is-ICD9-Code.
fU: Is-Medical-Unit.
gE: Entity-Attributes.
hS: stem.
iT: section.
jA: ICD9-Annotation.
kAP: ICD9-Annotation-Post.
lThe best-performing model is italicized.
mCRF: conditional random field.
The evaluation results of the best-performing model on the test set.
| Construct | Phrase-exact | Phrase-partial | Note | Patient | |||||||||
| Precision | Recall | F1 | Precision | Recall | F1 | Precision | Recall | F1 | Precision | Recall | F1 | ||
| Absence of fecal control | 0.833 | 0.625 | 0.714 | 1 | 0.750 | 0.857 | 1 | 0.714 | 0.833 | 1 | 0.667 | 0.800 | |
| Dementia | 0.324 | 0.350 | 0.337 | 0.703 | 0.759 | 0.730 | 0.604 | 0.873 | 0.714 | 0.625 | 1 | 0.769 | |
| Falls | 0.387 | 0.279 | 0.324 | 0.942 | 0.651 | 0.770 | 0.926 | 0.719 | 0.809 | 0.864 | 0.826 | 0.844 | |
| Weight loss | 0.571 | 0.215 | 0.312 | 0.714 | 0.272 | 0.394 | 0.866 | 0.586 | 0.699 | 0.857 | 0.632 | 0.727 | |
| Malnutrition | 0.577 | 0.090 | 0.155 | 0.577 | 0.090 | 0.155 | 0.680 | 0.288 | 0.405 | 0.700 | 0.583 | 0.636 | |
| Pressure ulcers | 0.304 | 0.200 | 0.241 | 0.957 | 0.629 | 0.759 | 0.929 | 0.722 | 0.813 | 1 | 0.667 | 0.800 | |
| Lack of social support | 0.551 | 0.541 | 0.546 | 0.707 | 0.706 | 0.706 | 0.923 | 0.845 | 0.882 | 0.935 | 0.967 | 0.951 | |
| Severe urinary control issues | 0.207 | 0.124 | 0.155 | 0.690 | 0.433 | 0.532 | 0.682 | 0.556 | 0.612 | 0.857 | 0.857 | 0.857 | |
| Visual impairment | 0.687 | 0.456 | 0.548 | 1 | 0.664 | 0.798 | 1 | 0.765 | 0.867 | 1 | 0.846 | 0.917 | |
| Walking difficulty | 0.517 | 0.394 | 0.447 | 0.842 | 0.689 | 0.758 | 0.894 | 0.781 | 0.834 | 0.912 | 0.912 | 0.912 | |
| Macroaverage | 0.496 | 0.327 | 0.394 | 0.813 | 0.564 | 0.666 | 0.850 | 0.685 | 0.759 | 0.875 | 0.796 | 0.834 | |
| Microaverage | 0.493 | 0.351 | 0.410 | 0.785 | 0.571 | 0.661 | 0.806 | 0.726 | 0.787 | 0.868 | 0.834 | 0.851 | |
Figure 1The F1 scores of the best-performing model on the test set. BC: absence of fecal control; DE: dementia; FL: fall;, WL: weight loss; ML: malnutrition; PU: pressure ulcers; SS: lack of social support; UC: severe urinary control issues; VI: visual impairment; WD: walking difficulty; Macro: macroaverage; Micro: microaverage.