| Literature DB >> 32734154 |
David Chang1, Woo Suk Hong2, Richard Andrew Taylor2.
Abstract
OBJECTIVE: We learn contextual embeddings for emergency department (ED) chief complaints using Bidirectional Encoder Representations from Transformers (BERT), a state-of-the-art language model, to derive a compact and computationally useful representation for free-text chief complaints.Entities:
Keywords: BERT; chief complaint; emergency medicine; machine learning; natural language processing
Year: 2020 PMID: 32734154 PMCID: PMC7382638 DOI: 10.1093/jamiaopen/ooaa022
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Figure 1.Model performance for Top-1 to Top-5 accuracy. Label-frequency cutoff thresholds are represented by colors. The accuracy increases drastically when taking into account the first few predictions. Dotted line shows 90% accuracy.
Predictive performance by algorithm
| Algorithm | LSTM | ELMo | BERT | |
|---|---|---|---|---|
| Full dataset (434 labels) | Top-1 | 0.63 | 0.63 | 0.65 |
| Top-2 | 0.77 | 0.78 | 0.80 | |
| Top-3 | 0.84 | 0.85 | 0.87 | |
| Top-4 | 0.88 | 0.88 | 0.90 | |
| Top-5 | 0.90 | 0.90 | 0.92 | |
| Reduced dataset (188 labels) | Top-1 | 0.66 | 0.66 | 0.69 |
| Top-2 | 0.81 | 0.81 | 0.84 | |
| Top-3 | 0.88 | 0.88 | 0.90 | |
| Top-4 | 0.90 | 0.91 | 0.93 | |
| Top-5 | 0.93 | 0.93 | 0.94 |
Figure 2.Common types of mislabeling for select chief complaint labels. Top row shows three of the most common chief complaint labels, with their accuracies shown within parentheses. Bottom row shows three chief complaint labels with lowest accuracies. X-axis shows the top five most common misclassifications in decreasing order. Y-axis shows frequency of error. Note that even for low performing chief complaint labels, a high percentage of errors are due to semantic overlap.
Examples of chief complaints and their corresponding top-5 predictions
| Chief complaint | Top-5 predictions | |
|---|---|---|
| Correctly classified at second prediction | “right third finger injured in door” | FINGER INJURY, |
| “pt comes to er with cc piece of plastic stuck to back of left ear from earing” | FOREIGN BODY IN EAR, | |
| “vomiting for days, increasing yesterday. pos home preg test on Saturday” | EMESIS, | |
| “both eyes swollen & itchy & tearing after his nap” | EYE SWELLING, | |
| “fall at 0300 today, rt side weakness” | FALL, | |
| Correctly classified at fifth prediction | “Felt like heart was pounding history of CABG. missed metoprolol for about 3 days.” | PALPITATIONS, RAPID HEART RATE, TACHYCARDIA, IRREGULAR HEART BEAT, |
| “2 weeks of sore throat, aches, dry cough. Denies intervention.” | SORE THROAT, COLD LIKE SYMPTOMS, URI, COUGH, | |
| “fall down 5 stairs lace to right eyebrow” | FALL, FACIAL LACERATION, LACERATION, FALL>65, | |
| “fever to 101, diarrhea, vomiting” | FEVER-9 WEEKS TO 74 YEARS, FEVER, EMESIS, ABDOMINAL PAIN, | |
| “blister on back of foot.” | BLISTER, FOOT PAIN, FOOT INJURY, FOOT SWELLING, |
Figure 3.t-SNE visualization of averaged embeddings of common chief complaints. Embeddings for common chief complaints were grouped by their ground truth label, then their arithmetic mean visualized using t-SNE. The embeddings are distributed in a clinically meaningful way, with related concepts embedded close to each other and broader types of chief complaints clustered together. Note that t-SNE is a stochastic algorithm and, while it preserves local structure of the data, does not completely preserve its global structure. The text labels have been jittered to enhance readability. Colored groupings represent clusters as determined by gaussian mixture modeling.