| Literature DB >> 33781918 |
Kevin Lybarger1, Mari Ostendorf2, Matthew Thompson3, Meliha Yetisgen4.
Abstract
Coronavirus disease 2019 (COVID-19) is a global pandemic. Although much has been learned about the novel coronavirus since its emergence, there are many open questions related to tracking its spread, describing symptomology, predicting the severity of infection, and forecasting healthcare utilization. Free-text clinical notes contain critical information for resolving these questions. Data-driven, automatic information extraction models are needed to use this text-encoded information in large-scale studies. This work presents a new clinical corpus, referred to as the COVID-19 Annotated Clinical Text (CACT) Corpus, which comprises 1,472 notes with detailed annotations characterizing COVID-19 diagnoses, testing, and clinical presentation. We introduce a span-based event extraction model that jointly extracts all annotated phenomena, achieving high performance in identifying COVID-19 and symptom events with associated assertion values (0.83-0.97 F1 for events and 0.73-0.79 F1 for assertions). Our span-based event extraction model outperforms an extractor built on MetaMapLite for the identification of symptoms with assertion values. In a secondary use application, we predicted COVID-19 test results using structured patient data (e.g. vital signs and laboratory results) and automatically extracted symptom information, to explore the clinical presentation of COVID-19. Automatically extracted symptoms improve COVID-19 prediction performance, beyond structured data alone.Entities:
Keywords: COVID-19; Coronavirus; Information extraction; Machine learning; Natural language processing
Mesh:
Year: 2021 PMID: 33781918 PMCID: PMC7997694 DOI: 10.1016/j.jbi.2021.103761
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 8.000
Demographic, vital signs, and laboratory fields that are predictive of COVID-19 infection in current literature.
| age | |
| alanine aminotransferase (ALT) | |
| albumin | |
| alkaline phosphatase (ALP) | |
| aspartate aminotransferase (AST) | |
| basophils | |
| calcium | |
| C-reactive protein (CRP) | |
| D-dimer | |
| eosinophils | |
| gamma-glutamyl transferase (GGT) | |
| gender | |
| heart rate | |
| lactate dehydrogenase (LDH) | |
| lymphocytes | |
| monocytes | |
| neutrophils | |
| oxygen saturation | |
| platelets | |
| prothrombin time (PT) | |
| respiratory rate | |
| temperature | |
| troponin | |
| white blood cell (WBC) count |
Annotation guideline summary. ∗ indicates the argument is required. † indicates at least one of the arguments, Test Status or Assertion, is required.
| COVID | Trigger∗ | – | “COVID,” “COVID-19” |
| Test Status† | {positive, negative, pending, conditional, not ordered, not patient, indeterminate} | “tested positive” | |
| Assertion† | {present, absent, possible, hypothetical, not patient} | “positive,” “low suspicion” | |
| Symptom | Trigger∗ | – | “cough,” “shortness of breath” |
| Assertion∗ | {present, absent, possible, conditional, hypothetical, not patient} | “admits,” “denies” | |
| Change | {no change, worsened, improved, resolved} | “improved,” “continues” | |
| Severity | {mild, moderate, severe} | “mild,” “required ventilation” | |
| Anatomy | – | “chest wall,” “lower back” | |
| Characteristics | – | “wet productive,” “diffuse” | |
| Duration | – | “for two days,” “1 week” | |
| Frequency | – | “occasional,” “chronic” | |
Fig. 1BRAT annotation examples for COVID and Symptom (SSx) event types.
Fig. 2Annotation examples describing event extraction as a slot filling task.
Fig. 3COVID annotation summary.
Fig. 4Most frequent symptoms in the training set broken down by Assertion subtype.
Expert-derived mapping of symptoms to canonical forms.
| altered mental status | ams, confused, confusion |
| anxiety | agitated, agitation, anxious |
| arthralgia | arthralgias |
| bleeding | bleed, blood, bloody |
| bruising | bruise, bruises, ecchymosis |
| chest pain | cp |
| chills | chill |
| cough | c, c., cough cough, coughing, coughs, distress coughing, distressed coughing |
| cramping | cramps |
| decreased appetite | loss of appetite, poor appetite, poor p.o. intake, poor po intake, reduced appetite |
| deformities | deformity |
| dehydration | dehydrated |
| diarrhea | d, d., diarrhea stools, loose stools |
| disharge | drainage |
| distended | distention |
| dysphagia | difficulty swallowing, dysphagia symptoms |
| erythema | erythematous, redness |
| exudates | exudate |
| fall | falls |
| fatigue | drowsiness, drowsy, fatigued, somnolence, somnolent, tired, tiredness |
| fever | f, f., febrile, fevers |
| flu-like symptoms | flu - like symptoms, influenza - like symptoms |
| gi symptoms | abdominal symptoms |
| headache | ha, headaches |
| heartburn | gerd symptoms, heartburn symptoms |
| hematochezia | brbpr |
| ill | ill - appearing, ill appearing, ill symptoms, illness, sick |
| incontinent | incontinence |
| irritation | irritable |
| itching | itchy |
| lethargy | lethargic |
| lightheadedness | dizziness, dizzy, headedness, lightheaded |
| myalgia | ache, aches, aching, bodyaches, myalgias |
| nausea | n, n., nauseated, nauseous |
| pain | discomfort, painful, pains |
| pruritus | pruritis |
| rash | rashes |
| respiratory symptoms | uri symptoms |
| runny nose | rhinorrhea |
| seizures | seizure, seizures |
| shortness of breath | ___shortness of breath, difficult breathing, difficulty breathing, difficulty of breathing, distress breathing, distressed breathing, doe, dsypnea, dypsnea, dyspnea, dyspnea exertion, dyspnea on exertion, increase work of breathing, increased work of breathing, out of breath, respiratory distress, short of breath, shortneses of breath, shortness breath, shortness of breaths, sob, sob on exertion, trouble breathing, work of breathing |
| sore throat | pharyngitis |
| soreness | sore |
| sputum | sputum production |
| sweats | diaphoresis, nightsweats, sweating |
| swelling | edema, oedema, swollen |
| syncope | fainting |
| tenderness | tender |
| tremors | tremor |
| ulcers | ulcer, ulceration, ulcerations |
| urinary symptoms | urinary |
| urination | urinating |
| vomiting | emesis, v, v., vomitting |
| weakness | weak |
| wheezing | wheeze, wheezes |
| wounds | wound |
Fig. 5Annotator agreement.
Fig. 6Span-based Event Extractor.
Hyperparameters for the Span-based Event Extractor.
| Maximum sentence length, | 30 |
| Maximum span length, | 6 |
| Top- | sentence token count |
| Batch size | 100 |
| Number of epochs | 100 |
| Learning rate | 0.001 |
| Optimizer | Adam |
| Maximum gradient L2-norm | 100 |
| BERT embedding dropout | 0.3 |
| bi-LSTM hidden size, | 200 |
| bi-LSTM activation function | tanh |
| bi-LSTM dropout | 0.3 |
| Span classifier projection size, | 100 |
| Span classifier activation function | ReLU |
| Span classifier dropout | 0.3 |
| Role classifier projection size, | 100 |
| Role classifier activation function | ReLU |
| Role classifier dropout | 0.3 |
Extraction performance.
| COVID | Trigger | 3,931 | 0.95 | 0.97 | 0.96 | 1,497 | 0.96 | 0.97 | 0.97 |
| Assertion | 2,936 | 0.70 | 0.74 | 0.72 | 1,075 | 0.72 | 0.74 | 0.73 | |
| Test Status | 1,068 | 0.60 | 0.62 | 0.61 | 457 | 0.63 | 0.60 | 0.62 | |
| Symptom | Trigger | 13,823 | 0.82 | 0.85 | 0.83 | 5,789 | 0.81 | 0.85 | 0.83 |
| Assertion | 13,833 | 0.77 | 0.79 | 0.78 | 5,791 | 0.77 | 0.80 | 0.79 | |
| Change | 739 | 0.45 | 0.03 | 0.06 | 341 | 0.45 | 0.05 | 0.09 | |
| Severity | 743 | 0.47 | 0.30 | 0.37 | 327 | 0.45 | 0.31 | 0.37 | |
| Anatomy | 3,839 | 0.76 | 0.59 | 0.66 | 1,959 | 0.78 | 0.50 | 0.61 | |
| Characteristics | 3,145 | 0.59 | 0.26 | 0.36 | 1,441 | 0.66 | 0.25 | 0.36 | |
| Duration | 3,744 | 0.62 | 0.44 | 0.51 | 1,344 | 0.54 | 0.56 | 0.55 | |
| Frequency | 801 | 0.64 | 0.39 | 0.48 | 250 | 0.60 | 0.51 | 0.55 | |
MetaMapLite++ extraction performance for Symptom trigger and Assertion.
| Exact trigger match | Trigger | 0.53 | 0.54 | 0.54 |
| Assertion | 0.43 | 0.44 | 0.44 | |
| Any triggers overlap | Trigger | 0.66 | 0.67 | 0.66 |
| Assertion | 0.54 | 0.55 | 0.54 | |
Symptom Assertion comparison for events with equivalent triggers (exact span match).
| MetaMap++ | 3,152 | 0.81 | 0.81 | 0.81 |
| Span-based Event Extractor | 4,952 | 0.95 | 0.94 | 0.94 |
Structured fields from UW EHR used to predict COVID-19 infection. indicates the function used to aggregate multiple measurements/values. Fields that measure the same phenomena and were treated as a single feature, resulting in 29 distinct structured EHR fields: {“Temperature - C,” “Temperature (C)”}, {“HR,” “Heart Rate”}, and {“O2 Saturation (%),” “Oxygen Saturation”}. All fields numerical (e.g. “Temperature (C)” =38.1), except “Troponin I Interpretation” and ”Gender”.
| age | “AgeIn2020” | max |
| ALT | “ALT (GPT)” | max |
| albumin | “Albumin” | min |
| ALP | “Alkaline Phosphatase (Total)” | max |
| AST | “AST (GOT)” | max |
| basophils | “Basophils” and “% Basophils” | min |
| calcium | “Calcium” | min |
| CRP | “CRP, high sensitivity” | max |
| D-dimer | “D_Dimer Quant” | max |
| eosinophils | “Eosinophils” and “% Eosinophils” | min |
| GGT | “Gamma Glutamyl Transferase” | max |
| gender | “Gender” | last |
| heart rate | “Heart Rate” and “HR” | max |
| LDH | “Lactate Dehydrogenase” | max |
| lymphocytes | “Lymphocytes” and “% Lymphocytes” | min |
| monocyptes | “Monocytes” | max |
| neutrophils | “Neutrophils” and “% Neutrophils” | max |
| oxygen saturation | “Oxygen Saturation” and “O2 Saturation (%)” | min |
| platelets | “Platelet Count” | min |
| PT | “Prothrombin Time Patient” and “Prothrombin INR” | max |
| respiratory rate | “Respiratory Rate” | max |
| temperature | “Temperature - C” and “Temperature (C)” | max |
| troponin | “Troponin_I” and “Troponin_I Interpretation” | max |
| WBC count | “WBC” | min |
COVID-19 prediction hyperparameters for Random Forest Models.
| ED | structured | 200 | 4 | 2 | 1 | 10 |
| ED | notes | 200 | 4 | 4 | 1 | 6 |
| ED | all | 200 | 6 | 3 | 1 | 8 |
| Progress | structured | 200 | 18 | 6 | 1 | 6 |
| Progress | notes | 200 | 10 | 4 | 1 | 4 |
| Progress | all | 200 | 8 | 4 | 1 | 6 |
| Telephone | structured | 200 | 10 | 2 | 1 | 2 |
| Telephone | notes | 200 | 6 | 2 | 1 | 4 |
| Telephone | all | 200 | 10 | 8 | 1 | 4 |
Fig. 7Receiver operating characteristic by note type and feature set combination for repeated hold-out iterations. The solid line indicates the average ROC, and the shaded region around the solid line indicates one standard deviation.
COVID-19 prediction false positive rate at a true positive rate of 80%.
| ED | structured | 0.48 |
| symptoms | 0.63 | |
| all | 0.41 | |
| Progress | structured | 0.64 |
| symptoms | 0.64 | |
| all | 0.58 | |
| Telephone | structured | 0.66 |
| symptoms | 0.71 | |
| all | 0.64 | |
Fig. 8SHAP plot for a single Random Forest model from the ED note experimentation with the all feature set, explaining the importance of features in making predictions for the withheld test set. * indicates the feature is an automatically extracted symptom.
Fig. 9Distribution of averaged SHAP values by note type with the all feature set. The vertical lines in each violin indicate the quartiles. * indicates the feature is an automatically extracted symptom.