| Literature DB >> 32633720 |
Tyler Wagner1, Fnu Shweta2, Karthik Murugadoss1, Samir Awasthi1, A J Venkatakrishnan1, Sairam Bade3, Arjun Puranik1, Martin Kang1, Brian W Pickering2, John C O'Horo2, Philippe R Bauer2, Raymund R Razonable2, Paschalis Vergidis2, Zelalem Temesgen2, Stacey Rizza2, Maryam Mahmood2, Walter R Wilson2, Douglas Challener2, Praveen Anand3, Matt Liebers1, Zainab Doctor1, Eli Silvert1, Hugo Solomon1, Akash Anand3, Rakesh Barve3, Gregory Gores2, Amy W Williams2, William G Morice2,4, John Halamka2, Andrew Badley2, Venky Soundararajan1.
Abstract
Understanding temporal dynamics of COVID-19 symptoms could provide fine-grained resolution to guide clinical decision-making. Here, we use deep neural networks over an institution-wide platform for the augmented curation of clinical notes from 77,167 patients subjected to COVID-19 PCR testing. By contrasting Electronic Health Record (EHR)-derived symptoms of COVID-19-positive (COVIDpos; n = 2,317) versus COVID-19-negative (COVIDneg; n = 74,850) patients for the week preceding the PCR testing date, we identify anosmia/dysgeusia (27.1-fold), fever/chills (2.6-fold), respiratory difficulty (2.2-fold), cough (2.2-fold), myalgia/arthralgia (2-fold), and diarrhea (1.4-fold) as significantly amplified in COVIDpos over COVIDneg patients. The combination of cough and fever/chills has 4.2-fold amplification in COVIDpos patients during the week prior to PCR testing, in addition to anosmia/dysgeusia, constitutes the earliest EHR-derived signature of COVID-19. This study introduces an Augmented Intelligence platform for the real-time synthesis of institutional biomedical knowledge. The platform holds tremendous potential for scaling up curation throughput, thus enabling EHR-powered early disease diagnosis.Entities:
Keywords: COVID-19; SARS-CoV-2; artificial intelligence; electronic health record; human; human biology; infectious disease; machine learning; medicine; microbiology; neural networks
Mesh:
Year: 2020 PMID: 32633720 PMCID: PMC7410498 DOI: 10.7554/eLife.58227
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.140
Augmented curation of the unstructured clinical notes from the EHR reveals specific clinically confirmed phenotypes that are amplified in COVID patients over COVID patients in the week prior to the SARS-CoV-2 PCR testing date.
The key COVID amplified symptoms in the week preceding PCR testing (i.e. day = −7 to day = −1) are highlighted in gray (p-value<1E-10). The ratio of COVID to COVID proportions represents the fold change amplification of each phenotype in the COVID patient set (symptoms are sorted based on this column).
| Symptom | COVID+ | COVID- | (COVID+/COVID-) Relative Ratio | Relative ratio | 2-tailed p-value | BH-corrected p-value |
|---|---|---|---|---|---|---|
| Altered or diminished sense of taste or smell | 145 (6.3%) | 173 (0.2%) | 27.08 | (21.81, 33.62) | <1E-300 | <1E-300 |
| Fever/chills | 750 (32.4%) | 9421 (12.6%) | 2.57 | (2.42, 2.74) | 3.57E-169 | 4.64E-168 |
| Cough | 769 (33.2%) | 11083 (14.8%) | 2.24 | (2.11, 2.38) | 4.60E-129 | 3.99E-128 |
| Respiratory difficulty | 681 (29.4%) | 10082 (13.5%) | 2.18 | (2.04, 2.33) | 3.06E-105 | 1.99E-104 |
| Myalgia/Arthralgia | 288 (12.4%) | 4620 (6.2%) | 2.01 | (1.8, 2.25) | 5.35E-34 | 2.78E-33 |
| Rhinitis | 200 (8.6%) | 2947 (3.9%) | 2.19 | (1.92, 2.52) | 2.25E-29 | 9.75E-29 |
| Headache | 325 (14.0%) | 6124 (8.2%) | 1.71 | (1.55, 1.9) | 1.34E-23 | 4.98E-23 |
| Congestion | 228 (9.8%) | 4261 (5.7%) | 1.73 | (1.53, 1.96) | 4.45E-17 | 1.45E-16 |
| GI upset | 195 (8.4%) | 10670 (14.3%) | 0.59 | (0.52, 0.68) | 1.74E-15 | 5.03E-15 |
| Wheezing | 49 (2.1%) | 3765 (5.0%) | 0.42 | (0.32, 0.56) | 1.82E-10 | 4.73E-10 |
| Dermatitis | 26 (1.1%) | 2519 (3.4%) | 0.33 | (0.23, 0.5) | 2.60E-09 | 6.15E-09 |
| Generalized symptoms | 169 (7.3%) | 8129 (10.9%) | 0.67 | (0.58, 0.78) | 4.82E-08 | 1.04E-07 |
| Respiratory Failure | 73 (3.2%) | 1363 (1.8%) | 1.73 | (1.38, 2.19) | 3.09E-06 | 6.18E-06 |
| Diarrhea | 228 (9.8%) | 5452 (7.3%) | 1.35 | (1.19, 1.53) | 3.47E-06 | 6.44E-06 |
| Pharyngitis | 160 (6.9%) | 3635 (4.9%) | 1.42 | (1.22, 1.66) | 7.05E-06 | 1.22E-05 |
| Chest pain/pressure | 148 (6.4%) | 6122 (8.2%) | 0.78 | (0.67, 0.92) | 1.88E-03 | 3.06E-03 |
| Change in appetite/intake | 95 (4.1%) | 2271 (3.0%) | 1.35 | (1.11, 1.66) | 3.37E-03 | 5.15E-03 |
| Otitis | 13 (0.6%) | 874 (1.2%) | 0.48 | (0.29, 0.85) | 6.98E-03 | 1.01E-02 |
| Cardiac | 95 (4.1%) | 2443 (3.3%) | 1.26 | (1.03, 1.54) | 2.62E-02 | 3.59E-02 |
| Fatigue | 229 (9.9%) | 8268 (11.0%) | 0.89 | (0.79, 1.02) | 7.83E-02 | 1.02E-01 |
| Conjunctivitis | 9 (0.4%) | 167 (0.2%) | 1.74 | (0.95, 3.52) | 1.00E-01 | 1.24E-01 |
| Dry mouth | 5 (0.2%) | 316 (0.4%) | 0.51 | (0.24, 1.3) | 1.28E-01 | 1.51E-01 |
| Hemoptysis | 13 (0.6%) | 283 (0.4%) | 1.48 | (0.89, 2.65) | 1.61E-01 | 1.78E-01 |
| Dysuria | 16 (0.7%) | 732 (1.0%) | 0.71 | (0.45, 1.18) | 1.64E-01 | 1.78E-01 |
| Diaphoresis | 35 (1.5%) | 979 (1.3%) | 1.15 | (0.84, 1.63) | 3.99E-01 | 4.15E-01 |
| Neuro | 150 (6.5%) | 4952 (6.6%) | 0.98 | (0.84, 1.15) | 7.86E-01 | 7.86E-01 |
Temporal analysis of the EHR clinical notes for the week preceding PCR testing (i.e. day −7 to day −1), leading up to the day of PCR testing (day 0) in COVID and COVID patients.
Temporal enrichment for each symptom is quantified using the ratio of COVID patient proportion over the COVID patient proportion for each day. The patient proportions in the rows labeled ‘Positive’ and ‘Negative’ represent the fraction of COVID (n = 2,317) and COVID (n = 74,850) patients with the specified symptom on each day. Symptoms with p-value<1E-10 are highlighted in green and 1E-10 < p value<1E-03 in gray.
| Symptom | COVID-19 (N = 77167) | Day = −7 | Day = −6 | Day = −5 | Day = −4 | Day = −3 | Day = −2 | Day = −1 |
|---|---|---|---|---|---|---|---|---|
| Altered or diminished sense of taste or smell | Positive (n = 2317) | 4.75E-03 | 3.88E-03 | 3.45E-03 | 2.59E-03 | 1.73E-03 | 0.00E+00 | 4.75E-03 |
| Negative (n = 74850) | 1.07E-04 | 4.01E-05 | 1.07E-04 | 1.07E-04 | 9.35E-05 | 2.27E-04 | 9.75E-04 | |
| Ratio (Positive/Negative) | 44.42 | 96.91 | 32.30 | 24.23 | 18.46 | 0.00 | 4.87 | |
| Cough | Positive | 2.55E-02 | 2.29E-02 | 1.90E-02 | 1.64E-02 | 1.38E-02 | 8.63E-03 | 7.94E-02 |
| Negative | 4.88E-03 | 5.30E-03 | 5.21E-03 | 5.33E-03 | 5.73E-03 | 8.40E-03 | 8.71E-02 | |
| Ratio (Positive/Negative) | 5.22 | 4.31 | 3.64 | 3.08 | 2.41 | 1.03 | 0.91 | |
| Diarrhea | Positive | 8.20E-03 | 7.77E-03 | 6.04E-03 | 4.32E-03 | 4.75E-03 | 2.59E-03 | 2.68E-02 |
| Negative | 3.70E-03 | 4.26E-03 | 4.58E-03 | 4.09E-03 | 4.58E-03 | 5.61E-03 | 3.78E-02 | |
| Ratio (Positive/Negative) | 2.22 | 1.82 | 1.32 | 1.06 | 1.04 | 0.46 | 0.71 | |
| Fever/chills | Positive | 2.42E-02 | 2.20E-02 | 1.94E-02 | 1.68E-02 | 1.34E-02 | 6.47E-03 | 7.90E-02 |
| Negative | 3.39E-03 | 3.74E-03 | 3.90E-03 | 4.42E-03 | 4.61E-03 | 6.77E-03 | 7.48E-02 | |
| Ratio (Positive/Negative) | 7.12 | 5.88 | 4.98 | 3.81 | 2.90 | 0.96 | 1.06 | |
| Respiratory Difficulty | Positive | 2.24E-02 | 2.11E-02 | 1.81E-02 | 1.55E-02 | 1.25E-02 | 8.20E-03 | 5.35E-02 |
| Negative | 5.06E-03 | 5.70E-03 | 5.81E-03 | 5.87E-03 | 6.16E-03 | 8.66E-03 | 7.65E-02 | |
| Ratio (Positive/Negative) | 4.43 | 3.71 | 3.12 | 2.65 | 2.03 | 0.95 | 0.70 | |
| Change in appetite/intake | Positive | 1.73E-03 | 1.73E-03 | 1.73E-03 | 5.18E-03 | 4.32E-03 | 5.61E-03 | 1.86E-02 |
| Negative | 1.30E-03 | 1.36E-03 | 1.34E-03 | 1.39E-03 | 1.40E-03 | 1.91E-03 | 1.35E-02 | |
| Ratio (Positive/Negative) | 1.33 | 1.27 | 1.29 | 3.73 | 3.08 | 2.94 | 1.37 | |
| Myalgia/Arthralgia | Positive | 8.20E-03 | 9.06E-03 | 7.77E-03 | 6.47E-03 | 5.61E-03 | 2.59E-03 | 3.84E-02 |
| Negative | 2.24E-03 | 3.05E-03 | 3.17E-03 | 2.99E-03 | 2.87E-03 | 3.99E-03 | 2.72E-02 | |
| Ratio (Positive/Negative) | 3.65 | 2.98 | 2.45 | 2.16 | 1.95 | 0.65 | 1.41 | |
| Congestion | Positive | 6.91E-03 | 6.47E-03 | 5.18E-03 | 3.45E-03 | 5.18E-03 | 2.16E-03 | 1.94E-02 |
| Negative | 1.95E-03 | 2.38E-03 | 1.98E-03 | 2.36E-03 | 2.18E-03 | 2.95E-03 | 2.63E-02 | |
| Ratio (Positive/Negative) | 3.54 | 2.72 | 2.62 | 1.46 | 2.38 | 0.73 | 0.74 | |
| Rhinitis | Positive | 7.77E-03 | 6.04E-03 | 4.32E-03 | 3.02E-03 | 2.16E-03 | 8.63E-04 | 1.38E-02 |
| Negative | 1.23E-03 | 1.42E-03 | 1.32E-03 | 1.36E-03 | 1.38E-03 | 2.04E-03 | 1.96E-02 | |
| Ratio (Positive/Negative) | 6.32 | 4.27 | 3.26 | 2.22 | 1.57 | 0.42 | 0.70 | |
Figure 1.Augmented curation of the unstructured clinical notes and comparison of symptoms between COVIDpos vs. COVIDneg patients.
(a) Augmented curation of the unstructured clinical notes from Electronic Health Records (EHRs). (b) COVID-19-related symptom entity recognition, sentiment analysis and grouping of synonyms. (c) Comparison of symptoms extracted from EHR clinical notes of COVIDpos vs. COVIDneg patients.
Figure 1—figure supplement 1.SciBERT Architecture and Training Configuration.
Figure 1—figure supplement 2.Examples of Sentence Classification Used in Training a SciBERT Model for Phenotype/Symptom Sentiment Analysis.