| Literature DB >> 33200151 |
Juan Zhao1, Monika E Grabowska2, Vern Eric Kerchberger3,1, Joshua C Smith1, H Nur Eken4, QiPing Feng5, Josh F Peterson1, S Trent Rosenbloom1, Kevin B Johnson1, Wei-Qi Wei1.
Abstract
OBJECTIVE: Identifying symptoms highly specific to COVID-19 would improve the clinical and public health response to infectious outbreaks. Here, we describe a high-throughput approach - Concept-Wide Association Study (ConceptWAS) that systematically scans a disease's clinical manifestations from clinical notes. We used this method to identify symptoms specific to COVID-19 early in the course of the pandemic.Entities:
Year: 2020 PMID: 33200151 PMCID: PMC7668764 DOI: 10.1101/2020.11.06.20227165
Source DB: PubMed Journal: medRxiv
Patient characteristics of the study cohort
| Attribute | Cases: COVID-19-positive (n=1,483) | Controls: COVID-19-negative (n=18,209) | P- value |
|---|---|---|---|
| Age (mean years +/− stddev) | 41.5 (16.2) | 44.9 (16.9) | <0.0001 |
| Gender (% Male) | 48.0% | 41.7% | <0.0001 |
| Race (% White) | 49.6% | 66.7% | <0.0001 |
| Average EHR length (years, +/− stddev) | 7.3 (8.1) | 9.2 (8.5) | <0.0001 |
| Average CUIs (+/− stddev) | 46.1 (61.1) | 71.9 (96.3) | <0.0001 |
2-proportion z hypothesis test was performed. For age, EHR length, and average CUIs, a t- test was performed for comparing the median and standard deviations.
Figure 1.Volcano plot of a ConceptWAS scan for 19, 692 patients that included COVID-19-positive group (cases) and negative group (controls). The points are colored by the semantic type of the concepts. Selected associations related to signs, symptoms, or diseases/syndromes are labeled. The volcano plot indicates -log 10 (p-value) for association (y-axis) plotted against their respective log 2 (fold change) (x-axis). The dashed line represents significance level using a Bonferroni correction.
Figure 2.Forest plot comparing individual concepts between COVID-19-positive (case) and COVID-19-negative (control) patients. Selected associations include the significant signals related to semantic types of symptoms that met Bonferroni-corrected significance (p-value < 2.55E-06). The odds ratio has been adjusted for age, gender, and race. The concepts are ordered by p-value.
Figure 3.Temporal ConceptWAS using bi-weekly cumulative data. For significant signals (related to signs, symptoms) using all data (labeled in Figure 2), the plot indicates their −log 10 (p-value) for association (y-axis) against using the cumulative data started between March 8, 2020 to n weeks (x-axis). The dashed line indicates a significant association using a Bonferroni correction.
Results of chart reviews.
| Concepts | Reviewed samples | True signals | True signals percentage % | Examples of false positive |
|---|---|---|---|---|
| Absent sense of smell | 20 | 19 | 95.00% | “(−) altered/loss of smell”, were wrongly recognized as an affirmative/ positive attribute. |
| Ageustia | 20 | 19 | 95.00% | “Symptoms, n/v, fever, cough, loss of taste or smell or around anyone + for Covid 19.” |
| Mental Depression | 20 | 18 | 90.00% | One was recognized from a medical history title without any answers; the other came from a recommendation for further Psychosocial assessment. |
| Current some day smoker | 20 | 20 | 100.00% | |
| Smoking monitoring status | 20 | 19 | 95.00% | One is uncertain. “Smoking Status Not on file”. |
| Fever | 20 | 17 | 85.00% | Template issue. “The following ROS were reviewed and are negative, unless otherwise stated as +positive: |
| Pericardial Fluid (neg) | 20 | 20 | 100.00% | |
| Hydrocephalus [neg] | 20 | 20 | 100.00% | |
| Hydronephroses | 20 | 20 | 100.00% | |
| Blood group AB Rh(D) negative (finding) | 20 | 0 | 0.00% | From blood typing tests. This signal was not specific to blood type AB+, but generated by other ABO blood types and Rh-positive patients. |
| Allergy test positive (finding) | 20 | 5 | 25.00% | The false positives were wrongly mapped from a sentence like “He /She has been exposed to covid, family member or friends have tested positive.” |
| Laurin-Sandrow syndrome | 20 | 20 | 100.00% | |
| Cough nonproductive | 20 | 20 | 100.00% | |
| In total | 260 | 217 | 83.46% |