| Literature DB >> 35457497 |
George D Vavougios1,2,3, Vasileios T Stavrou2, Christoforos Konstantatos4, Pavlos-Christoforos Sinigalias5, Sotirios G Zarogiannis3,6, Konstantinos Kolomvatsos7, George Stamoulis7, Konstantinos I Gourgoulianis2,3.
Abstract
The aim of our study was to determine COVID-19 syndromic phenotypes in a data-driven manner using the survey results based on survey results from Carnegie Mellon University's Delphi Group. Monthly survey results (>1 million responders per month; 320,326 responders with a certain COVID-19 test status and disease duration <30 days were included in this study) were used sequentially in identifying and validating COVID-19 syndromic phenotypes. Logistic Regression-weighted multiple correspondence analysis (LRW-MCA) was used as a preprocessing procedure, in order to weigh and transform symptoms recorded by the survey to eigenspace coordinates, capturing a total variance of >75%. These scores, along with symptom duration, were subsequently used by the Two Step Clustering algorithm to produce symptom clusters. Post-hoc logistic regression models adjusting for age, gender, and comorbidities and confirmatory linear principal components analyses were used to further explore the data. Model creation, based on August's 66,165 included responders, was subsequently validated in data from March-December 2020. Five validated COVID-19 syndromes were identified in August: 1. Afebrile (0%), Non-Coughing (0%), Oligosymptomatic (ANCOS); 2. Febrile (100%) Multisymptomatic (FMS); 3. Afebrile (0%) Coughing (100%) Oligosymptomatic (ACOS); 4. Oligosymptomatic with additional self-described symptoms (100%; OSDS); 5. Olfaction/Gustatory Impairment Predominant (100%; OGIP). Our findings indicate that the COVID-19 spectrum may be undetectable when applying current disease definitions focusing on respiratory symptoms alone.Entities:
Keywords: COVID-19; big data; comorbidity; epidemiology; pattern recognition; phenotypes
Mesh:
Year: 2022 PMID: 35457497 PMCID: PMC9029400 DOI: 10.3390/ijerph19084630
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Study Workflow. Phenotype legends: 1. Afebrile (0%), Non-Coughing (0%), Oligosymptomatic (ANCOS); 2. Febrile (100%) Multisymptomatic (FMS); 3. Afebrile (0%) Coughing (100%) Oligosymptomatic (ACOS); 4. Oligosymptomatic with additional self-described symptoms (100%; OSDS); 5. Olfaction/Gustatory Impairment Predominant (100%; OGIP).
Demographics per month of the study population.
| April | May | June | July | August | September | October | November | December | ||
|---|---|---|---|---|---|---|---|---|---|---|
| COVID-19 | 4000 | 4955 | 6573 | 13,370 | 10,279 | 1773 | 5936 | 10,026 | 21,617 | |
| Age Group | 18–24 | 1783 | 2973 | 4202 | 7047 | 5965 | 1194 | 1211 | 1606 | 3312 |
| 25–34 | 4856 | 7423 | 9837 | 15,559 | 12,158 | 2204 | 3353 | 4805 | 9002 | |
| 35–44 | 4794 | 7249 | 8919 | 14,450 | 11,502 | 2116 | 3886 | 4856 | 9529 | |
| 45–54 | 4281 | 7030 | 8886 | 13,660 | 11,146 | 2119 | 3519 | 4095 | 8809 | |
| 55–64 | 3220 | 6235 | 8661 | 12,385 | 10,775 | 2173 | 3320 | 3227 | 7440 | |
| 65–74 | 1447 | 3597 | 5655 | 7791 | 7218 | 1483 | 1808 | 1539 | 3907 | |
| >75 | 312 | 907 | 1632 | 2317 | 2155 | 531 | 573 | 427 | 1181 | |
| NA | 1627 | 2629 | 3790 | 5742 | 5246 | 981 | 1467 | 2143 | 5449 | |
| Gender | M | 5001 | 9754 | 13,230 | 20,313 | 17,427 | 3434 | 4555 | 4752 | 10,684 |
| F | 15,459 | 24,985 | 33,672 | 51,556 | 42,364 | 8170 | 12,732 | 15,360 | 31,612 | |
| NB | 149 | 284 | 366 | 613 | 532 | 104 | 141 | 176 | 347 | |
| SD | 102 | 219 | 252 | 357 | 297 | 72 | 119 | 144 | 248 | |
| NA | 153 | 325 | 449 | 653 | 565 | 100 | 128 | 128 | 322 | |
| N/A | 1456 | 2486 | 3613 | 5459 | 4980 | 921 | 1462 | 2138 | 5416 | |
| Cancer | 1223 | 2338 | 3209 | 4353 | 3783 | 816 | 1082 | 1040 | 2289 | |
| HD | 6556 | 7768 | 8450 | 13,352 | 10,792 | 2036 | 4738 | 6213 | 12,229 | |
| HTN | 3952 | 4682 | 5129 | 8263 | 6486 | 1302 | 2979 | 3931 | 7785 | |
| Asthma | 12,222 | 19,035 | 24,962 | 38,645 | 32,205 | 6031 | 11,461 | 14,309 | 28,597 | |
| CLD | 9361 | 14,946 | 19,201 | 30,197 | 25,728 | 5141 | 9644 | 12,504 | 25,272 | |
| KD | 8035 | 12,264 | 14,668 | 22,830 | 19,393 | 3907 | 8203 | 10,029 | 20,615 | |
| AD | 8807 | 14,702 | 19,365 | 28,438 | 23,692 | 4587 | 8450 | 10,533 | 21,677 | |
| Diabetes | T1D | 4817 | 8313 | 9950 | 17,181 | 12,822 | 2627 | 5102 | 5565 | 12,807 |
| T2D | 4202 | 4397 | 4556 | 7698 | 5899 | 1110 | 2803 | 3933 | 7743 | |
| IC | 3091 | 5150 | 5847 | 10,235 | 7325 | 14755 | 3519 | 4087 | 9238 |
Notes: Age Groups are measured in years. Cancer was specified as any form of neoplasm except skin cancer. AD: Autoimmune Disease; CLD: Chronic Lung Disease such as COPD; F: Female; HD: Heart Disease; HTN: Hypertension; IC: Immunocompromised. KD: Kidney Disease; M: Male; N/A: Not answered; NA: Not available; NB: Non-Binary; SD: Self-Described. Please note that for November’s responders we selected participants that received wave 4 of Delphi Study Questionnaire.
Figure 2Rose charts presenting the temporal relationships between phenotypes and symptoms. Phenotype legends: 1. Afebrile (0%), Non-Coughing (0%), Oligosymptomatic (ANCOS); 2. Febrile (100%) Multisymptomatic (FMS); 3. Afebrile (0%) Coughing (100%) Oligosymptomatic (ACOS); 4. Oligosymptomatic with additional self-described symptoms (100%; OSDS); 5. Olfaction/Gustatory Impairment Predominant (100%; OGIP). The numbers in parentheses represent the cluster numbers from the validation process, e.g., cluster number n in August and cluster number c in June would both correspond to FMS.
Cluster composition and symptom-based prediction vs. COVID-19—(controls)—August.
| N | AUC | 95% CI | ||
|---|---|---|---|---|
| ANCOS (1) | 2506 | <0.5 | NA | NA |
| FMS (2) | 2266 | 0.963 | <0.001 | 0.961–0.965 |
| ACOS (3) | 2060 | 0.737 | <0.001 | 0.729–0.746 |
| OSDS (4) | 1013 | 0.777 | <0.001 | 0.762–0.792 |
| OGIP (5) | 2434 | 0.983 | <0.001 | 0.982–0.984 |
Notes: Five COVID-19 syndromes were identified in August: 1. Afebrile (0%), Non-Coughing (0%), Oligosymptomatic (ANCOS); 2. Febrile (100%) Multisymptomatic (FMS); 3. Afebrile (0%) Coughing (100%) Oligosymptomatic (ACOS); 4. Oligosymptomatic with additional self-described symptoms (100%; OSDS); 5. Olfaction/Gustatory Impairment Predominant (100%; OGIP). AUC: Area Under Curve.
Figure 3Decision tree developed using the QUEST algorithm. The decision tree’s branches are based on splits, i.e., variables that were selected based on a Chi-squared test-determined p-value. The dependent variable for this analysis was a cluster number, represented as a nominal categorical variable with 5 levels, corresponding to the initial 5 clusters detected in August.