| Literature DB >> 33741586 |
Carole H Sudre1,2,3, Karla A Lee4, Mary Ni Lochlainn4, Thomas Varsavsky5, Benjamin Murray5, Mark S Graham5, Cristina Menni4, Marc Modat5, Ruth C E Bowyer4, Long H Nguyen6, David A Drew6, Amit D Joshi6, Wenjie Ma6, Chuan-Guo Guo6, Chun-Han Lo6, Sajaysurya Ganesh7, Abubakar Buwe7, Joan Capdevila Pujol7, Julien Lavigne du Cadet7, Alessia Visconti4, Maxim B Freidin4, Julia S El-Sayed Moustafa4, Mario Falchi4, Richard Davies7, Maria F Gomez8, Tove Fall8, M Jorge Cardoso5, Jonathan Wolf7, Paul W Franks4,8, Andrew T Chan6, Tim D Spector4, Claire J Steves4, Sébastien Ourselin1.
Abstract
As no one symptom can predict disease severity or the need for dedicated medical support in coronavirus disease 2019 (COVID-19), we asked whether documenting symptom time series over the first few days informs outcome. Unsupervised time series clustering over symptom presentation was performed on data collected from a training dataset of completed cases enlisted early from the COVID Symptom Study Smartphone application, yielding six distinct symptom presentations. Clustering was validated on an independent replication dataset between 1 and 28 May 2020. Using the first 5 days of symptom logging, the ROC-AUC (receiver operating characteristic - area under the curve) of need for respiratory support was 78.8%, substantially outperforming personal characteristics alone (ROC-AUC 69.5%). Such an approach could be used to monitor at-risk patients and predict medical resource requirements days before they are required.Entities:
Mesh:
Year: 2021 PMID: 33741586 PMCID: PMC7978420 DOI: 10.1126/sciadv.abd4177
Source DB: PubMed Journal: Sci Adv ISSN: 2375-2548 Impact factor: 14.136
Fig. 4Flowchart showing entry of participants into analysis.
Fig. 1Illustrative representation of the 6 clusters.
(Top) Frequency of positive answers per symptom across days for each cluster (darker, reported more frequently) and (bottom) associated Z-score of presentation of symptoms over overall symptom distribution (red, reported more than average; blue, reported less than average). The clusters are ordered from left to right by rates of reported hospital visit with associated rates of respiratory support of 1.5, 4.4, 3.7, 8.6, 9.9, and 19.8%, respectively.
Demographic details for the app users in each cluster, stratified by training/independent replication set membership.
BMI, body mass index; Hosp, hospital visit; RS, respiratory support. For continuous variables (age and BMI), results are given as mean (SD); frailty, PRISMA7 score ≥ 3; numbers in parentheses indicate the origin of the participants (UK, United Kingdom; US, United States; SE, Sweden).
| 462 | 315 | 216 | (273-7-/) | 213 | 167 | 404 | 199 | 130 | 102 | 118 | 94 | |
| 41.1 | 43.2 | 41.0 | 43.0 | 43.7 | 43.8 | 44.2 | 46.4 | 45.9 | 48.9 | 48.1 | 46.7 | |
| 26.9 (5.5) | 27.3 (5.7) | 26.9 (5.8) | 27.4 (5.9) | 28.0 (5.9) | 28.9 (6.3) | 27.1 (5.7) | 28.2 (5.9) | 28.6 (6.5) | 27.2 (5.3) | 28.7 (6.5) | 29.3 (6.2) | |
| 29.4% | 26.3% | 19.9% | 16.8% | 27.7% | 24.6% | 21.3% | 24.6% | 21.5% | 25.5% | 28.8% | 34.0% | |
| 13.6% | 10.5% | 10.2% | 19.6% | 19.7% | 28.1% | 12.1% | 15.1% | 11.5% | 18.6% | 16.1% | 30.9% | |
| 1.5% | 1.0% | 1.9% | 2.1% | 3.8% | 4.2% | 0.7% | 2.0% | 1.5% | 3.9% | 5.9% | 1.1% | |
| 2.2% | 3.2% | 4.2% | 3.2% | 4.2% | 4.8% | 1.7% | 4.0% | 6.9% | 2.0% | 5.1% | 7.4% | |
| 0.4% | 0.6% | 0.5% | 1.1% | 0.9% | 1.8% | 0.5% | 0.5% | 0.0% | 0.0% | 1.7% | 2.1% | |
| 0.0% | 1.3% | 0.0% | 0.0% | 3.3% | 5.4% | 1.0% | 2.0% | 1.5% | 2.0% | 5.9% | 8.5% | |
| 16.0% | 17.5% | 23.6% | 24.6% | 27.2% | 45.5% | 11.6% | 13.1% | 13.8% | 24.5% | 37.3% | 50.0% | |
| 1.5% | 4.4% | 3.7% | 8.6% | 9.9% | 19.8% | 0.2% | 2.5% | 2.3% | 7.8% | 16.9% | 23.4% | |
Fig. 2Confusion matrix showing cluster prediction using projections based on 2 to 9 days after onset of symptoms.
By day 5 of COVID-19, the cluster in which a participant falls can be predicted with 72% weighted average precision.
Fig. 3Frequency of occurrence and duration of symptoms at 5 days.
(Left) Percentage of occurrence of symptoms at 5 days per cluster. (Right) Z-score in duration of symptom when occurring over the five first days.