| Literature DB >> 35478198 |
Jonathan E Millar1, Lucile Neyton1, Sohan Seth2, Jake Dunning3,4, Laura Merson5,6, Srinivas Murthy7, Clark D Russell8, Sean Keating9, Maaike Swets1,10, Carole H Sudre11, Timothy D Spector12, Sebastien Ourselin11, Claire J Steves12, Jonathan Wolf13, Annemarie B Docherty14, Ewen M Harrison14, Peter J M Openshaw4, Malcolm G Semple15, J Kenneth Baillie16.
Abstract
COVID-19 is clinically characterised by fever, cough, and dyspnoea. Symptoms affecting other organ systems have been reported. However, it is the clinical associations of different patterns of symptoms which influence diagnostic and therapeutic decision-making. In this study, we applied clustering techniques to a large prospective cohort of hospitalised patients with COVID-19 to identify clinically meaningful sub-phenotypes. We obtained structured clinical data on 59,011 patients in the UK (the ISARIC Coronavirus Clinical Characterisation Consortium, 4C) and used a principled, unsupervised clustering approach to partition the first 25,477 cases according to symptoms reported at recruitment. We validated our findings in a second group of 33,534 cases recruited to ISARIC-4C, and in 4,445 cases recruited to a separate study of community cases. Unsupervised clustering identified distinct sub-phenotypes. First, a core symptom set of fever, cough, and dyspnoea, which co-occurred with additional symptoms in three further patterns: fatigue and confusion, diarrhoea and vomiting, or productive cough. Presentations with a single reported symptom of dyspnoea or confusion were also identified, alongside a sub-phenotype of patients reporting few or no symptoms. Patients presenting with gastrointestinal symptoms were more commonly female, had a longer duration of symptoms before presentation, and had lower 30-day mortality. Patients presenting with confusion, with or without core symptoms, were older and had a higher unadjusted mortality. Symptom sub-phenotypes were highly consistent in replication analysis within the ISARIC-4C study. Similar patterns were externally verified in patients from a study of self-reported symptoms of mild disease. The large scale of the ISARIC-4C study enabled robust, granular discovery and replication. Clinical interpretation is necessary to determine which of these observations have practical utility. We propose that four sub-phenotypes are usefully distinct from the core symptom group: gastro-intestinal disease, productive cough, confusion, and pauci-symptomatic presentations. Importantly, each is associated with an in-hospital mortality which differs from that of patients with core symptoms.Entities:
Mesh:
Year: 2022 PMID: 35478198 PMCID: PMC9043502 DOI: 10.1038/s41598-022-08032-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Symptom prevelence and relationships. (a) Symptom prevalence. Upset plot, intersections describe the ‘top 10’ symptom combinations within the cohort. The upper graph charts the total number of patients exhibiting these symptom sets. The lower graph charts the total number of patients with each symptom in the cohort. (b) Symptom network graph, derived using eLasso. Lines between symptom nodes illustrate conditional dependencies. A thicker line width and darker hue represents a stronger positive conditional dependence. Red lines represent a negative conditional dependence.
Figure 2Symptom clusters. (a) Cluster identities, proportions, and patterns. Data are presented as count (percentage). (b) Distribution of age by symptom cluster. Density plots, solid lines represent the median age.
Figure 3Patient characteristics and symptom clusters. (a) Time from symptom onset to hospital admission. Data are presented as counts in singleday bins. Vertical dashed lines represent the median time (days). (b) Co-morbidities by symptom cluster. Percentage of individuals with co-morbidity at time of admission. (c) Association between patient characteristics and symptom cluster membership. Multinomial regression, presented as relative risk ratio (95% confidence interval). Core symptoms chosen as reference cluster. The age group 60-80 years serves as the reference group for age.
Figure 4Symptom clusters and survival. (a) Unadjusted 30-day in-hospital mortality by cluster. Kaplan-Meier curves, those discharged before 30 days were assumed to have survived until the end of follow-up. (b) The risk of 30 day in-hospital mortality, adjusted for age and sex. Upper panel, Forest plot, showing results of a Cox proportional hazards model. Lower panel, Hazard ratio associated with varying age, fitted with psplines. Data are presented as hazard ratio (HR) and 95% confidence intervals. Red dotted line - 95% CI Core symptoms serves as the reference symptom cluster. The cohort median age serves as the reference age.