| Literature DB >> 33087383 |
Jane Lyons1, Ashley Akbari2, Fatemeh Torabi2, Gareth I Davies2, Laura North2, Rowena Griffiths2, Rowena Bailey2, Joseph Hollinghurst2, Richard Fry2, Samantha L Turner2, Daniel Thompson2, James Rafferty2, Amy Mizen2, Chris Orton2, Simon Thompson2, Lee Au-Yeung2, Lynsey Cross2, Mike B Gravenor3, Sinead Brophy2, Biagio Lucini2, Ann John2, Tamas Szakmany4,5, Jan Davies6, Chris Davies6, Daniel Rh Thomas7, Christopher Williams7, Chris Emmerson7, Simon Cottrell7, Thomas R Connor8, Chris Taylor9, Richard J Pugh10, Peter Diggle11,12, Gareth John13, Simon Scourfield13, Joe Hunt13, Anne M Cunningham13, Kathryn Helliwell14, Ronan Lyons2.
Abstract
INTRODUCTION: The emergence of the novel respiratory SARS-CoV-2 and subsequent COVID-19 pandemic have required rapid assimilation of population-level data to understand and control the spread of infection in the general and vulnerable populations. Rapid analyses are needed to inform policy development and target interventions to at-risk groups to prevent serious health outcomes. We aim to provide an accessible research platform to determine demographic, socioeconomic and clinical risk factors for infection, morbidity and mortality of COVID-19, to measure the impact of COVID-19 on healthcare utilisation and long-term health, and to enable the evaluation of natural experiments of policy interventions. METHODS AND ANALYSIS: Two privacy-protecting population-level cohorts have been created and derived from multisourced demographic and healthcare data. The C20 cohort consists of 3.2 million people in Wales on the 1 January 2020 with follow-up until 31 May 2020. The complete cohort dataset will be updated monthly with some individual datasets available daily. The C16 cohort consists of 3 million people in Wales on the 1 January 2016 with follow-up to 31 December 2019. C16 is designed as a counterfactual cohort to provide contextual comparative population data on disease, health service utilisation and mortality. Study outcomes will: (a) characterise the epidemiology of COVID-19, (b) assess socioeconomic and demographic influences on infection and outcomes, (c) measure the impact of COVID-19 on short -term and longer-term population outcomes and (d) undertake studies on the transmission and spatial spread of infection. ETHICS AND DISSEMINATION: The Secure Anonymised Information Linkage-independent Information Governance Review Panel has approved this study. The study findings will be presented to policy groups, public meetings, national and international conferences, and published in peer-reviewed journals. © Author(s) (or their employer(s)) 2020. Re-use permitted under CC BY. Published by BMJ.Entities:
Keywords: COVID-19; epidemiology; health informatics; public health
Mesh:
Year: 2020 PMID: 33087383 PMCID: PMC7580065 DOI: 10.1136/bmjopen-2020-043010
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 3.006
Figure 1Data linkage of multiple demographic and healthcare data sources used in the creation of two population-wide cohorts: C20 and C16. ADDD, Annual District Death Daily; ADDE, Annual District Death Extract; CAPD, Postponed Admitted Procedures; CARE, Care homes Index; CCDS, Critical Care Data Set; CDDS, Consolidated Death Data Source; CVSP, COVID-19 Shielded People list; CVST, KCL Zoe Symptom Tracker App; EDDD, Emergency Department Dataset Daily; EDDS, Emergency Department Data Set; ICNARC, Intensive Care National Audit & Research Centre; LIMS, Laboratory Information Management System; LSOA, Lower Layer Super Output Area; OPDW, Out Patient Dataset for Wales; PEDW, Patient Episode; PLASC, Pupil Level Annual School Census; SAIL, Secure Anonymised Information Linkage; SQL, Structured Query Language; WDSD, Wash Demographic Service Dataset; WIMD, Welsh Index of Multiple Deprivation; WLGP, Welsh Longitudinal General Practise; WRRS, Wales Results Reporting Service.
Figure 2Flow diagram of the C20 cohort inclusion criteria. WDSD, Wales Demographic Service Dataset.
C16 and C20 cohort demographics till the end of May 2020
| Cohort | C16 | C20 |
| Individuals (N) | 3 087 032 | 3 277 114 |
| Cohort start date | 1 January 2016 | 1 January 2020 |
| Cohort end date | 31 December 2019 | 31 May 2020 |
| Deaths in period | 117 565 (3.8%) | 16 380 (0.5%) |
| Full coverage (cohort end date=31 December 2019/31 May 2020) | 2 651 957 (85.9%) | 3 237 389 (98.8%) |
| Registered with a SAIL providing practice (registration end date >cohort start date) | 2 608 761 (84.5%) | 2 666 331 (81.4%) |
| Mean age (SD) | 41.3 (23.7) | 41.9 (23.8) |
| Sex | ||
| Female | 50.1% | 50.1% |
| WIMD 2019 quintile* | ||
| 1 | 20.3% | 19.1% |
| 2 | 19.9% | 18.5% |
| 3 | 20.1% | 18.4% |
| 4 | 19.7% | 18.1% |
| 5 | 19.9% | 18.3% |
| Missing WIMD | 0.0% | 7.7% |
*WIMD 2019 quintile: 1=most deprived, 5=least deprived, please note a one decimal place rounding error.
SAIL, Secure Anonymised Information Linkage; WIMD, Welsh Index of Multiple Deprivation.