| Literature DB >> 34184982 |
Andrew J Page1, Alison E Mather1,2, Thanh Le-Viet1, Emma J Meader3, Nabil-Fareed Alikhan1, Gemma L Kay1, Leonardo de Oliveira Martins1, Alp Aydin1, David J Baker1, Alexander J Trotter1,2, Steven Rudder1, Ana P Tedim1,4, Anastasia Kolyva1,3, Rachael Stanley3, Muhammad Yasir1, Maria Diaz1, Will Potter3, Claire Stuart3, Lizzie Meadows1, Andrew Bell1, Ana Victoria Gutierrez1, Nicholas M Thomson1, Evelien M Adriaenssens1, Tracey Swingler2, Rachel A J Gilroy1, Luke Griffith2, Dheeraj K Sethi3, Dinesh Aggarwal5,6,7,8, Colin S Brown5, Rose K Davidson2, Robert A Kingsley1,2, Luke Bedford9, Lindsay J Coupland3, Ian G Charles1,2, Ngozi Elumogo1,3, John Wain1,2, Reenesh Prakash3, Mark A Webber1,2, S J Louise Smith10, Meera Chand5, Samir Dervisevic3, Justin O'Grady1,2.
Abstract
The COVID-19 pandemic has spread rapidly throughout the world. In the UK, the initial peak was in April 2020; in the county of Norfolk (UK) and surrounding areas, which has a stable, low-density population, over 3200 cases were reported between March and August 2020. As part of the activities of the national COVID-19 Genomics Consortium (COG-UK) we undertook whole genome sequencing of the SARS-CoV-2 genomes present in positive clinical samples from the Norfolk region. These samples were collected by four major hospitals, multiple minor hospitals, care facilities and community organizations within Norfolk and surrounding areas. We combined clinical metadata with the sequencing data from regional SARS-CoV-2 genomes to understand the origins, genetic variation, transmission and expansion (spread) of the virus within the region and provide context nationally. Data were fed back into the national effort for pandemic management, whilst simultaneously being used to assist local outbreak analyses. Overall, 1565 positive samples (172 per 100 000 population) from 1376 cases were evaluated; for 140 cases between two and six samples were available providing longitudinal data. This represented 42.6 % of all positive samples identified by hospital testing in the region and encompassed those with clinical need, and health and care workers and their families. In total, 1035 cases had genome sequences of sufficient quality to provide phylogenetic lineages. These genomes belonged to 26 distinct global lineages, indicating that there were multiple separate introductions into the region. Furthermore, 100 genetically distinct UK lineages were detected demonstrating local evolution, at a rate of ~2 SNPs per month, and multiple co-occurring lineages as the pandemic progressed. Our analysis: identified a discrete sublineage associated with six care facilities; found no evidence of reinfection in longitudinal samples; ruled out a nosocomial outbreak; identified 16 lineages in key workers which were not in patients, indicating infection control measures were effective; and found the D614G spike protein mutation which is linked to increased transmissibility dominates the samples and rapidly confirmed relatedness of cases in an outbreak at a food processing facility. The large-scale genome sequencing of SARS-CoV-2-positive samples has provided valuable additional data for public health epidemiology in the Norfolk region, and will continue to help identify and untangle hidden transmission chains as the pandemic evolves.Entities:
Keywords: ARTIC; NGS; SARS-CoV-2; genome; genomic epidemiology; sequencing
Mesh:
Substances:
Year: 2021 PMID: 34184982 PMCID: PMC8461472 DOI: 10.1099/mgen.0.000589
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Fig. 1.Total number of positive samples in the region per week, broken down by type. Not all of these were available for sequencing and a single individual may have been sampled multiple times. Staff (key workers) include healthcare workers and essential workers, such as police officers.
Fig. 2.Number of positive samples sequenced at Quadram Institute Bioscience (grey bars) over time compared with the number of samples collected (red dashed line). Project sample collection only officially began on 8 April 2020 and not all samples taken before this time were available for sequencing. However, archived samples for March were sequenced and are represented in this figure.
Fig. 3.Age and sex of positive cases in absolute values (a) and scaled as proportions of all positive cases sequenced (b). Data from one person over 100 years of age are not included. NA, not available.
Fig. 4.The weekly number of co-occurring global lineages, excluding lineages that were only found in a single sample.
Fig. 5.The proportion of samples represented by each lineage per week, excluding lineages represented by a single sample. Sample collection for sequencing began on 8 April 2020, and no archival samples were available for sequencing for the period 26 March to 7 April 2020. Samples from July and August were primarily repeated sampling of the same case or with only a single individual in a lineage.
Summary of the prevalence of global lineages represented by more than one sample in the Norfolk dataset compared with the public set of genomes from GISAID and COG-UK (accessed 28 August 2020)
|
Norfolk |
UK |
Global | ||||
|---|---|---|---|---|---|---|
|
Lineage |
Samples |
Per cent |
Samples |
Per cent |
Samples |
Per cent |
|
B |
21 |
2.03 |
1021 |
2.59 |
1831 |
2.97 |
|
B.1 |
63 |
6.09 |
5918 |
14.99* |
13 354 |
21.63* |
|
B.1.1 |
464 |
44.83 |
15 654 |
39.65* |
19 424 |
31.46* |
|
B.1.1.1 |
11 |
1.06 |
3658 |
9.26* |
3933 |
6.37* |
|
B.1.1.10 |
6 |
0.58 |
295 |
0.75 |
324 |
0.52 |
|
B.1.1.15 |
35 |
3.38 |
45 |
0.11* |
60 |
0.97* |
|
B.1.1.3 |
19 |
1.84 |
100 |
0.25* |
100 |
0.16* |
|
B.1.1.7 |
2 |
0.19 |
157 |
0.40 |
173 |
0.28 |
|
B.1.105 |
7 |
0.68 |
90 |
0.23 |
90 |
0.15* |
|
B.1.11 |
111 |
10.72 |
793 |
2.01* |
825 |
1.34* |
|
B.1.111 |
11 |
1.06 |
11 |
0.03* |
18 |
0.03* |
|
B.1.5 |
33 |
3.19 |
997 |
2.53 |
1996 |
3.23 |
|
B.1.5.5 |
67 |
6.47 |
73 |
0.18* |
75 |
0.12* |
|
B.1.75 |
84 |
8.12 |
152 |
0.38* |
156 |
0.25* |
|
B.1.88 |
6 |
0.58 |
39 |
0.1* |
43 |
0.07* |
|
B.2 |
51 |
4.93 |
1343 |
3.40 |
1792 |
2.90* |
|
B.2.1 |
13 |
1.26 |
2004 |
5.08* |
2153 |
3.49* |
|
B.21 |
12 |
1.16 |
26 |
0.07* |
27 |
0.04* |
|
B.3 |
9 |
0.87 |
1000 |
2.53* |
1118 |
1.81 |
a, In this column those values marked with an asterisk are significantly more/less prevalent in Norfolk samples than UK samples (P<0.05 Bonferroni corrected for multiple hypothesis testing).
†‡b, In this column those values marked with an asterisk are significantly more/less prevalent in Norfolk samples than in global samples (P<0.05 Bonferroni corrected for multiple hypothesis testing).
Fig. 6.Phylogenetic tree of the Norfolk genomes sequenced in this study. The phylogenetic tree was estimated as part of the COG-UK phylogenetic pipeline (7 September 2020). The inner circle represents the UK lineages assigned to each sequence, while the outer circle shows their equivalent global lineages. Only high-quality samples are included (837 sequences). The tree is scaled by nucleotide substitutions per site.
Fig. 7.Weekly numbers of genomes with the wild type genome (D614) or the D614G mutant in the spike protein.