Literature DB >> 32679081

Rapid implementation of SARS-CoV-2 sequencing to investigate cases of health-care associated COVID-19: a prospective genomic surveillance study.

Luke W Meredith1, William L Hamilton2, Ben Warne2, Charlotte J Houldcroft3, Myra Hosmillo1, Aminu S Jahun1, Martin D Curran4, Surendra Parmar4, Laura G Caller5, Sarah L Caddy6, Fahad A Khokhar6, Anna Yakovleva1, Grant Hall1, Theresa Feltwell3, Sally Forrest6, Sushmita Sridhar7, Michael P Weekes6, Stephen Baker6, Nicholas Brown4, Elinor Moore8, Ashley Popay9, Iain Roddick9, Mark Reacher9, Theodore Gouliouris10, Sharon J Peacock11, Gordon Dougan6, M Estée Török12, Ian Goodfellow13.   

Abstract

BACKGROUND: The burden and influence of health-care associated severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infections is unknown. We aimed to examine the use of rapid SARS-CoV-2 sequencing combined with detailed epidemiological analysis to investigate health-care associated SARS-CoV-2 infections and inform infection control measures.
METHODS: In this prospective surveillance study, we set up rapid SARS-CoV-2 nanopore sequencing from PCR-positive diagnostic samples collected from our hospital (Cambridge, UK) and a random selection from hospitals in the East of England, enabling sample-to-sequence in less than 24 h. We established a weekly review and reporting system with integration of genomic and epidemiological data to investigate suspected health-care associated COVID-19 cases.
FINDINGS: Between March 13 and April 24, 2020, we collected clinical data and samples from 5613 patients with COVID-19 from across the East of England. We sequenced 1000 samples producing 747 high-quality genomes. We combined epidemiological and genomic analysis of the 299 patients from our hospital and identified 35 clusters of identical viruses involving 159 patients. 92 (58%) of 159 patients had strong epidemiological links and 32 (20%) patients had plausible epidemiological links. These results were fed back to clinical, infection control, and hospital management teams, leading to infection-control interventions and informing patient safety reporting.
INTERPRETATION: We established real-time genomic surveillance of SARS-CoV-2 in a UK hospital and showed the benefit of combined genomic and epidemiological analysis for the investigation of health-care associated COVID-19. This approach enabled us to detect cryptic transmission events and identify opportunities to target infection-control interventions to further reduce health-care associated infections. Our findings have important implications for national public health policy as they enable rapid tracking and investigation of infections in hospital and community settings. FUNDING: COVID-19 Genomics UK funded by the Department of Health and Social Care, UK Research and Innovation, and the Wellcome Sanger Institute.
Copyright © 2020 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Year:  2020        PMID: 32679081      PMCID: PMC7806511          DOI: 10.1016/S1473-3099(20)30562-4

Source DB:  PubMed          Journal:  Lancet Infect Dis        ISSN: 1473-3099            Impact factor:   71.421


Introduction

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in the human population in December, 2019, originating from an intermediate animal host. Owing to the error prone nature of the viral replication process, RNA viruses, such as SARS-CoV-2, accumulate mutations over time resulting in sequence diversity. The current mutation rate of SARS-CoV-2 is estimated to be approximately 2·5 nucleotides per month. Sequencing of SARS-CoV-2 can provide valuable information on virus biology, transmission, and population dynamics.4, 5, 6, 7 When linked with detailed epidemiological data and on a timescale of days, genomic data can support epidemiological investigations of potential hospital-acquired infections. On a larger population scale, genomic surveillance of SARS-CoV-2 can inform which lineages of the virus are circulating in the human population, how these change over time as an indicator of the success of control measures, how often new sources of virus are introduced from other geographical areas, and how the virus evolves in response to interventions. Health-care associated infections can affect both patients (increasing morbidity and mortality) and health-care workers (impacting on staff sickness and morale) to the detriment of patient care. It is crucial to rapidly detect and manage health-care associated infections effectively to prevent both complications and onward transmission to susceptible patients and staff. The burden of nosocomial COVID-19 infections is unknown with one early study from China reporting a prevalence of 41% among hospitalised patients. Worldwide, more than 22 000 cases of COVID-19 infection in health-care workers were reported in the WHO situation report from May, 2020, which is likely to be an underestimate. As the number of community-acquired COVID-19 cases reduces, health-care settings are likely to act as reservoirs of infection. Identifying transmission events in these settings will therefore become increasingly important to manage outbreaks and effectively monitor infection control. Evidence before this study Since the emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in December, 2019, the infection has spread worldwide, infecting more than 9·1 million people and causing more than 472 000 deaths as of June 23, 2020. Despite investigation, substantial gaps remain in our understanding of virus biology and transmission. We searched PubMed Central, medRxiv, and bioRxiv with combinations of “SARS-CoV-2”, “genome”, or “genomic” and “hospital-acquired” or “healthcare-associated” for articles published in English from database inception until May 11, 2020, and returned few relevant results. Previous studies have analysed SARS-CoV-2 biology, diversity and evolution, transmission networks, and health-care worker infections. Very few have applied genomic epidemiology to tackle health-care associated infection and, to our knowledge, none have been at the scale of a hospital COVID-19 epidemic comprising hundreds of patients in real-time. Attempts with other pathogens have often been assessed retrospectively, and in a timeframe that was not actionable. Added value of this study We present the first report, to our knowledge, applying rapid genome sequencing to systematically investigate SARS-CoV-2 health-care associated infections, integrating genomic and epidemiological data to identify transmission networks and inform targeted infection control interventions. In 6 weeks, we sequenced 1000 genomes, including 70% of all COVID-19 cases tested at our hospital. We uncovered ward outbreaks of hospital-acquired infections and substantial transmission in health-care associated community settings. Genomic analysis identified cryptic transmission that had not been suspected by clinical or infection control teams. These complex transmission networks involved patients and health-care workers and spanned hospital and community health-care settings. By feeding results back to the hospital weekly, the data could be actioned by the infection control team during the course of outbreak investigations. Implications of all the available evidence Rapid viral sequencing can contribute to health-care associated infection investigations by uncovering evidence for or against transmission events. As transmission shifts from the community to health-care settings, the use of rapid sequencing integrated with epidemiological investigations can help to reveal complex transmission chains and inform targeted infection control and public health interventions. This strategy could support the new test, track, and trace initiative of the UK Government, enabling a more targeted approach to disease control. We aimed to examine the use of rapid sequencing of SARS-CoV-2, combined with detailed epidemiological analysis, to investigate health-care associated COVID-19 infections and to inform infection control measures in our hospital.

Methods

Study design and participants

We did a prospective surveillance study of SARS-CoV-2 infections at Cambridge University Hospitals National Health Service Foundation Trust (CUH; Cambridge, UK), a secondary care provider and tertiary referral centre in the East of England. Sufficient supplies of personal protective equipment (PPE) to meet national recommendations were available in CUH at all times during the study period. We adhered to national guidance on PPE use. Patients were isolated either pre-emptively (suspected COVID-19) or immediately after a confirmed PCR positive result. Clinical specimens collected from patients presenting to 18 hospitals in the East of England were submitted to the Public Health England Clinical Microbiology and Public Health Laboratory at CUH for diagnostic testing. Samples underwent nucleic acid extraction and were tested for the presence of SARS-CoV-2 with a validated in-house RT-qPCR assay developed by the Public Health England Clinical Microbiology and Public Health Laboratory (appendix p 1). The test was reported as SARS-CoV-2 PCR positive if the cycle threshold (Ct) value was 36 or less. The study was done as part of surveillance for COVID-19 under the auspices of Section 251 of the National Health Service Act 2006. It therefore did not require individual patient consent or ethical approval. The COVID-19 Genomics UK (COG-UK) study protocol was approved by the Public Health England Research Ethics Governance Group.

Procedures

Demographic, clinical, and laboratory data were extracted from the hospital information system (Epic Systems, Verona, WI, USA; appendix p 5). Sample collection, metadata curation, and linkage to sequencing identification numbers involved coordination between multiple teams and depended on daily email communications and face-to-face contact between the sequencing and diagnostic laboratory staff (appendix p 5). Each day samples with positive SARS-CoV-2 PCR results from the last 24–72 h were identified from the hospital information system and clinical metadata was extracted, formatted, and integrated into a master metadata file. A 15 μL aliquot of the RNA extract of all CUH samples and a random selection of samples from the East of England were collected from the diagnostic microbiology laboratory for local sequencing. We aimed for each sampling site in the East of England to be represented across the study period. Typically, two positive samples per site were selected per day for sequencing, including both high and low Ct samples when possible. Samples were also included from a local health-care worker screening programme (n=37 in this dataset). Most samples were from patients with symptomatic COVID-19; two samples were from asymptomatic individuals. Samples were assigned COG-UK sequencing codes that were integrated back into the master metadata file. For samples that were not sequenced locally, the remaining RNA extract was collected from the diagnostic microbiology laboratory and sent to the Wellcome Sanger Institute (Hinxton, UK) for sequencing as part of the COG-UK consortium. 14 samples sequenced on site were also sent to the institute to compare consistency across sequencing platforms. Each week, clinical metadata and sequencing data were combined and formatted for upload to the Medical Research Council CLIMB system. Data manipulations were done in R (version 3.6.2) with the tidyverse packages (version 1.3.0) installed onto computers within the Trust network. When Ct values were available before sample selection, positive samples with a Ct value of 33 or less were sequenced with a multiplex PCR based approach according to the modified ARTIC version 2 protocol with version 3 primer set (appendix p 2).11, 12 Amplicon libraries were sequenced using MinION flow cells version 9.4.1 (Oxford Nanopore Technologies, Oxford, UK). Genomes were assembled with reference-based assembly and a bioinformatic pipeline with 20 ×  minimum coverage cutoff for any region of the genome and 50·1% cutoff for defining single nucleotide polymorphisms (SNPs).

Analysis

All sequences underwent quality control filtering and de-duplication to remove repeat samples from the same patient. Multiple sequence alignment was done with MAFFT. Phylogenetic trees were produced with IQ-TREE and visualised in Microreact for weekly hospital reports and the R package ggtree (appendix pp 2–3). A pairwise SNP distance matrix was produced from the alignment with use of the snp-dists package. Viral lineages were assigned with the PANGOLIN package, version 1.07. For epidemiological analysis, patient movement data for all SARS-CoV-2 PCR positive samples were extracted from the hospital information system and transferred to the Public Health England Field Service (Epidemiology). Epidemiological analysis was done with a cloud-based plotter application (Cluster Track; appendix pp 3–4). Clusters of COVID-19 cases including health-care workers were identified by the clinical and infection control teams and reviewed at a weekly meeting, coordinated by the patient safety team (appendix p 6). The genomic and epidemiological analyses were presented to help establish whether infections were health-care associated and to identify possible causes and interventions. A weekly report was fed back to the clinical, infection control, and hospital management teams to inform changes in infection-control practice and comply with patient safety procedures. Definitions of health-care associated COVID-19 are shown in the table .
Table

Definitions of health-care associated COVID-19

LabelDefinitionTotal (n)
1Community onset, community associatedDiagnostic sample positive within 48 h of admission and no health-care contact during the previous 14 days263
2Community onset, suspected health-care associatedDiagnostic sample positive within 48 h of admission with health-care contact during the previous 14 days32
3Hospital onset, indeterminate health-care associatedDiagnostic sample positive after 48 h but less than 7 days post-admission13
4Hospital onset, suspected health-care associatedDiagnostic sample positive 7–14 days post-admission14
5Hospital onset, health-care associatedDiagnostic sample positive more than 14 days post-admission43
6Health-care workerAny positive test identified as coming from a health-care worker*9

Different categories of hospital onset infection reflect the increasing likelihood of hospital acquired infection based on the incubation period of the virus. N=374 is the number of CUH patients included in the study period, of whom 262 (70%) had sequencing available for analysis (appendix p 9). CUH=Cambridge University Hospitals National Health Service Foundation Trust.

These data do not include 37 health-care workers identified through the CUH screening programme.

Definitions of health-care associated COVID-19 Different categories of hospital onset infection reflect the increasing likelihood of hospital acquired infection based on the incubation period of the virus. N=374 is the number of CUH patients included in the study period, of whom 262 (70%) had sequencing available for analysis (appendix p 9). CUH=Cambridge University Hospitals National Health Service Foundation Trust. These data do not include 37 health-care workers identified through the CUH screening programme.

Role of the funding source

The funder of the study had no role in study design, data collection, data analysis, data interpretation, or writing of the report. The corresponding authors had full access to all the data in the study and had final responsibility for the decision to submit for publication.

Results

Between March 13 and April 24, 2020, 1000 samples were selected for sequencing and, after quality control and de-duplication, 747 were used for downstream analysis. 299 (40%) of 747 samples were from CUH including 46 health-care worker samples, of which 37 were identified through a health-care worker screening programme (including two asymptomatic individuals). 374 patients with PCR-confirmed COVID-19 were tested at CUH between March 10 and April 24, 2020 (figure 1 , appendix pp 7–8). The median age was 64 years (range 0–98) and 233 (62%) of 374 were male. 74 (20%) of 374 patients were admitted to critical care units and 75 (20%) died. Excluding the health-care workers screening samples, 262 (70%) of 374 CUH COVID-19 samples had sequencing data available (figure 1; appendix p 9). 57 (15%) infections were suspected or highly likely to be hospital-acquired, of which 49 (86%) had genome sequences available. A further 32 (9%) admissions were community-acquired but likely to be health-care associated, and nine (2%) were health-care workers (not counting health-care workers identified through screening). The CUH epidemic curve showed that weekly admissions peaked in week 4 (commencing March 30, 2020) and then declined (figure 2 ). The UK went into full lockdown on March 23, 2020. In the early stages of the epidemic, community-onset and community-acquired infections predominated but the frequency of health-care associated infections increased from March 23 until April 6, 2020, and then declined.
Figure 1

Study profile

We prioritised CUH samples for nanopore sequencing on site for quick turnaround to investigate health-care associated infections. Of the 166 samples that did not pass quality control, two were removed as their genomes were less than 29 kb and 164 were removed as they had more than 2990 undefined nucleotides. SARS-CoV-2=severe acute respiratory syndrome coronavirus 2. CUH=Cambridge University Hospitals National Health Service Foundation Trust. *Of 3891 samples assigned for sequencing, 2940 were uploaded to the CLIMB server, as have all of the 1000 genomes sequenced on site, for national COVID-19 Genomics UK analyses. †The 37 health-care workers were identified through a CUH screening programme. In addition, nine self-presented to CUH admission units and are counted as patients (n=46).

Figure 2

Epidemic curve of COVID-19 at CUH

Data for 374 patients tested at CUH. Classification of infection: (1) community onset, community associated; (2) community onset, suspected health-care associated; (3) hospital onset, indeterminate health-care associated; (4) hospital onset, suspected health-care associated; (5) hospital onset, health-care associated; (6) health-care worker. These data do not include 37 health-care workers identified through the CUH screening programme, but do include nine health-care workers that self-presented for testing. Note that data for week commencing April 20, 2020, stops at 8 AM on April 24, 2020, so does not include the weekend. CUH=Cambridge University Hospitals National Health Service Foundation Trust.

Study profile We prioritised CUH samples for nanopore sequencing on site for quick turnaround to investigate health-care associated infections. Of the 166 samples that did not pass quality control, two were removed as their genomes were less than 29 kb and 164 were removed as they had more than 2990 undefined nucleotides. SARS-CoV-2=severe acute respiratory syndrome coronavirus 2. CUH=Cambridge University Hospitals National Health Service Foundation Trust. *Of 3891 samples assigned for sequencing, 2940 were uploaded to the CLIMB server, as have all of the 1000 genomes sequenced on site, for national COVID-19 Genomics UK analyses. †The 37 health-care workers were identified through a CUH screening programme. In addition, nine self-presented to CUH admission units and are counted as patients (n=46). Epidemic curve of COVID-19 at CUH Data for 374 patients tested at CUH. Classification of infection: (1) community onset, community associated; (2) community onset, suspected health-care associated; (3) hospital onset, indeterminate health-care associated; (4) hospital onset, suspected health-care associated; (5) hospital onset, health-care associated; (6) health-care worker. These data do not include 37 health-care workers identified through the CUH screening programme, but do include nine health-care workers that self-presented for testing. Note that data for week commencing April 20, 2020, stops at 8 AM on April 24, 2020, so does not include the weekend. CUH=Cambridge University Hospitals National Health Service Foundation Trust. Each week a sample set was locked and underwent bioinformatic analysis. Of 1000 sequenced genomes presented here, 253 were excluded from downstream analysis because they did not reach quality control thresholds (n=166), metadata were missing (n=16), or they were repeat samples from the same patient (n=71; figure 1). The median genome depth of coverage was 6612 ×. We compared Ct value versus depth of coverage and found that the latter declined at Ct values of more than 30 (appendix p 10). We also examined the location and frequency of SNPs across the genomes (appendix p 11). Genomes were assigned to a lineage based on the combination of mutations that have accumulated since the virus emerged. As of March 23, 2020, 12 lineages had been described in the UK. Most samples in both the East of England and CUH belonged to lineage B.1. There were no lineage A samples, which have mainly been identified in China, the USA, South Korea, and Australia (figure 3 , appendix pp 12–13).
Figure 3

SARS-CoV-2 lineages identified in the East of England and CUH

Map of the East of England region (A) showing the breakdown of SARS-CoV-2 lineages by collecting hospital site. Lineage B.1 was the most prevalent lineage throughout the region (B). SARS-CoV-2=severe acute respiratory syndrome coronavirus 2. CUH=Cambridge University Hospitals National Health Service Foundation Trust.

SARS-CoV-2 lineages identified in the East of England and CUH Map of the East of England region (A) showing the breakdown of SARS-CoV-2 lineages by collecting hospital site. Lineage B.1 was the most prevalent lineage throughout the region (B). SARS-CoV-2=severe acute respiratory syndrome coronavirus 2. CUH=Cambridge University Hospitals National Health Service Foundation Trust. Phylogenetic trees were used to explore potential genetic clustering and correlation with sampling ward location and cases of suspected hospital-acquired infections (figure 4 ). Samples collected from the emergency department were phylogenetically dispersed, probably reflecting unconnected transmission events within the past few days or weeks in community-acquired infections (appendix p 14). By contrast, samples collected from several wards in CUH and an outpatient dialysis unit were genetically clustered (figure 4). The putative ward clusters included a high proportion of suspected hospital-acquired infections, suggestive of linked transmission chains in the hospital.
Figure 4

Phylogenetic tree of SARS-CoV-2 genomes from CUH

Phylogenetic tree of 299 SARS-CoV-2 genomes form CUH and 30 international genomes. The tree is rooted on a December, 2019, genome from Wuhan, China. The left-hand column highlights several hospital and community associated clusters in different colours. Wards A, B, and C all had clusters of hospital-acquired infections cases with viruses of less than two single nucleotide polymorphisms different. Eight ward C cases are contained within one of the largest clusters of identical viruses. Genomic clusters containing the cases for wards B and C, the dialysis unit, and care home A all included health-care workers. The right hand column shows the classification of infection, also shown in figure 2. Classification of infection: (1) community onset, community associated; (2) community onset, suspected health-care associated; (3) hospital onset, indeterminate health-care associated; (4) hospital onset, suspected health-care associated; (5) hospital onset, health-care associated; (6) health-care worker; (7) unable to determine or missing. See appendix pp 23–24 for GISAID identification codes of included reference genomes, and appendix p 13 for their position on the East of England phylogenetic tree. SARS-CoV-2=severe acute respiratory syndrome coronavirus 2. CUH=Cambridge University Hospitals National Health Service Foundation Trust.

Phylogenetic tree of SARS-CoV-2 genomes from CUH Phylogenetic tree of 299 SARS-CoV-2 genomes form CUH and 30 international genomes. The tree is rooted on a December, 2019, genome from Wuhan, China. The left-hand column highlights several hospital and community associated clusters in different colours. Wards A, B, and C all had clusters of hospital-acquired infections cases with viruses of less than two single nucleotide polymorphisms different. Eight ward C cases are contained within one of the largest clusters of identical viruses. Genomic clusters containing the cases for wards B and C, the dialysis unit, and care home A all included health-care workers. The right hand column shows the classification of infection, also shown in figure 2. Classification of infection: (1) community onset, community associated; (2) community onset, suspected health-care associated; (3) hospital onset, indeterminate health-care associated; (4) hospital onset, suspected health-care associated; (5) hospital onset, health-care associated; (6) health-care worker; (7) unable to determine or missing. See appendix pp 23–24 for GISAID identification codes of included reference genomes, and appendix p 13 for their position on the East of England phylogenetic tree. SARS-CoV-2=severe acute respiratory syndrome coronavirus 2. CUH=Cambridge University Hospitals National Health Service Foundation Trust. SARS-CoV-2 has low genetic diversity due to its introduction into the human population within 6 months of sample collection; here, a median of eight SNPs separated any two samples at CUH (appendix pp 15–16). This low sequence diversity makes interpretation of putative clusters challenging, as samples could be identical by chance rather than because of a connected transmission chain. To investigate genomic clustering further, we adopted a combined genetic, clinical, and epidemiological approach. Samples with zero SNP differences were identified and clusters named numerically by decreasing sample size. Overall 159 (53%) of 299 genomes from CUH in 35 clusters shared at least one identical sequence. The largest cluster had 18 identical genomes. Patients' medical records (including address, social setting, clinical details, and ward movements) were reviewed for all putative genomic clusters to assess whether cases had plausible or probable linked transmission within a few days or weeks (appendix pp 17–20). Of 159 cases from 35 putative clusters, 92 (58%) cases had strong epidemiological evidence to support transmission within a few days or weeks, 32 cases (20%) had intermediate evidence, and 35 (22%) no evidence of connected transmission. Clusters with strong evidence of linked transmission within a few days or weeks included cases in which a connection was already suspected, such as groups of probable hospital-acquired infections seen on multiple CUH hospital wards (figure 4), and clusters that were previously not recognised as being linked, such as a care home outbreak involving health-care workers based in hospital and community settings. Directed by clinical and infection control teams, we established a process for focused genomic and epidemiological analyses of suspected hospital-acquired infections (appendix p 6). These were discussed in weekly meetings with an accompanying written report submitted to the hospital. A full description of clusters is detailed in the appendix (pp 17–20). In brief, epidemiologically linked cases with identical viral genomes indicating transmission events were identified in 12 hospital wards and an outpatient dialysis unit. Nine of the hospital wards were classed as green (ie, no known patients with COVID-19) at the onset of cluster cases, and three were classed as red (ie, housing patients with confirmed or highly suspected COVID-19). Additionally, community transmission events were identified in three care homes (in both residents and carers), hostel accommodation, and several households. Transmission events involving health-care workers were identified in both hospital and community settings, such as a cluster including several paramedics. In hospital cluster 1, six surgical patients on ward A (an area that housed no known patients with COVID-19) were diagnosed with COVID-19 (figure 5 ). All cases had been on the ward for more than 7 days before their specimen collection date and were considered as probable hospital-acquired infections. Genomic data were available for all cases; five differed by zero SNPs and one by a single SNP, consistent with ward-based transmission events within a few days. The genomic evidence supported the infection control decision to close the ward to new admissions and enhance surveillance of all patients who had contact with this clinical area.
Figure 5

Epidemiological timeline of ward A cluster

Six patients on ward A were diagnosed with COVID-19. All had been admitted for more than 7 days before their specimen date and were considered likely to have hospital-acquired infections. The date of the first positive sample collection is shown with a red box and patient death date indicated with a solid black bar. Five of the viral genomes (patients A to E) had zero single nucleotide polymorphism differences between them, and one of the viral genomes (patient F) differed by one single nucleotide polymorphism.

Epidemiological timeline of ward A cluster Six patients on ward A were diagnosed with COVID-19. All had been admitted for more than 7 days before their specimen date and were considered likely to have hospital-acquired infections. The date of the first positive sample collection is shown with a red box and patient death date indicated with a solid black bar. Five of the viral genomes (patients A to E) had zero single nucleotide polymorphism differences between them, and one of the viral genomes (patient F) differed by one single nucleotide polymorphism. In hospital cluster 2, four transplant patients on ward B (an area that housed no known patients with COVID-19) were diagnosed with the virus between April 1 and April 20, 2020 (appendix p 21). A fifth patient, who had been discharged from the ward within the past few days, presented to the emergency department with COVID-19 symptoms. Genomic analysis revealed that all five cases had identical genomes. Three health-care workers were found to have identical genomes in the same cluster as the ward B cases; two had worked on ward B, one of whom had professional contact with the other health-care workers. These findings led to a review of infection control and PPE procedures for staff and patients in the transplant service. Patients on renal dialysis are among the most susceptible to COVID-19 (with up to 19% mortality). Most dialysis units have challenging infection control arrangements, consisting of large open rooms with no barriers between patients. In the dialysis unit cluster, six patients with end-stage renal failure were diagnosed with COVID-19 between April 1 and April 20, 2020, testing positive in several locations including the emergency department and an acute admissions ward (appendix p 21). Their viral genomes were identical, and epidemiological investigation revealed that they dialysed at the same outpatient unit on the same days of the week. This information suggests linked transmission of community-onset, health-care associated infections. These findings led to a review of infection control procedures in patients on dialysis and identified shared patient transportation and neighbouring dialysis chairs as risk factors for transmission. Genomics was also useful for ruling out linked transmission. The renal ward (which shares patients with the outpatient dialysis unit) also had a group of COVID-19 cases at around the same time. However, the dialysis unit genomes belonged to lineage B.2 (relatively rare in East of England), whereas the renal ward genomes were the common B.1 lineage, making it very unlikely that infections between the two patient groups were related. In the community cluster, 18 patients were admitted to CUH with COVID-19 between April 5 and April 21, 2020, with genetically identical viruses. Nine patients were residents at a community care home (care home A). A review of medical records revealed that another patient in this genetic cluster worked in care home A and one was a retired nurse who worked in an unknown care home. Three cases were paramedics and two were nurses (who worked in different wards at CUH but lived with paramedics). The final case did not have any discernible epidemiological links with the others. In summary, this investigation revealed a cluster of cases with evidence of linked transmission coming from either the same care home, the ambulance service, or shared accommodation. None of these associations had been detected by clinicians or infection control. The information from these combined epidemiological and genomic investigations was fed back to the clinical, infection control, and hospital management teams. This information triggered further investigations into patient isolation, ward cleaning procedures, use of PPE, and staff physical distancing behaviour. Health-care associated infections were also assessed in relation to potential harm caused to patients and recorded in the hospital's patient safety reporting system for follow-up and further action.

Discussion

The value of real-time viral genome sequencing has been shown in previous epidemics (eg, Ebola virus, measles, Zika virus, and influenza).21, 22, 23, 24, 25 We sought to embed genomic surveillance as part of an active SARS-CoV-2 infection control process in a large UK hospital. A rapid sequencing workflow was established on March 23, 2020, with multiplex PCR-based nanopore sequencing, which has been shown to be effective in a wide range of clinical samples and viral loads. We aimed to sequence all available positive samples from CUH and a selection from each of the East of England regional hospitals submitted to the diagnostic microbiology laboratory, linking with clinical metadata pulled from the hospital electronic patient records system. We also included 37 samples collected as part of a local health-care worker screening programme. In 5 weeks, more than 1000 SARS-CoV-2 genomes had been sequenced including most of the CUH samples from this phase of the epidemic. We applied this system to investigate nosocomial and health-care worker COVID-19 cases at CUH, integrating genomics with epidemiological and clinical data. We examined the diversity in SARS-CoV-2 at CUH and found that overall genetic diversity was low and reflected the pattern seen in the East of England as a whole, with most viruses belonging to the B.1 lineage. We identified a median of eight (range 0–24) SNP differences between viruses, and 4·5% of pairwise comparisons between CUH genomes had zero to one SNP differences. Given the virus' mutation rate and infectious timeframe, cases might share linked transmission within a few days or weeks if there are fewer than approximately two SNP differences. We applied a more stringent definition of genetic clusters with zero SNP differences, and despite detailed epidemiological data, we found no identifiable connection between 22% of pairwise comparisons within clusters. This finding reflects the low genetic diversity in circulating SARS-CoV-2 during the study, and emphasises the need for in-depth epidemiological analysis to unravel potential transmission networks. The ability of genomics to resolve transmission events might increase as the virus evolves and accumulates greater diversity. Genetics can be used more confidently to rule out transmission—eg, if viruses from two patients suspected as being linked belong to different lineages. We investigated groups of patients and health-care workers at CUH in response to queries from the clinical and infection control teams. Using ward location data for patients and health-care workers, we analysed epidemiological data to establish if there had been ward-based contact. We compared the genomes of patients and health-care workers in the suspected groups with those of other patients at CUH and in hospitals in the East of England to examine relatedness. This approach enabled us to add supporting evidence or to refute linked transmission between patients and health-care workers (eg, adding confidence to our assessment of whether an infection was hospital or community acquired). Ruling out transmission was useful as a mechanism to monitor and target infection control measures (eg, showing that viruses on the renal ward belonged to a distinct lineage from those in the outpatient dialysis unit). Genomic analysis also enabled us to identify cryptic transmission (ie, additional cases that were not initially suspected to be linked to the original ward or health-care worker clusters, including multiple instances of patients with identical viruses from the same care homes). These cases illustrate the power of combining rapid genomic and epidemiological analyses in near real time. By contrast with previous studies,27, 28, 29, 30 we reported results of our investigations to the clinical, infection control, and management teams on a weekly basis, thus enabling them to respond to this information and act accordingly within the timeframe of ongoing ward outbreaks. The genomic data informed reviews of patient placement and isolation procedures, assessment of PPE use, and staff break arrangements, supporting us to better focus efforts at a time of unprecedented demand on infection control teams. Finally, these analyses are being used to inform existing patient safety review processes within our hospital, including investigations related to hospital-onset COVID-19 in which the patient has come to harm. Our study highlights the importance of understanding SARS-CoV-2 transmission within health-care settings in managing the pandemic. The transmission networks that we identified were complex, involving patients and health-care workers in both hospital and community settings such as care homes, outpatient units, and ambulance services, which have been poorly studied. Of note, we have identified transmission events on nine wards that were considered green (ie, no known patients with COVID-19) at the start of each cluster. Although there were strong epidemiological and genomic associations between cases, the mechanism and direction of transmissions within these clusters are unclear. The role of asymptomatic intermediates, fomites (including PPE), and the environment are not well understood and require further investigation. During the timeframe of this study, several infection-control interventions were implemented across the hospital, as well as national public health measures to reduce community spread. Understanding the interaction between such interventions and nosocomial transmission are complex (especially in the context of the comparably long incubation period for SARS-CoV-2 relative to other respiratory viruses), but essential in enabling health-care providers to safely deliver existing services in the context of a pandemic. We acknowledge several limitations to our study. Firstly, we were unable to sequence all genomes from samples that were collected during the study period. We might therefore have missed the opportunity to investigate all potential transmission events. 166 (17%) of 1000 sequenced genomes did not pass our quality filtering. This finding reflects the stringent coverage threshold used (90%), the desire not to bias sample selection by sequencing only low Ct samples, and the nature of the diagnostic material used for sequencing. We only had main ward location data for health-care workers and could have overlooked potential epidemiological links with patients with whom they had contact on other wards or within shared communal areas. Due to its low genetic diversity, highly similar genomes could not be used definitively to infer meaningfully linked transmission events without supportive epidemiological data. However, our experience indicates that further investigation of genomic clusters with highly similar genomes can uncover previously unknown epidemiological links. Furthermore, we were able to use rapidly generated genomic data to investigate health-care associated infections within the hospital setting. Similar approaches could be applied in future studies to assess infections in health-care workers and community settings such as care homes. As the practical challenges associated with implementing real-time genome sequencing during epidemics are overcome, unlocking the real power of genomic epidemiology will require its integration with clinical and public health systems to support decision making on local, national, and international scales. This implementation might be of particular benefit in supporting the UK Government's test, track, and trace initiative, enabling a more targeted approach to disease control. This online publication has been corrected. The first corrected version appeared at thelancet.com/infection on January 22, 2021 and the second on February 11, 2021
  23 in total

1.  Whole-Genome Sequencing of Measles Virus Genotypes H1 and D8 During Outbreaks of Infection Following the 2010 Olympic Winter Games Reveals Viral Transmission Routes.

Authors:  Jennifer L Gardy; Monika Naus; Ashraf Amlani; Walter Chung; Hochan Kim; Malcolm Tan; Alberto Severini; Mel Krajden; David Puddicombe; Vanita Sahni; Althea S Hayden; Reka Gustafson; Bonnie Henry; Patrick Tang
Journal:  J Infect Dis       Date:  2015-07-06       Impact factor: 5.226

2.  Use of Whole-Genome Sequencing of Adenovirus in Immunocompromised Pediatric Patients to Identify Nosocomial Transmission and Mixed-Genotype Infection.

Authors:  Charlotte J Houldcroft; Sunando Roy; Sofia Morfopoulou; Ben K Margetts; Daniel P Depledge; Juliana Cudini; Divya Shah; Julianne R Brown; Erika Yara Romero; Rachel Williams; Elaine Cloutman-Green; Kanchan Rao; Joseph F Standing; John C Hartley; Judith Breuer
Journal:  J Infect Dis       Date:  2018-09-08       Impact factor: 5.226

3.  Use of Genome Sequencing to Define Institutional Influenza Outbreaks, Toronto, Ontario, Canada, 2014-15.

Authors:  Derek R MacFadden; Allison McGeer; Taryn Athey; Stephen Perusini; Romy Olsha; Aimin Li; AliReza Eshaghi; Jonathan B Gubbay; William P Hanage
Journal:  Emerg Infect Dis       Date:  2018-03       Impact factor: 6.883

4.  Rapid outbreak sequencing of Ebola virus in Sierra Leone identifies transmission chains linked to sporadic cases.

Authors:  Armando Arias; Simon J Watson; Danny Asogun; Ekaete Alice Tobin; Jia Lu; My V T Phan; Umaru Jah; Raoul Emeric Guetiya Wadoum; Luke Meredith; Lucy Thorne; Sarah Caddy; Alimamy Tarawalie; Pinky Langat; Gytis Dudas; Nuno R Faria; Simon Dellicour; Abdul Kamara; Brima Kargbo; Brima Osaio Kamara; Sahr Gevao; Daniel Cooper; Matthew Newport; Peter Horby; Jake Dunning; Foday Sahr; Tim Brooks; Andrew J H Simpson; Elisabetta Groppelli; Guoying Liu; Nisha Mulakken; Kate Rhodes; James Akpablie; Zabulon Yoti; Margaret Lamunu; Esther Vitto; Patrick Otim; Collins Owilli; Isaac Boateng; Lawrence Okoror; Emmanuel Omomoh; Jennifer Oyakhilome; Racheal Omiunu; Ighodalo Yemisis; Donatus Adomeh; Solomon Ehikhiametalor; Patience Akhilomen; Chris Aire; Andreas Kurth; Nicola Cook; Jan Baumann; Martin Gabriel; Roman Wölfel; Antonino Di Caro; Miles W Carroll; Stephan Günther; John Redd; Dhamari Naidoo; Oliver G Pybus; Andrew Rambaut; Paul Kellam; Ian Goodfellow; Matthew Cotten
Journal:  Virus Evol       Date:  2016-06-22

5.  Microreact: visualizing and sharing data for genomic epidemiology and phylogeography.

Authors:  Silvia Argimón; Khalil Abudahab; Richard J E Goater; Artemij Fedosejev; Jyothish Bhai; Corinna Glasner; Edward J Feil; Matthew T G Holden; Corin A Yeats; Hajo Grundmann; Brian G Spratt; David M Aanensen
Journal:  Microb Genom       Date:  2016-11-30

6.  A pneumonia outbreak associated with a new coronavirus of probable bat origin.

Authors:  Peng Zhou; Xing-Lou Yang; Xian-Guang Wang; Ben Hu; Lei Zhang; Wei Zhang; Hao-Rui Si; Yan Zhu; Bei Li; Chao-Lin Huang; Hui-Dong Chen; Jing Chen; Yun Luo; Hua Guo; Ren-Di Jiang; Mei-Qin Liu; Ying Chen; Xu-Rui Shen; Xi Wang; Xiao-Shuang Zheng; Kai Zhao; Quan-Jiao Chen; Fei Deng; Lin-Lin Liu; Bing Yan; Fa-Xian Zhan; Yan-Yi Wang; Geng-Fu Xiao; Zheng-Li Shi
Journal:  Nature       Date:  2020-02-03       Impact factor: 69.504

Review 7.  Clinical and biological insights from viral genome sequencing.

Authors:  Charlotte J Houldcroft; Mathew A Beale; Judith Breuer
Journal:  Nat Rev Microbiol       Date:  2017-01-16       Impact factor: 60.633

8.  Screening of healthcare workers for SARS-CoV-2 highlights the role of asymptomatic carriage in COVID-19 transmission.

Authors:  Lucy Rivett; Sushmita Sridhar; Dominic Sparkes; Matthew Routledge; Nick K Jones; Ian G Goodfellow; Stephen Baker; Michael P Weekes; Sally Forrest; Jamie Young; Joana Pereira-Dias; William L Hamilton; Mark Ferris; M Estee Torok; Luke Meredith; Martin D Curran; Stewart Fuller; Afzal Chaudhry; Ashley Shaw; Richard J Samworth; John R Bradley; Gordon Dougan; Kenneth Gc Smith; Paul J Lehner; Nicholas J Matheson; Giles Wright
Journal:  Elife       Date:  2020-05-11       Impact factor: 8.140

9.  Temporal signal and the phylodynamic threshold of SARS-CoV-2.

Authors:  Sebastian Duchene; Leo Featherstone; Melina Haritopoulou-Sinanidou; Andrew Rambaut; Philippe Lemey; Guy Baele
Journal:  Virus Evol       Date:  2020-08-19

10.  The proximal origin of SARS-CoV-2.

Authors:  Kristian G Andersen; Andrew Rambaut; W Ian Lipkin; Edward C Holmes; Robert F Garry
Journal:  Nat Med       Date:  2020-04       Impact factor: 87.241

View more
  128 in total

1.  Patterns of Volatility Across the Spike Protein Accurately Predict the Emergence of Mutations within SARS-CoV-2 Lineages.

Authors:  Roberth A Rojas Chávez; Mohammad Fili; Changze Han; Syed A Rahman; Isaiah G L Bicar; Guiping Hu; Jishnu Das; Grant D Brown; Hillel Haim
Journal:  bioRxiv       Date:  2022-02-02

2.  Patterns of within-host genetic diversity in SARS-CoV-2.

Authors:  Gerry Tonkin-Hill; Inigo Martincorena; Roberto Amato; Andrew R J Lawson; Moritz Gerstung; Ian Johnston; David K Jackson; Naomi Park; Stefanie V Lensing; Michael A Quail; Sónia Gonçalves; Cristina Ariani; Michael Spencer Chapman; William L Hamilton; Luke W Meredith; Grant Hall; Aminu S Jahun; Yasmin Chaudhry; Myra Hosmillo; Malte L Pinckert; Iliana Georgana; Anna Yakovleva; Laura G Caller; Sarah L Caddy; Theresa Feltwell; Fahad A Khokhar; Charlotte J Houldcroft; Martin D Curran; Surendra Parmar; Alex Alderton; Rachel Nelson; Ewan M Harrison; John Sillitoe; Stephen D Bentley; Jeffrey C Barrett; M Estee Torok; Ian G Goodfellow; Cordelia Langford; Dominic Kwiatkowski
Journal:  Elife       Date:  2021-08-13       Impact factor: 8.140

3.  Genomic epidemiology of COVID-19 in care homes in the east of England.

Authors:  William L Hamilton; Gerry Tonkin-Hill; Emily R Smith; Dinesh Aggarwal; Charlotte J Houldcroft; Ben Warne; Luke W Meredith; Myra Hosmillo; Aminu S Jahun; Martin D Curran; Surendra Parmar; Laura G Caller; Sarah L Caddy; Fahad A Khokhar; Anna Yakovleva; Grant Hall; Theresa Feltwell; Malte L Pinckert; Iliana Georgana; Yasmin Chaudhry; Colin S Brown; Sonia Gonçalves; Roberto Amato; Ewan M Harrison; Nicholas M Brown; Mathew A Beale; Michael Spencer Chapman; David K Jackson; Ian Johnston; Alex Alderton; John Sillitoe; Cordelia Langford; Gordon Dougan; Sharon J Peacock; Dominic P Kwiatowski; Ian G Goodfellow; M Estee Torok
Journal:  Elife       Date:  2021-03-02       Impact factor: 8.140

Review 4.  Tools and Techniques for Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2)/COVID-19 Detection.

Authors:  Seyed Hamid Safiabadi Tali; Jason J LeBlanc; Zubi Sadiq; Oyejide Damilola Oyewunmi; Carolina Camargo; Bahareh Nikpour; Narges Armanfard; Selena M Sagan; Sana Jahanshahi-Anbuhi
Journal:  Clin Microbiol Rev       Date:  2021-05-12       Impact factor: 26.132

5.  Initial Insights Into the Genetic Epidemiology of SARS-CoV-2 Isolates From Kerala Suggest Local Spread From Limited Introductions.

Authors:  Chandni Radhakrishnan; Mohit Kumar Divakar; Abhinav Jain; Prasanth Viswanathan; Rahul C Bhoyar; Bani Jolly; Mohamed Imran; Disha Sharma; Mercy Rophina; Gyan Ranjan; Paras Sehgal; Beena Philomina Jose; Rajendran Vadukkoot Raman; Thulaseedharan Nallaveettil Kesavan; Kalpana George; Sheela Mathew; Jayesh Kumar Poovullathil; Sajeeth Kumar Keeriyatt Govindan; Priyanka Raveendranadhan Nair; Shameer Vadekkandiyil; Vineeth Gladson; Midhun Mohan; Fairoz Cheriyalingal Parambath; Mohit Mangla; Afra Shamnath; Sridhar Sivasubbu; Vinod Scaria
Journal:  Front Genet       Date:  2021-03-17       Impact factor: 4.599

6.  Applying prospective genomic surveillance to support investigation of hospital-onset COVID-19.

Authors:  William L Hamilton; Tom Fieldman; Aminu Jahun; Ben Warne; Christopher J R Illingworth; Chris Jackson; Beth Blane; Elinor Moore; Michael P Weekes; Sharon J Peacock; Daniela De Angelis; Ian Goodfellow; Theodore Gouliouris; M Estée Török
Journal:  Lancet Infect Dis       Date:  2021-05-10       Impact factor: 25.071

7.  The impact of testing and infection prevention and control strategies on within-hospital transmission dynamics of COVID-19 in English hospitals.

Authors:  Stephanie Evans; Emily Agnew; Emilia Vynnycky; James Stimson; Alex Bhattacharya; Christopher Rooney; Ben Warne; Julie Robotham
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2021-05-31       Impact factor: 6.237

Review 8.  SARS-CoV-2 variants, spike mutations and immune escape.

Authors:  William T Harvey; Alessandro M Carabelli; Ben Jackson; Ravindra K Gupta; Emma C Thomson; Ewan M Harrison; Catherine Ludden; Richard Reeve; Andrew Rambaut; Sharon J Peacock; David L Robertson
Journal:  Nat Rev Microbiol       Date:  2021-06-01       Impact factor: 78.297

9.  Prediction and Evolution of the Molecular Fitness of SARS-CoV-2 Variants: Introducing SpikePro.

Authors:  Fabrizio Pucci; Marianne Rooman
Journal:  Viruses       Date:  2021-05-18       Impact factor: 5.048

10.  Using Whole Genome Sequences to Investigate Adenovirus Outbreaks in a Hematopoietic Stem Cell Transplant Unit.

Authors:  Chloe E Myers; Charlotte J Houldcroft; Sunando Roy; Ben K Margetts; Timothy Best; Cristina Venturini; Jose A Guerra-Assunção; Charlotte A Williams; Rachel Williams; Helen Dunn; John C Hartley; Kanchan Rao; Kathryn J Rolfe; Judith Breuer
Journal:  Front Microbiol       Date:  2021-07-02       Impact factor: 5.640

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.