Literature DB >> 35958084

Alternative COVID-19 mitigation measures in school classrooms: analysis using an agent-based model of SARS-CoV-2 transmission.

M J Woodhouse¹, W P Aspinall^1,2, R S J Sparks¹, E Brooks-Pollock³, C Relton⁴.

Abstract

The SARS-CoV-2 epidemic has impacted children's education, with schools required to implement infection control measures that have led to periods of absence and classroom closures. We developed an agent-based epidemiological model of SARS-CoV-2 transmission in a school classroom that allows us to quantify projected infection patterns within primary school classrooms, and related uncertainties. Our approach is based on a contact model constructed using random networks, informed by structured expert judgement. The effectiveness of mitigation strategies in suppressing infection outbreaks and limiting pupil absence are considered. COVID-19 infections in primary schools in England in autumn 2020 were re-examined and the model was then used to estimate infection levels in autumn 2021, as the Delta variant was emerging and it was thought likely that school transmission would play a major role in an incipient new wave of the epidemic. Our results were in good agreement with available data. These findings indicate that testing-based surveillance is more effective than bubble quarantine, both for reducing transmission and avoiding pupil absence, even accounting for insensitivity of self-administered tests. Bubble quarantine entails large numbers of absences, with only modest impact on classroom infections. However, maintaining reduced contact rates within the classroom can have a major benefit for managing COVID-19 in school settings.

Entities: Chemical

Keywords: COVID-19 in schools; SARS-CoV-2; agent-based epidemiological model; random networks; structured expert judgement

Year: 2022 PMID： 35958084 PMCID： PMC9363991 DOI： 10.1098/rsos.211985

Source DB: PubMed Journal: R Soc Open Sci ISSN： 2054-5703 Impact factor: 3.653

Introduction

As an increasing proportion of people become vaccinated, the spread of SARS-CoV-2 becomes concentrated mainly with unvaccinated persons. Children become of particular importance in these circumstances since the benefits, to the children themselves, of vaccination are moot. While, at the time of writing, the UK government has a vaccination programme for secondary age children, primary age children have not been mass vaccinated. Factors that influence these discussions include the observation that severe COVID-19 illness is very rare in children, while there are very small risks of adverse reactions to the vaccine. Thus, the importance of vaccination, from the point of view of an individuals' protection against serious disease, is equivocal. On the other hand, schools with largely unvaccinated populations may act as environments for spreading SARS-CoV-2, with the potential for the development of new variants. Thus, from a public health perspective, schools are, potentially, a significant reservoir of infection. Measures to mitigate transmission can be very disruptive to learning and to the economy if large numbers of pupils and their families are required to self-isolate or have their work compromised by childcare responsibilities. In the latter context, there is an urgent need to understand which policies can be pursued that reduce transmission but at the same time minimize disruption of education and collateral effects. In this study, we have developed a basic stochastic model for quantifying the risk of transmission of SARS-CoV-2 occurring in primary schools in England. Evidence was emerging that respiratory aerosols expelled by infected people are a significant mode of infection transmission [1], so indoor classrooms, which may be poorly ventilated, can represent environments with increased transmission risk. However, concentrations of virions in aerosols are substantially elevated near to an infected person, and close contacts between infected and susceptible individuals are likely to be responsible for most transmissions in many situations (see the electronic supplementary material, appendix A1). Our model is based on a discrete agent-based epidemiological model of transmission in a primary school classroom, and involves an empirical representation of the probability of transmission owing to close contacts within class-sized interactive networks. With this approach, sets of several classrooms collectively can be assessed as a way of quantifying transmission probabilities in individual schools. Statistical representation of close contacts between children and adults within school settings is derived from a study of 36 primary schools in England using teachers’ expert judgements, formally elicited in spring/summer 2020 [2]; in that study, close contact was defined to be a face-to-face contact within 1 m for at least 5 min. It also included a comparison of anticipated contact rates in schools taking mitigation measures in the following September 2020, such as formation of bubbles, reduced class sizes and other social distancing rules, with previous, normal contact rates in ‘pre-COVID’ times. The model results are compared with data on school attendances, self-isolation of children who are sent home because of COVID outbreaks and prevalence of COVID in both schools and communities. These comparisons enable us to check that the model produces reasonable estimations of expected infection rates and absences. However, the main purpose of our modelling approach is to compare the effects of different mitigation strategies on infection transmission rates within schools, including sending ‘bubbles’ or whole classes home for self-isolation periods, sending home only those pupils or adults thought to have become infected, or using testing to identify infections. At the centre of our approach is a time-stepping epidemiological model of SARS-CoV-2 infection transmission in a classroom. The model we have developed combines a discrete agent-based epidemiological framework with a random network for daily person-to-person contacts within the classroom. In the basis case, the classroom population consists of a single teacher, a set number of pupils, and a small number of classroom teaching assistants. In relation to infection transmissions, the modelled classroom population is assumed to be isolated from other classes and persons within the school, so the present model does not include interactions with other people in the school. This is a major assumption and is best suited to primary school settings where classroom groups can be most effectively separated from one another as a part of infection risk management; this was a mitigation instituted in March 2020 for primary schools in England that were open for vulnerable children and those of critical workers and, later, as other schools in England reopened to selected age groups in June 2020. The simplifying assumption is also justified by evidence for limited mixing across different classes and year groups in UK primary schools [3]. As the model advances through time, it includes a daily chance of seeding infection within one or more of the classroom occupants from interactions with the outside community; this infection probability is a function of community incidence rate. We describe this aspect of our model in more detail below. We then applied our model to the autumn term (term 1) in 2020 when schools in England first reopened to full classes. Using estimates of SARS-CoV-2 infection prevalence and the incidence rate during term 1 2020, we illustrate the application of our model and show that it can simulate numbers of infections that are in good agreement with available data. We use the model over this period with different mitigation measures applied and analyse the effect of these strategies on the transmission of SARS-CoV-2 within schools. We then applied our model to estimate infection in schools for term 1 in 2021 using projections of community prevalence, presuming schools play an important role in an ensuing third wave of the SARS-CoV-2 epidemic, particularly in the face of the Delta variant and exclusion of children from vaccination programmes.

Methods

Epidemiological model

We model SARS-CoV-2 transmission within a classroom of individuals using an agent-based network model, with each member of the classroom population having their own characteristics and attributes updating on interactions and over time. We adopt simplified disease states of persons in school, based on a compartmental epidemiological model. To accommodate transmission from pre-symptomatic and asymptomatic individuals, the progression of infection is described using a variation of a stochastic compartmental epidemiological model, which includes explicit time variation in the transmissibility of SARS-CoV-2 based on time since infection. Specifically, we use the following disease states: susceptible (S), exposed (E, which includes infectious individuals), unwell (U, where an individual has symptoms), quarantined (Q), and recovered (R). We adopt a discrete-time stochastic modelling framework, with updates to the disease states occurring each day. Further details of the agent-based model are given in the electronic supplementary material, appendix A2 and the model is available from [4] with supporting data from [5]. The S population are those who have not become infected with the SARS-CoV-2 virus. On becoming infected, an individual moves to the E state and begins a period of incubation during which there is no external indication of infection, but where there is a possibility of infection transmission in contacts. For symptomatic individuals, the incubation period ends with the individual experiencing symptoms and moving into the U state. At this stage, we assume that symptoms of infection would be recognized by the infected individual or observed by others and, if such symptoms are detected or confirmed, usually by testing, then a U individual will be Q. Individuals remain quarantined for a minimum quarantine period, and for as long as they are unwell. However, others may be asymptomatic or have symptoms so mild as to be unobserved, so these persons are not migrated to the U state in the model. The R state collects infected individuals following their period of being U, or for the duration of the infectious period for asymptomatic cases, at which stage they are assumed to have acquired immunity from further SARS-CoV-2 infection for the simulation period. Using evidence derived from large datasets on the timing of secondary infections with the onset of symptoms of primary cases [6], we model infectivity through time-dependent transmission probability of an index case with a contact. The transmission probability increases during the incubation period from zero at the time of infection, peaks slightly before the time of symptom onset (for symptomatic cases), and subsequently decreases over time [6]. In [6], it is inferred from data that there is a distribution of the duration of incubation period of SARS-CoV-2 and that the duration of the pre-symptomatic infectious period is related to the duration of the incubation period, with individuals who have long incubation periods tending to have an earlier and longer lasting pre-symptomatic infectious period. Future variants of the virus could alter these distributions. These time-dependencies are encoded in an expression for the relative probability of transmission in a contact within the class that is a function of time (t, in days) since symptom onset [6], denoted here by , which depends on the incubation period of the infected individual (t, in days). For the distribution of the incubation period, we use a log-Normal distribution, [7]. For the relative probability of transmission, we adopt the parametric model determined from data by [6]:where the parameters days, σ = 1.85 days, α = 5.85 are best-fit parameters as estimated by [6] and τ = 5.42 days is the mean incubation period [7]. Note that this function is not a probability density function, rather it is applied as a time-varying weighting to the transmission probability in a contact, and the pre-factor ( is specified so that . The time variation of the relative transmission probability is illustrated in figure 1 for an unusually short (t = 2, ), a typical () and an unusually long (t = 12, ) incubation period.

Figure 1

Time variation of the relative probability of transmission from an infected individual with incubation period t. Probability of transmission increases from the time of infection, peaks close to the time of onset of symptoms, and then decreases over the duration of infectivity. Individuals with long incubation periods have longer pre-symptomatic infection periods, but following symptom onset the decrease in transmission probability is independent of t. The absolute probability of transmission in a contact is found by scaling the relative probability of transmission by a fixed peak transmission probability, denoted by pmax, which differs for adults and children [8-10], for symptomatic or asymptomatic cases [11], reflects mitigation measures, and is altered to model differences in transmissibility for SARS-CoV-2 variants. The classroom population can comprise both symptomatic and asymptomatic individuals, distinguished as these who will (respectively, will not) develop identifiable symptoms if they are infected with SARS-CoV-2. The specification of symptomatic and asymptomatic individuals occurs on initialization of the model by random Bernoulli draws with a prescribed probability. For asymptomatic cases, we adopt the same time-varying relative probability of transmission by specifying a notional incubation time (drawn from the log-Normal distribution), but the absolute probability of transmission in a contact is reduced relative to symptomatic cases by 30% (table 1).

Table 1

Stochastic and fixed parameters of the model. (CDF, cumulative distribution function; SEJ, Structured Expert Judgement.)

parameter	distribution			update
initial infection in each class member	Bernoulli with community prevalence of infection			set on initialization for class members using estimates of community prevalence
initial infection in each class member				updates on change of teacher using community prevalence time series
probability of infection from community	Bernoulli with community incidence rate			daily, using time series of community incidence rate from observations or large-scale forecast models
number of daily contacts for each class member	non-parametric CDFs derived from SEJ for pupils and classroom staff			set on initialization for pupils
	CDFs for pre-COVID and ‘new normal’ reduced contacts			updates on change of teacher
	CDFs for pre-COVID and ‘new normal’ reduced contacts			contact network altered daily
symptomatic status	Bernoulli with specified probabilities for symptomatic adults and children. Default value of 0.8 for both			set on initialization
symptomatic status				updates on change of teacher
duration of incubation period, t_i (note that asymptomatic cases have a notional t_i)	drawn from log-Normal distribution [7]; ti∼log N(1.63,0.5)			set on initialization
duration of unwell period, t_u (note that asymptomatic cases have a notional t_u)	drawn from log-Normal distributions for children and adults [12,13]			set on initialization
	adults: tu∼log N(2.40,0.840)			updates on change of teacher
	children: tu∼log N(1.61,0.923)			updates on change of teacher
probability of contact transmission (base values for original wild-type SARS-CoV-2), p_max		adults	children	fixed values
	symptomatic	0.035	0.0175	50% reduction for children relative to adults [8,9]
	asymptomatic	0.0245	0.01225	30% reduction for asymptomatic relative to symptomatic infection
soft mitigation factor, f_soft	pupils: 0.75			fixed values
soft mitigation factor, f_soft	adults: 0.6			fixed values
rapid testing	values from [14]			fixed values
	sensitivity: 0.58
	specificity: 0.9968
PCR testing	values from [15]			fixed values
	sensitivity: 0.90
	specificity: 0.999
weighting of pre-COVID contact rates to small classes contact rates, w		pupils	classroom staff	fixed values
	pre-COVID	1	1
	reduced contacts	0.5	0.4
	small classes	0	0
probability of vaccinated adult	term 1 2020: 0			fixed values
probability of vaccinated adult	term 1 2021: 95% [16]			fixed values
probability of vaccinated child	term 1 2020: 0			fixed values
probability of vaccinated child	term 1 2021: 1% [16]			fixed values
vaccine transmission factor, fvaccine transmision	0.60 [17–19]			fixed values
vaccine susceptibility factor, fvaccine susceptibility	0.40 [17–19]			fixed values
vaccine symptomatic factor	0.20 [17–19]			fixed values

Stochastic and fixed parameters of the model. (CDF, cumulative distribution function; SEJ, Structured Expert Judgement.) Symptomatic individuals become unwell following the incubation period. The duration of symptoms varies for adults and children [12,13] with adults typically having longer-lasting symptoms (median duration of 11 days) compared to children (median duration of 5 days for children aged 5–11 years [13]). However, there is pronounced variation around the median duration which we model using a log-Normal distribution. This is a two-parameter distribution, specified by using the reported proportion of people having symptoms extending beyond 28 days in [12,13] (13.3% for adults and 3.1% of primary school aged children). An additional ingredient of the population dynamics is that quarantined teachers must be replaced by a substitute teacher, drawn from outside the initial class group. Initialization of the model places the classroom members into the SEUQR states. A Bernoulli trial is performed for each individual using a community infection prevalence, specified from observation or national-scale model-based projections, with ‘failure’ resulting in the individual placed into the S state. A ‘success’ in the Bernoulli trial corresponds to an individual with infection on initialization, so a random draw is made to determine their elapsed time of infection and they are placed into the appropriate E, U or Q states on this basis. We note that initialization may result in classrooms without an infected individual, particularly when community prevalence is low. Additionally, we can model pre-existing immunity, either through previous infection or vaccination, with Bernoulli trials that adopt the probability of an individual being vaccinated or previously infected.

Transmission into the classroom

In our model, there are two pathways for susceptible members of the classroom to become exposed: contact with an infectious member of the classroom, or through interactions outside the classroom (referred to here as ‘community transmission’). To model the infection from an out-of-classroom interaction, we begin each daily time step with a Bernoulli trial for each susceptible classroom member. The probability of any susceptible classroom individual being exposed to community transmission is specified using an estimate of the daily incidence rate of new infections at the appropriate time. This approach overestimates community transmission as the individual is in school for much of the day, and classroom transmissions will contribute to the community incidence rate estimates. However, without detailed estimates of incidence rates in different settings, it is difficult to deconvolve these effects, and our approach is a pragmatic comprise, recognizing that time in school represents approximately 26% of waking time in any school week, and our model does not explicitly include effects that might promote incidence of infection connected to schools (such as increased social mixing at the start and end of the school day).

Classroom transmission network

Transmission of infection within the classroom requires a more sophisticated model. Potential pathways for SARS-CoV-2 transmission include physical contact, infection by virus contained in large respiratory droplets, contagiously from contaminated surfaces (fomites), and through inhalation of infectious aerosols (airborne transmission) that could act over long distances [20]. For all of these routes, close contacts are likely to substantially increase the probably of infection transmission (see the electronic supplementary material, appendix A1 for a discussion of airborne transmission). We, therefore, base our transmission model on close contacts between individuals, using a random network model of contacts within a classroom. The number of contacts for each individual is specified from stochastic draws from the distribution of daily classroom contacts [2]. Thus, each class member has their own number of daily contacts with others in the classroom. It is known that contact patterns in classrooms can be highly variable, affected by both individual and classroom behaviours [2]. Additionally, behavioural mitigation measures were instituted in March 2020 to reduce contacts within schools. To model the number of contacts, we use the results of a Structured Expert Judgement (SEJ) elicitation study [2] conducted in spring/summer 2020 in the UK. SEJ uses the collective knowledge of experts (here school teachers) to estimate quantities of interest and their uncertainties, and can be performed rapidly during crises to provide essential information to models and policy makers [21]. Experts' responses to a set of test questions with knowable results are used to score experts in terms of their individual statistical performance and their target item judgements are then combined jointly, using these performance scores, to derive a pooled ‘Decision Maker’ [21]; the latter represents associated groupwise judgement uncertainties through empirical cumulative distribution functions (eCDFs) [22]. In our model, eCDFs for the number of contacts in classrooms are those derived from pooled expert responses in an elicitation for primary school pupils and classroom staff [2]. That study determined different contact rates in: (i) the pre-COVID classroom, and then (ii) following implementation of bubbles and other measures to reduce the number of contacts [2]. The corresponding eCDFs are illustrated in figure 2a, showing substantial reduction in the number of contacts once mitigation measures are imposed (data available from [5]). The mean number of contacts for pupils is reduced from 16.7 per day (s.d. 16) in pre-COVID classrooms, to 6.4 per day (s.d. 5.4) when there are small numbers in the class. We note that the mean daily pupils' contact rate is similar to the value estimated by the POLYMOD social contact survey [23] (mean of 18.2 contacts per day, s.d. 12.2 for children aged 10–14 years).

Figure 2

(a) Empirical cumulative distribution functions (eCDFs) for the number of daily contacts in primary school classrooms. The eCDF derived from Structured Expert Judgement for pupils (blue) and classroom staff (teachers and teaching assistants; red) are shown for pre-COVID times (dashed lines) and following opening of schools to small numbers of children in June–July 2020 (dotted lines). The cumulative distribution function constructed from a weighted combination of the pre-COVID and small classes eCDFs are shown (solid lines). (b) Box and whisker plots (with outliers removed) comparing the number of daily contacts for pupils and classroom staff in pre-COVID times, for small classes, and modelled for full classes under conditions to reduce contacts. The contact data estimated by SEJ in [2] were obtained under the unique circumstances of June and July 2020 when between 30% and 40% of children had returned to school and the class sizes were greatly reduced, making social distancing measures much easier to implement. However, with full return to school we would not expect the same reduction in contact rates. In the study [2], teachers were asked to estimate whether the contact numbers would be affected. Their collective view (table 7 in ref. [2]) is that the contact numbers would be between their estimates in normal pre-COVID times and in new normal times. To model the reduction in the number of contacts in fully opened schools, we construct a cumulative distribution function (denoted by ) as a weighted linear combination of the eCDF in pre-COVID classrooms in normal times (denoted by ) and the eCDF in the small classrooms of June–July 2020 (denoted by ), with , where w is the weighting of the pre-COVID contact distribution. Contact rate distributions are constructed in this way for both pupils and teachers, with differing weights. In [2], teachers expected that daily pupil contacts would be halfway between the small classroom and normal pre-COVID values when classrooms become more highly occupied, and we therefore take a weight w = 0.5. However, teachers considered that their own numbers of daily contacts could be more strongly reduced in the fully open schools, so we take w = 0.4 to place greater weight on the small classroom distribution. With these weightings, the mean number of contacts per day is 11.6 (s.d. 10.9) for pupils and 15.1 (s.d. 15.9) for teachers. These summary values are similar to the contact rates for children estimated using surveys over term 1 (September–December) of 2020 when schools fully reopened (mean number of 15.1 contacts per day) [24], albeit with different definitions of a contact (in [2] face-to-face contact within 1 m for 5 min or more is used, whereas [24, p. 6] define direct contact as an interaction where ‘at least a few words were exchanged or physical contact was made’). At the start of each school day in the modelled sequence, we build a stochastic network of contacts for each class member who is not quarantined. The network is constructed using the ‘configuration model’ approach [25] , allowing multiple edges but removing self-loops through a sequence of random ‘rewirings’ (i.e. self-loop edges are randomly switched until a valid configuration is achieved). Preserving multiple edges allows us to model repeated contacts between individuals. The configuration model produces a random graph with a specified degree distribution in which nodes of the network are linked randomly using specified numbers of edges. This means that gregarious individuals with high numbers of contacts are more likely to be connected to other highly contacting individuals. However, the contact network is rebuilt each day, to reflect changing contact patterns and the removal of class members into quarantine. This is a subjective choice in our model which results in perturbations to the contact network. Since the configuration model is likely to connect high degree nodes, the daily reconstruction of the network maintains high numbers of contacts between gregarious individuals. Further discussion of this modelling choice and a brief analysis of the sensitivity of our results to the rebuilding rate of the network is provided in the electronic supplementary material, appendix A3. The configuration model requires an even number of connections; in cases where an odd number of edges is generated by the stochastic initialization of the degree distribution or from removal of nodes to quarantine, then the degree of a randomly selected node is increased by one. On non-school days, there are no classroom contacts, but community transmission can occur, and the agents continue to progress through the SEUQR states. We note that, in reality, contacts in a class are unlikely to be completely random. Younger children, for example, tend to form particular friendship groups which may mean that poorly connected networks may exist. While the configuration model with degrees specified by the contact rates distributions results in small groups with large numbers of interactions, the networks are typically connected. The configuration model does not exclude disconnected subgraphs, but we find they are rare occurrences in our simulations. Addressing the detailed structure of classroom contact networks would make an interesting development of the model. Figure 3 shows examples of the contact networks, illustrated using the ‘adjacency matrix’ (i.e. the number of edges linking each pair of nodes, corresponding to the number of daily contacts between two individuals in the classroom). An example of a randomly generated network is shown for both contact rates drawn from the pre-COVID and reduced contact distributions. For pre-COVID contacts (figure 3a), the most gregarious member of the class has 85 daily contacts, and there are four individuals with more than 50 contacts per day (pupils numbered 0, 6, 21 and 30). These highly contacting individuals have many interactions within this subgroup, with, e.g. pupils 21 and 30 having 11 contacts on this day. When reduced contact rates are employed (figure 3b), the maximum number of contacts decreases to 48, and only a single individual has more than 23 contacts in the day, and the highest number of contacts between pairs of individuals, in this case between pupils numbered 22 and 28, occurs only four times on this day. There are no disconnected subgraphs in either of the examples shown.

Figure 3

Examples of adjacency matrices (illustrated with blue shades) for stochastic contact networks produced using the configuration model with the total number of contacts for each agent (the degrees of each node, shown in red shades on the leading diagonal) drawn from (a) pre-COVID contact rate distributions and (b) the reduced contacts rate distributions. The 31 ‘agents’ in the classroom are numbered 0–30, with agent 30 corresponding to the teacher. Note the same colour scales are applied for both pre-COVID and reduced contact cases. If contact occurs between an infectious individual with another who is susceptible, there is the possibility of transmission. This is modelled as a random Bernoulli trial, with differing probabilities of transmission depending on whether the infectious individual is symptomatic (higher transmission probability) or asymptomatic (lower transmission probability), and with transmission probability varying in time. In a contact between an S person in the classroom with an infectious person, we determine the infection transmission probability aswhere and are factors (discussed further below) that reduce transmission probability owing to application of ‘soft’ mitigation measures and effect of vaccination on infectivity and susceptibility. Note that this approach combines the infectiousness of the index case with the susceptibility of their contact into a single probability of transmission. This is a simplifying assumption in the model, but includes individual susceptibility to SARS-CoV-2 for the population in the model. The SEUQR states are updated at the end of daily timesteps, so transmission can occur before recovery. An additional requirement is that the classroom must have a teacher present. If the teacher is quarantined owing to illness or a positive test result, a temporary replacement is added to the classroom population. The substitute teacher has their own number of daily contacts, drawn from the SEJ distribution, and their stage in the SEUQR progression is determined by Bernoulli trial with the infection prevalence at the time of replacement, noting that a valid substitute teacher cannot be showing symptoms (i.e. U) nor in Q, but could bring infection into the classroom population. If the permanent teacher is released from Q, or their time of illness elapses (i.e. they progress to R), then they are returned to the classroom and the substitute teacher is removed. Should the temporary teacher become U or Q, they are replaced by a different, substitute teacher. Examples of three random contact networks produced using the configuration model are illustrated in figure 4 for a small classroom of 10 pupils (enumerated from 1 to 10) and a single teacher (labelled as ‘T’), simulated over 3 days. The degree distribution is fixed; pupils 1 and 2 are the most gregarious with five contacts each day, followed by pupil 5 with five contacts on days 1 and 3 but four contacts on day 2. There are duplicated contacts (i.e. multiple edges) between pupils in each network, which present two opportunities for transmission between persons if one of the individuals is infectious and the other susceptible (which occurs in figure 4c between pupils 1 and 2).

Figure 4

An example of infection transmission within a random updating contact network in a small classroom of 10 pupils and one teacher. The panels represent a progression over 3 days (a to b to c). The degree distribution is fixed, and the edges are determined using the configuration model with self-loops removed by random rewiring. Multiple edges are allowed, as illustrated in (a) where, for example, there are two edges between nodes 1 and 10. Green nodes indicate S class members, orange nodes indicate E members and red nodes indicate U individuals. Nodes that are boldly edged (node 5 in c) are Q. Red coloured edges indicate infection transmission between infectious and susceptible members. Note the parameters for infection transmission in this example are not representative of SARS-CoV-2. Figure 4 also illustrates the transmission of infection across the network, with a daily progression from panels (a) to (b) to (c). Note that the parameters related to infection transmission in the example in figure 4 are taken to be much larger than reasonable to model SARS-CoV-2 and have been chosen simply to illustrate the way the model updates. On the first day (figure 4a), there are two exposed individuals (pupils 1 and 5), and their contacts represent potential pathways for infection, with actual transmission determined by Bernoulli trials. In this example, the random trials result in no transmissions. However, on the second day (figure 4b), the two exposed pupils do transmit the infection across some of their contacts; of the five contacts with pupil 1, there is one transmission to pupil 3, whereas two contacts of pupil 5 result in transmission to pupils 4 and 6. Therefore, on the third day, pupils 3, 4 and 6 are progressed from the S into the E states. Further infection transmission occurs on day 3, with two contacts of pupil 1, but, by chance and there is no infection transmission in contacts of the other exposed individuals. Additionally, on day 3, pupil 5 has developed symptoms and is progressed into the U state. In this example, the U individual is Q, is away from the classroom and has no connections with other members of the classroom.

Vaccination uptake and effectiveness

As vaccination programmes continue, there is a need to include the effects of vaccine on SARS-CoV-2 infection, transmission and symptoms. In the UK, vaccination began in October 2020 with high-risk groups and has proceeded downwards through age groups. Therefore, in modelling school infection in term 1 2020, there are very low levels of vaccination for pupils and teachers. However, vaccine uptake has been high, so that by the end of the 2020–2021 school year 80% of working age people are estimated to have received two vaccine doses [16]. Primary aged children are unlikely to have received vaccine unless they have additional risk factors. To include vaccination in our model, on initialization of the class members, we perform Bernoulli trials using estimates of the vaccination uptake proportion. We model three effects of the vaccine: (i) a reduction of the probability of a vaccinated individual becoming infected; (ii) a reduction of the probability of an infected vaccinated individual transmitting the infection in a contact; and (iii) a reduction of the probability of an infected vaccinated individual developing symptoms. The different available vaccines have different effectiveness in each of these [17,18]. Here, we use fixed values for scaling factors applied for the reduction in probability of infection, reduction in probability of transmission, and reduction in probability of symptoms for vaccinated members of the population. Therefore, for susceptible vaccinated individuals, their probability of becoming infected by both community transmission and classroom contact is reduced by a ‘vaccine susceptibility factor’. For an infected vaccinated individual, their probability of onward transmission in a contact is reduced by a ‘vaccine transmission factor’, and their probability of developing symptoms is reduced by a ‘vaccine symptoms factor’. These are assigned plausible values based on [17-19] (table 1).

Mitigations

To understand the effectiveness of mitigation measures in reducing SARS-CoV-2 transmission and infection prevalence within the classroom population, we consider several ‘hard’ and ‘soft’ approaches. Soft measures include enhanced cleaning and hygiene, as well as the reduction of mixing of separate classrooms (e.g. through staggered entry, breaks and lunch times; multiple and separated entry points to the school, etc.). We do not model these explicitly as the efficacy of some of these measures is limited [26] and is not well known and would substantially increase the complexity of the model and its interpretation. Instead, we apply soft mitigation factors that reduce the absolute probability of transmission in a contact by a fixed factor and specify different values for adults and children. Hard mitigations correspond to isolation of classroom members away from the school and result either from the identification of symptoms or from testing surveillance to detect pre-symptomatic infection or their contacts. We have discussed above the removal of symptomatic individuals from the classroom as they reach the U stage of the SEUQR progression. In our model, we also allow for two other hard mitigations: ‘bubble quarantine’ of the entire class and ‘regular rapid testing’ of all individuals. The ‘bubble quarantine’ approach is severe, with the entire class placed into Q for a specified time period if there is a confirmed infection case in the class (e.g. if a classroom member becomes unwell). During this time, there are no classroom contacts. However, classroom members may still acquire infection through community transmission and continue to progress through the S, E, U, R states. For S members in quarantine, the probability of becoming infected through community transmission is likely to be reduced as their contacts will be substantially reduced. However, as the infection transmission within households is the primary pathway for infection [27], the reduction in community transmission is likely to be small, and for simplicity, we do not include it in our model as it would introduce a further highly uncertain parameter. Any individual who becomes U during Q restarts their quarantine period. On completion of Q, all classroom members (except those who are U) are returned to the classroom. An individual in Q can become U if they are infected, and then remain in Q until they move to the R state and return to the classroom. Note that there may be asymptomatic or pre-symptomatic infected individuals returned to the classroom following bubble quarantine. An alternative and less severe approach is ‘regular rapid testing’. This surveillance method adopts the regular lateral flow testing of all the classroom population on specified days. The rapid testing is capable of identifying asymptomatic and pre-symptomatic infection, allowing early quarantine by advancing members from the E state to Q. However, all tests have the potential to return misleading results, which may be particularly acute when administered by untrained people [28]. Therefore, we model the application of rapid testing using a stochastic approach. On each test day, each classroom member undergoes a Bernoulli trial: for E individuals who have the infection, the trial adopts the rapid test sensitivity (the true positive rate, i.e. the test probability of correctly detecting infection in an infected subject); for S individuals who are not infected, the trail adopts the rapid test false-positive rate (i.e. the test probability of incorrectly detecting infection in an uninfected subject ). A ‘success’ in each trial corresponds to a positive test result and leads to the individual moving into Q. However, for uninfected individuals, this is an incorrect test result, and the S member is unnecessarily removed from the classroom. The sensitivity of rapid testing in the UK can be high if administered correctly [28]. However, in studies of self-administered lateral flow tests by untrained members of the public, the sensitivity can be as low as 0.58 [14]. We adopt this value here. We have not found similar values for the test specificity for self-administered lateral flow tests, so adopt the reported value of 0.9968. We consider a further approach to reduce the number of classroom members unnecessarily quarantined by false-positive rapid tests through the use of confirmatory polymerase chain reaction (PCR) testing. In our model system, we only include PCR tests as confirmatory to a lateral flow test. The PCR tests are more accurate (high sensitivity and high specificity) and can detect infection sooner after transmission, but require laboratory analysis, so are not well-suited for regular testing of large populations. Here, we allow for them to be used to confirm (or overturn) a positive rapid test result. The PCR test results do not appear immediately; rather, there is a lag while the test is processed, during which time the test-positive subject remains in Q. The sensitivity and specificity of the PCR tests are reported to be much higher than rapid lateral flow tests [15].

Model parameters and stochastic ensemble

The parameters in the model are presented in table 1. Many of them are uncertain; where possible we have adopted published values as noted in table 1 (although many studies have not completed peer-review at the time of writing), and otherwise, we have taken reasonable values. We have not attempted to ‘fit’ parameters to data, but such a study would be valuable. Also, at this stage of model development, we have not performed a full range of detailed sensitivity tests with respect to input parameters, nor imposed uncertain distributions on them. Therefore, our reported values are best used as a comparison between the different scenarios we simulate rather than detailed hind- or forecasts of absolute numbers of infections, although, as we show below, our results are broadly comparable to observed infection levels. There are several stochastic components in the model, which we also summarize in table 1, and therefore, we must perform stochastic ensemble simulations of sufficient size to explore the space of possible trajectories of the infection dynamics within a classroom. In this study, we take ensembles with 100 000 realizations of isolated classrooms. As uncertainties are not included in the fixed parameters, variations in model outputs are owing to stochastic variation in transmission (including contact rates), illness progression and testing. Note, in the school year 2020–2021, there were 156 843 state primary school classrooms and 4.18 million pupils in England [29] (see also [8]). Ensembles of 100 000 samples of our hypothetical class of 30 pupils provide indicative traits and trends in distributional variability; if we invoke the ergodicity assumption, while admitting national schools classes, en masse, will have a variety of pupil numbers and will have other factors in play (inner city versus rural; economic differences, etc.), then our 30-pupil class infection distributional profile can be adopted as first order representative of all real classes faute de mieux. On this basis, we expect that findings from the realizations will scale to transmission at a national level.

Retrospective analysis

The beginning of the 2020–2021 school year in September was the first time that there was ‘full’ attendance in schools since the start of the SARS-CoV-2 epidemic in England. The prevalence of infection in England in September 2020 was relatively low after the aggressive mitigation measures put in place in spring and early summer, but relaxation of restrictions over the traditional school summer holiday period in August resulted in increasing incidence of infection across England as schools reopened in September, a trend that persisted through term 1 (Tuesday 1 September to Friday 23 October 2020) as shown in figure 5. Data from the Office for National Statistics [30] (available in [5]) gives infection prevalence of 0.09% in England for all ages, and 0.11% for children (age 2 to school year 6) on 1 September 2020, increasing by about an order of magnitude, to 1.10% for all ages and 0.87% for children, by 23 October 2020. The incidence rate (per 10 000 population per day) was 0.74 on 1 September, peaked at 8.44 on 21 October 2020, falling slightly to 8.41 on 23 October 2020. At this time, vaccine uptake was low (the national vaccination programme had not started in September and was targeted at high-risk groups through October 2020) so we do not include vaccination in our simulations.

Figure 5

Time series of COVID-19 infection prevalence for adults (blue line) and children (age 2 to school year 6; green line) and the community incidence rate (red line) over term 1 (Tuesday 1 September to Friday 23 October 2020). Data from Office for National Statistics [30].

School collections

Although we model SARS-CoV-2 in a single isolated classroom, the level of infection in larger collections of classrooms (e.g. classes in a school, and schools together in a local authority area) are often the most pertinent issues for decision makers. We, therefore, model a school as a collection of classrooms, and several schools collected together, through sampling with replacement from a stochastic ensemble of simulations of independent individual classes. Specifically, we generate an ensemble of simulations and randomly select realizations from this ensemble to produce a collection of classrooms to represent a single school, and a collection of such schools to model, e.g. a local authority area or academy network. An ensemble of such classroom collections is readily derived by resampling with replacement from the single-classroom ensemble so that statistically meaningful metrics can be derived. We take this approach when comparing mitigation strategies. Modelling transmission in schools as a collection of classrooms is a simplification as it does not include contacts during times where individuals from different classes mix together such as play and lunch breaks and at the beginning and end of the school day. However, at primary schools, children are within classroom settings for typically 80% of the time, while, in break periods, children from the same class will spend much of their time together. This expectation is consistent with the evidence of limited across class mixing [3]. We conclude that the model will capture the majority of transmission contacts. We model a ‘typical’ primary school classroom, consisting of 30 pupils and a teacher, over term 1 in an ensemble with 100 000 realizations. At this time, there were no ‘hard’ mitigation measures, and pupils were only asked to self-isolate if they (or a close contact) developed symptoms. However, soft mitigations including enhanced cleaning and reduction of classroom contacts were in place. We, therefore, do not incorporate hard mitigation measures in the model simulations but build our contact networks using the reduced contacts eCDFs from SEJ, while presuming pupils are quarantined if they become unwell. In figure 6, we illustrate the dynamics of classroom transmission over term 1 by plotting the time series of the number of infected pupils in a classroom for each of the ensemble realizations. The ensemble members have been ordered by the maximum number of infected pupils, which peaks at 14 across all ensemble members. A slight majority of realizations (54%) have no infected pupils across the whole of term 1. The numbers of infected pupils increase towards the end of term 1, following the trend in the community incidence rate and, we infer, reflects seeding of infection in the classroom largely from outside community transmission. Figure 7 shows the number of infections within the ensemble that results from either community (figure 7a) or classroom transmission (figure 7b), and their correlation with the total number of infections.

Figure 6

Figure 7

The number of infections caused by (a) community transmissions and (b) classroom contact transmissions in the ensemble simulations as functions of the total number of infections. The intensity of colour of the points indicates the frequency of occurrence within the ensemble on a logarithmic scale, and the ensemble has 100 000 members. Note that stochastic variations are apparent for low frequencies.

Time series of the number of infected pupils for each realization in an ensemble of 100 000 stochastic simulations of an isolated classroom in term 1 of 2020. Each row of the heatmap corresponds to an ensemble member, with colours denoting the number of pupils in the classroom with COVID-19 infection on each day. The ensemble realizations have been sorted by the sum of the number of infected pupils over all days. (a) All ensemble realizations are shown. (b) The 10 ensemble realizations with the most infections are shown, illustrating the dynamics of large outbreaks that occur with a frequency of 1/10 000. Colours denote the number of infected pupils in the classroom for each day of the simulation. The number of infections caused by (a) community transmissions and (b) classroom contact transmissions in the ensemble simulations as functions of the total number of infections. The intensity of colour of the points indicates the frequency of occurrence within the ensemble on a logarithmic scale, and the ensemble has 100 000 members. Note that stochastic variations are apparent for low frequencies. When the total number of infections is up to six, we find that most infection occurs because of community transmission (figure 7a), but beyond six infections this correlation is broken, and in all these simulations (with infections greater than 5), the number of classroom infections significantly exceeds the number of infections expected in the community. These cases could pragmatically be defined as ‘clusters’ or ‘outbreaks’ owing to in-class transmission being dominant. Current Department for Education (DfE) contingency operational guidance for schools [31, p. 10] defines two thresholds for ‘extra action’ under specific setting conditions and in relation to ‘close mixing’ groups within schools: ‘5 children, pupils, students or staff, who are likely to have mixed closely, test positive for COVID-19 within a 10-day period; or 10% of children, pupils, students or staff who are likely to have mixed closely test positive for COVID-19 within a 10-day period’. These criteria relate to two completed rounds of tests in an ‘asymptomatic test sites' regime, initiated only following full return to school; it is not clear if there are criteria for similar actions owing to in-school outbreaks under other circumstances. In our ensemble, there are relatively few realizations (approximately 1%) with total infections exceeding six, suggesting such outbreaks are relatively uncommon. Larger outbreaks of 10 infected pupils do not occur in our ensemble unless classroom transmission occurs. We find outbreaks with 10 or more infected pupils occur in approximately 0.3% of the simulations, while very large outbreaks of 15 or more pupils (i.e. at least half of our simulated class are infected) occur in only 25 (0.025%) of the realizations. Open data on which to compare our results are scarce, but some summary statistics on school attendance have been published through the pandemic by the UK DfE [32]. These statistics report that on 22 October 2020, about 20% of state-funded primary schools reported one or more pupils who were self-isolating. While it is difficult to compare this summary statistic with our stochastic simulations, as we do not know how many classes had pupils in self-isolation, the value compares favourably with our ensemble which has 18 028 (18%) of the realizations having one or more pupils quarantined on this day of the simulation. Based on the simulations, we also expect about 2665 outbreaks.

Assessing the effectiveness of mitigation approaches

To examine the effectiveness of mitigation approaches, we perform stochastic ensemble simulations with different infection control measures imposed. We model a collection of 10 primary schools, each consisting of four classrooms with 30 pupils and one teacher. The term 1 variation in community prevalence and incidence rate is adopted to model the changes over time in the external community epidemic and allows us to provide a comparison with the simulations above where there are no hard mitigation measures included. In figure 8, we compare the effect of the contact rates in classrooms and the mitigation approaches on the total number of pupils infected over the term 1 period within the 40 classrooms consisting of 1200 pupils in total. The box and whisker plots show the median, 25th and 75th percentiles, with the interquartile range (IQR) being their difference, the lower (25th percentile IQR) and upper (75th percentile IQR) Tukey fences, and outliers.

Figure 8

Comparison of the number of infected pupils in 40 primary classrooms with different mitigation measures applied. The total number includes those initially infected and those infected from community transmission. Box and whisker plots illustrate median, interquartile range, Tukey fences and outliers. Contact rate distributions for pre-COVID times and in small classes are derived from SEJ and used to construct distributions of expected contact rates for schools during term 1 in September–October 2020. Mitigation measures simulated are: (i) ‘no hard mitigations’ where only individuals who are unwell are quarantined (blue); (ii) ‘bubble quarantine’ where an unwell individual results in quarantine of the whole class (orange); (iii) ‘twice weekly rapid test (no PCR)’ where each individual is tested with a rapid lateral flow test twice each week, with positive test results resulting in quarantine of the individual (green); (iv) ‘twice weekly rapid test (with PCR)’ where each individual is tested with a rapid lateral flow test twice each week, with positive test results resulting in quarantine of the individual and a confirmatory PCR test is applied after a 2-day interval (red); (v) ‘daily rapid test (no PCR)’ where each individual is tested with a rapid lateral flow test daily, with positive test results resulting in quarantine of the individual (purple); and (vi) ‘daily rapid test (with PCR)’ where each individual is tested with a rapid lateral flow test daily, with positive test results resulting in quarantine of the individual and a confirmatory PCR test is applied after a 2-day interval (brown). We consider first the simulations where no ‘hard’ mitigation measures are applied (blue boxes in figure 8). Here, the contact rate distributions have a strong influence on the number of infections. With the contact rates of pre-COVID classrooms, the median number of pupils infected is 41 (75th percentile: 50). With reduced contacts, the median number of infections is reduced by 17% to 34, with decreases also in the spread of values, particularly on the upper tail (75th percentile: 40). Each of the mitigation measures reduces the number of infected pupils, but they are generally less effective than the changes in the contact rates. Where contact rates are highest (i.e. using the pre-COVID contact rate distributions), the mitigation measures each substantially reduce the number of infections, as seen in the decrease in the quartiles, Tukey fences and outliers. The extent of the decrease in these statistics diminishes as contact rates decrease. We compare each of the mitigation measures using the reduced contact rates adopted for primary schools in term 1 of 2020. Bubble quarantine lowers the upper quartile and upper fence values but does not substantially change the median number of infected pupils across the ensemble (figure 8), lowering the median slightly from 34 to 31. This result is expected as bubble quarantine is not very effective in guarding against the seeding of infection carried into the classrooms from the community, and the spreading of this infection in the incubation period before an infected individual has symptoms that trigger the quarantine period. However, the reduction of the upper tail of the distribution (figure 9) shows that bubble quarantine can produce a decrease in the occurrence of outbreaks. For example, small outbreaks of six of more infected pupils occur in 0.60% of isolated classroom ensemble realizations (reduced from 1.7% with no hard mitigations), outbreaks of 10 or more occur in 0.03% of realizations (cf. 0.3% with no hard mitigations), and large outbreaks of 15 or more infections occur in only 0.002% of the realizations (cf. 0.025% with no hard mitigations) (table 2).

Figure 9

Table 2

Proportion of realizations in ensemble of 100 000 isolated classroom simulations that produce outbreak with more than six infected pupils, large outbreaks with 10 or more infected pupils and very large outbreaks with 15 or more infected pupils. (Six mitigation measures are simulated, as detailed in the main text. A zero value indicates no realizations occur in the ensemble. Simulations were conducted using the reduced contact rates distribution.)

mitigation measure	proportion of outbreaks (>6 infected) (%)	proportion of large outbreaks (≥10 infected) (%)	proportion of very large outbreaks (≥15 infected) (%)
no hard mitigations	1.06	0.28	0.025
bubble quarantine	0.28	0.028	0.002
twice weekly rapid test (no PCR)	0.08	0.007	0
twice weekly rapid test (with PCR)	0.093	0.01	0
daily rapid test (no PCR)	0.003	0	0
daily rapid test (with PCR)	0.007	0	0

Comparison of the number of infected pupils in 40 primary classrooms with different mitigation measures applied. Estimates of the probability density functions (a) and cumulative distribution functions (b) using kernel density estimation for each of the mitigation strategies are shown. Proportion of realizations in ensemble of 100 000 isolated classroom simulations that produce outbreak with more than six infected pupils, large outbreaks with 10 or more infected pupils and very large outbreaks with 15 or more infected pupils. (Six mitigation measures are simulated, as detailed in the main text. A zero value indicates no realizations occur in the ensemble. Simulations were conducted using the reduced contact rates distribution.) Considering next the use of rapid testing, we simulate approaches where each member of the classroom is tested at regular intervals. Specifically, we model a twice weekly testing regime, such as that used in summer 2021, and a daily test regime. Furthermore, we simulate each with and without confirmatory PCR testing, with a 2-day time lag for conducting the PCR test. With twice weekly testing without PCR testing, there is a decrease in the median number of infections to 27, a narrowing of the IQR with a pronounced reduction in the upper quartile to 31 infections and a reduction in the upper fence value to 43 infections (figure 8). This suggests that a test-based surveillance approach is effective in reducing infections within school through the disruption of contact networks. Large outbreaks, in particular, are less common, occurring in approximately 0.007% of ensemble realizations (table 2), and very large outbreak did not occur in the simulations. Confirmatory PCR testing has minor effects on the distribution of the number of infections, increasing only the maximum value in this sample to 62 (noting that the outliers are rare events and fluctuate when resampling). This result can be rationalized as there is a small non-zero probability of a ‘false negative’ PCR test overturning a ‘true positive’ rapid test and thus returning an infectious individual into the class. The frequency of outbreaks slightly increases in the ensemble. Implementing daily rapid lateral flow testing frequency further shifts the distribution of the number of infections to lower values (figures 8 and 9), although the change is modest (median: 25; 75th percentile: 28 for simulations both with and without PCR testing), representing a change of only 7% in the median and 10% in the upper quartile values compared to the twice weekly rapid testing ensembles. Outbreaks are further reduced (albeit from small values for twice weekly testing), and no large outbreaks of 10 or more infected pupils occur in the ensemble (table 2). In this case, the effect of confirmatory PCR testing is barely discernible in the numbers of infected pupils, with only far outliers differing (the interpretation of which should be made cautiously without larger ensembles). This can also be explained: daily testing ensures that ‘false negative’ test results are rapidly re-tested, and sequences of repeated false negatives are unlikely even for the low sensitivity rapid test. There is an increase in the number of outbreaks within the ensemble, although the proportion remains small. Where infection occurs in the ensemble realizations, we track the number of secondary infections within the classroom from each infected individual over the duration of the simulation. The mitigation measures shrink the number of secondary infections by reducing contact transmission. This is illustrated in figure 10 which shows box and whisker plots for the proportion of infections that occur owing to classroom transmissions. The reduction of contact rates, without additional mitigation measures, has a substantial effect on the proportion of classroom transmissions, reducing the median from 0.41 with pre-COVID contacts to 0.28 with reduced contacts (an approximately 30% reduction), and similar relative reduction in the quartile values. The mitigation measures further reduce the proportion of classroom transmissions, with the testing-based approaches having greater effect than the bubble quarantine. Indeed, with reduced contact rates and daily rapid testing, the median proportion of classroom transmissions is zero, and the 99th percentile value is 0.12 when there is no confirmatory PCR testing (which increases to 0.15 when PCR testing is applied).

Figure 10

Box and whisker plots comparing the proportion of infections resulting from classroom transmission in 40 primary classrooms with different mitigation measures applied in schools during term 1 in September–October 2020. Box and whisker plots illustrate median, interquartile range, Tukey fences and outliers. Contact rate distributions and mitigation measures are as for figure 8. The ensemble results can be used to estimate the secondary infection rate in the classroom as the expected number of in-classroom infection transmissions. Figure 11 shows two summary statistics (the mean and 99th percentile values) to characterize the distribution of the number of secondary infections that occur in the ensemble simulations. To highlight the trends when curtailing contact rates, for figure 11 we have conducted simulations here with the contact rate distributions derived from SEJ for small classes and applied these rates to full classroom attendance; it is unlikely that such low contact rates could be maintained in a fully occupied classroom, but the simulations are illustrative hypothetical scenarios.

Figure 11

The number of secondary infections in classroom simulations with different contact rate distributions and mitigation measures applied. Two summary statistics to characterize the distributions are shown, with the 99th percentile value (a) and mean value (b) of the number of secondary infections in all 100 000 ensemble simulations calculated. ‘Small classes contacts' refers to simulations that adopt contact rate distributions derived from SEJ for small classes that are applied to full classroom attendance. Note that most of the infections in the simulations are not transmitted in classroom contacts, so mean values are below 1.0 in all cases. The mean value of the number of secondary infections is less than unity in all cases, indicating that the infection is not transmitted in the classroom contacts of most primary cases. Each mitigation measure reduces the mean value of the number of secondary infections, with more pronounced effects when the contact rate is high. Outbreaks in the classroom can be driven by relatively rare ‘super-spreaders’. Our model includes the influence of having some very gregarious individuals in a mixing group, as reported in the contact distributions in [2]. Here, we illustrate the impact of mitigation measures on super-spreaders by calculating the 99th percentile value for the number of secondary infections (i.e. 1% of infected individuals in the simulations have a secondary infection number larger than the 99th percentile value). The mitigation measures are again seen to reduce the 99th percentile values, although the ‘bubble quarantine’ approach has only modest (if any) affect. The testing-based surveillance approaches are more effective in eliminating super-spreading, reducing the 99th percentile value of the number of secondary infections to unity (or smaller) if daily testing is implemented. The mitigation measures have consequences on pupil absences. The ‘bubble quarantine’ is particularly severe, with whole classes being quarantined for a period of 10 days if there is a single symptomatic case. However, the testing-based surveillance approaches lead to the self-isolation of individual cases, so the impact on absences is greatly reduced. In figure 12, we illustrate the effects of the mitigation measures on the number of pupils present in the classroom on one day, 23 October 2020 which we took as the last day of term 1 in 2020. As the majority of the realizations in the ensemble have no infections, there is typically full attendance in each of the scenarios simulated. However, when ‘bubble quarantine’ is employed, there is full-class quarantine on this day in greater than 15% of the ensemble members. If this result is applied to all classrooms in England, this would suggest approximately 25 000 classrooms in quarantine on this day and, with an average class size of 27 in England [29], approximately 675 000 pupils absent from school. For the testing-based surveillance, there are fewer absences, with only approximately 0.1% of the ensemble simulations having more than five pupils in quarantine on the last day of the school term. The contact rate distribution is again seen to play a prominent role, with reduced contacts leading to reduced absences as there are fewer infections to trigger quarantine.

Figure 12

Number of pupils absent from school on 23 October 2020 (the last day of term 1 in 2020) in an ensemble of 100 000 simulations of a class of 30 pupils. Two contact rates are applied: pre-COVID contact rates (a) and reduced contact rates (b). Six mitigation measures are modelled. Colours indicate the proportion of ensemble realizations on a logarithmic scale (white indicating no occurrence in the ensemble). Apparent gaps in some of the rows occur owing to low probability events which may not occur in the finite ensemble. To assess the impact of the testing strategies, we focus here on the ‘reduced contacts’ distribution (figure 12b). Twice weekly rapid testing has similar proportions absent from the classroom as the simulations with no hard mitigations applied. Without confirmatory PCR testing, twice weekly rapid testing slightly increases the number of ensemble members for which there is a single pupil absent, while the application of a confirmatory PCR test returns this proportion to close to that found when no hard mitigations measures are applied. By contrast, daily testing without PCR confirmation has a substantially elevated proportion of simulations with classroom absences, with more than approximately 1% of simulations having five pupils absent. Confirmatory PCR testing reduces the absentee proportions to below that when no mitigation measures are applied. This confirms that many of those quarantined by daily rapid testing are done so unnecessarily and are promptly returned to the classroom when a PCR test overturns a false-positive rapid test. However, the disruption to education by missing some days at school is significant and the benefit from a public health perspective is negligible. We estimate the numbers absent on a national scale by performing a bootstrap resampling of our ensemble to produce a sample of 156 843 classes, matching the number of state primary schools in 2020–2021 [29]. As our nominal class size is 30 pupils, this gives a total number of pupils of 4 705 290 which is comparable to the national state primary pupil count of 4 177 058 [29] (a difference of approximately +13% in the simulated classes). The bootstrap ensemble is recomputed 10 000 times to construct a median value for the number of pupils absent on 23 October 2020 and confidence intervals on our estimates, which are reported in table 3.

Table 3

mitigation measure	no. pupils absent		percentage of pupils absent
mitigation measure	median	5th; 95th percentiles	median	5th; 95th percentiles
pre-COVID contacts
no hard mitigation	48 384	47 861; 48 908	1.03	1.02; 1.04
bubble quarantine	762 605	755 422; 769 729	16.21	16.05; 16.35
twice weekly rapid test (no PCR)	95 529	94 977; 96 069	2.03	2.02; 2.04
twice weekly rapid test (with PCR)	45 995	45 592; 46 416	0.98	0.97; 0.99
daily rapid test (no PCR)	202 887	202 152; 203 617	4.31	4.30; 4.33
daily rapid test (with PCR)	81 329	80 855; 81 811	1.73	1.72; 1.74
reduced contacts
no hard mitigation	40 734	40 312; 41 174	0.87	0.86; 0.88
bubble quarantine	748 469	741 461; 755 625	15.9	15.8; 16.1
twice weekly rapid test (no PCR)	93 517	92 980; 94 053	1.99	1.98; 2.00
twice weekly rapid test (with PCR)	42 991	42 615; 43 374	0.91	0.91; 0.92
daily rapid test (no PCR)	203 566	202 836; 204 305	4.35	4.31; 4.34
daily rapid test (with PCR)	80 725	80 248; 81 199	1.72	1.71; 1.73

Estimated national number of pupils absent from school on 23 October 2020 from ensemble simulations with bootstrap resampling. (Isolated classes are simulated with six mitigation strategies and two contact rate distributions. Ensembles of 100 000 classes are simulated and collections of 156 843 classes are formed by bootstrap resampling from the ensembles. Each bootstrap is constructed 10 000 times to obtain distributions on the number of absences, and we report the median, 5th and 95th percentile values. The percentage of the school pupil population is estimated assuming each class has 30 pupils, giving a school population of 4 705 290.) Table 3 shows that the different mitigation strategies have a strong effect on school absence, with the model adopting bubble quarantine resulting in more than 15% of pupils being absent from school on 23 October 2020. The use of rapid testing reduces the numbers of absent pupils substantially, decreasing the median value to 2% of the national primary school pupil population for twice weekly testing with no PCR confirmation. This is further decreased to around 1% of the pupil population if confirmatory PCR testing is used to return pupils with false-positive rapid test results to the classroom. However, daily rapid testing produces many more false-positive results, so the number of absent pupils increased markedly to over 200 000 (approx. 4% of the pupil population) when there is no confirmatory follow-up PCR test. With a PCR test to identify false-positive cases of daily rapid tests, the number of absent pupils decreases to approximately 80 000 (1.7% of the pupil population) which is comparable to the number absent with twice weekly rapid testing with no PCR follow-up.

Application of classroom infection model to autumn 2021 based on SARS-CoV-2 community prevalence projections

Our analysis in §3 adopted indicative estimates of SARS-CoV-2 community prevalence and incidence rates, derived directly from national testing campaigns and statistical modelling [30]. However, the model can also be driven by projections of anticipated future levels of community prevalence, such as those provided to the UK government through the Scientific Pandemic Influenza Group on Modelling and the Scientific Advisory Group for Emergencies. We illustrate this through application of our model to the autumn term of 2021. In August 2021 (when our simulations were performed), the safe return of pupils to schools was a national concern, with increasing prevalence over the summer holidays and emerging predominance of the more transmissible Delta variant. To estimate SARS-CoV-2 infection transmission and assess the effectiveness of mitigation measures, we performed simulations of our model for the first seven weeks of the new school year (i.e. assumed here to be Thursday 2 September to Friday 22 October 2021 of term 1), accepting projections of community prevalence determined by national-scale epidemiological models. These simulations should not be considered fully comprehensive forecasts of school infection levels through the autumn term of 2021: at the time, immediacy was judged important for policy support so we did not perform detailed calibration of our model parameters, nor quantify uncertainties in our model estimates. Thus, we were unable to report formally the forecast skill of our model. There were, in addition, potentially substantial uncertainties in the national-scale epidemiological models, particularly in response to changing social behaviour. We have also alluded to the simplification of treating schools as a collection of isolated classrooms and that transmission outside the classroom is assumed to be minor in agreement with mixing studies [3]. The design of our model framework allowed us to explore the potential impact of such behaviour and mitigation changes on infection levels in schools. Thus, our purpose here is to illustrate the effectiveness of coupling our classroom-level infection model with national-scale epidemiological models and, notwithstanding limitations just noted, to show that its application provides policy-relevant information for decision-makers.

Amendments to the model

To apply our model to term 1 of 2021, we altered our model parameters from those used for term 1 2020. In particular, we added factors for the impact of population vaccination by September 2021. To illustrate this additional model capability, here we assume that there is 95% uptake of vaccination among classroom staff, and that 1% of children have been vaccinated. Another crucial difference is the high prevalence of the Delta (B.1.617.2) variant in the UK population. Genomic surveillance [33] and modelling studies [17-19] indicate that the Delta variant is expected to be predominant in the UK in September 2021, accounting for more than 95% of COVID-19 infection cases [17-19]. The Delta variant has two major impacts on our updated model. Firstly, it has a substantial transmission advantage over the original wild-type SARS-CoV-2 [34,35]. This is estimated as a 61% transmission advantage over the Alpha (B.1.1.7) variant in [34], from which we estimate a transmission advantage of 140% over the original wild-type. We, therefore, increase our values of the probability of contact transmission from original wild-type SARS-CoV-2 values using this scaling factor. The probabilities of contact transmission we adopt for the Delta variant are given in table 4 (assuming, as before, that adults are twice as likely as children to transmit infection [8-10]).

Table 4

Probability of transmission of Delta variant SARS-CoV-2.

Delta variant transmission probability	adults	children
symptomatic	0.084	0.042
asymptomatic	0.059	0.029

Probability of transmission of Delta variant SARS-CoV-2. An additional characteristic of the Delta variant, which drives increased infection, is a shortening of the incubation period for infected individuals [36]. Delta variant infection is detectable (by PCR testing) in cases approximately 2 days earlier than for original wild-type SARS-CoV-2 infection [36]. We adopt this shorter interval in our model by reducing the median duration of the incubation period by 2 days. Thus, for each individual in our population, we assign an incubation time drawn from log-Normal distribution [7] with . In principle, the variance of the log-Normal distribution could also be amended, but in the absence of detailed information, we retain the previous value. The reduced incubation period affects the time-dependence of the relative transmission probability, with a more rapid increase to peak-infectiousness for the Delta variant model parameters. Therefore, the probability of infection transmission from pre-symptomatic individuals is enhanced when considering the Delta variant.

Projection scenarios

At the time our projection simulations were performed, there was significant uncertainty in the trajectory of the epidemic into the new school year. Projections from national-scale epidemiological models fitted to observations [17-19] provide a basis for our scenario estimations. Typically, the projections in [17-19] suggested a third wave of the epidemic was likely in late summer as the effects of easing of restrictions on 19 July on infection transmission were realized. The model projections also indicated that infection among younger age groups would become dominant [17]. The magnitude of a third wave varies across models and with model assumptions, particularly regarding social mixing over the summer, and peak incidence rates (per 10 000 population per day) range from approximately 12 [17] to approximately 300 [18] across the models, spanning values substantially higher than those for September 2020 (figure 5; [30]). To model the community incidence rate of new infections over term 1 of 2021, we digitize projection graphics produced by a national-scale epidemiological model in [17] (figure 13; data available from [5]). This provides two time series of incidence rates which differ in the magnitude and timing of the peak depending on how rapidly societal contacts returned to pre-COVID levels which we refer to as the ‘gradual-relaxation’ and ‘rapid-relaxation’ incidence rates (figure 13). For the gradual-relaxation scenario, a peak incidence rate of approximately 67 000 new infections per day was projected to occur in late July, with a gradual decrease through August and September, to approximately 20 000 per day on 1 October 2021, with incidence slightly increasing subsequently [17]. In this scenario, our simulations of the school term begin in a period of relatively steady incidence. By contrast, if societal contact rates rapidly return to pre-pandemic levels in summer 2021, then the projected peak incidence rate was approximately 145 000 new infections per day and occurs on 12 August 2021, with incidence rates elevated with respect to the gradual-relaxation until 1 October 2021, from when the incidence rate gradually decreases to low levels through October and November [17]. The incidence of new infection was not expected to be evenly distributed through the population, but rather skewed to lower age groups who were unvaccinated. Here, we adopt the values reported in [17] that attribute 27% and 25% of new infections to age groups 0–9 and 10–19, respectively, for the gradual-relaxation scenario, and combine these to produce a child-incidence rate. These proportions change but slightly for the rapid-relaxation scenario to 26% and 24% for age groups 0–9 and 10–19, respectively.

Figure 13

Projections of the incidence rate (a) and prevalence (b) of COVID-19 in the UK from June to December 2021. Incidence rates are obtained from a national-scale epidemiological model [17] with scenarios modelling a gradual or rapid return to pre-pandemic societal contact rates. The prevalence is modelled assuming R0 = 5 and infection duration of 40 days (see the electronic supplementary material, appendix A4). Data from the Office for National Statistics Coronavirus Infection Survey [30] for the estimated incidence and prevalence with credible intervals are illustrated for the data available at the time of writing (early December 2021, orange) and the data available at the time simulations were conducted (late August 2021, yellow). At the time our simulations were performed (mid-August 2021), the projections of community incidence rate were tracking well the estimates derived from the national Coronavirus Infection Survey compiled by the Office for National Statistics [30] (noting that the national scale projections were made in early July 2021), with the gradual-relaxation scenario in particular predicting quite well the timing and magnitude of the early summer ‘peak’ in incidence. Subsequently, however, observed incidence rates did not fall as substantially as the projections anticipated, and there are significant and increasing deviations in the projected incidence rate from observations from mid-September and into October. Therefore, as our classroom transmission model adopts the projected incidence rate for community transmission seeding of infection into schools, we expect our forecasts should differ from observed infection rates in schools. Our model also requires the community prevalence. This is not reported in the projections, so we estimate prevalence using a simple rescaling of incidence rates with comparison to the available data (figure 13), as described in the electronic supplementary material, appendix A4. The prevalence has less importance in our model than the incidence rate (as it is used only as a seeding of infection on model initiation and in cases where substitute teachers are added to the simulated classroom) and we judge that the estimate from incidence rate is sufficient in our forecasts. However, we note that, as with the incidence rate, from mid-September there are substantial deviations of the projection-derived prevalence from those subsequently observed. Guidance from the DfE issued in summer 2021 [37] removed many of the SARS-CoV-2 mitigation measures in school settings, but recommends ‘soft mitigation’ measures continue to be applied, such as enhanced cleaning and classroom ventilation. Therefore, in our forecast simulations, we do not model any of the possible hard mitigations that might be activated under certain conditions. Additionally, schools were able to reconvene in large groups, such as during lunch and break times and for school assemblies. This may increase transmission rates within school settings and particularly between classes. However, school managers were responsible for deciding whether to continue to ‘bubble’ classes to minimize interactions. As teachers and school leaders could also decide to continue to implement measures that reduce contacts within the classroom, we adopt the two contact rates distributions from [2] that model contact rates for pre-COVID classrooms and reduced contact rates during the epidemic.

Classroom transmission modelling results

We perform stochastic simulations for an isolated school classroom over the 50 days of term 1 in 2021, adopting the ‘reduced contacts’ distribution of contact rates (i.e. we assume that in this time period, schools will continue to enact measures to reduce the number of contacts in classrooms). Figure 14 illustrates the result for the ‘gradual-relaxation’ scenario, showing time series of the number of infected pupils in each realization from an ensemble of 100 000 stochastic simulations in which no hard mitigation measures are applied. Therefore, figure 14 is directly comparable to figure 6, and shows that there is potential for substantially increased infection within school classrooms in term 1 of 2021 when compared with term 1 of 2020. We find both (i) a potential substantial increase in the proportion of ensemble realizations that have infected pupils (62% for 2021, compared to 46% for 2020), and (ii) the maximum number of simultaneously infected pupils in a realization could be increased (22 in 2021, compared to 14 in 2020). There are also more realizations in the ensemble that have several simultaneously infected pupils, although most frequently there is only a single infected pupil in the class, as we found in the term 1 2020 simulations.

Figure 14

Time series of the number of infected pupils for each realization in an ensemble of 100 000 stochastic simulations of an isolated classroom in term 1 of 2021 under the gradual-relaxation scenario. Each row of the heatmap corresponds to an ensemble member, with colours denoting the number of pupils in the classroom with COVID-19 infection on each day. The ensemble realizations have been sorted by the sum of the number of infected pupils over all days. (a) All ensemble realizations are shown. (b) The 10 ensemble realizations with the most infections are shown, illustrating the dynamics of large outbreaks that occur with a frequency of 1/10 000. Colours denote the number of infected pupils in the classroom for each day of the simulation. The gradual-relaxation scenario can also be compared to forecasts under the rapid-relaxation scenario (figure 15) where the incidence rate is approximately three times larger at the start of term 1. In this case, there are infected pupils in more than 89% of the ensemble realizations, and more realizations have substantial numbers of pupils simultaneously infected. This is confirmed in figure 16 where histograms (empirical probability density functions) of the total number of infected pupils in each realization are shown for both scenarios for term 1 2021 and the equivalent simulation for term 1 2020.

Figure 15

Figure 16

Histograms (empirical probability density functions) of the total number of infected pupils in ensembles of 100 000 simulation of the rapid-relaxation and, gradual-relaxation scenarios for term 1 in 2021 and the ‘no hard mitigations’ simulation with reduced contacts for term 1 in 2020. Note the logarithmic scale on the proportion in the ensemble.

Time series of the number of infected pupils for each realization in an ensemble of 100 000 stochastic simulations of an isolated classroom in term 1 of 2021 under the rapid-relaxation scenario. Each row of the heatmap corresponds to an ensemble member, with colours denoting the number of pupils in the classroom with COVID-19 infection on each day. The ensemble realizations have been sorted by the sum of the number of infected pupils over all days. Colours denote the number of infected pupils in the classroom for each day of the simulation. (a) All ensemble realizations are shown. (b) The 10 ensemble realizations with the most infections are shown, illustrating the dynamics of large outbreaks that occur with a frequency of 1/10 000. Histograms (empirical probability density functions) of the total number of infected pupils in ensembles of 100 000 simulation of the rapid-relaxation and, gradual-relaxation scenarios for term 1 in 2021 and the ‘no hard mitigations’ simulation with reduced contacts for term 1 in 2020. Note the logarithmic scale on the proportion in the ensemble. The results in figure 16 show that for both scenarios for term 1 of 2021, there is a probability of greater than 0.01 of classes having 10 or more infected pupils over the course of the term. Indeed, the number of infected pupils at the 1% probability level increases from five for the 2020 simulation to seven for the gradual-relaxation scenario for 2021, and to 13 under the rapid-relaxation scenario. The change in the upper tail values from term 1 2020 (with no hard mitigations) to the 2021 scenarios is even more pronounced: the 99.9 percentile values on the numbers of infected pupils are 12 for 2020 and 22 for the 2021 scenarios. Thus, the simulations indicate a much greater potential for substantial outbreaks in term 1 of 2021. Simulations have also been performed for term 1 of 2021 scenarios assuming the classroom contact rates return to pre-COVID levels. To compare these simulations with those above, we consider again the collection of 10 schools each with four classrooms of 30 pupils. This better illustrates the level of infection that local authorities, city boroughs or academy networks may have needed to respond to in term 1 of 2021. In figure 17, we illustrate the differences that may occur in these scenarios by plotting histograms that show the probability that a given number of classes in the collection experiences infection at different levels. We find that the probabilities of classes having no infections (figure 17a) are independent of the classroom contact rates, but strongly depends on the assumed societal contact relaxation scenario, with a rapid-relaxation to pre-COVID societal contacts substantially reducing the number of classes with no infections. Similarly, the number of classes experiencing a single infection (figure 17b) has only slight variations with the classroom contact rates, but differ substantially under the two societal contacts forecast scenarios.

Figure 17

Histograms of the number of classes in a collection of 10 schools each with four classes of 30 pupils that are forecast to experience SARS-CoV-2 infections in term 1 of 2021 under different scenarios: gradual- and rapid-relaxation to pre-COVID societal contacts, and reduced or pre-COVID contact rates in the classroom. Six different infection levels are considered: (a) classes with no infections; (b) classes with a single infection; (c) classes with at least one infection; (d) classes with more than six infections; (e) classes with 10 or more infections; and (f) classes with 15 or more infections. Note, in (f), the reduced contacts gradual-relaxation scenario has a probability that no classes have 15 or more infections of 0.66 and is curtailed in these plots to better illustrate the lower probability but high consequence outbreaks. The effects of in-classroom contact rates increase if we consider outbreaks. Small clusters of three or more infections (figure 17c), which would be likely to trigger ‘extra action’ under DfE guidelines [31], are more likely for high (pre-COVID) in-class contact rates than for reduced contacts. However, differences in the community incidence rate in the two forecast scenarios have the greater effect; three or more infections are likely in around half of the 40 classes in the collection under the rapid-relaxation scenario, but in only 5–8 classes for the gradual-relaxation scenario. For outbreaks of more than six infections in a class (figure 17d), our model suggests that 16 classes in the collection are likely to experience such infection levels with pre-COVID classroom contact rates under the rapid-relaxation scenario, reducing to 12 classes under the gradual-relaxation scenario. Outbreaks of 10 or more infections (figure 17e) are less likely in all simulated conditions, but, in the worst-case modelled (pre-COVID classroom contact rates under a rapid-relaxation of societal contacts), it remains likely that eight classes in the collection would experience an outbreak of this size. Additionally, the number of classes with 10 or more infections is similar for reduced classroom contacts in the rapid-relaxation scenario and the pre-COVID contacts in the gradual-relaxation scenario. Considering a large outbreak of 15 or more pupils (i.e. at least half of the pupils in a class being infected), we find that this is likely in three classes, and may occur in more than 10 classes, under the worst-case modelled, but becomes unlikely in the gradual-relaxation scenario. However, even in the best-case modelled (reduced classroom contact rates and gradual-relaxation of societal contacts), there is an appreciable probability (approx. 0.0075) of three classes experiencing a large outbreak. Finally, we estimate the proportion of pupils absent through term 1 of 2021 and compare with estimates of pupil absences from reports to the DfE [32]. Table 5 collates the proportions of absences over six weeks in September and October 2021 as found from model ensemble simulations within the rapid- and gradual-relaxation scenarios and adopting pre-COVID or reduced contact rates. We model a twice weekly rapid testing regime, both with and without confirmatory PCR testing, as well as simulations with no hard mitigations applied for comparison. We note that regular testing of primary aged children was not mandated by the UK Government during this time, although staff were required to self-administer tests twice weekly and households of pupils were recommended to carry out tests. However, an internet search has found several examples of primary schools advising regular testing of children at home, so it is likely that ad hoc arrangements were in place in many schools.

Table 5

date	09/09 (%)	16/09 (%)	23/09 (%)	30/09 (%)	07/10 (%)	14/10 (%)
reported absences	1.1	1.4	2.0	1.9	1.8	2.2
rapid-relaxation scenario
pre-COVID contacts
no hard mitigations	0.2	0.6	1.3	2.2	3.4	4.8
twice weekly rapid test (no PCR)	0.9	1.5	2.1	2.8	3.7	4.6
twice weekly rapid test (with PCR)	0.2	0.5	1.1	1.8	2.6	3.5
reduced contacts
no hard mitigations	0.2	0.4	1.0	1.7	2.6	3.6
twice weekly rapid test (no PCR)	0.9	1.5	2.0	2.6	3.4	4.2
twice weekly rapid test (with PCR)	0.2	0.45	0.9	1.6	2.3	3.1
gradual-relaxation scenario
pre-COVID contacts
no hard mitigations	0.2	0.6	1.3	2.2	2.6	2.3
twice weekly rapid test (no PCR)	0.9	1.5	2.1	2.6	2.7	2.3
twice weekly rapid test (with PCR)	0.2	0.5	1.1	1.6	1.6	1.2
reduced contacts
no hard mitigations	0.2	0.4	1.0	1.6	1.8	1.5
twice weekly rapid test (no PCR)	0.9	1.4	1.9	2.4	2.5	2.1
twice weekly rapid test (with PCR)	0.2	0.5	1.0	1.4	1.4	1.1

Comparison of reported pupil absences in state-funded primary schools in England with modelled pupil absences under rapid-relaxation and gradual-relaxation scenarios over term 1 of 2021. (Model ensemble simulations with 100 000 realizations are performed adopting pre-COVID and reduced contact distributions. Reported absences obtained from [32].) Table 5 indicates an increasing level of reported absence over the first three weeks of term 1, almost doubling from approximately 1% of pupils on 9 September 2021 to 2% on 23 September 2021. Subsequently, the absence rate fluctuates at around 2% of pupils for the remaining weeks of term 1. Our modelling results are broadly consistent with these reported values. There are differences observed in the absences between the two community prevalence scenarios, which increase over time, with approximately twice as many absences under the rapid-relaxation scenario on 14 October 2021 than the gradual-relaxation scenario. As expected, reduced contact rates result in lower numbers absent. In the first week of term 1 (9 September 2021), our modelled values are substantially lower than reported (table 5), except in the case where twice weekly rapid testing without PCR confirmation is applied. We note here that pupils were asked to perform two self-administered lateral flow tests in the week prior to returning to school in September. This requirement was introduced after our simulations were performed, so was not included in our model, and is likely to result in model simulations under-estimating absences in the first week, but is somewhat corrected where we have modelled twice weekly rapid testing. The modelled absences generally increase monotonically over the weeks of term 1 under the rapid-relaxation scenario, but with gradual-relaxation the absence level shows a similar fluctuation for the final three weeks of term 1 as seen in the reported absences (table 5). The simulation results under the gradual-relaxation scenario with reduced contact rates and twice weekly rapid testing (with no confirmatory PCR tests) show particularly good matches to the observed rates.

Discussion

Main results

We have developed a stochastic discrete agent-based model of SARS-CoV-2 transmission in a primary school classroom and applied this to analyse infections over term 1 of 2020 using reported community infection levels, and to project trends in term 1 of 2021 using projections from national-scale epidemiological models. The model takes account of the known large variations in the contact patterns of individual pupils in primary schools [2]. This includes allowing for the possibility of having one or more very gregarious individuals in a mixing group, who may drive major in-class outbreaks, and allows the potential impact of such pupils to be investigated quantitatively. Our basic model is for in-classroom transmission, but individual schools can be modelled as collections of such classes such that the results can be applied to local authority areas and academy networks and provide estimates on a national scale. Our analysis of model findings for term 1 of 2020 has focussed on assessing alternative mitigation measures that have been proposed to manage infection risk in schools. While there are limited data with which to test our model, the reanalysis broadly aligns quantitatively with observations from 2020. Furthermore, in comparing mitigation measures, our model parameters are not varied, allowing relative changes to be compared and reducing the impact of uncertainties on parameters. We find the dynamics of infection in the classroom follow the external community incidence rates through the seeding of infection. Typically, there are few instances of classroom transmission and, in the majority of ensemble realizations, there are either no infections or a single pupil is infected. Our model results support the notion that outbreaks can be defined as infections exceeding about five persons in a class-sized mixing group. Below five cases, we find that the infections are largely driven by community prevalence, but this connection is broken for realizations numbering more than five infections with large outbreaks driven by classroom transmission. Our forecasts indicate that outbreaks may be substantially larger and more frequent in term 1 of 2021 compared to term 1 of 2020. This is owing to the higher transmission rates with variant Delta SARS-CoV-2 and its more rapid onset of infectivity. In the worst-case scenario, the modelling indicates that almost all school classes will experience at least one infection case in term 1 of 2021. A prominent result of our modelling is the essential role of contact rates within the classroom on infection transmission. In this study, we have compared two different contact rate distributions, derived from an elicitation of teachers in May 2020 [2], using our new agent-based stochastic transmission model. The reduction of contact rates from pre-COVID levels is shown to have a comparable reduction on pupils' infection as implementing bubble quarantine with pre-COVID contacts. The reduction of contacts also shrinks in-class secondary infections to a similar extent as the application of bubble quarantine. Following on from these retrospective analyses, we have applied the new model to estimate infection in schools for term 1 2021 using projections of community prevalence. Here, the model assumptions and parameters have been changed from the September 2020 situation to include a largely vaccinated adult population and the more virulent Delta variant of SARS-CoV-2. We have contrasted two different contact patterns, namely pre-COVID with no mitigation and then reduced contact levels, as ascribed to primary schools in term 1 of 2020. We then contrasted two different modelling projections, concerning community prevalence and incidence rate of infection, making different assumptions about how quickly the public would alter their community contact patterns back to pre-COVID rates. In comparing these two incipient scenarios for September–October 2021, we conclude, commensurate with [8], that prevalence levels in the community will remain the dominant control on infection in primary schools. Adopting ‘soft’ mitigation measures to reduce transmission has a modest effect compared to differences in projected community prevalence but significantly suppresses occurrence of larger outbreaks.

Implications for managing COVID-19 in schools

Our results have implications for the management of SARS-CoV-2 infection transmission in primary schools. We find that mitigation steps which reduce contacts between pupils have a tangible benefit in disrupting transmission. Schools are already experienced in organizing classrooms and other social interactions to reduce pupil-to-pupil and pupil-to-teacher contacts. Our model results suggest that, given practicalities of managing contacts within schools, continuation of these practices can serve to reduce in-school transmission by up to 30%. Of greater significance is the comparison of different strategies to deal with infection occurrences. The exclusion of whole classrooms and bubbles has led to self-isolation of large numbers of pupils from primary schools in the recent past, with major adverse effects on education. In addition, such absences have significant and unwelcome knock-on effects on families and the economy. Our model results show that, in terms of infection numbers and transmissions, there are no tangible benefits of this policy in comparison to a policy of simply removing a pupil who has been infected from school. Thus, excluding whole classes or bubbles is hard to justify from a public health perspective and in the context of its highly disruptive effects on education and society. Our modelling also shows that a regular rapid lateral flow testing regime has significant benefits in reducing transmission and is more effective than bubble or class exclusion. This results in much lower absence numbers, so is manifestly superior from an educational and societal perspective. Contrasts between different testing regimes with respect to frequency of testing and whether lateral flow testing is augmented by PCR tests are marginal. Our model results indeed suggest that augmented PCR tests have little benefit since a positive lateral flow result will, in most cases, simply be confirmed by a PCR test. Thus, a testing regime which meets criteria of practicality and cost should be chosen. From an epidemiological costs and benefits perspective, our model findings indicate that two lateral flow tests per week is a suitable—and justifiable—regime, with little or no need for PCR tests. This said, a number of other social and economic considerations can also influence policies that need to balance infection control, educational needs and costs of interventions. In terms of ability to forecast infection levels, the model relies on data for community incidence rates and prevalence, which limits forecast skill to some extent. At the time of conducting our simulations for the return to school in autumn 2021, there were very large uncertainties in future projections of incidence rate and prevalence. Projections over long timescales (several weeks) are unreliable. In particular, they are sensitive to changes in model parameters that are influenced by external forcings (e.g. weather conditions influencing the extent of indoor mixing in the summer) and changes in the characteristics of the virus that affect its transmissivity, as is being demonstrated by the arrival of the Omicron variant. Subsequent observations are substantially different from projected community infection levels (figure 13). Community incidence is the main driver of infection in school, seeding infection in the classroom, owing to the epidemiology of SARS-CoV-2 infection and transmission for children adopted in our model. In our simulations, this remains the case even with a large proportion of the adult population vaccinated. We caution though that—to some extent—community prevalence control is built into our models (see next section for fuller discussion).

Caveats, limitations and further developments

A number of assumptions underpin our model, with some contingent limitations. It considers only close contacts (n.b. defined here as per [2]) as the predominant infection transmission pathway among persons in a classroom. Unavoidably, the contact pattern data we have used, derived from teacher elicitations [2], are an approximation of complex varying patterns of human interactions. Our model also assumes each classroom is isolated and independent of other classrooms, as far as transmission likelihood is concerned. While there are some interactions between pupils and adult staff from different classes and across age groups, we judge this simplification is justified for primary schools because most of their time at school is in classroom, from evidence of limited cross age mixing [9] and due to the imposition of school-wide infection control measures that were put in place to minimize inter-class mixing. Our model also simplifies the complex social interactions by assuming that contacts are randomly distributed among all members of a class, whereas there will be friendship groups resulting in variations in the connectness of the individuals with one another. The in-classroom transmission of infection in our model is assumed to be predominantly through close contacts. This neglects the possibility of long-range airborne transmission by dispersed aerosol, which may be a contribution in situations where there are several infected people in the classroom, thereby elevating the background aerosol concentration to comparable levels to those near infectious individuals. While a model of such transmission (e.g. [38]) could be incorporated in addition to the close contact model, we note, however, that such embellishment of the model will introduce additional uncertain parameters that vary widely between schools, and is unlikely to alter the relative effectiveness of the mitigation measures. Our model does not consider other measures to mitigate transmission (e.g. masks, cleaning and ventilation) in detail (only adopting a lumped parameter to represent these ‘soft’ mitigations). Some or all of these might be taken into account in development of a more sophisticated, more complicated model; this said, we doubt there would be much benefit for such additional complexities, given that they would all introduce uncertainties and, it can be argued, would only introduce second order factors and effects. The scenarios we discuss here were relevant to mitigation measures in place or under consideration at the time of our study. Alternative testing strategies could be considered for any further extension of, or refinements to the model. For example, rather than testing all pupils on specified days, a staggered testing regime could be imposed and its effects analysed. This alternative was considered in the agent-based model of Asgary et al. [39] who find that testing as few as 20% of pupils in a class each day could provide testing coverage sufficient to prevent infection outbreaks. However, the model in [39] has simplified transmission dynamics, with a fixed number of contacts between pupils applied randomly and lacks a contact network, such as is implemented in our model. We have not performed extensive calibration of our model, in part owing to a lack of detail in published data with which to compare our model. While calibration is likely to improve our predictions of infection levels and resulting pupil absences, there is also a danger of over-fitting models as there are a number of parameters that can be tuned and data are scarce and uncertain. We have adopted published values of epidemiological parameters, where available, and have demonstrated that the model is able to reproduce observed levels of pupil absence. Even without detailed calibration, the model provides a useful tool with which to simulate potential scenarios and provide evidence to judge the effectiveness of alternative mitigation measures and the tension between controlling infection level and pupil absence. One major issue is that the model does not explicitly distinguish between community-related infection transmission outside and inside school. Thus, to a large extent, the current model is predicated intrinsically on community prevalence being dominant. The model is, however, able to identify transmission clusters and larger outbreaks of in-school infections. We think this aspect of modelling could be usefully developed to allow it to detect differences between infection rates inside and outside school separately. With much of the adult population vaccinated, it could be that schools become important reservoirs and drivers of overall community infection. In terms of ‘outbreaks’ of infection, at this stage, we have concentrated on projections for six or more infections in a class because this breakpoint clearly emerges from the data. In this regard, in §3, we mention current DfE contingency operational guidance for schools [31] where two thresholds are defined for ‘extra action’ under specific setting conditions and in relation to ‘close mixing’ groups within schools. If we take the second of these, i.e. ‘10% of children, pupils, students or staff who are likely to have mixed closely test positive for COVID-19 within a 10-day period’, for our hypothetical class of 30, this equates to three or more positives. En passant, in figure 15c, we present the projected number of classes—in a collection of 10 schools with four classes each—that could match this concentration of infections, under two mitigation scenarios. There is, therefore, a need to focus our modelling specifically on these stated DfE criteria, and to develop such projections—and, critically, their associated uncertainties—in more detail than has been possible thus far. Another useful potential development relates to the use of our model for forecasting purposes. Forecast reliability in such models is strongly influenced by associated uncertainties and requires a thorough calibration process to properly constrain input parameters, their uncertainties, and their influence on model predictions. This requires compilation of various sets of data, which are often scarce during the early stage of an epidemic and can be modulated by interventions and by ad hoc behavioural changes in the affected population. Furthermore, implementing our model in forecast mode requires additional ‘forcing’ data, in the form of community prevalence and incidence rates, which are likely to be sourced from other models with their own uncertainties. Our example of using established models of community transmission (figure 13) indicates that their own forecasting skill is limited to no more than a few weeks. Nonetheless, our model might usefully be enhanced for short-term forecasting by modifying it to assimilate near real-time data on observed incidence. It could then be applied to generate short-term forecasts that should be reasonably dependable, out to a few weeks ahead.

Conclusion

Our transmission modelling results, reported above, provide one objective basis for informing appraisals of the relative efficacies of different mitigation measures in classroom conditions in primary schools (and other education gatherings). In particular, the form of agent-based contact model we have developed is well-suited for characterizing group infection transmission risks when respiratory drops and aerosol dispersion are dominant vectors in enclosed spaces (fomite transmission is now recognized to be a low-risk infection route [40]). The emergence of SARS-CoV-2 variants with different transmissibility properties suggests that there could be distinct differences in the efficacies of bubbles and testing when it to comes to projected infection levels in schools, and hence potential absence rates; our model parameterization is easily adjusted for quantifying such effects and can, we believe, offer punctual and auditable information on incipient infection fluctuations for policy purposes. The results reported here indicate that refocusing on social distancing and enhanced ventilation in schools would present benefits for managing COVID-19 infections in schools. These, and related arguments from our findings, suggest that, following comprehensive calibration and validation, our stochastic contact network modelling approach should be of value to Directors of Public Health and Education Authorities charged with responsibility for prompt assessment and managing of infection transmission dynamics in school classes and in multiple schools in a given area. In terms of current or future DfE action criteria (e.g. [31,37]), our model could be tailored straightforwardly to schools' demographics and to the relevant local community incidence level. The numerical modelling code has been fine-tuned for rapid analytical computation and, therefore, could be used on a frequent, real-time basis under rapidly changing COVID-19 infection conditions—e.g. for the new, emerging Omicron variant; with suitable resources, daily data-adaptive updating of projections should be feasible for a group of schools in a region.

21 in total

1. Measuring social networks in British primary schools through scientific engagement.

Authors: A J K Conlan; K T D Eames; J A Gage; J C von Kirchbach; J V Ross; R A Saenz; J R Gog
Journal: Proc Biol Sci Date: 2010-11-03 Impact factor: 5.349

2. The relative transmissibility of asymptomatic COVID-19 infections among close contacts.

Authors: Daihai He; Shi Zhao; Qianying Lin; Zian Zhuang; Peihua Cao; Maggie H Wang; Lin Yang
Journal: Int J Infect Dis Date: 2020-04-18 Impact factor: 3.623

3. Ten scientific reasons in support of airborne transmission of SARS-CoV-2.

Authors: Trisha Greenhalgh; Jose L Jimenez; Kimberly A Prather; Zeynep Tufekci; David Fisman; Robert Schooley
Journal: Lancet Date: 2021-04-15 Impact factor: 79.321

4. A guideline to limit indoor airborne transmission of COVID-19.

Authors: Martin Z Bazant; John W M Bush
Journal: Proc Natl Acad Sci U S A Date: 2021-04-27 Impact factor: 11.205

5. Pupils returning to primary schools in England during 2020: rapid estimations of punctual COVID-19 infection rates.

Authors: W P Aspinall; R S J Sparks; M J Woodhouse; R M Cooke; J H Scarrow
Journal: R Soc Open Sci Date: 2021-09-15 Impact factor: 2.963

6. Low risk of SARS-CoV-2 transmission by fomites - a clinical observational study in highly infectious COVID-19 patients.

Authors: Toni Luise Meister; Marielen Dreismeier; Elena Vidal Blanco; Yannick Brüggemann; Natalie Heinen; Günter Kampf; Daniel Todt; Huu Phuc Nguyen; Jörg Steinmann; Wolfgang Ekkehard Schmidt; Eike Steinmann; Daniel Robert Quast; Stephanie Pfaender
Journal: J Infect Dis Date: 2022-05-05 Impact factor: 7.759

7. Age-dependent effects in the transmission and control of COVID-19 epidemics.

Authors: Petra Klepac; Yang Liu; Nicholas G Davies; Kiesha Prem; Mark Jit; Rosalind M Eggo
Journal: Nat Med Date: 2020-06-16 Impact factor: 53.440

8. Social contacts and mixing patterns relevant to the spread of infectious diseases.

Authors: Joël Mossong; Niel Hens; Mark Jit; Philippe Beutels; Kari Auranen; Rafael Mikolajczyk; Marco Massari; Stefania Salmaso; Gianpaolo Scalia Tomba; Jacco Wallinga; Janneke Heijne; Malgorzata Sadkowska-Todys; Magdalena Rosinska; W John Edmunds
Journal: PLoS Med Date: 2008-03-25 Impact factor: 11.069

9. Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research.

Authors: Conor McAloon; Áine Collins; Kevin Hunt; Ann Barber; Andrew W Byrne; Francis Butler; Miriam Casey; John Griffin; Elizabeth Lane; David McEvoy; Patrick Wall; Martin Green; Luke O'Grady; Simon J More
Journal: BMJ Open Date: 2020-08-16 Impact factor: 2.692

10. Accuracy of UK Rapid Test Consortium (UK-RTC) "AbC-19 Rapid Test" for detection of previous SARS-CoV-2 infection in key workers: test accuracy study.

Authors: Ranya Mulchandani; Hayley E Jones; Sian Taylor-Phillips; Justin Shute; Keith Perry; Shabnam Jamarani; Tim Brooks; Andre Charlett; Matthew Hickman; Isabel Oliver; Stephen Kaptoge; John Danesh; Emanuele Di Angelantonio; Anthony E Ades; David H Wyllie
Journal: BMJ Date: 2020-11-11