Elise J Kuylen1,2, Andrea Torneri1, Lander Willem1, Pieter J K Libin2,3,4, Steven Abrams2,5, Pietro Coletti2, Nicolas Franco2,6, Frederik Verelst1, Philippe Beutels1,7, Jori Liesenborgs8, Niel Hens1,2. 1. Centre for Health Economic Research and Modeling Infectious Diseases, University of Antwerp, Antwerp, Belgium. 2. Data Science Institute, I-BioStat, Hasselt University, Hasselt, Belgium. 3. Artificial Intelligence Lab, Vrije Universiteit Brussel, Brussels, Belgium. 4. Rega Institute for Medical Research, Clinical and Epidemiological Virology, University of Leuven, Leuven, Belgium. 5. Global Health Institute, University of Antwerp, Antwerp, Belgium. 6. Namur Institute for Complex Systems, Department of Mathematics, University of Namur, Namur, Belgium. 7. School of Public Health and Community Medicine, The University of New South Wales, Sydney, NSW, Australia. 8. Expertise Centre for Digital Media, Hasselt University - transnational University Limburg, Hasselt, Belgium.
Abstract
Superspreading events play an important role in the spread of several pathogens, such as SARS-CoV-2. While the basic reproduction number of the original Wuhan SARS-CoV-2 is estimated to be about 3 for Belgium, there is substantial inter-individual variation in the number of secondary cases each infected individual causes-with most infectious individuals generating no or only a few secondary cases, while about 20% of infectious individuals is responsible for 80% of new infections. Multiple factors contribute to the occurrence of superspreading events: heterogeneity in infectiousness, individual variations in susceptibility, differences in contact behavior, and the environment in which transmission takes place. While superspreading has been included in several infectious disease transmission models, research into the effects of different forms of superspreading on the spread of pathogens remains limited. To disentangle the effects of infectiousness-related heterogeneity on the one hand and contact-related heterogeneity on the other, we implemented both forms of superspreading in an individual-based model describing the transmission and spread of SARS-CoV-2 in a synthetic Belgian population. We considered its impact on viral spread as well as on epidemic resurgence after a period of social distancing. We found that the effects of superspreading driven by heterogeneity in infectiousness are different from the effects of superspreading driven by heterogeneity in contact behavior. On the one hand, a higher level of infectiousness-related heterogeneity results in a lower risk of an outbreak persisting following the introduction of one infected individual into the population. Outbreaks that did persist led to fewer total cases and were slower, with a lower peak which occurred at a later point in time, and a lower herd immunity threshold. Finally, the risk of resurgence of an outbreak following a period of lockdown decreased. On the other hand, when contact-related heterogeneity was high, this also led to fewer cases in total during persistent outbreaks, but caused outbreaks to be more explosive in regard to other aspects (such as higher peaks which occurred earlier, and a higher herd immunity threshold). Finally, the risk of resurgence of an outbreak following a period of lockdown increased. We found that these effects were conserved when testing combinations of infectiousness-related and contact-related heterogeneity.
Superspreading events play an important role in the spread of several pathogens, such as SARS-CoV-2. While the basic reproduction number of the original Wuhan SARS-CoV-2 is estimated to be about 3 for Belgium, there is substantial inter-individual variation in the number of secondary cases each infected individual causes-with most infectious individuals generating no or only a few secondary cases, while about 20% of infectious individuals is responsible for 80% of new infections. Multiple factors contribute to the occurrence of superspreading events: heterogeneity in infectiousness, individual variations in susceptibility, differences in contact behavior, and the environment in which transmission takes place. While superspreading has been included in several infectious disease transmission models, research into the effects of different forms of superspreading on the spread of pathogens remains limited. To disentangle the effects of infectiousness-related heterogeneity on the one hand and contact-related heterogeneity on the other, we implemented both forms of superspreading in an individual-based model describing the transmission and spread of SARS-CoV-2 in a synthetic Belgian population. We considered its impact on viral spread as well as on epidemic resurgence after a period of social distancing. We found that the effects of superspreading driven by heterogeneity in infectiousness are different from the effects of superspreading driven by heterogeneity in contact behavior. On the one hand, a higher level of infectiousness-related heterogeneity results in a lower risk of an outbreak persisting following the introduction of one infected individual into the population. Outbreaks that did persist led to fewer total cases and were slower, with a lower peak which occurred at a later point in time, and a lower herd immunity threshold. Finally, the risk of resurgence of an outbreak following a period of lockdown decreased. On the other hand, when contact-related heterogeneity was high, this also led to fewer cases in total during persistent outbreaks, but caused outbreaks to be more explosive in regard to other aspects (such as higher peaks which occurred earlier, and a higher herd immunity threshold). Finally, the risk of resurgence of an outbreak following a period of lockdown increased. We found that these effects were conserved when testing combinations of infectiousness-related and contact-related heterogeneity.
As of December 2021, the SARS-CoV-2 pandemic has led to over 300 million confirmed cases and more than 5 million confirmed deaths worldwide [1]. Mathematical modeling has been instrumental in understanding transmission dynamics, as well as in evaluating the impact of both pharmaceutical and non-pharmaceutical interventions [2-9]. A large number of different models were developed to account for the multitude of factors that were found to be important for the spread and control of SARS-CoV-2, including age [10, 11], seasonality [12, 13], and superspreading [14-20].Through the analysis of contact tracing data and the reconstruction of transmission clusters, superspreading events have been shown to be a driving factor in the spread of several pathogens, among which are SARS-CoV-1, MERS, and, more recently, SARS-CoV-2 [21-27]. This means that the number of secondary cases caused by a single infectious individual is subject to substantial inter-individual variation. In other words, a small number of infected persons generates the majority of new infections, while most infected individuals cause only very few to no secondary cases.In 2005, Lloyd-Smith et al. [28] proposed a framework to study superspreading dynamics in which the expected number of secondary cases caused by an infected individual, i.e., the individual reproduction number, is represented by a random variable, following a distribution—Lloyd-Smith et al. use a Negative Binomial distribution—on the positive real axis [28, 29]. As such, superspreading events can be characterized as occurrences in the right-hand tail of the aforementioned distribution. When such a superspreading event occurs, different factors, such as for example heterogeneity in infectiousness or heterogeneity in contact behavior, are at play [30]. Increased infectiousness—defined here as the likelihood that a social contact between an infectious and a susceptible individual leads to transmission—either due to behavioral, physical or biological reasons, may play a role. For example, more virus particles are shed when talking loudly or singing [31]. Furthermore, given the importance of aerosol transmission for the spread of SARS-CoV-2, differences in immune response among individuals may explain why some persons experience a more virulent infection than others, thereby excreting more virus particles and producing more secondary cases [32-34]. Additionally, timing is also important, i.e., individuals infected with SARS-CoV-2 are most infectious during a short interval [35], a period which is not necessarily accompanied by COVID-19 symptoms and associated behavioural change [36].Some individuals have a higher number of contacts, and thus more opportunity to infect others, which is especially important during the short interval in which they are most infectious [37]. Some persons might not have a higher number of total contacts, but meet more susceptible individuals than others—such as for example in nursing homes [38]. Finally, the environment also plays an important role in the genesis of superspreading events: the risk of transmission is much higher in enclosed spaces than it is outside, and ventilation works well to limit transmission indoors [38, 39].In the research presented here, we focus on inter-individual heterogeneity in infectiousness and heterogeneity in number of contacts. However, the other sources of heterogeneity mentioned above—intra-individual temporal variation in infectiousness, heterogeneity in susceptibility, and the environments in which contacts take place—remain an important topic for future research regarding superspreading.Some transmission models for SARS-CoV-2 [14–20, 40], as well as for other pathogens like SARS-CoV-1 [41-43] and MERS [44], have taken superspreading into account. These approaches include compartmental models [15, 16], that divide the population in different and exclusive subpopulations based on realistic disease states, and branching process models [14, 28], as well as network and individual-based models [17–20, 40–44], in which each individual in the population is represented as a separate entity. Some models include the superspreading potential of individuals as a general factor [14, 17, 28], while others implement superspreading events as the result of heterogeneity in infectiousness [16], contact behavior [18, 19, 43], or a combination of both [15, 20, 41, 42, 44].However, while some of these models do demonstrate that both factors contribute to the occurrence of superspreading events [20], it is still poorly understood how exactly these different forms of superspreading impact the spread of disease and the effectiveness of control measures. To further investigate these questions in detail, we tested the effect of different forms of superspreading on the spread of SARS-CoV-2 in the Belgian population. To this end, we implemented both infectiousness-related and contact-related heterogeneity into STRIDE, an individual-based model for the transmission of SARS-CoV-2 [9, 45].In an individual-based model, each individual is represented by a separate entity with a unique set of characteristics, such as age, health status, and behavioral traits. As such, individual-based models are particularly suited to model superspreading, as they allow for a direct integration of different sources of heterogeneity at the individual level.We investigated the effects of superspreading caused by variation in infectiousness versus heterogeneity in contact behaviour, in the absence of intervention measures. Additionally, we looked at the impact of different modes of superspreading on the effectiveness of social distancing—a non-pharmaceutical intervention which continues to play an important role in the control of SARS-CoV-2.
Materials and methods
Implementation
We used STRIDE [45], an individual-based, stochastic model, which was recently adapted to encompass the disease-specific features related to the transmission of SARS-CoV-2 in the Belgian population [9]. In this model, individuals are assigned to social contact pools representing their household, a workplace or school (depending on the individual’s age and employment status), and more general communities representing an individual’s leisure contacts, contacts on public transportation and contacts in other locations. These latter communities consist of on average 500 persons and differ for each person between week- and weekend days.Simulations move forward in discrete time-steps of one day. Each simulation day, two processes take place. First, the health state of each individual is updated. Individuals can be in one of the following states: Susceptible, Exposed, Infectious, Infectious and Symptomatic, Symptomatic, and Recovered. We assume that individuals obtain immunity after recovery from the disease, and maintain this immunity for the duration of the simulation. For more information on our implementation of the natural history of SARS-CoV-2, see S1 Text.Secondly, individuals can access the pools of which they are a member, depending on the day of the week and their health status—e.g., children do not visit their school pool during the weekend, and symptomatic individuals primarily remain within their household pool. The number of contacts an individual makes each day in each of the above locations is based on their age and the type of location (household, school, workplace, or community). More specifically, daily contact rates are based on a social contact survey conducted in Belgium in 2010–2011 [46]. When a contact occurs between an infectious individual and a susceptible individual, a ‘transmission probability’ determines whether a transmission event occurs. This transmission probability depends solely on the infectious individual in our model, and not on the location in which the contact takes place.We adapted the model to represent both infectiousness-related and contact-related heterogeneity. Both are implemented on the individual level: an individual will have a tendency to be more or less infectious, or have more or less daily contacts over the course of the entire simulation. Infectiousness-related heterogeneity was implemented as follows. When a person becomes infected, an ‘individual transmission probability’ is assigned to this person. This probability determines whether an actual transmission event takes place whenever a contact occurs between the infected individual and a susceptible individual. We assume that this probability remains the same for the entire duration of an individual’s infectious period—one of the limitations we elaborate on in the discussion.Following the work of Lloyd-Smith et al. [28], we chose to represent inter-individual variation by using a Gamma distribution. As such, each individual transmission probability is drawn from a (right-)truncated Gamma distribution on the interval (0, 1]. The truncated distribution is characterized by a shape parameter α, which determines the level of overdispersion of the distribution, and a mean—hereafter referred to as the mean transmission probability. A lower value of α entails more variation in the individual transmission probability, and thus a higher level of superspreading, as shown in Fig 1A.
Fig 1
Probability density functions of the Gamma distributions considered for the individual transmission probability (panel a) and the individual contact factor (panel b).
To account for contact-related heterogeneity, we multiply an individual’s contact rate in community and workplace pools by a factor, which is unique for each individual. This factor is drawn from a Gamma distribution with shape parameter α and mean 1, upon creation of the individual. Again, a lower value of α implies more variation in contact rates and thus a higher level of superspreading, which is shown in Fig 1B.The code of STRIDE is open-source and available at https://github.com/elisekaa/stride. All code and data used for the research in this paper has been assigned a DOI using Zenodo: 10.5281/zenodo.6669350.
Simulations
For an exhaustive overview of the population and parameters used in the simulations described below, we refer to S1 Text and earlier work conducted on SARS-CoV-2 transmission using the STRIDE simulator [9]. As there is no standard procedure to estimate the number of realisations necessary for the type of simulations we conduct, we chose to run 200 stochastic simulations for each scenario described below, which led to stable results.
Verification
We verified our implementation of superspreading in STRIDE in a number of ways. First, we confirmed that the mean R0 remained stable over the different scenarios regarding infectiousness-related and contacts-related heterogeneity, to ensure that the differences we observed were indeed due to variations in infectiousness- and contact-related heterogeneity. We found that the mean R0 was indeed consistent between the different scenarios, although the variance did differ—which we expected. This is shown in Fig A in S2 Text.We also checked that the baseline case—in which both the individual transmission probability and the individual contact factor are constant—corresponds to a scenario in which either the individual transmission probability or the individual contact probability follows a Gamma distribution with respectively α = 10 and α = 10. This should be the case, since, as the shape parameter α approaches infinity, the dispersion of the distribution decreases, and the distribution becomes more centered around the mean.Furthermore, we calculated P80: the minimal proportion of infected individuals that is responsible for 80% of cases—a measure commonly used in the superspreading literature [22, 28, 29]. We calculated P80 for each simulation by ordering individuals that were infectious during the simulation according to the number of secondary cases they caused, in decreasing order. Next, we compute the proportion of individuals that are responsible for 80% of the total number of infections caused over the entire simulation run. We used this measure to verify that as we decreased either α or α, this resulted in more heterogeneity in transmission—leading in turn to a lower P80. We observed that this was the case, but the effect was stronger with the same values for α compared to α. This is shown in Fig B in S2 Text.Finally, we constructed a theoretical description for the transmission process in STRIDE, which allowed us to calculate the expected number of secondary cases per infected individual, as well as the variance of this quantity. We then compared the results of this theoretical description with simulation results for different forms and levels of superspreading. We observed that the results of our analytical calculations were largely in agreement with the results obtained through simulations, which is shown in Fig C–N in S2 Text.For more details on the way in which these checks were conducted, and their results, see S2 Text.
Superspreading effects in the absence of interventions
First, we investigated the effect of different modes of superspreading on the unmitigated spread of SARS-CoV-2. To do this, we introduced one infected individual in a completely susceptible population at the beginning of a simulation and tracked transmission events over a period of 200 days. Typically, after this period, no more new infections were recorded for all considered scenarios, so we assumed we had observed most, if not all, of the epidemic curve. This is apparent from the plots depicting the evolution of the number of new cases and the cumulative number of cases per day, as shown in Fig B–E in S1 File.To compare the effect of infectiousness-related heterogeneity to the effect of contact-related heterogeneity, we varied the distribution of the individual transmission probability and the individual contact factor as described in Table 1. The values for α and α were chosen as follows: the value 10 represents a distribution in line with the baseline scenario (i.e., a distribution which approaches a degenerate one), with low levels of heterogeneity for either the individual transmission probability or the individual contact factor, a value of 1 was chosen as an intermediate scenario, and values 0.6, 0.4 and 0.2 were chosen to represent a high level of heterogeneity, in line with what we would expect to see in the transmission of SARS-CoV-2 [29]. The mean of the truncated Gamma distribution considered for the individual transmission probability (i.e., 0.08) was chosen so that it corresponds to an R0 value of about 3 in the baseline scenario. This is in line with initial estimates for R0 for SARS-CoV-2 [3, 9, 47, 48].
Table 1
Overview of scenarios.
Scenario
Individual transmission probability
Individual contact factor
Baseline
0.08
1.00
A
Trunc. Gamma (αi = 10.0, mean = 0.08)
1.00
B
Trunc. Gamma (αi = 1.0, mean = 0.08)
1.00
C
Trunc. Gamma (αi = 0.6, mean = 0.08)
1.00
D
Trunc. Gamma (αi = 0.4, mean = 0.08)
1.00
E
Trunc. Gamma (αi = 0.2, mean = 0.08)
1.00
F
0.08
Gamma (αc = 10.0, mean = 1.0)
G
0.08
Gamma (αc = 1.0, mean = 1.0)
H
0.08
Gamma (αc = 0.6, mean = 1.0)
I
0.08
Gamma (αc = 0.4, mean = 1.0)
J
0.08
Gamma (αc = 0.2, mean = 1.0)
Overview of scenarios based on the distribution considered for the individual transmission probability and the individual contact factor.
Overview of scenarios based on the distribution considered for the individual transmission probability and the individual contact factor.We then compared epidemiological metrics between the different scenarios. First, we looked at the probability of extinction, calculated as the fraction of simulation runs that produce no more new cases after only a few (or no) initial secondary infections [49]. We define what is regarded as extinction by looking at the final outbreak sizes after 200 simulation days, resulting from all simulations over all scenarios. After ordering these in descending order, a sharp drop-off can be observed, separating runs with persistent outbreaks from runs in which extinction occurs. As we found that outbreaks starting from one initial case either remain below 15 cases, or grow much larger, we set the threshold below which we will consider an outbreak to have gone extinct at 20 cases. In Fig A in S1 File, a histogram is shown depicting the frequency of final outbreak sizes per scenario.We also compared the attack rate of outbreaks, and looked at the peak size and the timing of the peak. Furthermore, we investigated the evolution of the daily effective reproduction number R. We approximated R by calculating the mean number of secondary cases caused by individuals that contracted infection on day t. However, since at the end of a simulation only a few infected individuals remain, thereby making inference about R prone to substantial oscillations in daily estimates, we applied a LOWESS smoothing approach for the time-varying R-values [50].To further gauge the effect of different forms of superspreading on epidemic spread, we estimated the herd immunity threshold, the day this threshold is reached, and the day on which the last infection is recorded. In the absence of a standard procedure to define the herd immunity threshold in an individual-based model, we estimated this measure as follows. We looked at the proportion of individuals that were no longer susceptible (i.e., recovered or currently infected) on the last day for which the smoothed R ≥ 1.Finally, we kept track of the type of location (household, school, workplace or community) in which infections occurred.
Superspreading effects in the presence of social distancing
We considered a scenario to investigate the impact of different forms of superspreading on the effectiveness of a period with strong social distancing followed by a period of mild relaxations.The scenario was implemented as follows: after the introduction of a single infected individual in the population we simulated 30 days without interventions, after which a ‘lockdown’ period began, in which schools were closed (primary, secondary, and tertiary education), contacts at the workplace were reduced by 94.51% and contacts in the communities were reduced by 88.74%. These contact reductions were inferred based on social contact data collected for Belgium from April to mid-May 2020 in the CoMix study [51].After 60 days of lockdown (i.e. on day 90 of the simulation), a partial release of the lockdown followed. Schools were re-opened and contacts in the workplace increased to 25.09% of pre-pandemic levels, while contacts in the community increased to 28.55% of pre-pandemic levels. Again, these contact reductions were estimated based on data collected in the CoMix study, from mid-May to August 2020. More information on how these and the above estimates were obtained can be found in S1 Text.We tested the same distributions for the individual contact probability and for the individual contact factor as listed in Table 1. We ran each simulation for 600 days (30 days pre-lockdown, 60 days lockdown and 510 days partial release), after which we typically did not observe any new infections. This can be seen in the plots in Fig N–Q in S1 File, showing the number of new cases and the number of cumulative cases per day.We compared the number of cases before, during, and after the lockdown between the different scenarios, as well as the attack rate over the entire simulation. We also looked at the evolution of the number of new cases, the cumulative number of cases and the effective R per day. Additionally, we calculated a ‘resurgence probability’, to represent the chance that a lockdown followed by a partial release would not be effective in stopping the epidemic in a particular scenario. This resurgence probability was calculated as follows. As the effectiveness of a lockdown can only be gauged when the epidemic is still ongoing, and we did not consider the importation of new cases, we only took into account those runs in which new cases were still being observed when the lockdown period started. As such, we excluded simulation runs in which 0 cases were recorded during the entire lockdown period. Then, to define resurgence, we looked at the distribution of the number of cases during the period of partial release.For all scenarios, we observed that either at most 309 cases occurred during the period of partial release, or a much larger number. As such we set the ‘resurgence threshold’ at 500 cases. In Fig L in S1 File, a histogram is shown depicting the frequency of numbers of cases during the release period per scenario.
Sensitivity analysis
We conducted a sensitivity analysis regarding the mean transmission probability and the number of infected cases introduced at the beginning of the simulation. More information on this can be found in S3 Text.To get an idea of the combined effect of infectiousness-related and contact-related heterogeneity on disease spread, we also ran simulations for combinations of α and α. We constructed a grid using Latin Hypercube Sampling, and ran 50 simulations for each combination of α and α in this grid. We describe this analysis in more detail in S4 Text.
Results
Superspreading effects in the absence of interventions
We investigated the spread of SARS-CoV-2 throughout the Belgian population, following the introduction of one infected individual. First, we looked at the probability of extinction. Using the extinction threshold of 20 cases, we calculated, for each scenario, the proportion of simulation runs that lead to extinction. We observed that as infectiousness-related heterogeneity increases, the extinction probability also increases (see Fig 2A). In the baseline scenario, where the transmission probability is the same for all individuals, extinction occurs in 12.5% of all simulation runs. As α decreases to 0.2, however, we see that as much as 72% of the simulation runs produce less than 20 cases.
Fig 2
Extinction probabilities for scenarios investigating infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b), defined as the proportion of simulation runs (n = 200) that produces less than 20 cases.
Extinction probabilities for scenarios investigating infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b), defined as the proportion of simulation runs (n = 200) that produces less than 20 cases.
Error bars represent 95% (Clopper-Pearson) confidence intervals.A different trend can be observed when varying contact-related heterogeneity (see Fig 2B). Here, varying α has very little effect on the fraction of runs that lead to extinction.We also looked at the attack rate of outbreaks. In the baseline scenario, around 92.5% of the population is infected 200 days after the introduction of an infected individual in the population, whenever extinction did not occur. When increasing infectiousness-related heterogeneity as well as when increasing contact-related heterogeneity, the final size of outbreaks lowers. However, this decrease is much more pronounced in the case of contact heterogeneity. This is shown in Fig 3.
Fig 3
Violin plots for the attack rate over 200 days for scenarios investigating the infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b).
The orange dots represent the mean attack rate across the simulation runs without extinction, i.e., simulation runs in which extinction occurs (< 20 cases) were excluded.
Violin plots for the attack rate over 200 days for scenarios investigating the infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b).
The orange dots represent the mean attack rate across the simulation runs without extinction, i.e., simulation runs in which extinction occurs (< 20 cases) were excluded.When looking at the size and timing of the peak of the epidemic curve of outbreaks when no extinction occurs (see Fig 4), opposite trends can be observed for increasing heterogeneity in infectiousness and contacts, respectively. When infectiousness-related heterogeneity increases, the mean size of the peak decreases, while no effect is seen on the time when the peak occurs. However, when contact-related heterogeneity increases, the mean peak size becomes higher and also occurs earlier. Furthermore, contact-related heterogeneity leads to more variability in peak size.
Fig 4
Violin plots for the size and day of peak across the different simulation runs for scenarios investigating the infectiousness-related heterogeneity (in green, panels a and c, respectively) and contact-related heterogeneity (in blue, panels b and d, respectively).
The orange dots represent the mean of the simulated peak sizes and days at which the peak is reached. Simulation runs in which extinction occurs (< 20 cases) were excluded.
Violin plots for the size and day of peak across the different simulation runs for scenarios investigating the infectiousness-related heterogeneity (in green, panels a and c, respectively) and contact-related heterogeneity (in blue, panels b and d, respectively).
The orange dots represent the mean of the simulated peak sizes and days at which the peak is reached. Simulation runs in which extinction occurs (< 20 cases) were excluded.These trends can also be observed when looking at the evolution of the number of new cases, the number of cumulative cases and the smoothed daily R values. The number of new cases per day, the cumulative number of cases per day, and the smoothed Rt per day for the different scenarios is shown in Fig B–G in S1 File.We estimated the impact of different modes of superspreading on the herd immunity threshold (see Fig 5). We observed that when infectiousness-related heterogeneity increases as α decreases from 10 to 0.2, the average herd immunity threshold decreases from 58.97% to 32.66%. This strong decline is in part due to the increase in the number of runs in which extinction occurs, but is still present when these runs are excluded, as can be seen in Fig 5C.
Fig 5
Violin plots of the estimated herd immunity threshold values over the different simulation runs including or excluding runs with extinction for scenarios investigating the infectiousness-related heterogeneity (in green, panels a and c, respectively) and contact-related heterogeneity (in blue, panels b and d, respectively).
The orange dots represent the mean of the simulated values.
Violin plots of the estimated herd immunity threshold values over the different simulation runs including or excluding runs with extinction for scenarios investigating the infectiousness-related heterogeneity (in green, panels a and c, respectively) and contact-related heterogeneity (in blue, panels b and d, respectively).
The orange dots represent the mean of the simulated values.However, when contact-related heterogeneity increases, an opposite trend can be observed: the herd immunity threshold increases to 64.28% when α decreases to 0.2.We also looked at the day on which the herd immunity threshold is reached (see Fig H in S1 File), and the day on which the last transmission event is observed (see Fig I in S1 File). We observed that, with higher infectiousness-related heterogeneity, the epidemic slows down, as the herd immunity threshold is reached later—even though it is lower—and the last infections are observed at a later time compared to the baseline scenario. Conversely, with higher contact-related heterogeneity, the herd immunity threshold is reached faster and infections stop occurring at an earlier point in the simulation.Finally, we looked at the type of locations in which transmissions occurred (see Fig J–K in S1 File). We found that neither increasing heterogeneity in infectiousness nor increasing heterogeneity in contact behavior changed the locations where most infections took place, which for all scenarios were communities and households.We also investigated the impact of different combinations of α and α. We found that the effects we observed when testing different values for α and α separately were largely conserved when we tested combinations of α and α, which is shown in Fig B in S4 Text. The extinction probability is still dominated by α (increasing for lower values of α), while there was no discernible effect of α. Although the attack rate decreased for both lower values of α and α, the impact of α on the attack rate was much larger. The effects of α and α on the size and timing of the peak, the herd immunity threshold and the day on which the last transmission event was observed remained opposite. However, α had a bigger impact on the size of the peak and the herd immunity threshold, while α dominated the timing of the peak and the day of the last transmission event.
Superspreading effects in the presence of social distancing
We investigated the effect of different forms of superspreading on the outcome of a social distancing scenario. We compared the number of cases before lockdown (day 0—day 30, see Fig 6A–6B), during lockdown (day 30—day 90, see Fig 6C–6D) and during the period of partial release (day 90—day 600, see Fig 6E–6F) between the different scenarios.
Fig 6
Violin plots for the number of cases before lockdown (day 0–day 30), during lockdown (day 30–day 90) and during partial release phase (day 90–day 600) for the different scenarios investigating infectiousness-related heterogeneity (in green, panels a, c, and e, respectively) and contact-related heterogeneity (in blue, panels b, d, and f, respectively).
The orange dots represent the mean of the simulated values.
Violin plots for the number of cases before lockdown (day 0–day 30), during lockdown (day 30–day 90) and during partial release phase (day 90–day 600) for the different scenarios investigating infectiousness-related heterogeneity (in green, panels a, c, and e, respectively) and contact-related heterogeneity (in blue, panels b, d, and f, respectively).
The orange dots represent the mean of the simulated values.The number of cases observed before lockdown confirmed what we observed when examining unmitigated transmission: when contact-related heterogeneity is high, the start of outbreaks is more explosive. On the contrary, when infectiousness-related heterogeneity is high, many outbreaks stop after only a few cases. During lockdown, we also observe more cases when there is higher contact-related heterogeneity. However, as infectiousness-related heterogeneity increases, the average number of cases observed during lockdown slightly decreases. Finally, the number of cases during the partial release phase decreases sharply as infectiousness-related heterogeneity increases, while changes in contact-related heterogeneity seem to have little impact on the number of cases during this period. The same trends can be observed when looking at the attack rate over the entire simulation period (see Fig M in S1 File).Furthermore, we looked at the evolution of the number of new cases per day, the cumulative number of cases and the smoothed daily R values (see Fig N–S in S1 File). We observed that with higher infectiousness-related heterogeneity, outbreaks were slower to fade out during the period of partial release. When contact-related heterogeneity was high outbreaks were more explosive and took off again faster once social distancing measures were relaxed. However, what is remarkable here as well is that, as contact-related heterogeneity increases, distinct waves can be observed during the period of partial relaxations (see Fig M and Q in S1 File).We calculated the resurgence probability for different forms and levels of superspreading (see Fig 7). As infectiousness-related heterogeneity increases, the resurgence probability decreases, meaning that it is less likely that the epidemic grows large in size again when relaxing measures after a period of lockdown. However, resurgence probabilities increase slightly as contact-related heterogeneity increases.
Fig 7
The proportion of simulation runs, in which the number of cases during lockdown is greater than 0, that produces more than 500 cases during the partial release phase for scenarios investigating infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b).
The proportion of simulation runs, in which the number of cases during lockdown is greater than 0, that produces more than 500 cases during the partial release phase for scenarios investigating infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b).
Error bars represent 95% (Clopper-Pearson) confidence intervals.We assessed how the herd immunity threshold changes when social distancing is applied (see Fig 8). For the baseline case, the estimated herd immunity threshold lowered from 56.89% without interventions to 18.18% on average in the social distancing scenario. When increasing infectiousness-related heterogeneity, the herd immunity threshold decreased even further: to 2.28% when α = 0.2. However, this is not the case when contact-related heterogeneity is high. Even though the average herd immunity threshold initially lowers from 16.94% to 13.89% when α decreases from 10 to 0.6, it again increases to 22.43% when α is further decreased to 0.2.
Fig 8
Violin plots for the herd immunity threshold as estimated for the social distancing scenario, when investigating infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b).
The orange dots represent the mean of the simulated values.
Violin plots for the herd immunity threshold as estimated for the social distancing scenario, when investigating infectiousness-related heterogeneity (in green, panel a) and contact-related heterogeneity (in blue, panel b).
The orange dots represent the mean of the simulated values.Finally, we looked at the locations in which transmissions occurred (see Fig T–U in S1 File). We found that social distancing had little effect on where transmission happened: even though there were less transmissions overall, most transmissions still occurred in households and communities for all scenarios.
Discussion
Conclusion
We investigated different forms of superspreading in an individual-based model by considering both infectiousness-related heterogeneity and contact-related heterogeneity. We found that these two types of superspreading have very different effects, both on the unmitigated spread of SARS-CoV-2 as well as on the effectiveness of social distancing measures. In the absence of containment measures, we observed that with high infectiousness-related heterogeneity, the introduction of an infected individual in the population led to large outbreaks less frequently. Furthermore, peak sizes were smaller and occurred at a later time. The estimated herd immunity threshold also decreased. On the other hand, when contact-related heterogeneity is high, we observed that the introduction of an infected individual led to outbreaks slightly more often compared to the baseline scenario in which the mean transmission probability is 0.08 for all individuals and the individual contact factor is set to 1. Outbreaks that did not go extinct after only a few secondary cases were also more explosive, with higher peaks that occurred earlier. The herd immunity threshold also increased slightly.The difference between the effects caused by these two forms of superspreading might be explained by the fact that when infectiousness-based heterogeneity is high, superspreaders have to infect other superspreaders to keep the epidemic going—leading to more extinctions and a drawn-out, but less explosive, outbreak whenever extinction does not occur. Furthermore, not all infected individuals with a very high individual transmission probability will have a lot of contacts and thus the opportunity to realize their ‘transmission potential’. However, when a person has a large number of contacts, they not only have more opportunity to infect others once they are infected, but they also have a disproportionally high chance to be exposed to infection themselves [37], leading to faster, more explosive outbreaks.We also observed that the total attack rate of outbreaks decreased much faster when contact-related heterogeneity increased than when infectiousness-related heterogeneity increased. This seems to be in contrast with the results described above, but can be explained due to a faster depletion of superspreaders (and susceptible individuals in general). This is enhanced by the fact that STRIDE uses a structured population, in which individuals are limited in the contacts they can make by the constraints of the contact pools (household, school, workplace, communities) they belong to. As such, when a large fraction of the persons in their contact pools has already been infected, an infected individual cannot cause a large number of secondary cases, no matter how infectious they are, or how many contacts they have [52]. Furthermore, as contact-related heterogeneity increases, more individuals will have no or very few contacts, making them ‘unreachable’, thereby decreasing the maximum number of cases that can be generated during an outbreak.We should also note here that the same α and α parameters lead to different levels of overdispersion in transmissions (as shown in Fig A–B in S2 Text)—which might explain some of the differences between both forms of superspreading.Other models [16, 42, 44] that implement both heterogeneity in infectiousness and heterogeneity in contacts also conclude that superspreading driven by contact-related heterogeneity leads to more explosive outbreaks. These models (and others that only consider infectiousness-related heterogeneity), however, do not capture the slowdown of outbreaks due to infectiousness-related heterogeneity, since they do not implement a structured population, and hence every highly infectious person is able to realize their ‘transmission potential’.In this respect, some interesting parallels to research on network models can be drawn. A recent review considers the difference between ‘out-degree’ and ‘in-degree’ in partially directed networks [53]. Extinction probability is a function of out-degree, while epidemic size is a function of in-degree. This might explain why infectiousness-related heterogeneity has more impact on extinction while contact-related heterogeneity has more impact on attack rate: infectiousness-related heterogeneity mainly affects ‘out-degree’, while contact-related heterogeneity has more impact on ‘in-degree’. An important difference however, is that the number of secondary cases an infected individual can cause in STRIDE is always limited by the contact pools that individual belongs to, no matter how large the total population is.It is likely that in the current SARS-CoV-2 pandemic, a combination of both heterogeneity in infectiousness and heterogeneity in contacts—in addition to a multitude of other factors—has facilitated the occurrence of superspreading events [15, 20]. For example, for the original Wuhan SARS-CoV-2, P80 was estimated to be about 0.1 [22]. Based on the results depicted in Fig A1 in S4 Text, we can then estimate α to be between 0.25 and 0.25, depending on the value for α, which can be estimated from contact surveys. As circumstances change and new variants appear, the relative contribution of different forms of superspreading presumably also changes [54]. Calibrating our model of superspreading to the different waves and variants of SARS-CoV-2 would thus be a useful extension of our present work.We also investigated the impact of different forms of superspreading on the effectiveness of social distancing. We found that, when superspreading is driven by heterogeneity in infectiousness, a period of strict social distancing, followed by a partial release is most effective and extinguishes almost all outbreaks. This is in line with the conclusion of other models that also assume superspreading events are driven by heterogeneity in infectiousness [18, 19].However, when superspreading is driven by heterogeneity in contact behavior, we found that while social distancing measures might limit cases during a strict lockdown, the chance of resurgence of the epidemic following relaxations increases. Furthermore, with high infectiousness-related heterogeneity, few outbreaks grow large in size after a period of lockdown, but outbreaks that do not go extinct linger for a long time before completely disappearing.
Limitations
Some limitations need to be taken into account when interpreting these results.Each individual that is infected over the course of the simulation, eventually recovers and gains immunity for the remainder of the simulation. As such, neither deaths nor re-infections occur. We also did not model the waning of immunity, nor vaccination, making these results more representative of the initial wave of the SARS-CoV-2 pandemic. Additionally, we only introduce infected individuals at the beginning of the simulation, and do not take into account the continuous importation of infectious individuals into the population. Furthermore, we focus on the initial course of the infections, in which individuals can transmit the disease. We did not model hospitalizations with case isolation, since we assume the transmission dynamics are mostly shaped by the pre-symptomatic and mild symptomatic stages and individuals exhibit adapted social contact patterns once they become symptomatic. To fully capture the effect of superspreading on the spread of SARS-CoV-2, these factors would, however, need to be taken into account.To represent heterogeneity in infectiousness and contact behavior, we used a right-truncated and untruncated Gamma distribution respectively. However, other distributions could be used to obtain similar levels of heterogeneity, which we did not test in the current study [29].Finally, we only modeled heterogeneity caused by differences between individuals: heterogeneity in infectiousness and heterogeneity in number of daily contacts. However, several other factors might contribute to the occurrence of superspreading events. These can be attributable to differences between individuals as well, such as heterogeneity in susceptibility [55] or infectiousness that varies over time [35], or they can be attributable to differences in the environment, such as the differing spreading potential between indoor and outdoor venues or superspreading facilitated by mass events [25, 56, 57]. Although these factors may be important in the occurrence of superspreading events, we did not model them in our current study. We did however model two different community pools (for weekdays and weekend days, respectively), where individuals could potentially contact a larger number (up to 500) of people, which could serve as a proxy for attending a large event. However, large events also have a temporal aspect that is not modeled through community contact pools. As such, explicitly modeling mass gatherings might lead to new insights regarding the effect of different types of superspreading on disease spread and on the effectiveness of interventions. Furthermore, our implementation of contact heterogeneity implicitly constitutes a proxy for heterogeneity in contact environment as well.
Future work
Several extensions of this work could be useful in furthering our understanding of superspreading both within the context of the current COVID-19 pandemic and within a broader context.Further research is needed to disentangle effects of both forms of superspreading. As such, it would be possible to estimate the level of both infectiousness-related and contact-related heterogeneity at play in the current SARS-CoV-2 pandemic. We plan to use data collected during the CoMix study [51] to estimate the level of contact-related heterogeneity during different phases of the SARS-CoV-2 pandemic. Subsequently, assuming that the two forms of heterogeneity investigated in this paper are most important for the occurrence of superspreading events, it is possible to estimate the level of infectiousness-related heterogeneity from the overall observed heterogeneity in transmissions. Further insight could also be gained by comparing the importance of these two types of superspreading for the spread of different pathogens, as contact-related heterogeneity presumably remains largely the same for a population under the spread of different pathogens.We implemented heterogeneity in contact behavior by applying a distribution to the daily contact rate of individuals. There is however, in the context of social distancing, another possible type of contact heterogeneity, namely non-compliance to social distancing measures. It would be relevant to investigate how different forms of superspreading modulate the negative effect of this phenomenon on the effectiveness of social distancing measures.Furthermore, while we investigated the impact of superspreading on the effectiveness of social distancing, it is conceivable that superspreading also has an impact on other interventions, both pharmaceutical and non-pharmaceutical, such as vaccination, contact tracing and universal testing [14, 58].Finally, we did not take into account person characteristics that might make an individual more or less likely to be a superspreader, due to the lack of data about this subject. Instead, the individual transmission probability and individual contact factor were drawn at random from a distribution. However, some characteristics, such as age, profession, or perception of the severity of COVID-19, might have an impact on how likely an individual is to transmit the disease [17]. Taking such characteristics into account when distributing transmission potential in the population would provide a more accurate model of superspreading.
Population and parameters.
(PDF)Click here for additional data file.
Verification.
(PDF)Click here for additional data file.
Sensitivity analysis.
(PDF)Click here for additional data file.
Varying α and α together.
(PDF)Click here for additional data file.
Supplementary figures.
(PDF)Click here for additional data file.20 Apr 2022Dear Mrs Kuylen,Thank you very much for submitting your manuscript "Different forms of superspreading lead to different outcomes: heterogeneity in infectiousness and contact behavior relevant for the case of SARS-CoV-2" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.When you are ready to resubmit, please upload the following:[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).Important additional instructions are given below your reviewer comments.Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.Sincerely,Miles P. Davenport, MB BS, D.PhilAssociate EditorPLOS Computational BiologyTom BrittonDeputy EditorPLOS Computational Biology***********************A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:[LINK]Reviewer's Responses to QuestionsComments to the Authors:Please note here if the review is uploaded as an attachment.Reviewer #1: Thank you for the opportunity to review this manuscript. In this study the authors applied an agent-based model for SARS-CoV-2 transmission, with two distinct mechanisms for superspreading implemented in the model: heterogeneity in the infectiousness of individuals (implemented by drawing the infectiousness of an infected person from a Gamma distribution); and heterogeneity in contact rates (implemented by drawing a multiplicative contact parameter for each infectious person again from a Gamma distribution). The authors compared the impact of allowing for each type of superspreading event in two types of transmission scenarios (one with unconstrained social mixing, and one with restrictions gradually being lifted). They found that each of these types of superspreading produces different effects when considering the probability of importation generating an epidemic; the size of the resulting peak; the overall attack rate; timing of the peak; and the herd immunity threshold. I thought that the authors reflection was interesting that the reason contact-based heterogeneity causes a larger peak and reduced chance of extinction is due to more initial opportunities for the epidemic to be seeded and agree that this seems to be the case.I found this to be a well-written study, that was clearly explained, with well-presented results. I think it is a useful contribution to the literature on models that account for superspreading, and some of those points identified in the limitations would be interesting to see explored in future work. I have only minor comments and requests for clarification.1. Abstract. “On the one hand, a higher level of infectiousness-related heterogeneity results in less outbreaks occurring following the introduction of one infected individual.”Just to be very specific here – by fewer outbreaks, do you mean that the risk of an outbreak occurring following introduction of an infected individual into a susceptible population was lower? (From reading the methods I now see that this is the case but worth being clear in the Abstract).2. How did you confirm that the overall level of transmission within each of the two superspreading scenarios is the same, when you were adjusting the infectiousness parameter versus the contact rate parameter (i.e. to ensure that you are comparing like-with-like)?3. It took me a little while to identify in your paper what you assumed about acquisition and waning of immunity (as my immediate response to the results was to ponder the connection with immunity). I see in the first paragraph of your limitations section you do fully describe your assumptions but I think this should be made clearer earlier in your manuscript.Reviewer #2: Kuylen et al. have used a preexisting COVID-19 simulation package to study 2 different "forms" of superspreading and determine the epidemiological consequences of the overdispersion in both factors in the population. My summary of the present results is: In a theoretical scenario with baseline contact related heterogeneity (CrH) but increased infectiousness-related heterogeneity (IrH), a simulation initiated by a single infected individual had a higher extinction probability, a delayed and decreased peak of infections, and a lower herd immunity threshold. In a parallel scenario with baseline IrH but increased CrH, peaks were earlier and higher, and herd immunity thresholds were higher. Resurgence after a lockdown increased with CrH but decreased with IrH.The manuscript is very well written, and I found the results intuitive and the conclusions mostly justified by the results. I personally think the authors should do a bit more to take this work to a really high level -- it is already good but just misses out on some exciting opportunities and I don't think should be too much extra work.My major comments are:1) Most importantly, the conclusions could be strengthened and made much more generalizable by using a global sensitivity analysis. I think the current version teaches us how each mechanism works independently, but I think you really miss out on the opportunity to comment on what happens if both change in a scenario. If you can run some grid beyond Table 1 and try to see if these effects can cancel out (ie are not identifiable if like IrH goes up but CrH goes down), or if they can amplify in any surprising ways, this would really bring it to the next level. I know the means are varied in the supplement, but I think the heterogeneities and means should all be simultaneously varied and a typical global approach applied (see Sobol/Morris or more simple LHS protocols for example).2) I thought there were some nice results in the supplement including agreement between simulations and analytical calculations. I thought maybe this is worth noting in the main body so that others could use your expressions3)The definitions of infectiousness-related and contact-related still kind of miss the point you make about contact situation. Perhaps you should frame this as per-contact transmission probability heterogeneity, and allow this to contain the notion of more indoor gatherings?4) I know you are trying to avoid "disentangling" these two effects, but I really think this misses an opportunity to tell a more applicable message. To me you have data in here that already help disentangle, for example: you could speculate on what you think happened between Alpha and Beta vs Omicron waves based on case data. Did the virus get more heterogeneous in terms of infection probability? or were people just tired of restritctions and more superspreading heterogeneity occurred? I think the case curves look pretty different, omicron was a lot sharper both up and down in a lot of places.5) The figures were kind of unusual with the a) and b) legends above the actual legends. I think it could be a lot clearer (different colors or something) that left panels are always varying IrH and right are always varying CrH too6) I didn't write much on text because in general it was clear and excellent, however, the abstract could be much stronger, see my comments below:Abstract:Superspreading events play an important role in the spread of SARS-CoV-2 and several other pathogens.-Ok, but you don't really mention the other pathogens for the rest of the paperHence, while the basic reproduction number of the original Wuhan SARS-CoV-2 is estimated to be about 3 for Belgium, there is substantial inter-individual variation in the number of secondary cases each infected individual causes.-Ok, this 3 seems oddly specific given that you don't talk about the magnitude of variationMultiple factors contribute to the occurrence of superspreading events: heterogeneity in infectiousness and susceptibility, variations in contact behavior, and the environment in which transmission takes place.-Why bring up 3 if you only study 2?While superspreading has been included in several infectious disease transmission models, our understanding of the effect that these different forms of superspreading have on the spread of pathogens and the effectiveness of control measures remains limited.-What do you mean? Need to be more specific about what you think is missing, why do we need to know more and what precisely is not known?To disentangle the effects of infectiousness-related heterogeneity on the one hand and contact-related heterogeneity on the other, we implemented both forms of superspreading in an individual-based model describing the transmission and spread of SARS-CoV-2 in the Belgian population.-"the Belgian population" is not definedWe considered its impact on viral spread as well as on the effectiveness of social distancing.-I'm not sure you really studied superspreading vs social distancing, more like resurgence probabilities after social distancing restrictions are lifted. If you wanted to talk about effectiveness of social distancing, you might want to study how often you see case loads of X level despite social distancing.We found that the effects of superspreading driven by heterogeneity in infectiousness are very different from the effects of superspreading driven by heterogeneity in contact behavior.-"very" is not specific enough, since you don't present absolute numbers (which I think you should), you should at least present percent increases or somethingOn the one hand, a higher level of infectiousness-related heterogeneity results in less outbreaks occurring following the introduction of one infected individual. Outbreaks were also slower, with a lower peak which occurred at a later point in time, and a lower herd immunity threshold. Finally, the risk of resurgence of an outbreak following a period of lockdown decreased. On the other hand, when contact-related heterogeneity was high, this also led to smaller final sizes, but caused outbreaks to be more explosive in regard to other aspects (such as higher peaks which occurred earlier, and a higher herd immunity threshold). Finally, the risk of resurgence of an outbreak following a period of lockdown increased.-fix less vs fewer-is "outbreaks" really right? I'm not an epi specialist, but I think this word is kind of dangerous-"also led to smaller final sizes" do you mean peaks? or is this prevalence? was prevalence lower after a simulatino with increased IrHDetermining the contribution of both source of heterogeneity is therefore important but left to be explored further.-Like I said above, this just doesn't feel right, I think you can do a bit of this here instead of saving for later.-Whatever you decide, this should not be the last sentence in your abstract, end on something strong about the utility of your study (which is really nice!) rather than a lame concession of its limitationsReviewer #3: I’m a theorist and I have never used STRIDE, so I’m trying to understand these results in terms of known results in network epidemiology. As recently discussed in a review https://arxiv.org/abs/2005.11283, it’s important to distinguish forms of superspreading that affect the distribution of “out-degree” or “spread” - the number of secondary cases generated by a given infected individual - from those which affect to the “in-degree” or “risk” - the number of individuals from which a given individual can receive the infection. In some cases these are the same, since infections can spread in either direction along a social contact. But especially when different kinds of heterogeneity and interventions are taken into account, it’s important to think about the network of contacts as partially directed. For instance, an individual with high viral load would have a high out-degree, while an immunocompromised and highly susceptible person would have high in-degree. As another example, masking is often claimed to reduce out-degree more than in-degree (i.e. protect others from the masked individual more than the other way around).In particular, in networks where short loops are rare, the extinction probability is a function only of the out-degree distribution, while the epidemic size (and therefore the attack rate if it is measured over the entire epidemic) is a function only of the in-degree distribution. As a result, models of superspreading where individuals generate differing numbers of secondary cases, but where these secondary infections fall uniformly on all individuals in a given compartment, don’t affect the epidemic size at all as long as R_0 is held constant. But they do affect the extinction probability, which increases as the variance in out-degree increases. Similarly, models where individuals have differing susceptibilities, but similar distributions of how many others they infect if they become infected, affect the epidemic size but not the extinction probability or epidemic threshold (again assuming R_0 is held constant).So I’m trying to interpret Figures 2 and 3 in that context. Based on the analysis in https://arxiv.org/abs/2005.11283, it looks to me as if your infectiousness heterogeneity affects the “out-degree” or “spread”, while your contact heterogeneity mainly affects the “in-degree” or “risk”. This would explain why the extinction probability increases with infectiousness heterogeneity alpha_i but is essentially constant with respect to contact heterogeneity alpha_c. (In the text you claim a slight increase and then a downward trend, but the error bars in Fig. 2b are consistent with no dependence on alpha_c at all.)Similarly, in Fig. 3 the attack rate depends strongly on alpha_c, as we would expect if alpha_c affects the variance of the in-degree, but much less strongly on alpha_i. Is it possible that if you waited longer, so that you observed the entire epidemic curve — making the attack rate equal to the epidemic size — that this dependence would disappear completely?On the other hand, in the Conclusion you say “However, when a person has a large number of contacts, they not only have more opportunity to infect others once they are infected, but they also have a disproportionally high chance to be exposed to infection themselves.” So does your contact heterogeneity affect both risk and spread? I.e. does it increase the number of bidirectional contacts, along which the disease can spread in either direction? If so, this seems inconsistent with some of the experimental results.As for the size and timing of the peak, I would argue that — if we condition on extinction not occurring — in the limit of large population size neither alpha_i nor alpha_c should make a difference, since once there are O(n) infected individuals the usual differential equation analysis should take over. In that case, the height and timing of the peak should depend only on R_0 and the matrix of mean contact rates between compartments, not on the distributions of contacts among individuals. Thus my guess is that the dependence in Fig. 4 is a finite-population effect.That said, it is also true in branching process models that, if the variance in the degree distribution is large and we condition on the event that the process does not become extinct, then it has a higher average branching ratio at first (although this disappears after just a few generaitons if the underlying process is supercritical). This may explain your observation that the epidemic is more explosive at first — although I’m confused because I would expect this to depend more on alpha_i than alpha_c.So to conclude: you have observed interesting distinctions between two types of superspreading in a popular epidemic simulation, and how they affect extinction probabilities, attack rates, and peaks in different ways. But since we know a lot about how epidemics depend on the structure of directed networks, it would be good to add a little more theoretical insight (which I think is available). That way the paper would be more convincingly about the effects of different mechanisms for superspreading, as opposed to just an observation about a particular simulation (i.e. STRIDE).minor comments:Using a right-truncated Gamma distribution for the individual transmission probability seems like a strange choice. Why not use, say, a Beta distribution, which also lets you choose both the mean and the variance while staying within the unit interval?“as the overdispersion parameter approaches infinity, the distribution becomes more centered around the mean.” This phrase strikes me as strange, since as \\alpha increases the distribution has less dispersion, not more. Perhaps “shape parameter” is clearer.**********Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.If you choose “no”, your identity will remain anonymous but your review may still be made public.Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.Reviewer #1: NoReviewer #2: Yes: Daniel B ReevesReviewer #3: Yes: Cristopher MooreFigure Files:While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.Data Requirements:Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.Reproducibility:To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocolsReferences:Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.21 Jun 2022Submitted filename: reply_to_reviewers_20_06_2022.pdfClick here for additional data file.29 Jun 2022Dear Mrs Kuylen,We are pleased to inform you that your manuscript 'Different forms of superspreading lead to different outcomes: heterogeneity in infectiousness and contact behavior relevant for the case of SARS-CoV-2' has been provisionally accepted for publication in PLOS Computational Biology.Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.Best regards,Miles P. Davenport, MB BS, D.PhilAssociate EditorPLOS Computational BiologyTom BrittonDeputy EditorPLOS Computational Biology***********************************************************17 Aug 2022PCOMPBIOL-D-22-00314R1Different forms of superspreading lead to different outcomes: heterogeneity in infectiousness and contact behavior relevant for the case of SARS-CoV-2Dear Dr Kuylen,I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!With kind regards,Andrea SzaboPLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol
Authors: Laura Cooper; Su Yun Kang; Donal Bisanzio; Kilama Maxwell; Isabel Rodriguez-Barraquer; Bryan Greenhouse; Chris Drakeley; Emmanuel Arinaitwe; Sarah G Staedke; Peter W Gething; Philip Eckhoff; Robert C Reiner; Simon I Hay; Grant Dorsey; Moses R Kamya; Steven W Lindsay; Bryan T Grenfell; David L Smith Journal: Nat Commun Date: 2019-09-02 Impact factor: 14.919
Authors: Sima Asadi; Anthony S Wexler; Christopher D Cappa; Santiago Barreda; Nicole M Bouvier; William D Ristenpart Journal: Sci Rep Date: 2019-02-20 Impact factor: 4.379
Authors: Thang Van Hoang; Pietro Coletti; Yimer Wasihun Kifle; Kim Van Kerckhove; Sarah Vercruysse; Lander Willem; Philippe Beutels; Niel Hens Journal: BMC Infect Dis Date: 2021-03-18 Impact factor: 3.090
Authors: Akira Endo; Quentin J Leclerc; Gwenan M Knight; Graham F Medley; Katherine E Atkins; Sebastian Funk; Adam J Kucharski Journal: Wellcome Open Res Date: 2021-03-31