| Literature DB >> 33143594 |
Laurent Hébert-Dufresne1,2,3, Benjamin M Althouse4,5,6, Samuel V Scarpino7,8,9,10,11,12, Antoine Allard3,13.
Abstract
The basic reproductive number, R0, is one of the most common and most commonly misapplied numbers in public health. Often used to compare outbreaks and forecast pandemic risk, this single number belies the complexity that different epidemics can exhibit, even when they have the same R0. Here, we reformulate and extend a classic result from random network theory to forecast the size of an epidemic using estimates of the distribution of secondary infections, leveraging both its average R0 and the underlying heterogeneity. Importantly, epidemics with lower R0 can be larger if they spread more homogeneously (and are therefore more robust to stochastic fluctuations). We illustrate the potential of this approach using different real epidemics with known estimates for R0, heterogeneity and epidemic size in the absence of significant intervention. Further, we discuss the different ways in which this framework can be implemented in the data-scarce reality of emerging pathogens. Lastly, we demonstrate that without data on the heterogeneity in secondary infections for emerging infectious diseases like COVID-19 the uncertainty in outbreak size ranges dramatically. Taken together, our work highlights the critical need for contact tracing during emerging infectious disease outbreaks and the need to look beyond R0.Entities:
Keywords: branching processes; complex networks; epidemiology
Mesh:
Year: 2020 PMID: 33143594 PMCID: PMC7729039 DOI: 10.1098/rsif.2020.0393
Source DB: PubMed Journal: J R Soc Interface ISSN: 1742-5662 Impact factor: 4.118
Figure 1.Final size of outbreaks with different average R0 and heterogeneity k in the distribution of secondary cases. We use a negative binomial distribution of secondary cases and scan a realistic range of parameters. The range of parameters corresponding to estimates for COVID-19 based on a binomial negative distribution in large populations is highlighted by a red box (see [25] and table 1). Most importantly, with fixed average, the dispersion parameter is inversely proportional to the variance of the underlying distribution of secondary cases. The degree of freedom, p0, is here set by setting the average number of infections around patient zero to be less than or equal to R0. The Kermack–McKendrick solution would correspond to the limit k → ∞, and could be more appropriate in some dense and well-mixed settings.
Estimates for R0 and for the negative binomial distribution dispersion parameter, k, used in figure 2 (a and b, respectively, denote 95% and 90% confidence intervals). The proportion of susceptible individuals infected as reported either in the literature or by the US Centers for Disease Control and Prevention. For severe acute respiratory syndrome (SARS) the proportion of infected was taken from serosurveys among wild animal handlers (15%) and among healthcare workers (<1%) [27]. For influenza (2009), we took data on school-aged children. For COVID-19, we present emerging evidence surrounding the final proportion of infected individuals after the first outbreak waves at the level of large communities [28,29] and a school [30], which all fall around 15%, and at the level of dense groups like a fishing vessel with a value around 86% [31]. Note that the estimates of the proportion of infected individuals, for R0 and for k, were not necessarily inferred from the same populations. Such information is rarely, if ever, available for the same outbreak, unfortunately. COVID-19, coronavirus disease 2019; MERS, Middle East respiratory syndrome.
| disease | location | year | prop. infect. | reference | ||
|---|---|---|---|---|---|---|
| MERS | global | 2013 | 0% | 0.47 (0.29–0.80)a | 0.26 (0.09–1.24)a | [ |
| SARS | global | 2003 | 0–15% | 1.63 (0.54–2.65)b | 0.16 (0.11–0.64)b | [ |
| smallpox | Europe | 1958–1973 | 55% | 3.19 (1.66–4.62)b | 0.37 (0.26–0.69)b | [ |
| influenza | Baltimore (USA) | 1918 | 40% | 1.77 (1.61–1.95)a | 0.94 (0.59–1.72)a | [ |
| influenza | Italy | 2009 | 39% | 1.321 (1.299–1.343)a | 8.092 (5.170–11.794)a | [ |
| COVID-19 | global | 2020 | 13–16% and 86% | 2.5 (1.4–12)a | 0.1 (0.04–1)a | [ |
Figure 2.Using published estimates of R0 and the dispersion parameter k, we estimated the total outbreak size for six different diseases using three versions of the network approach and compared them with the classic Kermack–McKendrick solution. The confidence intervals span the range of uncertainty reported for R0 and k. The black markers show reported total outbreak sizes (total proportion of susceptible individuals infected) for each disease. For influenza, we report the estimated proportion of school-aged children infected. For COVID-19, we use tentative markers showing the range of attack rates measured in different contexts as there is currently no consensus for what constitutes a typical COVID-19 outbreak. We highlight though the differences between the final size estimates for COVID-19: most typify the observed over-dispersed nature of transmission, except for the outbreak on a fishing vessel (right side point) where contacts are more well mixed and thus better characterized by a Kermack–McKendrick transmission process. The red circles are the estimated proportion infected using the method developed by Kermack and McKendrick, i.e. equation (1.1). The other markers show the estimated proportion infected obtained with equation (3.17) under different assumptions about patient zero: the model described in the main text, which ensures that the expected number of secondary infections caused by patient zero is at most R0 (blue squares); the same model but assuming p0 = 0 such that no individuals have exactly zero contact (cyan stars); and a network version of [13], where G0(x) ≡ G1(x) such that patient zero is no different from subsequent patients (green triangles). See table 1 for data and additional information.