| Literature DB >> 32693748 |
Sang Woo Park1, Benjamin M Bolker2,3,4, David Champredon5, David J D Earn3,4, Michael Li2, Joshua S Weitz6,7, Bryan T Grenfell1,8,9, Jonathan Dushoff2,3,4.
Abstract
A novel coronavirus (SARS-CoV-2) emerged as a global threat in December 2019. As the epidemic progresses, disease modellers continue to focus on estimating the basic reproductive number [Formula: see text]-the average number of secondary cases caused by a primary case in an otherwise susceptible population. The modelling approaches and resulting estimates of [Formula: see text] during the beginning of the outbreak vary widely, despite relying on similar data sources. Here, we present a statistical framework for comparing and combining different estimates of [Formula: see text] across a wide range of models by decomposing the basic reproductive number into three key quantities: the exponential growth rate, the mean generation interval and the generation-interval dispersion. We apply our framework to early estimates of [Formula: see text] for the SARS-CoV-2 outbreak, showing that many [Formula: see text] estimates are overly confident. Our results emphasize the importance of propagating uncertainties in all components of [Formula: see text], including the shape of the generation-interval distribution, in efforts to estimate [Formula: see text] at the outset of an epidemic.Entities:
Keywords: Bayesian multilevel model; COVID-19; SARS-CoV-2; basic reproductive number; generation interval; novel coronavirus
Mesh:
Year: 2020 PMID: 32693748 PMCID: PMC7423425 DOI: 10.1098/rsif.2020.0144
Source DB: PubMed Journal: J R Soc Interface ISSN: 1742-5662 Impact factor: 4.118
Summary of the models, analysed data, reported estimates of the basic reproductive number, and the assumptions about the generation-interval distributions. Model details, estimates of and their assumptions about the shape of the generation interval distributions were collected from seven studies. Generation-interval dispersion represent the squared coefficients of variation in generation intervals.
| model | data (study period) | data source | basic reproductive number | mean generation interval | generation-interval dispersion | reference | |
|---|---|---|---|---|---|---|---|
| study 1 | deterministic branching process model | total number of cases in Wuhan City, China (through 18 Jan 2020) | estimated by Imai | 1.5–3.5 | 10 | 1 | Bedford |
| study 2 | stochastic branching process model | total number of cases in Wuhan City, China (through 18 Jan 2020) | estimated by Imai | 2.6 (1.5–3.5)a | 8.4 | not reportedb | Imai |
| study 3 | Poisson offspring distribution model | confirmed cases from China and other countries (29 Dec 2019–23 Jan 2020) | medical records and epidemiological investigations from Guangdong Province, China, and official websites of other regions in China | 2.92 (95% CI: 2.28–3.67) | 8.4 | 0.2 | Liu |
| study 4 | deterministic metapopulation susceptible–exposed–infected–recovered (SEIR) model | confirmed cases from China and other countries (1–21 Jan 2020) | not reported | 3.8 (95% CI: 3.6–4.0) | 7.6 | 0.5 | Read |
| study 5 | stochastic branching process model | total number of cases in Wuhan City, China (through 18 Jan 2020) | estimated by Imai | 2.2 (90% CI: 1.4–3.8) | 7–14 | 0.5 | Riou & Althaus [ |
| study 6 | exponential growth model | confirmed cases from China (10–22 Jan 2020) | Wuhan Municipal Health Commission, China and National Health Commission of China | 5.47 (95% CI: 4.16–7.10)c | 7.6–8.4 | 0.2 | Zhao |
| study 7 | Incidence Decay and Exponential Adjustment (IDEA) model | reported cases from Wuhan City, China (1 Dec 2019–26 Jan 2020) | World Health Organization, National Health Commission of China, Wuhan Municipal Health Commission, and Huang | 2.0–3.1 | 6–10 | 0 | Majumder & Mandl [ |
aThese intervals reflect values for best and worst scenarios. We treat these intervals as a 90% confidence/credible interval in our analysis.
bWe assume κ = 0.5 in our analysis.
cThe authors presented estimates under different assumptions regarding the reporting rate; we use their baseline scenario in our analysis to remain consistent with other studies, which do not account for changes in the reporting rate.
Probability distributions for , and κ. We use these probability distributions to obtain a probability distribution for the exponential growth rate r. The gamma distribution is parametrized by its mean and shape. Constant values are fixed according to table 1.
| basic reproductive number | mean generation interval | generation-interval dispersion | |
|---|---|---|---|
| study 1 | Uniform (1.5, 3.5) | 10 | 1 |
| study 2 | Gamma (mean = 2.6, shape = 18) | 8.4 | 0.5 |
| study 3 | Gamma (mean = 2.92, shape = 67) | 8.4 | 0.2 |
| study 4 | Gamma (mean = 3.8, shape = 1400) | 7.6 | 0.5 |
| study 5 | Gamma (mean = 2.2, shape = 12) | Uniform (7, 14) | 0.5 |
| study 6 | Gamma (mean = 5.47, shape = 54) | Uniform (7.6, 8.4)a | 0.2 |
| study 7 | Uniform (6, 10) | 0 |
aWe do not account for this uncertainty during our re-estimation of the exponential growth rate r because the reported estimate of and its uncertainty assumes . We still account for this uncertainty in our pooled estimates (μ).
bInstead of modeling with a probability distribution and re-estimating r, we use r = 0.114 days−1 (see text).
Figure 1.Comparisons of the reported parameter values with our pooled estimates. We inferred point estimates (black), uniform distributions (orange) or confidence/credible intervals (purple) for each parameter from each study, and combined them into pooled estimates using a Bayesian multilevel model (red). Points represent medians calculated from the parameter set for each study i (orange and purple). Error bars represent 95% equi-tailed quantiles calculated from the parameter set for each study i. Red density plots represent distributions of 2000 posterior samples. Open triangle: we assumed κ = 0.5 for study 2, which does not report generation-interval assumptions.
Figure 2.Effects of the exponential growth rate r, mean generation interval and generation-interval dispersion κ on the estimates of the basic reproductive number . We compare estimates of under nine scenarios that propagate different parameter uncertainties (a) based on our pooled estimates (μ, μ and ) and (b) assuming a fourfold reduction in uncertainty of our pooled estimate of the exponential growth rate (using instead of μ). Each uncertainty type represents estimates based the posterior distributions of one of three parameters (μ, μ and ) while using median estimates of two other parameters. The ‘none’ type represents estimate based on the median estimates of μ, μ and . The ‘all’ type represents estimates based on the joint posterior distributions of μ, μ and (also corresponds to ). Points represent the median estimates. Vertical error bars represent the 95% credible intervals.
Figure 3.Sensitivity of the reported estimates with respect to our pooled estimates of the underlying parameters. We calculate substitute estimates by replacing the reported parameter values (growth rate r, mean generation interval and generation-interval dispersion κ) with our corresponding pooled estimates (μ, μ and ) one at a time and recalculating . The pooled estimate represents , which is calculated from the joint posterior distribution of μ, μ and ; this corresponds to replacing all reported parameter values with our pooled estimates, which gives identical results across all studies. The reported estimates refer to estimates listed in table 1. Points represent the medians of the reported, base, substitute and pooled estimates. Vertical error bars represent the 95% credible intervals of our base, substitute and pooled estimates (based on 2000 posterior samples). Horizontal dashed lines represent the 95% credible intervals of our pooled estimate.