| Literature DB >> 33301457 |
Katelyn M Gostic1, Lauren McGough1, Edward B Baskerville1, Sam Abbott2, Keya Joshi3, Christine Tedijanto3, Rebecca Kahn3, Rene Niehus3, James A Hay3, Pablo M De Salazar3, Joel Hellewell2, Sophie Meakin2, James D Munday2, Nikos I Bosse2, Katharine Sherrat2, Robin N Thompson2,4, Laura F White5, Jana S Huisman6,7, Jérémie Scire7,8, Sebastian Bonhoeffer6, Tanja Stadler7,8, Jacco Wallinga9,10, Sebastian Funk2, Marc Lipsitch3, Sarah Cobey1.
Abstract
Estimation of the effective reproductive number Rt is important for detecting changes in disease transmission over time. During the Coronavirus Disease 2019 (COVID-19) pandemic, policy makers and public health officials are using Rt to assess the effectiveness of interventions and to inform policy. However, estimation of Rt from available data presents several challenges, with critical implications for the interpretation of the course of the pandemic. The purpose of this document is to summarize these challenges, illustrate them with examples from synthetic data, and, where possible, make recommendations. For near real-time estimation of Rt, we recommend the approach of Cori and colleagues, which uses data from before time t and empirical estimates of the distribution of time between infections. Methods that require data from after time t, such as Wallinga and Teunis, are conceptually and methodologically less suited for near real-time estimation, but may be appropriate for retrospective analyses of how individuals infected at different time points contributed to the spread. We advise caution when using methods derived from the approach of Bettencourt and Ribeiro, as the resulting Rt estimates may be biased if the underlying structural assumptions are not met. Two key challenges common to all approaches are accurate specification of the generation interval and reconstruction of the time series of new infections from observations occurring long after the moment of transmission. Naive approaches for dealing with observation delays, such as subtracting delays sampled from a distribution, can introduce bias. We provide suggestions for how to mitigate this and other technical challenges and highlight open problems in Rt estimation.Entities:
Mesh:
Year: 2020 PMID: 33301457 PMCID: PMC7728287 DOI: 10.1371/journal.pcbi.1008409
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Instantaneous reproductive number as estimated by the method of Cori et al. vs. cohort reproductive number estimated by Wallinga and Teunis.
For each definition of R, arrows show the times at which infectors (upwards) and their infectees (downwards) appear in the data. Curves show the generation interval distribution (A, B), or serial interval distribution (C). (A) The instantaneous reproductive number quantifies the number of new infections incident at a single point in time (ti, blue arrow), relative to the number of infections in the previous generation (green arrows) and their current infectiousness (green curve). The methods of Cori et al. and of Bettencourt and Ribeiro estimate the case reproductive number. This figure illustrates the Cori method. (B-C) The case reproductive number is defined as the average number of new infections that an individual who becomes infected on day ti (green arrows in B) or symptomatic on day ts (yellow arrows in C) will eventually go on to cause (blue downward arrows in B and C). The first definition applies when estimating the case reproductive number using inferred times of infection, and the second applies when using data on times of symptom onset. The method of Wallinga and Teunis estimates the case reproductive number.
Fig 2Accuracy of R estimation methods given ideal, synthetic data.
Solid and dashed black lines show the instantaneous and case reproductive numbers, respectively, calculated from synthetic data. Colored lines show estimates and confidence or credible intervals. To mimic an epidemic progressing in real time, the time series of infections or symptom onset events up to t = 150 was input into each estimation method (inset). Terminating the time series while R is falling or rising produces similar results as in S1 Fig. (A) By assuming an SIR model (rather than SEIR, the source of the synthetic data), the method of Bettencourt and Ribeiro systematically underestimates R when the true value is substantially higher than 1. The method is also biased as transmission shifts. (B) The Cori method accurately measures the instantaneous reproductive number. (C) The Wallinga and Teunis method estimates the cohort reproductive number, which incorporates future changes in transmission. Thus, the method produces R estimates that lead the instantaneous effective reproductive number and becomes unreliable for real-time estimation at the end of the observed time series without adjustment for right truncation [4,29]. In panels A and B, the colored lines show the posterior mean and the shaded region the 95% credible interval. In panel C, the colored line shows the maximum likelihood estimate and the shaded region the 95% confidence interval.
Fig 3Biases from misspecification of the generation interval mean (A) or variance (B). Demonstrated using the method of Cori et al. GI, generation interval.
Fig 4R is a measure of transmission at time t.
Observations after time t must be adjusted. ICU, intensive care unit.
Fig 5Pitfalls of simple methods to adjust for delays to observation when estimating R.
Infections back calculated from (A) observed cases or (B) observed deaths either by shifting the observed curve back in time by the mean observation delay (shift), by subtracting a random sample from the delay distribution from each individual time of observation (convolve), or by deconvolution (deconvolve). Only the deconvolved time series is adjusted for right truncation. Deconvolution most accurately recovers peaks or valleys in the true infection curve. Shifting is less accurate, and convolution is least accurate. Errors from back-calculation increase with the variance of the delay distribution (B vs. A). (C) Posterior mean and credible interval of R estimates from the Cori et al. method. Inaccuracies in the inferred incidence curves affect R estimates, especially when R is changing (here, R was estimated using shifted values from panels A and B). Finally, we note that shifting the observed curves back in time without adjustment for right truncation leads to a gap between the last date in the inferred time series of infection and the last date in the observed data, as shown by the dashed lines and horizontal arrows in panels A–C.
Fig 6Accuracy of R estimates given smoothing window width and location of t within the smoothing window.
Estimates were obtained using synthetic data drawn from the S→E transition of a stochastic SEIR model (inset) as an input to the method of Cori et al. Colored estimates show the posterior mean and 95% credible interval. Black line shows the exact instantaneous R calculated from synthetic data.