| Literature DB >> 33163738 |
Eve Armstrong1,2, Manuela Runge3, Jaline Gerardin3.
Abstract
We demonstrate the ability of statistical data assimilation (SDA) to identify the measurements required for accurate state and parameter estimation in an epidemiological model for the novel coronavirus disease COVID-19. Our context is an effort to inform policy regarding social behavior, to mitigate strain on hospital capacity. The model unknowns are taken to be: the time-varying transmission rate, the fraction of exposed cases that require hospitalization, and the time-varying detection probabilities of new asymptomatic and symptomatic cases. In simulations, we obtain estimates of undetected (that is, unmeasured) infectious populations, by measuring the detected cases together with the recovered and dead - and without assumed knowledge of the detection rates. Given a noiseless measurement of the recovered population, excellent estimates of all quantities are obtained using a temporal baseline of 101 days, with the exception of the time-varying transmission rate at times prior to the implementation of social distancing. With low noise added to the recovered population, accurate state estimates require a lengthening of the temporal baseline of measurements. Estimates of all parameters are sensitive to the contamination, highlighting the need for accurate and uniform methods of reporting. The aim of this paper is to exemplify the power of SDA to determine what properties of measurements will yield estimates of unknown parameters to a desired precision, in a model with the complexity required to capture important features of the COVID-19 pandemic.Entities:
Keywords: COVID-19; Epidemiology; Inference; Measurement noise
Year: 2020 PMID: 33163738 PMCID: PMC7605798 DOI: 10.1016/j.idm.2020.10.010
Source DB: PubMed Journal: Infect Dis Model ISSN: 2468-0427
Fig. 1Schematic of the model. Each rectangle represents a population. Note the distinction of asymptomatic cases, undetected cases, and the two tiers of hospitalized care: H and C. The aim of including this degree of resolution is to inform policy on social behavior so as to minimize strain on hospital capacity. The red ovals indicate the variables that correspond to measured quantities in the inference experiments.
Unknown parameters to be estimated. Ki, dSym, and dSys are taken to be time-varying. Parameters fsympt and fsevere are constant numbers, as they are assumed to reflect an intrinsic property of the disease. The detection probability of asymptomatic cases is taken to be known and zero.
| Parameter | Description |
|---|---|
| Ki(t) | Time-varying transmission rate |
| dSym(t) | Time-varying detection probability of mild symptomatics |
| dSys(t) | Time-varying detection probability of symptomatics requiring hospitalization |
| fsympt | Fraction of positive cases that produce symptoms |
| fsevere | Fraction of symptomatics that are severe |
Fig. 2Schematic of the four simulated experiments.
Estimates of static parameters fsympt and fsevere over all simulated experiments. The established values are taken from Refs (Oran and Topol, 2020) and (Salje et al., 2020). For Experiments i and iv, the reported numbers are taken from the annealing iteration with a value of parameter β of 32 and 40, respectively: once the deterministic limit has been reached (see text). For Experiment ii, an attempt was made to retrieve parameter estimates at β = 2; that is: before the solution grows unstable exponentially (see Fig. 5). See specific subsections for details of each experiment.
| Experiment | fsympt | (established: 0.6) | fsevere | (established: 0.07) |
|---|---|---|---|---|
| Mean | Variance | Mean | Variance | |
| 0.59 | 2 × 10−4 | 0.07 | 4 × 10−6 | |
| – | ||||
| – | ||||
| 0.39 | 0.8 | 0.19 | 0.2 |
Fig. 5Cost versus β for Experiment ii: R is not measured. As β increases, the cost increases indefinitely, indicating that no solution has been found that is consistent with both measurements and model dynamics.
Fig. 3Cost function plotted at each annealing step β for the base experiment i, for twenty paths in state space, where β scales the rigidity of the imposed model constraint. At low β the procedure endeavours to fit the measured variables to the simulated measurements. As β increases, the cost increases until it approaches a plateau (around β = 30), indicating that a solution has been found that is consistent with both measurements and model.
Fig. 4Estimates of the state - measured and unmeasured - variables, and the time-varying parameters Ki, dSym, and dSys, for the base experiment i. Excellent estimates are obtained of all states and parameters, except early values of Ki prior to the implementation of social distancing; see text. The dotted blue lines are the simulated data. Solid red, black, and green lines are SDA estimates of measured variables, unmeasured variables, and parameters, respectively. These conventions also hold for Fig. 6, Fig. 7. Results are taken at a value for annealing parameter β of 32.
Fig. 6Estimates for Experiment ii: without a measurement of Population R. This result is taken at β = 2, prior to the exponential runaway in the cost. Estimates of unmeasured states and time-varying parameters are poor.
Fig. 7Estimates for Experiment iv: low noise added to Population R and with a doubled temporal baseline of 201 days. The noise added to R propagates to some unmeasured States (S, E, As, and Asdet), but the overall evolution is captured well. The noise precludes an estimate of the time-varying parameters (not shown). Results are reported using a value for β of 40.
State variables of the COVID-19 transmission model. The “detected” qualifier signifies that the population has been tested and is positive for COVID-19.
| Variable | Description |
|---|---|
| S | Susceptible |
| E | Exposed |
| Asdet | Asymptomatic, detected |
| As | Asymptomatic, undetected |
| Symdet | Symptomatic mild, detected |
| Sym | Symptomatic mild, undetected |
| Sysdet | Symptomatic severe, detected |
| Sys | Symptomatic severe, undetected |
| H1,det | Hospitalized and will recover, detected |
| H2,det | Hospitalized and will go to critical care and recover, detected |
| H3,det | Hospitalized and will go to critical care and die, detected |
| H1 | Hospitalized and will recover, undetected |
| H2 | Hospitalized and will go to critical care and recover, undetected |
| H3 | Hospitalized and will go to critical care and die, undetected |
| C2,det | In critical care and will recover, detected |
| C3,det | In critical care and will die, detected |
| C2 | In critical care and will recover, undetected |
| C3 | In critical care and will die, undetected |
| R | Recovered |
| D | Dead |
The model parameters, with the unknown parameters to be estimated denoted in boldface. The unknown parameters Ki, Sym, and dSys are taken to be time-varying. The unknown parameters fsympt and fsevere are taken to be intrinsic properties of the disease and therefore constant numbers. The detection probability of asymptomatic cases is taken to be known and zero. Units of time are days.
| Parameter | Description | Value |
|---|---|---|
| N | Total population | 9,000,000 |
| Reduced | The property that a detected case is likely to transmit less, via successful quarantine) | 0.2 |
| Ki(t) | Transmission rate | See |
| dAs(t) | Detection probability of asymptomatic cases | 0.0 |
| fsympt | Fraction of positive cases that produce symptoms | 0.6 ( |
| tinfection | Time from exposure to infection | 4.0 ( |
| tR,a | Time to recovery for asymptomatics | 8.0 Assumed to be same as tR,m |
| dSym(t) | Detection probability of mild symptomatics | See |
| dSys(t) | Detection probability of severe symptomatics | See |
| fsevere | Fraction of symptomatics that are severe | 0.07 ( |
| tsympt | Time to symptoms, for symptomatics | 4.0 ( |
| tR,m | Time from symptoms to recovery, for mild symptomatics | 8.0 ( |
| fH | Fraction of severe cases that are hospitalized and then recover: fH = 1.0 − fC − fD | 0.66 |
| fC | Fraction of severe cases that require critical care and then recover | 0.3 ( |
| fD | Fraction of severe cases that die | 0.04 ( |
| tH | Time from symptoms to hospital, for severe symptomatics | 5.0 ( |
| tR,h | Time from entering hospital to recovery, for severe symptomatics that do not require critical care | 10.0 ( |
| tC | Time from entering hospital to critical care, for severe symptomatics | 5.0 ( |
| tR,c | Time from entering critical care to recovery for severe symptomatics | 10.0 ( |
| tD | Time from entering critical care to death, for severe symptomatics | 5.0 ( |
aAs described in (Roman et al., 2020), viral load can be high and detectable for up to 20 days. We choose a shorter duration of infectiousness to capture the time during which transmissibility is highest.