Simon Loertscher1, Ellen V Muir2. 1. Department of Economics & Centre for Market Design, Level 4, FBE Building, 111 Barry Street, University of Melbourne, Victoria 3010, Australia. 2. Department of Economics, Stanford University, United States.
Abstract
Without widespread immunization, the road to recovery from the current COVID-19 lockdowns will optimally follow a path that finds the difficult balance between the social and economic benefits of liberty and the toll from the disease. We provide an approach that combines epidemiology and economic models, taking as given that the maximum capacity of the healthcare system imposes a constraint that must not be exceeded. Treating the transmission rate as a decreasing function of the severity of the lockdown, we first determine the minimal lockdown that satisfies this constraint using an epidemiology model with a homogeneous population to predict future demand for healthcare. Allowing for a heterogeneous population, we then derive the optimal lockdown policy under the assumption of homogeneous mixing and show that it is characterized by a bang-bang solution. Possibilities such as the capacity of the healthcare system increasing or a vaccine arriving at some point in the future do not substantively impact the dynamically optimal policy until such an event actually occurs.
Without widespread immunization, the road to recovery from the current COVID-19 lockdowns will optimally follow a path that finds the difficult balance between the social and economic benefits of liberty and the toll from the disease. We provide an approach that combines epidemiology and economic models, taking as given that the maximum capacity of the healthcare system imposes a constraint that must not be exceeded. Treating the transmission rate as a decreasing function of the severity of the lockdown, we first determine the minimal lockdown that satisfies this constraint using an epidemiology model with a homogeneous population to predict future demand for healthcare. Allowing for a heterogeneous population, we then derive the optimal lockdown policy under the assumption of homogeneous mixing and show that it is characterized by a bang-bang solution. Possibilities such as the capacity of the healthcare system increasing or a vaccine arriving at some point in the future do not substantively impact the dynamically optimal policy until such an event actually occurs.
Without widespread immunization of the population, the road to recovery from pandemic-induced lockdowns requires sustained vigilance to ensure that the spread of the disease remains at a level that is manageable for a country’s or region’s healthcare system. At the same time, recovery ought to start as soon as possible to limit the reduction in liberty that such lockdowns impose, the mental and other health issues associated with social distancing and isolation, and to minimize the economic cost. If eradication is impossible or possible only at tremendous costs, keeping the pandemic under control without inducing economic and social hardship at a catastrophic scale requires finding a path through territory that is uncharted for both epidemiologists and economists. From a public health perspective, recovery requires the transition from a paradigm in which eradication of an epidemic is the goal to one in which the epidemic is managed. For economists, recovery requires plowing a path through a system whose dynamics are non-linear.In this paper, we show how this can be done by providing a methodology that permits the return to some kind of normalcy, while keeping the spread of the disease at a level that even at the peak of the epidemic does not exceed the capacity constraint of the healthcare system. Specifically, we use a standard epidemiology model – a simple SIR model – to predict the peak of the epidemic and treat the rate of transmission of the disease as the variable that the policymaker can influence by choosing the severity of a lockdown. We treat as a hard constraint the capacity of the healthcare system, that is, the maximum number of patients that it can handle at the peak of the crisis.1
Of course, this capacity constraint will need to be defined in such a way that patients with other – but no less severe – needs for care are still able to access treatment.2Our framework resonates with recent policy, as the capacity constraints-based approach allows policymakers to avoid explicitly trading off dollars against lives.3
In the U.S., as they approach the winter peak of the pandemic, the states of California and New York have both adopted policies that make lockdowns – and more generally restrictions on public life – contingent on hospital capacity utilization. For example, starting in December 2020, any region in California goes into lockdown as soon as the available ICU hospital capacity dips below 15%.4
In New York, the “state’s new approach focuses on maintaining sufficient hospital capacity instead of shutting down economic activity”, according to the New York Times article ‘‘Cuomo Tries to Jolt Public by Warning of Overwhelmed Hospitals’’ (December 12, 2020). Moreover, this new policy involves an element of prediction-based restrictions, precisely as implied by our approach: According to the same article, the “most complex element, which could prompt regionwide shutdowns, involves taking the rate of increase in an area’s hospitalizations and projecting forward to determine whether it would top 90% of capacity in three weeks. If so, restrictions will be introduced that include the closing of nonessential businesses, the limiting restaurants [sic] to takeout and delivery and a prohibition on nearly all gatherings”.The main contribution of this paper is to formulate an operational constraint that provides policymakers with guidance for how to manage an epidemic which is too costly to eradicate, to incorporate this constraint into a standard epidemiology model, and to determine the severity of the lockdown that is necessary to respect the constraint. Allowing for heterogeneity in the population, we show that managing an epidemic subject to a capacity constraint involves a non-trivial economic optimization problem without requiring the policymaker to take a stance on the value of life because optimality requires satisfying the constraint at minimum economic cost. Extending our model to consider dynamically optimal policies, we show that possibilities such as the capacity of the healthcare system increasing or a vaccine arriving at some point in the future do not substantively impact the optimal policy until such an event actually occurs. While the purpose of our model is to serve as a proof of concept that would need to be refined if applied, many of the key insights – such as the need to use epidemiology models to predict future healthcare demand and the non-trivial economic optimization problem when faced with a capacity constraint – will extend well beyond the confines of the specific setups we study.The economic toll from the lockdowns implemented due to the COVID-19 pandemic has little parallel in living memory. To convey a sense of the magnitude of the potential economic and social costs, consider the unemployment rate during the Great Depression in the U.S., then and now the world’s largest economy, and the unemployment rates before and in the wake of the ongoing lockdowns in the U.S. in 2020. The immediate consequences of the Great Depression were mass poverty and economic devastation, and at least indirectly, the rise of fascism in Europe. As Table 1 shows, the unemployment rate in the U.S. rose sharply from 5.2% in March 2020 to 19.5% in April 2020 as the nationwide lockdown hit the country and much of the rest of the world economy, and then steadily declined as the economy began to reopen. This steep incline and swift partial recovery reflects the peculiarity of the present economic downturn, which was not caused by a bad state of the economy. This is at the same time a source of hope and of concern: while the healthy underlying state of the economy at the onset may make for a relatively fast recovery, extended or repeated complete lockdowns can turn a public health shock into a deep and prolonged economic crisis. The firms that workers could return to in May and June may simply go out of business after further or extended lockdowns. Thus, the problem of finding a smooth path to recovery is particularly salient.
Table 1
Upper table: unemployment rates during the Great Depression in the U.S. Lower table: weekly unemployment filings (in thousands) in the U.S. in 2020.
Great D.
1929
1930
1931
1932
1933
Unemployment ratea
3.2
8.7
15.9
23.6
24.9
Sources: thebalance.com and Reinhart and Rogoff (2009).
Source: US Bureau of Labor. For March, April and May, the table displays the rates that are adjusted for a counting error. With the adjustments, the respective rates would be 4.4, 14.7 and 13.3. For June, the official rate is displayed as an adjusted rate was not available.
As documented by Brodeur et al. (2020), who counted 106 NBER publications over a ten week period in March and April of 2020, there has been an upsurge of interest in the economics of COVID-19 that coincided with the first wave of the pandemic hitting Europe and North-America.5
Atkeson (2020) provides an introduction for economists to the SIR modeling approach, which is standard in mathematical biology (see, for example, Murray, 2002). As our paper utilizes the SIR framework, it is most closely related to other SIR-based economics papers, including Alvarez et al. (2021), Acemoglu et al. (2021), and Farboodi et al. (2020), who derive optimal lockdown policies for a planner that assigns some weight to economic output and some weight to human life.6
Alvarez et al. (2021) apply an optimal control approach to an SIR model to derive the optimal lockdown policy that trades off the cost of death against economic output. In this framework, the intensity of the optimal lockdown naturally depends on how the fatality rate varies with the number of infected individuals and the value of a statistical life. These authors also find that the possibility that the pandemic will become easier to manage in the future creates a dynamic complementarity, where the planner has an incentive to induce a stronger lockdown, delaying the spread of the disease until the planner is better equipped to handle it. Acemoglu et al. (2021) provide a multi-group SIR model in which infection, hospitalization and fatality rates vary between groups and find that optimal policies that differentially target these groups outperform non-discriminatory policies. Farboodi et al. (2020) develop a quantitative framework for exploring how individuals trade off the benefits of social activity against the health costs of social activity. They find that the expected cost of COVID-19 in the US is $12,700 per person in a laissez-faire equilibrium and $8,100 per person under an optimal policy. Akbarpour et al. (2020) depart from a classic SIR modeling approach by simulating an agent-based model calibrated to a rich set of micro-level data to analyze the social, economic and health impacts of alternative policies.Upper table: unemployment rates during the Great Depression in the U.S. Lower table: weekly unemployment filings (in thousands) in the U.S. in 2020.Sources: thebalance.com and Reinhart and Rogoff (2009).Source: US Bureau of Labor. For March, April and May, the table displays the rates that are adjusted for a counting error. With the adjustments, the respective rates would be 4.4, 14.7 and 13.3. For June, the official rate is displayed as an adjusted rate was not available.Our paper differs from the aforementioned SIR-based models in that the starting point of our analysis is not that the planner puts one weight on economic output and another on human life. Rather, the optimal policy is derived subject to a capacity constraint, so that the policymaker does not have to take an explicit stance on the value of human life at the outset of the analysis. As mentioned, ensuring that the capacity of the healthcare system is not exceeded is a key factor in preventing catastrophic health outcomes such as those that occurred in the first half of 2020 in Italy’s Lombardy region and in New York City. We also find that, under a capacity constraint, when the planner faces the possibilities that the capacity of the healthcare system will increase or a vaccine will arrive at some point in the future there are no dynamic complementarities of the nature explored in Alvarez et al. (2021), as well as many other papers that utilize the same objective function involving lost output and deaths. While we make use of numerical methods to derive optimal dynamic policies, our paper differs methodologically from the previously discussed papers by also deriving analytical results.The remainder of this paper is organized as follows. Section 2 describes our setup. Section 3 derives the dynamics of an epidemic and the optimal lockdown necessary to keep it at a level that respects the capacity constraint in a homogeneous population model. In Section 4 we derive the optimal lockdown policy for a model with a heterogeneous population and show that the optimal policy takes a bang–bang form under homogeneous mixing. Section 5 extends the analysis by deriving the dynamically optimal policy in a discrete-time version of the model. It also augments the model by allowing for stochastic capacity increases and stochastic arrival of a vaccine. Section 6 concludes the paper. All proofs omitted from the body of the paper can be found in the appendix.
Setup
Consider a basic susceptible–infectious–recovered (SIR) model with a population whose constant size we normalize to . This is a classic model in epidemiology (see, for example, Murray, 2002 p.320), in which the population is divided into three compartments consisting of susceptible individuals, infected individuals and recovered individuals, respectively denoted by , and at time . Note that because of the assumption of a constant population of size , for all , we have We let , and and assume that only two types of transitions are possible: susceptible individuals can become infected and infected individuals recover.7
(As is standard, “recovered” simply means the individuals are no longer infectious, which occurs either because they gained immunity or died following infection.) Let denote the rate of infection: the average number of contracts per individual per unit time, multiplied by the probability that the infection is transmitted in a given contact between a susceptible and an infectious individual. We assume that we have a well-mixed or homogeneous population so that is the fraction of contact occurrences that involve an infectious individual and is the fraction of contract occurrences that involve a susceptible individual. The rate of transition between the susceptible compartment and the infectious compartment is thus given by and is the fraction of the population that is newly infected per unit time.8We denote by the severity of the lockdown, with meaning no lockdown and meaning complete lockdown. We assume that is the choice variable of the policymaker, and with regards to the epidemic, its impact is that it affects the transmission rate as follows: where is a fixed component of the transmission rate, is a constant, and makes the dependence of on explicit.We further assume that individuals recover at rate .9
In SIR models, the parameter plays an important role in governing the dynamics of an epidemic. Suppose that . Then in this simple model, whenever the number of infected individuals will increase from time , resulting in an epidemic. If then the number of infected individuals will decrease from time and an epidemic does not occur (alternatively, we can think of the “peak” of the epidemic as occurring at time ).The proportion of those who are infected need treatment, so that, given and , the number of people requiring treatment at time is Letting denote the maximum capacity of the healthcare system to treat COVID-19patients without reducing the care given to other patients in need, the constraint for managing the epidemic is, for all ,In the following section, we augment the epidemiology model by an economic production function to analyze tradeoffs involving economics. Specifically, we assume that GDP, denoted , is produced using labor according to the production function , where is a parameter that measures labor’s productivity, which can be calibrated using labor’s income share in national accounts data.10
Letting denote the amount of labor that is not affected by the lockdown variable , the amount of labor that is productive given is where is the part of the labor that is affected by the lockdown variable . Thus, is the pre-lockdown labor supply.11In Section 4, we will also consider a heterogeneous agent version of the model.12
Specifically, we assume that there is a continuum of types in the population.13
We denote by the absolutely continuous distribution of types in the population and by , and the density of susceptible, infected and recovered individuals, respectively, of type at time . The density of individuals of type thus satisfies, for all , We let , and .We assume that the type of any given individual is observable and that the policymaker can implement a type-dependent lockdown policy , where denotes the severity of the lockdown for individuals of type , with meaning no lockdown and meaning complete lockdown. Similarly to the basic SIR model, we impose a homogeneous mixing assumption. The transmission rate for individuals of type under the lockdown policy is then given by For simplicity, we assume that the parameters and are independent of . Note that in the heterogeneous agent model we make the dependence of the evolution of the epidemic on the lockdown policy explicit by introducing a subscript to every variable in the model that depends on .We assume that is the proportion of infected individuals of type that require treatment. Given a lockdown policy , the total number of individuals that require treatment at time is given by Similarly to the homogeneous agent model, we augment the heterogeneous agent model by an economic production function to analyze tradeoffs involving economics. Specifically, if denotes labor supplied by individuals of type under lockdown policy , then output produced by individuals of type is given by where denotes the productivity of individuals of type . Total output under lockdown policy is thus given by Letting denote the proportion of labor that is not affected by the lockdown variable , the amount of labor of type that is productive given is For simplicity, we again assume that and are independent of .
Homogeneous agent model
We now analyze the dynamics of an epidemic and then derive the minimal lockdown policy necessary to satisfy the constraint at all times in the model with homogeneous agents.
Dynamics of an epidemic
In this subsection we treat as a parameter and study the dynamics of an epidemic in our simple homogeneous SIR model. We also characterize the epidemic peak and derive some useful comparative statics. The dynamics of an epidemic in our simple homogeneous agent SIR model are governed by the following system of non-linear differential equations: with initial conditions , and . Harko et al. (2014) provided an analytic solution to this system of equations by parameterizing time by a parameter . In particular, introducing the integration constants we have Notice that when we have and that decreases as increases.14
We can then back out using .The basic dynamics of an epidemic are as follows. As susceptible individuals become infected and then recover, the stock of susceptible individuals decreases over time and the stock of recovered individuals increases over time. The number of infected individuals initially increases before reaching an epidemic peak and then gradually decreasing. The number of infected individuals stops increasing once the population of susceptible individuals is sufficiently small. An example of a typical epidemic path is shown in Fig. 1. Note that unless stated otherwise all figures are drawn for the parameterization , , , and ; Fig. 1 assumes .15
Fig. 1
The evolution of a typical epidemic .
Assuming (so that the peak of the epidemic does not occur at ), the maximal number of infected individuals during the epidemic is characterized by where Notice that we have where the inequality follows from the fact that since by assumption . Not surprisingly, the peak number of infected individuals, , increases in the rate of infection . This means that any policy intervention that decreases , such as wearing masks, decreases . Intuitively, and as shown next, this will make it easier to meet a given capacity constraint in the sense that, all else equal, a less severe lockdown is required to satisfy that constraint. Put differently, anyone who dislikes catastrophic health outcomes and lockdowns should be in favor of wearing masks according to this framework.The evolution of a typical epidemic .
Capacity constraints
We now return to the problem faced by our policymaker, as introduced in Section 2, where is endogenous and depends on the chosen lockdown policy . Notice that the parameters , , and , as well as the initial conditions, impose restrictions on the lower feasible bound for . Specifically, denote by the number of infectious at time given policy , by the maximum number of infected individuals given , and by the maximum number of people needing treatment per time given policy . From (2) we have that and are continuously decreasing in since is decreasing in . From this and continuity it follows that is feasible if and only if If , then the capacity constraint is so tight that it can never be satisfied at the peak of the epidemic, not even with the most severe lockdown policy. If , then no lockdown is required to satisfy the constraint.Conversely, for any satisfying (3) there is a minimal lockdown policy, denoted , that satisfies the constraint that the number of individuals requiring treatment at time never exceeds . Formally, Because is a decreasing function of , it follows that is a decreasing function of . Intuitively, as the capacity constraint increases, the severity of the required lockdown decreases.As for policy implications, this means that, all else equal, states or countries with larger capacities can afford less stringent lockdowns. For a given lockdown policy the transmission rate parameter can also vary substantively between states and countries as the value of this parameter varies with factors such as population density and household composition. Since the maximum number of patients requiring treatment is given by and is increasing in (see (2)), it follows that, all else equal, states or countries with larger transmission rates require more stringent lockdowns. Formally, compare two regions, each with capacity , with transmission rates parameterized by and satisfying for , where at least one of these inequalities is strict. Denoting the respective minimal lockdown policies by and , we then have In other words, regions with lower transmission rates can afford slacker lockdown policies as is illustrated in Panel (a) of Fig. 2. This figure uses the same parameters values as Fig. 1 but with and . As noted at the end of Section 3.1, this also means that wearing masks affords slacker lockdown policies in the presence of an airborne, transmittable disease.
Fig. 2
Panel (a) illustrates that a higher schedule of values necessitates a more severe lockdown for a given value. Panel (b) illustrates the relationship between and for a range of values. As was shown analytically, for a given value, the severity of the lockdown decreases as the capacity of the healthcare system increases. This figure also shows that a more severe lockdown is required if a higher proportion of the population is initially infected .
The relationship between lockdown and capacity.
We now look in slightly more detail at the relationship between and . Panel (b) of Fig. 2 plots this relationship, assuming and .16
As before we set and but we now create plots for three different values of : (in which case ), (in which case ) and (in which case ). Panel (b) of Fig. 2 shows that the lockdown policy needed to achieve a given increases in the proportion of the population that is initially infected. This figure also shows how the proportion of the population that requires treatment at the height of the pandemic, for a given lockdown policy , increases in the proportion of individuals that are initially infected. Consequently, for a given cap , a more severe lockdown is required as increases. This result highlights the high cost of a delayed policy response.17Panel (a) illustrates that a higher schedule of values necessitates a more severe lockdown for a given value. Panel (b) illustrates the relationship between and for a range of values. As was shown analytically, for a given value, the severity of the lockdown decreases as the capacity of the healthcare system increases. This figure also shows that a more severe lockdown is required if a higher proportion of the population is initially infected .Fig. 3 provides some additional comparative statics showing how the severity of the required lockdown decreases as increases and as decreases. Panel (a) uses the same parameter values as Panel (b) of Fig. 2 but with (in which case since we set ) and , and . Panel (b) also uses the same parameter values as Panel (b) of Fig. 2 but with (in which case since we set ) and , 0.0875 and 0.055. One interpretation of these comparative statics is that as superior treatments become available, individuals both require less overall treatment and recover from the disease more quickly and hence a less severe lockdown is required.
Fig. 3
The severity of the required lockdown decreases as increases and as decreases .
The severity of the required lockdown decreases as increases and as decreases .
Economic impact.
By mapping the severity of the lockdown to gross domestic product (GDP), one can trace out the relationship between the capacity constraint and economic output. Substituting into the production function yields output It follows that, given the minimal lockdown policy for the constraint , output, denoted , is given by Because decreases in and decreases in , it follows that A plot illustrating how increases in for a given set of parameters can be found in Fig. 4, which assumes , and (and as before , , , , ).18
This plot shows that longer delay in the initial policy response – which leads to a higher number of infected individuals in the population prior to any lockdown intervention – results in policymakers facing a more severe economic impact of the pandemic in order to satisfy the binding constraint .
Fig. 4
Output increases in and decreases in .
Output increases in and decreases in .
Sensitivity analysis and confidence intervals
The very nature of contagious diseases is that their dynamics are inherently non-linear. Epidemiology models are needed to predict the spread of the disease and future demand for healthcare. Consequently, a policy that only adapts to current data without accounting for future states will fail to satisfy the capacity constraints. Of course, as with any predictive model, predictions are subject to uncertainty and errors that can result both from model misspecification and from uncertainty about parameters within the model. We now briefly discuss how the latter can be accounted for within the homogeneous population model.Up to this point we have treated the parameters of the model as given and known. As just mentioned, in practice, there may be considerable uncertainty and measurement error associated with these parameter values, implying that the predictions of the model are not deterministic. We now discuss how the distribution of the predicted peak of the epidemic can be derived from the, by assumption, known distributions of the uncertain parameters , , , and .After some tedious algebra, the proportion of the population requiring treatment at the peak of the epidemic is given by
The density of is thus where denotes the Dirac delta function and denotes the distribution of the random variable . From here one can construct a confidence interval for the maximum number of individuals requiring treatment at the peak of the epidemic.For example, suppose that , and are known parameters and that . That is, is normally distributed with mean 0.11 and standard deviation 0.01. Then is normally distributed with mean and standard deviation A 95% confidence interval for the value of is then given by . Therefore, if we use then we can say that with 97.5% confidence, the constraint will not be violated at the peak of the epidemic. An illustration is provided in Fig. 5. This figure uses precisely the same parameters as those shown in Panel (b) of Fig. 2 and sets (in which case ).
Fig. 5
Assume . If is deterministic and equal to 0.11, we have . In contrast, if is normally distributed with mean 0.11 and a standard deviation of 0.01, then is required to satisfy the constraint with probability 0.975 .
An alternative to the approach adopted here would be to perform a conditional worst-case analysis. That is, if one had estimates of moments of a particular parameter distribution (such as its mean and variance), then one could compute confidence intervals with respect to the worst-case distribution.Assume . If is deterministic and equal to 0.11, we have . In contrast, if is normally distributed with mean 0.11 and a standard deviation of 0.01, then is required to satisfy the constraint with probability 0.975 .
Heterogeneous agent model
We now turn our attention to the heterogeneous agent model. Without loss of generality, one can normalize for all . Note that this implies that Under our assumption of homogeneous mixing among all type cohorts, the time rate of transition between the compartment of susceptible individuals of type and the compartment of infected individuals of type is The dynamics of an epidemic in this SIR model are then governed by the following system of non-linear differential equations with initial conditions , and , where . Letting , and and integrating this system of differential equations yields The social planner then selects the lockdown policy that maximizes output subject to the constraint that total hospitalizations not exceed at any point during the epidemic. That is, for all , To ensure that we have an interesting problem, we assume that this constraint is violated under the lockdown policy and is slack under lockdown policy . Under homogeneous mixing this model reduces to a simple homogeneous agent SIR model with a transmission rate that depends on the type-dependent lockdown policy. Exploiting this fact yields the following proposition.Assume
and
, where
are constants, and for all
,
. Then the optimal policy is bang–bang, that is, there is an
such thatHaving a bang–bang solution is not only analytically convenient but also useful in practice: even though the planner might want to consider a continuum of lockdown policies, which would pose practical difficulties, in this case it is without loss of generality to only consider minimal and maximal lockdown policies across type cohorts. Fig. 6 indicates that similar comparative statics to those illustrated in Fig. 2, Fig. 3 hold for the cutoff type that characterizes the optimal lockdown policy under the heterogeneous agent model.
Fig. 6
The type cutoff associated with the optimal bang–bang lockdown policy increases in and and decreases in and . These figures use the same parameter values as Fig. 2, Fig. 3 where types are distributed uniformly over the interval and is constant across types .
The type cutoff associated with the optimal bang–bang lockdown policy increases in and and decreases in and . These figures use the same parameter values as Fig. 2, Fig. 3 where types are distributed uniformly over the interval and is constant across types .Interestingly, the heterogeneous agent model induces a non-trivial economic optimization problem that does not require taking a stance on how economic activity is traded off against the number of deaths caused by the disease. Indeed, the optimal lockdown policy in this setup maximizes economic output subject to a given capacity constraint. In this sense, the dollars-death tradeoff is not the starting point of the analysis but rather a result of the analysis.19
Of course, many of the specific policy implications derived from our setup need not carry over to richer models but the basic feature that models with heterogeneous agents and a capacity constraint induce an economic optimization problem without specifying the value of life remains valid.
Policy-dependent mixing.
Another notable feature of Proposition 1 is that it does not include an assumption concerning how varies with . This is a direct consequence of the homogeneous mixing assumption. However, in practice, one might also expect a lockdown policy to impact how the type cohorts mix. In this case, the structure of the optimal policy will also depend on how hospitalization rates vary across types. We now relax the assumption of homogeneous mixing and consider type-dependent mixing. Motivated by the form of the optimal policy under homogeneous mixing and for purposes of tractability we restrict attention to lockdown policies such that for all . We further assume that only types subject to the same lockdown policy mix (and mix in a homogeneous fashion). We then have the following proposition.Assume that for all
we have
,
,
and
, where
are constants. Then under policy-dependent mixing with
for all
the optimal policy is monotone. That is, there exists a
such that
Moreover,
.Suppose, for illustrative purposes, that agent types correspond to age cohorts and that younger age cohorts are more productive than older age cohorts. Then Proposition 1 shows that under homogeneous mixing the optimal lockdown policy allows younger individuals who are more productive to return to work, while older individuals are subject to a strict lockdown in order to combat the spread of the epidemic. Proposition 2 shows that if older age cohorts are also more vulnerable and more likely to require hospitalization, then the optimal lockdown policy under policy-dependent mixing takes a similar form. Moreover, a bang–bang lockdown policy is more effective, in the sense that a smaller proportion of the population is subjected to a lockdown (and hence economic output is higher) with policy-dependent mixing than without it.
Discussion
We now discuss several natural extensions of our modeling approach, with a particular focus on optimal dynamic policies. This allows us to compare our results to those of Alvarez et al. (2021) and Acemoglu et al. (2021).
Optimal dynamic lockdown policies
We now extend our baseline analysis by allowing the lockdown policy to vary over time and determine the optimal dynamic lockdown policy, subject to the capacity constraint. Since we will be solving this model numerically, for simplicity we now consider a discrete-time version of the model. We let , and denote the respective number of individuals that are susceptible, infected and recovered at time . We again assume that we have a population of a constant size and normalize the size of the population to so that . Given that , we have a model with only two state variables: and . We let denote the policy function, where specifies the lockdown policy in state . All other variables in the dynamic model are defined as they were in the static case. We assume that the policymaker discounts future output according to the discount factor and thus solves the following optimization problem: where the objective function in (4) is the time-discounted sum of output, (5) specifies the laws of motion of the state variables and (6) specifies the capacity constraint. As we did in Section 3, we simply take The Bellman equation associated with this problem thus satisfies where denotes the value function.20
Panels (a) and (b) of Fig. 7 provide an illustrative numerical solution for the parameters , , , , , , , , , and .21
Fig. 7
An illustrative solution (Panel (a)) and trajectories (Panel (b)) for the optimal dynamic lockdown policy. Panel (c) displays the optimal cutoff type under a heterogeneous agent model with homogeneous mixing. The cutoff varies over time but at each point in time a bang–bang policy is optimal .
As is illustrated in Panel (a) of Fig. 7, the optimal dynamic lockdown policy imposes a short, sharp lockdown. The policymaker allows the pandemic to progress to the point where the constraint binds before implementing a strict lockdown that prevents the constraint from being violated. The constraint then binds for an extended period while the policymaker swiftly eases the lockdown. Once the constraint becomes slack following the peak of the pandemic we have . The qualitative features of the optimal dynamic lockdown policy differs substantively from those derived (Alvarez et al., 2021).22
Their optimal dynamic policies have a “hump-shaped” appearance (see their Fig. 1, Fig. 2, Fig. 3, Fig. 4, Fig. 5, Fig. 6, Fig. 7, Fig. 8), with the policymaker gradually easing into and out of the lockdown.
Fig. 8
An illustration of the optimal policies (Panel (a)) and pandemic trajectories (Panels (b) and (c)) under the stochastic arrival of a vaccine. Displayed here are two cases involving late arrival of the vaccine (Panel (b), ) and early arrival of the vaccine (Panel (c), ) .
An illustrative solution (Panel (a)) and trajectories (Panel (b)) for the optimal dynamic lockdown policy. Panel (c) displays the optimal cutoff type under a heterogeneous agent model with homogeneous mixing. The cutoff varies over time but at each point in time a bang–bang policy is optimal .An interesting feature of the dynamically optimal lockdown policy is that once the capacity constraint becomes slack, no future policy interventions are required. Intuitively, the optimal dynamic policy leads to the shortest possible duration of the lockdown by decreasing the population of susceptible individuals as efficiently as possible, subject to the capacity constraint. Consequently, once the capacity constraint is slack we always have under the dynamically optimal policy and no future policy interventions are required. This means that a second wave of infections cannot occur unless (for reasons outside the scope of the model) there is a sufficiently large increase in the population of susceptible individuals. Relative to this optimal dynamic policy derived here, it appears that during the COVID-19 pandemic many US states (such as California) have adopted a policy more akin to a statically optimal policy. Under these policies a longer and less severe lockdown occurs, resulting in an extended period of depressed output. Even after the peak of the pandemic has passed, if a policymaker cancels a statically optimal lockdown too soon, this can result in a large second wave of infections occurring (particularly if the capacity constraint is tight and a large population of susceptible individuals remain after the peak of the pandemic passes). In this sense, statically optimal policies are less robust to future mistakes on the part of policymakers.In principle, a calibrated version of this dynamic model also allows us to predict total output loss. For example, for the parameters used to construct Panels (a) and (b) in Fig. 7, output falls by at most 13% over the course of the pandemic, while the corresponding decrease in the policymaker’s objective function (the time-discounted sum of output) is 3.4%. Of course, these predictions are highly sensitive to the choice of parameters and constraint. The purpose of our framework is to illustrate what we believe to be a fruitful approach to policy rather than produce a precisely calibrated model for which agent-based models à la Akbarpour et al. (2020) are much better suited.23
That said, with the exception of and , all the parameter values used for this exercise are, as mentioned, taken from the epidemiology and macroeconomics literature.The model introduced here can also be extended to account for heterogeneous agents. In particular, if we assume homogeneous mixing among all type cohorts, a bang–bang lockdown will continue to be optimal at every point in time under the same conditions as those stated in Proposition 1. What will change over time is the cutoff such that types with are subjected to the strictest lockdown possible and types with are not subjected to any lockdown. The optimal policy can then be represented by a function , which specifies the cutoff type that characterizes the optimal bang–bang lockdown policy in state . Panel (c) in Fig. 7 illustrates the optimal policy for the same parameters used to construct Panels (a) and (b), with types uniformly distributed over the interval (with constant across types). These results are directly applicable to those presented in Section 5.1 of Acemoglu et al. (2021), which considers homogeneous mixing of age cohorts and a “semi-targeted” lockdown.24
Under a “semi-targeted” lockdown policy, the policymaker can specify one lockdown policy for the oldest age group and another for the young and middle-aged age groups. Since these age groups are fixed, the optimal policies are not bang–bang (see Figures 5.4 and 5.5 of Acemoglu et al. (2021)). In contrast, in our framework, the age groups are endogenous and optimal – an agent is either below or above the threshold implied by the optimal dynamic policy – and the groups vary over time. This shows that when faced with the choice between having (i) a bang–bang policy and time-varying groups or (ii) a rich menu of policies but fixed groups the social planner would prefer (i).
Arrival of a vaccine
Another important concern for policymakers is the possibility that a vaccine may arrive at some point in the future. We now investigate the implications of this for optimal lockdown policies. Since the arrival of a vaccine is stochastic in nature, it is most natural to consider this in our setting with dynamically optimal policies.We extend the homogeneous agent model from Section 5.1 by allowing for the stochastic arrival of a vaccine. Specifically, we consider a simple model in which, prior to the arrival of a vaccine, the probability that a vaccine arrives in any given period is . This implies that the arrival period of the vaccine is geometrically distributed. We assume that when the vaccine arrives, all susceptible individuals are inoculated, which we can represent by moving them to the compartment for recovered individuals.25
The Bellman equation corresponding to the policymaker’s dynamic optimization problem then simply becomes An illustrative numerical solution for the same parameters used in Fig. 7 and is shown in Fig. 8. Naturally, if the vaccine arrives too late (i.e. once the capacity constraint is slack), this does not impact the optimal lockdown policy. However, if the vaccine arrives earlier, this allows the policymaker to immediately ease the lockdown policy, allowing for an earlier economic recovery. The possibility that a vaccine will arrive at some point in the future does not substantively impact the dynamically optimal policy until the arrival actually occurs. Intuitively, this is because the policymaker cannot allow the capacity of the healthcare system to be violated for any realization of the stochastic arrival process. Thus, under a capacity constraint, there are no dynamic complementarities of the form considered in Alvarez et al. (2021), where the possibility that a vaccine will arrive in the future incentivizes the planner to implement a stricter lockdown today.An illustration of the optimal policies (Panel (a)) and pandemic trajectories (Panels (b) and (c)) under the stochastic arrival of a vaccine. Displayed here are two cases involving late arrival of the vaccine (Panel (b), ) and early arrival of the vaccine (Panel (c), ) .
Variable healthcare capacity
In all of the models we have considered up to this point, the policymaker knows in advance what the capacity of the healthcare system will be at the peak of the pandemic and chooses its lockdown policy accordingly. However, as we have seen during the recent COVID-19 crisis, many countries expanded the capacity of their healthcare systems over the course of the pandemic. We now analyze the impact of stochastic increases in the capacity of the healthcare system, which is best done in a model with dynamic, rather than static, lockdown policies.To this end, we now augment the homogeneous agent model from Section 5.1 with the stochastic arrival of increased healthcare capacity. Specifically, we consider a model in which the initial capacity of the healthcare system is and at some point the capacity of the healthcare system increases from to , where . In any given period in which the healthcare capacity is , the probability that the healthcare capacity increases to in period is . The arrival period of the increase in healthcare capacity is thus geometrically distributed. Let
() denote the value function for states in which the capacity is
(). The Bellman equations corresponding to the policymaker’s dynamic optimization problem are now and An illustrative numerical solution for the same parameters used in Fig. 7 (but with , and ) is shown in Fig. 9. Naturally, if the increase in the capacity of the healthcare system arrives too late (i.e. once the capacity constraint is slack), it has no impact on the course of the pandemic. If the increased capacity of the healthcare system arrives earlier, then this allows the policymaker to temporarily ease the lockdown until the capacity constraint binds. This ensures that the overall lockdown is both shorter and less strict, which results in higher output from the period in which the increased capacity arrives. Similarly to what we saw with the random arrival of a vaccine, there are no dynamic complementarities and the possibility that the capacity of the healthcare system will increase at some point in the future does not substantively impact the dynamically optimal policy until this increase is actually realized.
Fig. 9
An illustration of the optimal policies (Panel (a)) and pandemic trajectories (Panels (b) and (c)) under stochastic arrival of increased healthcare capacity. Displayed here are two cases involving late arrival of the increased capacity (Panel (b), ) and early arrival of the increased capacity (Panel (c), ) .
An illustration of the optimal policies (Panel (a)) and pandemic trajectories (Panels (b) and (c)) under stochastic arrival of increased healthcare capacity. Displayed here are two cases involving late arrival of the increased capacity (Panel (b), ) and early arrival of the increased capacity (Panel (c), ) .
Conclusions
This time is different.26
The cause of the economic downturn associated with COVID-19 (a pandemic rather than the burst of a financial bubble or any other structural issue with the economy), its scope (universal, hitting all countries more or less within the same quarter) and magnitude (record increases in unemployment filings in the United States) are unprecedented. While there are good reasons to be confident that, informed by the in-depth analyses of past mistakes, the policy response to a severe economic downturn will be better and swifter than at the onset and during the Great Depression, the unparalleled nature of the current shock makes recovery a perilous and winding road. Although policymakers may be ready to act swiftly, the ongoing virulence of the disease may prevent them from so doing. Without widespread immunization, return to normalcy would be difficult if not impossible even if there were no inertia in rebooting economies that have come to a standstill. We will have to find the path to recovery by learning on the go, and learning quickly.During the COVID-19 pandemic, arguments have been put forth that policymakers should first take care of the public health aspect of the pandemic and only tackle the economic fallout once the health crisis has been dealt with. Generally speaking, it is not clear what it means to only turn to the economic aspects down the track nor whether the two dimensions can be really separated. Effective COVID-19 vaccines have now been developed, which is a much more comfortable situation than the world was in before. However, as the onslaught of the winter wave in Europe and the United States makes painstakingly clear, there is a time delay between having an effective vaccine and a vaccine becoming effective. Tough months and decisions lie ahead for policymakers in these and many other regions of the world, and catastrophic health outcomes like those New York City or Lombardy experienced in the first half of 2020 remain a lurking threat. In future pandemics, developing an effective vaccine may prove elusive, in which case the health crisis and the economic crisis cannot be separated. Our approach provides a way of formalizing the notion of dealing with the health crisis first – avoiding health catastrophes by satisfying the capacity constraints at all times – while minimizing the economic fallout of satisfying these constraints.Continuum SIR models, such as the ones analyzed here, provide good approximations for large populations. However, for smaller populations or more refined targets – such as ensuring that ICU beds do not run out – this family of models does not necessarily provide a good approximation. For these kinds of applications, models using agent-based simulations are more appropriate tools, and if calibrated to rich micro-level data, provide more reliable estimates of the outcomes of interest. A promising avenue for future research would be to develop agent-based models that account for the healthcare system’s capacity constraint. While it is true that, looking backwards, pandemics of the nature of COVID-19 are once-in-a-century events, given the growth in the world’s population and the globalization of trade and travel there is no reason to believe that this will continue to be the case going forward. Policies that are based on the backward-looking perspective will not be sustainable when the next pandemic comes around. Having frameworks at hand to guide policy in this contingency will be invaluable.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.