Tanujit Dey1, Jaechoul Lee2, Sounak Chakraborty3, Jay Chandra4, Anushka Bhaskar4, Kenneth Zhang5, Anchal Bhaskar6,7, Francesca Dominici8. 1. Center for Surgery and Public Health, Department of Surgery, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02120, USA. 2. Department of Mathematics, Boise State University, Boise, ID 83725, USA. 3. Department of Statistics, University of Missouri, Columbia, MO 65211, USA. 4. Harvard College, Cambridge, MA 02138, USA. 5. Ancaster High School, Ancaster, ON L9G 3K8, Canada. 6. Aliso Niguel High School, Aliso Viejo, CA, USA. 7. Williams College, Williamstown, MA 01267, USA. 8. Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA.
Abstract
State-level policy interventions have been critical in managing the spread of the new coronavirus. Here, we study the lag time between policy interventions and change in COVID-19 outcome trajectory in the United States. We develop a stepwise drifts random walk model to account for non-stationarity and strong temporal correlation and subsequently apply a change-point detection algorithm to estimate the number and times of change points in the COVID-19 outcome data. Furthermore, we harmonize data on the estimated change points with non-pharmaceutical interventions adopted by each state of the United States, which provides us insights regarding the lag time between the enactment of a policy and its effect on COVID-19 outcomes. We present the estimated change points for each state and the District of Columbia and find five different emerging trajectory patterns. We also provide insight into the lag time between the enactment of a policy and its effect on COVID-19 outcomes.
State-level policy interventions have been critical in managing the spread of the new coronavirus. Here, we study the lag time between policy interventions and change in COVID-19 outcome trajectory in the United States. We develop a stepwise drifts random walk model to account for non-stationarity and strong temporal correlation and subsequently apply a change-point detection algorithm to estimate the number and times of change points in the COVID-19 outcome data. Furthermore, we harmonize data on the estimated change points with non-pharmaceutical interventions adopted by each state of the United States, which provides us insights regarding the lag time between the enactment of a policy and its effect on COVID-19 outcomes. We present the estimated change points for each state and the District of Columbia and find five different emerging trajectory patterns. We also provide insight into the lag time between the enactment of a policy and its effect on COVID-19 outcomes.
In December of 2019, a new strand of coronavirus was identified in Wuhan, China. The first case of COVID-19 in the United States of America was identified on January 19–20, 2020, in the state of Washington, after a man returned from Wuhan, China, on January 15, 2020. The first non-travel-related COVID-19 case was confirmed on February 26, 2020, which raised concerns related to community transmission of the new coronavirus in the United States.On March 11, 2020, after noticing patterns of international spread, the World Health Organization declared COVID-19 a pandemic. With the number of COVID-19 cases rising steadily in the United States and around the world, concerted intervention by states and the US federal government was required in order to effectively monitor and prevent the spread of the virus. The goal of these interventions was to ensure and maintain access to testing for as many people as required. On March 13, 2020, President Donald Trump declared a state of emergency in an attempt to strengthen the response of the federal government to the pandemic. As commented on by the Kaiser Family Foundation (KFF), US states took slow action to contain the spread of the virus, especially those states hit hardest by the outbreak, suggesting that effective policy responses were delayed. In one such example, the first patient to have diagnosed COVID-19died on February 29, 2020, whereas the declaration of a state of emergency by national and state-level governments was announced nearly a full month later. According to KFF, every state in the United States had made an emergency declaration by March 16, 2020, with most of these declarations listed as either a State of Emergency or a Public Health Emergency. Although these emergency declarations were made, the United States had already reported 4,507 cases on March 16 and 188,461 by March 31, 2020. In response, several states issued stay-at-home orders, such as California issuing its stay-at-home order on March 19, 2020.
Importance of state-wise policies compared with federal-level policies
According to KFF, many states have implemented policies in order to increase access to COVID-19 testing and treatment in addition to improving the management of other health conditions. Governments across the world, at either the national or the subnational level, have similar methods for implementing public policies. Many nations implemented policies after considering the challenges faced by other nations when implementing a similar policy. This allowed legislators to draw valuable lessons during implementation and alter policies in the aim of achieving the best possible outcome for their constituents. Policy decisions are further conditioned by “contextual factors, including institutional (e.g., constitutional and legalistic structures) factors, cultural orientations, economies, and political styles (among others).” Policy networks, which are entities that seek to influence policy, relationships with other legislations, and the related outcomes, both respond and contribute to the shifting of attention to policy issues and the change of government agendas. Examples of policy networks include political parties, elected offices, non-governmental organizations, and public entities, which communicate across numerous connections vital to the policy-making process.The COVID-19 pandemic has encouraged a rapid and dramatic shift in the priorities of policy networks and, correspondingly, shifts in the priorities of many government decision-making venues, such as legislatures and parliaments. The Swiss parliament, for instance, interrupted its usual slate for the spring session and adopted other issues, such as climate change and pension changes. Since the start of COVID-19, many policy networks, like those at the state level, have begun to focus more on the fundamental purpose of particular policy areas, whether that be education or to provide food for struggling families. Other examples include the implementation of California's state-wide law to prohibit the eviction of tenants on commercial property, or initiatives put in place, such as Connecticut providing $95.5 million in SNAP benefits to families of children eligible for free and reduced-price school meals through the Pandemic Electronic Benefits Transfer program.
Literature review on previous studies
Social distancing measures were either fully or partially relaxed by all US states and the District of Columbia weeks after they were first issued in order to suppress the transmission of COVID-19 and reduce the growth in cases of severe coronavirus disease. Using segmented linear regression, one study measured the extent to which the relaxation of social distancing measures influenced the control of the epidemic in the United States. Following the gradual easing back of social distancing measures across the United States, Tsai et al. observed an immediate and significant turnaround in the suppression of the epidemic as the country's ability to monitor the disease burden associated with COVID-19 was compromised by the premature relaxation of social distancing steps. However, another study suggested a slightly dissimilar conclusion when applying reduced-form econometric methods to empirically evaluate the effect of anti-contagion policies on the growth rate of infections. When reduced-form econometric methods were used, it was found that anti-contagion policies have significantly decelerated the growth rate of infections. It was estimated that among the six countries (China, South Korea, Italy, Iran, France, and the United States) analyzed, anti-contagion policies have prevented or delayed approximately 61 million confirmed cases and averted 495 million total infections.In another study, a group of researchers used a stochastic individual-based model for the transmission of COVID-19 to describe individual contact networks stratified into household, school, community, and workplace layers, using demographic and epidemiological data from the United Kingdom. The authors found that if social distancing measures were relaxed, including the reopening of schools, they must be accompanied by large-scale, population-wide testing for symptomatic individuals alongside contact tracing. Another rapid review was also conducted to assess the effects of quarantine, alone or in combination with other measures, on individuals who have come in contact with confirmed cases of COVID-19. The review concluded that evidence of COVID-19 was limited to modeling studies, but consistently indicated that quarantine is crucial to the reduction of infection and mortality during the pandemic.Another study used generalized linear mixed-effects models with state-level clustering in order to estimate county-level associations with an overall social vulnerability index (SVI) as well as SVI subcomponent scores with COVID-19 case fatality rate (CFR). The authors found no significant association between overall SVI and COVID-19 incidence but found that the social status and minority status subcomponents of the SVI were both predictors of higher incidence and CFR. Other papers have been suggestive of the existence of social inequities in the United States for COVID-19 outcomes in regard to national, state, and local public health data. The analysis of US county-level COVID-19deaths, confirmed cases, and positive tests in Illinois and New York City, which is stratified by zip codes, area percent crowding, poverty, and population of color, revealed that socioeconomic disparities for COVID-19 outcomes exist in the United States. To provide an evidence base for policy and resource allocation, the paper suggests the use of straightforward cost-effective methods to report on social disparities in COVID-19 results. In addition to social disparities, the association of public health interventions with epidemiological characteristics of the COVID-19 pandemic was evaluated using individual-level data on 32,583 laboratory-confirmed COVID-19 cases in Wuhan, China. The rates of laboratory-confirmed COVID-19 cases were calculated across five periods: first, from December 8, 2019, to January 9, 2020, when no public health interventions were executed; then, January 10–22, 2020, during the Chinese New Year Holiday; January 23 to February 1, 2020, during which travel restrictions and quarantine were enforced; February 2–16, 2020; and February 17 to March 8, 2020, when universal symptom surveys were completed. Over the series of public health interventions, it was identified that there was a temporal association between the interventions and the improved control of the COVID-19 pandemic in Wuhan, China. This indicates that public health interventions were associated with improved control of the COVID-19 pandemic in the earlier stages of the outbreak in Wuhan, China, which may inform public health policy in other countries and regions.In addition to public health interventions, non-pharmaceutical interventions (NPIs) appeared to be effective in containing the COVID-19 outbreak in China. The study constructed a travel-network-based susceptible-exposed-infectious-removed model using epidemiological parameters estimated for the early stages of the outbreak in Wuhan before NPIs were implemented, in order to simulate the outbreak of the pandemic across cities in mainland China. Based on the model's results, without the implementation of NPIs, the number of confirmed COVID-19 cases would possibly have shown a 67-fold increase, indicating that the early detection and isolation of cases were forecast to prevent more infections than travel restrictions and reductions in contact. Also, it was proposed that integrated NPIs would have achieved the strongest, most robust, and most rapid effect of preventing more infections. However, if these NPIs had been implemented 1–3 weeks earlier in China, it is estimated that the total number of infections could have been reduced by 66%–95%. In conclusion, to mitigate health, social, and economic impacts in affected regions around the world, integrated NPI strategies should be planned, implemented, and modified earlier than they were.
COVID-19 infections and the effect of policies over time: A change-point perspective
As the data related to this pandemic are recorded over time, a natural choice for the development of methodological research could be time-series analysis. Also, because the infection rate of COVID-19 changes based on several reasons, one of the most important research goals could be the association of those changes in infection rates with the interventions imposed to stop or delay the growth of the pandemic. Therefore, a change-point analysis of the COVID-19 data associated with the growth rate of the pandemic should be rigorously studied. In the short period of time from the start of the pandemic until now, we are indeed witnessing an increasing number of contributions in this area of research.Using data from Germany, Dehning et al. applied a Bayesian epidemiological model to analyze the time dependence of the growth rate of COVID-19infections from a change-point perspective. Their research found that the growth rate of the pandemic is indeed correlated with the time points at which public interventions (policies) were decided. Jiang et al. performed a change-point analysis based on COVID-19 health outcomes (cases, deaths) using a piecewise linear trend model. They analyzed the trajectory of cumulative COVID-19 cases and deaths across 30 countries. In addition, they developed a forecasting model for predicting cumulative deaths in the United States. In yet another study, based on data from European countries, researchers identified the change points in the COVID-19 epidemic. A mixed-effects Poisson regression model was used to assess the relationship between the level of social distancing and the observed decay in the national epidemic. Wagner et al. performed an interrupted time-series analysis to evaluate the association between the interventions taken by the US states and the reduction in the spread of COVID-19.The current gaps in knowledge on this topic include four critical issues that need to be further considered altogether. First, the models for COVID-19 outcomes need to be flexible to better adapt to continuously evolving changes in the pandemic. Restrictive models such as model-based regression could be limited in modeling diverse patterns pertinent to COVID-19 outcomes. Second, the COVID-19 outcomes in a day are heavily dependent on the previous day. To be more effective at accounting for this temporal correlation, one will need a model that effectively models this strong temporal correlation. Third, the COVID-19 outcomes typically show multiple changing trajectories due to policy changes adopted by the states. The COVID-19 models need to consider such multiple structural changes in the trajectory. Fourth, as the time lag between a newly adopted policy and its resulting change in COVID-19 outcomes can vary, a probabilistic approach could be relevant to model the lag time. In short, to combine all of these four issues, one will need to develop a unified approach that rigorously finds change points in COVID-19 outcomes and links these change points to their associated policies via a probabilistic approach.
Goals of this study
In this paper we study the lag time between policy interventions and a change in daily COVID-19 outcome trajectory in the United States, a critical topic that has been less studied in the literature. First, we modeled the COVID-19 time series data using a stepwise drifts random walk to account for non-stationarity, strong temporal correlation, and multiple changes in the rate of change for the daily COVID-19 outcome (confirmed cases and deaths). Second, because a rigorous estimation of change points in the COVID-19 outcomes is very important, we apply a genetic algorithm (GA) with the minimum description length criterion to estimate the number and times of change points in the US COVID-19 outcome. GAs are a data-driven search technique based on the natural selection principle and, in particular, have been very effective in detecting multiple unknown change points in time-series data,19, 20, 21, 22 unlike some previous studies, that considered at most only one change point. We estimated the change points separately for each state and the District of Columbia. Third, we estimated the time-dependent growth rate for each US state based on its estimated change points and then identified five different emerging trajectory patterns in COVID-19 outcomes. Fourth, to link our estimated change points with population-level NPIs adopted by each US state, we created a random variable that assesses the relationship between the change points in COVID-19 health outcomes and the dates of NPIs and then tested the hypothesis of whether there is any impact of the NPIs on the COVID-19 outcomes within several selected days. For this, the underlying probabilistic model associated with this random variable is also identified.
Data used for this study
Data for COVID-19 outcomes
We used the US COVID-19 dataset from the New York Times data repository at https://github.com/nytimes/covid-19-data for our analysis of the US pandemic change points. For the data analysis, we selected the daily confirmed cases (from March 8, 2020, to February 28, 2021) and daily deaths (from March 18, 2020, to February 28, 2021) for each of the 50 states and the District of Columbia. We performed all the statistical analysis using the statistical software R version 4.0.5.
Data for non-pharmaceutical interventions
We extracted the dates marking policy implementations, available until July 30, 2020, for each of the 50 states and the District of Columbia, from the KFF GitHub (https://github.com/KFFData/COVID-19-Data/tree/kff_master/State%20Policy%20Actions/State%20Social%20Distancing%20Actions). We then manually updated the policy dates until February 22, 2020, using the current status of the different types of policies from August 3, 2019, to February 22, 2020. We made available the current status of each policy in the KFF GitHub (here). If from August to February there was a change in the state of any of the below-mentioned policy types, it was added to the original spreadsheet with policy dates until July 30, 2020. Policies were sorted into seven different categories, including stay-at-home orders, mandatory quarantines, non-essential business closures, bans on large gatherings, school closures, limitations on restaurants, and declarations of a state of emergency. The date of rollback for five of these policies (stay-at-home orders, mandatory quarantines, non-essential business closures, bans on large gatherings, and limitations on restaurant limits) was also recorded. Each state could have multiple policies of the same type if, for example, the policy was implemented, rolled back, and reimplemented. In this scenario, two separate policy implementation dates would be recorded. The updated policy data spreadsheet can be found at our GitHub website. Based on the updated policy data spreadsheet, we visualized the number of lockdown (Figure 1) and the number of reopening (Figure 2) policy implementations for each US state.
Figure 1
Number of lockdown policy implementations for the 50 US states
The number of lockdown policy implementations as recorded by the KFF is displayed for each of the 50 states. For each state, there are seven different types of lockdown policies, including stay-at-home orders, mandatory quarantine, non-essential business closures, large gathering bans, school closures, restaurant limits, and state-of-emergency declarations.
Figure 2
Number of reopening policy implementations for the 50 US states
The number of reopening policy implementations as recorded by the KFF is displayed for each of the 50 states. For each state, there are five different dates for different reopening policies, including the rollback of stay-at-home orders, mandatory quarantine, non-essential business closures, large gathering bans, and restaurant limits.
Number of lockdown policy implementations for the 50 US statesThe number of lockdown policy implementations as recorded by the KFF is displayed for each of the 50 states. For each state, there are seven different types of lockdown policies, including stay-at-home orders, mandatory quarantine, non-essential business closures, large gathering bans, school closures, restaurant limits, and state-of-emergency declarations.Number of reopening policy implementations for the 50 US statesThe number of reopening policy implementations as recorded by the KFF is displayed for each of the 50 states. For each state, there are five different dates for different reopening policies, including the rollback of stay-at-home orders, mandatory quarantine, non-essential business closures, large gathering bans, and restaurant limits.
Results
Estimated change points for COVID-19 outcomes in Florida
We applied our change-point method to each US state and the District of Columbia. To depict a detailed picture of the entire change-point results, we selected the COVID-19 data from Florida and thoroughly illustrated the analysis method and interpretation. Figure 3 summarizes our GA change-point estimation results for the log-transformed 7-day moving average series of daily new COVID-19 confirmed cases in Florida from March 8, 2020, to February 28, 2021. On the left, each vertical line denotes the daily number of new COVID-19 confirmed cases, the blue line represents the 7-day moving average case series, and the red vertical dotted lines show the GA estimated change-point times. As shown in the figure, the GA method estimates 10 change points on March 19, March 27, April 5, May 30, July 16, August 1, August 21, August 29, September 28, 2020, and January 9, 2021, segmenting the study period into 11 different regimes. These change-point times are also superimposed on the daily cumulative COVID-19 cases as displayed on the right of the figure. The GA appears to describe well the changes in day-to-day growth rate of the number of new cases (on the left) and the changes in the growth rate for day-to-day cumulative cases (on the right), as well.
Figure 3
Estimated change points in the Florida daily COVID-19 new confirmed cases and cumulative confirmed cases for the period from March 8, 2020, to February 28, 2021
New confirmed (left) and cumulative confirmed (right) cases are shown. The blue line represents the 7-day moving average of daily new confirmed cases. The red vertical dashed lines indicate the GA-estimated change points for the confirmed cases.
Estimated change points in the Florida daily COVID-19 new confirmed cases and cumulative confirmed cases for the period from March 8, 2020, to February 28, 2021New confirmed (left) and cumulative confirmed (right) cases are shown. The blue line represents the 7-day moving average of daily new confirmed cases. The red vertical dashed lines indicate the GA-estimated change points for the confirmed cases.Table 1 summarizes the estimated values of the means and standard deviations for day-to-day changes in the Florida log-transformed 7-day moving average case series with the 10 GA-estimated change points. The mean and standard deviation estimates for the day-to-day changes (or growth rates) experienced 11 different regimes, starting from a daily increase of 41.79% in the initial period of March 8–March 18, 2020, and ending with −2.09% in the last period of January 9–February 28, 2021. The results also indicate that, since March 8, 2020, the 7-day moving average of new cases increased, on average, at a rate of 41.79% day−1 until March 18, and after 10 changes, decreased at a rate of 2.09% day−1 in the period of January 9–February 28, 2021.
Table 1
Estimated means (with their standard errors in parentheses) and standard deviations for day-to-day changes in the Florida 7-day moving average COVID-19 log-transformed confirmed cases
The regime for COVID-19 cases
Estimated δj
Estimated ωj
March 8–March 18
0.4179 (0.0974)
0.3230
March 19–March 26
0.2172 (0.0131)
0.0371
March 27–April 4
0.0786 (0.0177)
0.0530
April 5–May 29
−0.0082 (0.0082)
0.0608
May 30–July 15
0.0594 (0.0068)
0.0466
July 16–July 31
−0.0212 (0.0062)
0.0248
August 1–August 20
−0.0389 (0.0076)
0.0338
August 21–August 28
−0.0343 (0.0051)
0.0143
August 29–September 27
−0.0085 (0.0125)
0.0685
September 28–January 8
0.0189 (0.0065)
0.0661
January 9–February 28
−0.0209 (0.0043)
0.0309
The parameters and are interpreted as the expected value and standard deviation, respectively, of day-to-day changes (or growth rate) in the moving average during the period in regime j, as expressed in Equations 3 and 4.
Estimated means (with their standard errors in parentheses) and standard deviations for day-to-day changes in the Florida 7-day moving average COVID-19 log-transformed confirmed casesThe parameters and are interpreted as the expected value and standard deviation, respectively, of day-to-day changes (or growth rate) in the moving average during the period in regime j, as expressed in Equations 3 and 4.Next, we applied the GA method to the log-transformed 7-day moving average series of Florida daily COVID-19deaths from March 18, 2020, to February 28, 2021. The GA detected seven change points on April 1, April 8, June 15, July 29, September 2, October 24, 2020, and January 24, 2021, partitioning the study period into eight regimes. Figure 4 depicts our GA change-point estimation results. The left of the figure shows the number of daily deaths as vertical lines, the 7-day moving average of daily deaths as the blue line, and the GA-estimated change-point times as red vertical lines. The right shows the daily cumulative deaths with these four GA change-point times overlaid. The GA change-point times appear to detect well the changing mean of day-to-day differences in the deaths series (on the left) and also the changes in the rate of mean changes in day-to-day cumulative cases (on the right).
Figure 4
Estimated change points in the Florida daily COVID-19 new deaths and cumulative deaths for the period from March 8, 2020, to February 28, 2021
New (left) and cumulative (right) deaths are shown. The blue line represents the 7-day moving average of daily new deaths. The red vertical dashed lines indicate the GA-estimated change points for the new deaths.
Estimated change points in the Florida daily COVID-19 new deaths and cumulative deaths for the period from March 8, 2020, to February 28, 2021New (left) and cumulative (right) deaths are shown. The blue line represents the 7-day moving average of daily new deaths. The red vertical dashed lines indicate the GA-estimated change points for the new deaths.Table 2 summarizes the estimated values of the means and standard deviations for day-to-day changes in the Florida log-transformed 7-day moving average death series with the 10 GA-estimated change points. The estimated mean for the day-to-day growth rate in Florida COVID-19deaths was 20.47% in the initial regime for March 18–31, and after seven changes, was decreased to −0.80% in the last regime for January 24–February 28, 2021. Equivalently, this result implies that the 7-day moving average of COVID-19deaths changed, starting from a rate of 20.47% day−1 until March 31 and ending with a rate of −0.80% day−1 for the period of January 24–February 28, 2021.
Table 2
Estimated means (with their standard errors in parentheses) and standard deviations for day-to-day changes in Florida 7-day moving average COVID-19 log-transformed deaths
The regime for COVID-19 deaths
Estimated δj
Estimated ωj
March 18–March 31
0.2047 (0.0474)
0.1773
April 1–April 7
0.1008 (0.0170)
0.0450
April 8–June 14
−0.0020 (0.0081)
0.0666
June 15–July 28
0.0387 (0.0075)
0.0495
July 29–September 1
−0.0116 (0.0070)
0.0413
September 2–October 23
−0.0119 (0.0116)
0.0838
October 24–January 23
0.0115 (0.0063)
0.0607
January 24–February 28
−0.0080 (0.0058)
0.0348
The parameters and are interpreted as the expected value and standard deviation, respectively, of day-to-day changes (or growth rate) in the moving average during the period in regime j, as expressed in Equations 3 and 4.
Estimated means (with their standard errors in parentheses) and standard deviations for day-to-day changes in Florida 7-day moving average COVID-19 log-transformed deathsThe parameters and are interpreted as the expected value and standard deviation, respectively, of day-to-day changes (or growth rate) in the moving average during the period in regime j, as expressed in Equations 3 and 4.
Estimated change points for COVID-19 outcomes in the United States
We now summarize our results of GA-estimated change points in the United States for two COVID-19 outcomes: (1) 7-day moving average series of daily new COVID-19 confirmed cases from March 8, 2020, to February 28, 2021, and (2) 7-day moving average series of daily COVID-19-related deaths from March 18, 2020, to February 28, 2021. For the new confirmed cases, we considered all 50 states and the District of Columbia in the analysis. For the death data, only those states that exceeded 720 cumulative deaths as of February 28, 2021, were considered in this study, resulting in Alaska, Hawaii, Maine, Vermont, and Wyoming being excluded from the analysis.Figure 5 depicts our GA change-point results for the daily new cases. Except for Hawaii, with its highest peak occurring in August 2020, all of the US states experienced their highest peaks during November 2020 and January 2021. Focusing on the continental United States, we identified five emerging patterns of the trajectory of US confirmed cases. The first pattern, denoted by +−+, is observed in Connecticut, Delaware, District of Columbia, Massachusetts, Michigan, New Hampshire, New Jersey, New York, Rhode Island, and Vermont, showing an early peak during April and May, a fast decrease after then, and the highest peak in mid-November 2020 and January 2021. The second pattern, denoted by +++, occurs in Illinois, Indiana, Louisiana, Maryland, Nebraska, Pennsylvania, Virginia, and Washington, characterized by the first peak in late March and May, another peak during the summer, and then the highest peak between mid-November 2020 and January 2021. The third pattern, −++, which shows two substantial peaks, the first substantial peak around July and the highest peak occurring between mid-November 2020 and January 2021, is observed in Alabama, Alaska, Arizona, California, Florida, Georgia, Idaho, Mississippi, Nevada, Oregon, South Carolina, Tennessee, Texas, and Utah. The fourth pattern, −/+, shows low constant trends until August and then a sharp increase, with the highest peak occurring between November 2020 and January 2021, and appears in Colorado, Maine, Minnesota, Montana, New Mexico, North Dakota, Ohio, South Dakota, West Virginia, Wisconsin, and Wyoming. The rest of the states experienced a steady increasing trend and then a fast increase, with the highest peak during the winter, the fifth pattern, //+, in daily case series.
Figure 5
Estimated change points in US COVID-19 confirmed cases
The blue line represents the 7-day moving average of daily confirmed cases. The red vertical dashed lines indicate the GA-estimated change points.
Estimated change points in US COVID-19 confirmed casesThe blue line represents the 7-day moving average of daily confirmed cases. The red vertical dashed lines indicate the GA-estimated change points.Table 3 summarizes the continental US states and the District of Columbia according to their corresponding patterns. We find geographical similarities within these five patterns, especially in patterns 1, 3, 4, and 5. Specifically, we note several northeastern states in pattern 1, many southern states in pattern 3, northern states in pattern 4, and a few central states in pattern 5.
Table 3
Five patterns of changes in daily COVID-19 new cases for the continental United States
Pattern 1 (+−+)
Pattern 2 (+++)
Pattern 3 (−++)
Pattern 4 (−/+)
Pattern 5 (//+)
Connecticut
Illinois
Alabama
Colorado
Arkansas
Delaware
Indiana
Alaska
Maine
Iowa
District of Columbia
Louisiana
Arizona
Minnesota
Kansas
Massachusetts
Maryland
California
Montana
Kentucky
Michigan
Nebraska
Florida
New Mexico
Missouri
New Hampshire
Pennsylvania
Georgia
North Dakota
North Carolina
New Jersey
Virginia
Idaho
Ohio
Oklahoma
New York
Washington
Mississippi
South Dakota
Rhode Island
Nevada
West Virginia
Vermont
Oregon
Wisconsin
South Carolina
Wyoming
Tennessee
Texas
Utah
Five patterns of changes in daily COVID-19 new cases for the continental United StatesFive patterns are also found in the daily deaths as displayed in Figure 6. The first pattern (+−+), an early considerable peak during April and May, a fast decrease after then, and another peak from December 2020 to February 2021, is observed in Colorado, Connecticut, Delaware, District of Columbia, Illinois, Indiana, Maryland, Massachusetts, Michigan, Minnesota, New Hampshire, New Jersey, New York, Pennsylvania, and Rhode Island. The second pattern (+++) of a first peak in April, another substantial peak in the summer, and then a third peak in the winter appears in Florida, Louisiana, and Washington. The third pattern (+//), with three incremental peaks, one peak during April and May, a larger peak during July and August, and the highest peak between December 2020 and February 2021, is found in Alabama, Arizona, California, Georgia, Idaho, Mississippi, Nevada, New Mexico, Oregon, South Carolina, Texas, and Utah. The fourth pattern (−−+), a low steady trend and then one large peak from November 2020 to February 2021, can be found in Iowa, Kansas, Montana, Nebraska, North Dakota, South Dakota, West Virginia, and Wisconsin. The fifth pattern (//+), a steadily increasing trend, is observed in Arkansas, Kentucky, Missouri, North Carolina, Ohio, Oklahoma, Tennessee, and Virginia.
Figure 6
Estimated change points in US COVID-19 deaths
The blue line represents the 7-day moving average of daily deaths. The red vertical dashed lines indicate the GA-estimated change points.
Estimated change points in US COVID-19deathsThe blue line represents the 7-day moving average of daily deaths. The red vertical dashed lines indicate the GA-estimated change points.Table 4 summarizes these 45 states and the District of Columbia according to their similar patterns. Geographical similarities are also identified: several northeastern states in pattern 1, southern states in pattern 3, and a few central states in pattern 4.
Table 4
Five different patterns of changes in daily COVID-19 related deaths in the United States
Pattern 1 (+−+)
Pattern 2 (+++)
Pattern 3 (+//)
Pattern 4 (−−+)
Pattern 5 (//+)
Colorado
Florida
Alabama
Iowa
Arkansas
Connecticut
Louisiana
Arizona
Kansas
Kentucky
Delaware
Washington
California
Montana
Missouri
District of Columbia
Georgia
Nebraska
North Carolina
Illinois
Idaho
North Dakota
Ohio
Indiana
Mississippi
South Dakota
Oklahoma
Maryland
Nevada
West Virginia
Tennessee
Massachusetts
New Mexico
Wisconsin
Virginia
Michigan
Oregon
Minnesota
South Carolina
New Hampshire
Texas
New Jersey
Utah
New York
Pennsylvania
Rhode Island
Five different patterns of changes in daily COVID-19 related deaths in the United States
Lag time between NPIs and change points for confirmed cases
In Table 5, we report the results of the hypothesis test using the Wilcoxon signed-rank test for test 1. At a 5% level of significance, the null hypotheses with d = 3 and d = 7 are rejected, but the null hypotheses with d = 10 and 14 are not. This result shows that the effect of a policy on a change point in COVID-19 confirmed cases may occur within 10 days of an NPI.
Table 5
Results from the Wilcoxon signed-rank test with four different choices of d days
d (days)
P
3
<0.001
7
<0.001
10
0.312
14
∼0.999
Results from the Wilcoxon signed-rank test with four different choices of d daysFigure 7 is the visual representation of the goodness-of-fit test to the log-normal distribution. All plots in the figure indicate that the log-normal model is well justified, as the candidate distribution of our random variable Y with change point is based on daily confirmed cases. Table 6 represents the numerical goodness-of-fit results based on the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). It shows that the log-normal model gives the best fit to the data compared with other competing models. To be more specific, both AIC and BIC values for the log-normal are at least 50 points below their closest competitor (Weibull model).
Figure 7
Visual diagnostics of the goodness of fit for the random variable Y to the log-normal model for the confirmed cases
CDF, cumulative distribution function.
Table 6
Goodness-of-fit criteria for the log-normal model for the confirmed cases
Log-normal
Weibull
Gamma
Gumbel
AIC
2,664.28
2,721.94
2,737.68
3,015.42
BIC
2,672.05
2,729.71
2,745.46
3,023.19
Visual diagnostics of the goodness of fit for the random variable Y to the log-normal model for the confirmed casesCDF, cumulative distribution function.Goodness-of-fit criteria for the log-normal model for the confirmed casesNow, we compute the log-normal parameter estimates and their standard errors for our fitted log-normal probability model and summarize the results in Table 7. The fitted log-normal cumulative distribution function plot for our random variable Y is displayed in Figure 8. This summarizes how our model can be used to figure with a certain degree of certainty within how many days we will see an impact of a policy or NPI on the change point in daily confirmed cases. For example, based on the red dotted horizontal line, we find that there is a 50% chance that we will see the effect of an NPI on changing the positivity rate within 8 days. Similarly, based on the blue-dotted horizontal line, there is a 95% chance that we will see the effect of an NPI on changing the positivity rate within 61 days.
Table 7
Parameter estimates for the log-normal model with change point for the confirmed cases
Parameter
Estimate (in log scale)
Standard error
μ
2.0369
0.0662
Σ
1.2569
0.0467
Figure 8
Cumulative density plot from the log-normal model to showcase days to decide as a function of cumulative probability for the confirmed cases
Parameter estimates for the log-normal model with change point for the confirmed casesCumulative density plot from the log-normal model to showcase days to decide as a function of cumulative probability for the confirmed cases
Lag time between NPIs and change points for death cases
The Wilcoxon signed-rank test results for the lag time between NPIs and change points for the death cases (test 2) are summarized in Table 8. At a 5% level of significance, the null hypotheses with d = 3 and d = 7 are rejected, but the null hypotheses with d = 10 and 14 are not. It shows that the effect of the policy on change points in the COVID-19 daily death counts may occur within 10 days of an NPI.
Table 8
Results from the Wilcoxon signed-rank test with four different choices of d days for the death counts
d (days)
P
3
<0.001
7
<0.001
10
0.081
14
0.933
Results from the Wilcoxon signed-rank test with four different choices of d days for the death countsFigure 9 displays the goodness-of-fit to the log-normal distribution. All plots in the figure indicate that the log-normal model is satisfactorily justified as the candidate distribution of our random variable Y with change point based on daily deaths. Table 9 represents the goodness-of-fit results based on AIC and BIC. The result numerically confirms that the log-normal model gives the best fit to the data compared with other competing models. To be more specific, both AIC and BIC values for the log-normal are at least 40 points below their closest competitor (Weibull model).
Figure 9
Visual diagnostics of the goodness of fit of the random variable Y to the log-normal model for the death cases
CDF, cumulative distribution function.
Table 9
Goodness-of-fit criteria for the log-normal model for death counts
Log-normal
Weibull
Gamma
Gumbel
AIC
2,130.96
2,173.89
2,186.68
2,421.36
BIC
2,138.21
2,181.13
2,193.93
2,428.61
Visual diagnostics of the goodness of fit of the random variable Y to the log-normal model for the death casesCDF, cumulative distribution function.Goodness-of-fit criteria for the log-normal model for death countsTable 10 shows the log-normal parameter estimates and their standard errors for our fitted log-normal model. Using these estimates, we estimated the cumulative distribution function of our random variable Y for the death cases. Figure 10 depicts this result and summarizes how our model can be used to assess within how many days we will see an impact of a policy or NPI on the change point in daily death counts. For instance, from the red dotted horizontal line, there is a 50% chance that we will see the effect of an NPI on changing the daily death counts within 9 days. Similarly, from the blue-dotted horizontal line, we can conjecture that there is a 95% chance that we will see the effect of an NPI on changing the daily death counts within 74 days.
Table 10
Parameter estimates for the log-normal model with change point based on death counts
Parameter
Estimate (in log scale)
Standard error
μ
2.1549
0.0784
Σ
1.3041
0.0554
Figure 10
Cumulative density plot from the log-normal model to showcase days to decide as a function of cumulative probability for the death counts
Parameter estimates for the log-normal model with change point based on death countsCumulative density plot from the log-normal model to showcase days to decide as a function of cumulative probability for the death counts
Association of policy types with COVID-19 growth rate changes
For each state and each dataset of COVID-19 cases and deaths, we calculated the number of positive and negative changes in the estimated COVID-19 growth rates that followed lockdown policies and also the number of positive and negative changes in the estimated COVID-19 growth rates that followed reopening policies. The change in growth rates at the current regime was determined by subtracting the previous growth rate from the current growth rate. A positive change in the growth rate represents an increase in the growth rate of COVID-19 cases or deaths, while a negative change in the growth rate indicates a decrease in the growth rate of COVID-19 cases or deaths. These results are summarized in Table 11. Overall, lockdown policies are more associated with a decrease in growth rates (∼87.6% for confirmed cases and ∼69.2% for deaths), and reopening policies are more likely associated with an increase in growth rates (∼68.9% for confirmed cases and ∼53.4% for deaths). Considering that many other factors can influence growth rates, this identified association between policy type and growth rate change is notable.
Table 11
Contingency table for the relationship between policy type and growth rate change for the confirmed cases and death counts
Growth rate for confirmed cases
Growth rate for deaths
Policy type
Increase
Decrease
Increase
Decrease
Lockdown
28
198
49
110
Opening
91
44
63
55
Contingency table for the relationship between policy type and growth rate change for the confirmed cases and death countsWe then conducted a chi-square test for independence between the policy type and the growth rate change using the results in Table 11 and found that there was a significant association between policy type and growth rate change for cases (χ2 = 113.29, p < 0.001) and for deaths (χ2 = 13.41, p < 0.001). We see a less prominent association between policy type and growth rate change for COVID-19deaths. This is likely because death is a more distal measure of pandemic trajectory, and therefore there are more confounders between the policy and its change point that will affect the strength of the relationship.To further understand this association, in Figure 11 we visualize the distribution of the magnitude of the growth rate changes associated with lockdown and reopening policies for confirmed cases (top) and deaths (bottom). As expected, lockdown policies have overall lower growth rate change for both confirmed cases and deaths, whereas reopening policies tend to have increased growth rate changes for both outcomes. We find that the separation for growth rate change between lockdown policies and reopening policies is greater for confirmed cases than for deaths.
Figure 11
Change in the growth rates for COVID-19 outcomes by lockdown and reopening policies
Shown are confirmed cases (top) and deaths (bottom).
Change in the growth rates for COVID-19 outcomes by lockdown and reopening policiesShown are confirmed cases (top) and deaths (bottom).
Discussion
As the pandemic becomes more apparent and the COVID-19 virus spreads more widely, governments and public health agencies have responded to this evolving situation, intending to reduce COVID-19 transmission by implementing several health policies. The US COVID-19 data show that the confirmed cases and deaths have had multiple changes in their growth rates according to the implemented policies. However, how many changes there are and when those changes have occurred are not clear for many states and should be rigorously estimated by using a reasonable approach. Otherwise, a naive use of the existing epidemiological models with these changes ignored could result in an unrealistic prediction with spurious patterns. To better understand the actual COVID-19 data with these changing patterns considered, we applied several statistical approaches to study such different patterns of change.We used two different sources of data and incorporated their separate analytical results to develop a unified method from a statistical data science perspective. For the first part of our unified method, we used two major outcomes (confirmed cases and deaths) associated with the COVID-19 pandemic. By using a stepwise drifts random walk model and a GA technique for those outcomes presented in the COVID-19 data, we assessed whether there existed changes in the growth rates of the outcomes over time for all the US states. We applied the GA change-point method to estimate change points in each state. We found that there were similarities in the trajectory of US COVID-19 outcomes among the states and then categorized the continental US states and the District of Columbia into five groups based on the underlying changing patterns associated with the outcomes. As summarized in Tables 3 and 4, we found strong geographical similarities within the five pattern groups, especially among the northeastern, northern, southern, and central states.In the second part of our unified method, we connected the findings from the change-point analysis to test the hypothesis to see if there exists any connection or impact between the policies (NPI) adapted by several states over time and the change points we have estimated from the change-point analysis of the COVID-19 outcomes. By using a time-to-event modeling approach for this context, we created a random variable, which can represent the lag times between the policies and their associated change points. A graphical example is provided for Florida to illustrate how this random variable can be interpreted as illustrated in Figure 12. Once this random variable is observed, our hypothesis testing problem is then well defined. We tested the hypothesis on the short-term and long-term impact of policies on the change of growth rate for COVID-19 confirmed cases and deaths by choosing several combinations of the lag time in days. We found that a policy implementation takes on average about 10–14 days to be effective in changing the growth rate of COVID-19 outcomes.
Figure 12
Example of how the variable Y was computed based on the COVID-19 confirmed cases for the state of Florida
Each red line represents the date of the associated change point and the blue lines represent the date of the policy taken by the state of Florida.
Example of how the variable Y was computed based on the COVID-19 confirmed cases for the state of FloridaEach red line represents the date of the associated change point and the blue lines represent the date of the policy taken by the state of Florida.As the realizations of a random variable can be characterized by a probability distribution, we further found that our defined random variable can be modeled by the two-parameter log-normal distribution as it is commonly used for the time-to-event data. Once we fitted the log-normal distribution to the data associated with our random variable, we were able to visualize how our log-normal model can be used to estimate, with certain degree of certainty, within how many days we will see an impact of a policy or NPI on the change point based on the outcomes of the COVID-19 pandemic. Also, our fitted log-normal model can be further used as a generative model to illustrate behavioral patterns of general public policy measures (NPI) on changing the course of public health in a pandemic situation. For example, our generative model can be used as a tool to help local governments decide when to put a policy into effect and how long it will take its course to run before they can see any significant impact on public health outcomes.There are some directions to be considered for future research. First, our lag-time random variable, when evaluating the time gap between policy changes and their subsequent change points, considers only the latest policy change and the first subsequent change point. This approach could underestimate longer-term associations between policy changes and their change points. By extending our model, one could create multiple lag-time random variables for different policies and therefore could more rigorously assess the long-term effects of each policy. This extended approach could also be useful to assess potential impacts of repeated implementations of the sample policy on COVID-19 outcomes. Second, our lag-time log-normal model does not consider covariates that could have an impact on lag-time distribution. Identifying such influential covariates and incorporating them into a relevant parametric or semiparametric model would seem useful. Third, one could be interested in incorporating heterogeneity of policy changes among different groups. To be specific, school closure, for example, could differently affect primary, secondary, and high schools and colleges and also could be different across regions and locations within the same state. In addition, the impact of state-level policies versus local policies in long-term care facilities could be further studied, as a substantial portion of COVID-19deaths in the United States have been reported in long-term care facilities. Finally, although we identified five patterns of the US COVID-19 trajectory and found geographical similarities within the patterns, a further study is anticipated to investigate what might be driving the five patterns.
Experimental procedures
Resource availability
Lead contact
Further information regarding this study should be requested from the lead contact, Francesca Dominici (fdominic@hsph.harvard.edu).
Materials availability
This study did not generate any physical materials.
Data and code availability
All codes and data are available at the GitHub repository: https://github.com/jaechoullee/COVID-19-Policy-and-Changepoints.
Change-point detection for COVID-19 outcomes
Suppose denotes a 7-day moving average series of daily new COVID-19 confirmed cases over a study period of n days for a given state. Because the 7-day moving average COVID-19 case series is non-stationary with a very strong temporal correlation, we use a random walk process to model these features presented in the COVID-19 data. In general, random walk processes are a stochastic process with a Markov property: the current state is dependent on the previous state only among all other past states. However, since adopted health policies could influence the rate of change in the number of new COVID-19 cases, as described below we develop a model that takes these possible rate changes into account to avoid bias in the model parameter estimation.More specifically, we assume that the rate of change in the 7-day moving average new COVID-19 case series has changed m times on days . To incorporate these rate changes into the COVID-19 data with non-stationary and strong autocorrelation features, we consider a random walk model with varying stepwise drifts:where denotes the 7-day moving average of the COVID-19 outcome (either daily confirmed cases or daily deaths) on day t, represents a time-dependent stepwise drift on day t with multiple changes occurring at times , modeled asand is a mean-zero Gaussian white noise process with a variance of varying at times :Note that since m change points occurred at times , the random walk model in Equation (1) experiences a total different regimes. The model parameters and are interpreted as the expected value and variance of day-to-day changes in , respectively, during the period in regime j, for . To elaborate, we reexpress the model in Equation (1) asand, by taking expectation and variance on both sides of Equation (2), we obtain:if day t is in regime j.We note that the difference in log-transformed daily outcomes, , approximates the daily growth rate . To elaborate, the growth rate can be reexpressed as , which in turn is equivalent to . If the growth rate is relatively small in magnitude as the COVID-19 case, the right-sided term is approximated as . Therefore, the use of log-transformed outcomes to the stepwise drifts random walk model can be effective to model the changes in growth rates.If the change-point number m and the change-point times are a priori known, we can estimate the random walk model parameters using the maximum likelihood method. However, because m and are unknown in many practices as COVID-19 outcomes, we treat these change points as unknown parameters and estimate them from the data. In addition, although the starting dates of newly implemented health policies are known, we do not know the exact dates of actual changes in day-to-day growth rate because of many factors that are inducing the changes with unknown time lags. Due to these reasons, we propose estimating the change-point number and times based on the COVID-19 data.To estimate the change-point number m and change-point times in the model (Equation 1), we use a GA. The GA is a data-driven search algorithm that finds an optimal solution for a given target function by implementing the principle of natural selection. Similar to the GA methods developed by Davis et al., Li and Lund, and Lee et al.,19, 20, 21, 22 our GA method uses a penalized likelihood approach with a penalty based on the minimum description length criterion. Our GA method can successfully detect multiple change points, unlike some other at-most one-change-point methods used in recent COVID-19 change-point results., Further, our GA method is distinct from the existing GA methods in that our GA uses the likelihood function derived from the stepwise drifts random walk model in Equation (1) to specifically adapt to the non-stationary and strong temporal correlation features in the COVID-19 data. By applying the GA method with a stepwise drifts random walk model, we estimate the change points in the 7-day moving average of daily positive cases for March 8, 2020, to February 28, 2021, and the change points in the 7-day moving average of daily deaths for March 18, 2020, to February 28, 2021.
Probabilistic inference on association between NPIs and change points
To gather evidence regarding a potential association between the impact of a policy (NPI) on the change point in COVID-19 outcomes, we have created a random variable Y by using the following definition:Note that we ignore the change point without any policy before the previous change point. Furthermore, we calculate the Y separately from those change points for the number of daily positive cases and the number of daily deaths. To be specific, assume a state policy (NPI) is denoted by P and a change point by C and suppose we find the following sequence of events: P1 P2 P3 C1 C2 P4 C3. Then, the first value of Y is calculated by the number of days between P3 and C1; the second value of Y is the number of days between P4 and C3, and so on. Figure 12 is an example of how the values of this random variable Y were computed based on the daily confirmed COVID-19 case data of the state of Florida. We did this computation for all 50 states and the District of Columbia to obtain the set of observed values for our defined random variable Y.Once the Y values are computed, our first hypothesis testing is as follows:
Test 1
H0 (null hypothesis): after a policy is implemented, a day-to-day growth rate change in COVID-19 confirmed cases occurs on average in at most d days.H1 (alternative hypothesis): after a policy is implemented, a day-to-day growth rate change in COVID-19 confirmed cases occurs on average in more than d days.Here, we test the above hypotheses for four different values of d (in the number of days): 3 (for immediate impact), 7, 10 (for moderate impact), and 14 (for long-term impact). Overall, the above hypothesis testing scheme will reveal to us “if and how long it takes for an NPI to cause significant impact (change point) on the daily positive case counts.”Next, we formulate similar statistical hypotheses but with Y defined based on the change point for the daily count of COVID-19 related deaths as follows:
Test 2
H0 (null hypothesis): after a policy is implemented, a day-to-day growth rate change in the number of COVID-19-related deaths occurs on average in at most d days.H1 (alternative hypothesis): after a policy is implemented, a day-to-day growth rate change in the number of COVID-19-related deaths occurs on average in more than d days.We also test these hypotheses for four different values of d (in the number of days): 3 (for immediate impact), 7, 10 (for moderate impact), and 14 (for long-term impact). This hypothesis testing will help us better understand “if and how long it takes for an NPI to cause significant impact (change point) on the daily number of COVID-19 related death counts.”Because the data on our Y (the number of days from the last policy to the next change point for daily positive cases and daily deaths) do not exactly follow a normal distribution and the sample size is not large enough for the asymptotic normality, we use a non-parametric test, Wilcoxon test, to test the hypotheses in tests 1 and 2.To quantify the uncertainties for our random variable Y, we need to identify which probability distribution fits the data best so that we can calculate the probability of impact of a policy decision on the change point of outcome (positivity and death counts) from the pandemic. It is important to use the probability distribution that accurately reflects the nature of the data. There are a few choices in this situation. From the way we have defined the random variable Y, the Y values can be treated as time-to-event data. For time-to-event or lifetime data, a common choice is a log-normal distribution; in fact, later we will justify why the “log-normal” model is more suitable for this problem compared with other competing models. Therefore, we assume that Y follows a log-normal distribution with parameters μ and σ > 0, respectively. The probability density function (PDF) of Y is hence expressed asfor y > 0.Note that we have two sets of data on Y, one when we use the positivity numbers for change point and the other when we use the death counts for change point. Therefore, we fit two separate log-normal distributions to each of these separately calculated values of Y. We visually assess the goodness of fit by overlaying the histogram with the PDF. We also produce Q-Q and P-P plots. For a numerical verification, we compare the log-normal model to a few other competing models (Gamma, Gumbel, and Weibull) using the two model selection indices AIC and BIC. The model with the smallest AIC and BIC values is selected as our best model. In both cases, the log-normal model gives us the smallest AIC and BIC values compared with all the others. All our hypothesis testing was performed with a 5% level of significance.
Authors: Michelle A Jorden; Sarah L Rudman; Elsa Villarino; Stacey Hoferka; Megan T Patel; Kelley Bemis; Cristal R Simmons; Megan Jespersen; Jenna Iberg Johnson; Elizabeth Mytty; Katherine D Arends; Justin J Henderson; Robert W Mathes; Charlene X Weng; Jeffrey Duchin; Jennifer Lenahan; Natasha Close; Trevor Bedford; Michael Boeckh; Helen Y Chu; Janet A Englund; Michael Famulare; Deborah A Nickerson; Mark J Rieder; Jay Shendure; Lea M Starita Journal: MMWR Morb Mortal Wkly Rep Date: 2020-06-05 Impact factor: 17.586
Authors: Jasmina Panovska-Griffiths; Cliff C Kerr; Robyn M Stuart; Dina Mistry; Daniel J Klein; Russell M Viner; Chris Bonell Journal: Lancet Child Adolesc Health Date: 2020-08-03
Authors: Christopher M Weible; Daniel Nohrstedt; Paul Cairney; David P Carter; Deserai A Crow; Anna P Durnová; Tanya Heikkila; Karin Ingold; Allan McConnell; Diane Stone Journal: Policy Sci Date: 2020-04-18
Authors: Aaron B Wagner; Elaine L Hill; Sean E Ryan; Ziteng Sun; Grace Deng; Sourbh Bhadane; Victor Hernandez Martinez; Peter Wu; Dongmei Li; Ajay Anand; Jayadev Acharya; David S Matteson Journal: Stat (Int Stat Inst) Date: 2020-10-05