Literature DB >> 34334839

JUE Insight: The Geography of Travel Behavior in the Early Phase of the COVID-19 Pandemic.

Abstract

We use U.S. county-level location data derived from smartphones to examine travel behavior and its relationship with COVID-19 cases in the early stages of the outbreak. People traveled less overall and notably avoided areas with relatively larger outbreaks. A doubling of new cases in a county led to a 3 to 4 percent decrease in trips to and from that county. Without this change in travel activity, exposure to out-of-county virus cases could have been twice as high at the end of April 2020. Limiting travel-induced exposure was important because such exposure generated new cases locally. We find a one percent increase in case exposure from travel led to a 0.21 percent increase in new cases added within a county. This suggests the outbreak would have spread faster and to a greater degree had travel activity not dropped accordingly. Our findings imply that the scale and geographic network of travel activity and the travel response of individuals are important for understanding the spread of COVID-19 and for policies that seek to control it.

Entities: Chemical

Keywords: COVID-19 pandemic; Mobility; Smartphone location; Spatial dynamics; Spatial networks; Travel behavior

Year: 2021 PMID： 34334839 PMCID： PMC8313794 DOI： 10.1016/j.jue.2021.103384

Source DB: PubMed Journal: J Urban Econ ISSN： 0094-1190

Introduction

In the early stages of the COVID-19 outbreak, people drastically reduced their travel. Governments enacted numerous policies including stay-at-home orders, business closures, and limits on mass gatherings to reduce exposure and slow the spread of the virus. The change in travel behavior may reflect the implementation of these policies but also may be attributed to people responding to information about the number of virus cases in their proximity. How did people reduce their travel behavior during the onset of the outbreak? Did they avoid places with larger outbreaks? And how did this response affect exposure and slow the spread of the disease? In this paper, we use data on the movement of smartphones between U.S. counties to study the change in travel behavior and virus exposure in the early stages of the outbreak. The data provide daily measures of the network of bilateral travel flows between counties.1 Aggregate patterns in the data confirm that travel between counties declined as COVID-19 cases rose. People not only traveled less, they avoided locations that had higher numbers of cases. Using gravity regressions of bilateral travel flows on case counts, we show that flows between locations declined in response to increased cases in both the origin and destination. During a period of explosive growth in cases, the results suggest that a doubling of cases in either end of a trip led to roughly a 3.5 percent decline in travel flows. This result holds even when controlling for government orders, suggesting that people adjusted travel behavior based on available information about the geography of the outbreak. A policy implication of this result is the importance of providing timely, accurate information about the geography of an outbreak. Changes in mobility had large effects on overall virus exposure. We construct a measure of nonlocal (out-of-county) exposure as a sum of flows between counties weighted by the number of confirmed cases in the counties visited. In counterfactual experiments, we find exposure would have been twice as high at the end of April 2020 had people not changed their travel behavior. Furthermore, a decomposition shows that roughly one third of the difference in exposure came from changes in the travel network, as opposed to overall declines in travel. The reduction in out-of-county exposure matters because such exposure led to increases in new COVID-19 cases. Under our preferred instrumental variable method, we find that a 1 percent increase in the exposure measure led to a 0.21 percent increase in new cases. Therefore, changes in travel patterns likely had significant benefits in reducing the spread of the disease by decreasing exposure. Finally, we provide a simple model of the spatial dynamics of an outbreak. The model is used to illustrate the importance of the connectedness of locations and the mobility response of individuals to the geographic spread of new cases. The important takeaway from the model is that travel can both speed the spread in the short run and amplify the outbreak over the longer run, while a mobility response mitigates both of these effects. The model does not include important features of an epidemiological model such as recovery rates, deaths, or immunity. However, it demonstrates the concept of how reductions in mobility reduce aggregate infections. Our findings on travel complement other recent research on declines in local activity during the outbreak. Gupta et al. (2020) find that government policies led to significant declines in mobility, while Engle et al. (2020) find that policy as well as local case levels reduced mobility. There is also evidence that reductions in mobility and government policies mitigated the outbreak, including work by Chinazzi et al. (2020), Courtemanche et al. (2020), Fang et al. (2020), Fenichel et al. (2020), Glaeser et al. (2020), Kraemer et al. (2020), and Wilson (2020). In addition Coven and Gupta (2020) show that migration out of urban areas drove the spread of the outbreak. In contrast to these studies, our research explicitly considers changes to the travel network in addition to declines in mobility levels. We also construct a measure of case exposure in addition to generic trip rates, which we find is an important determinant of case growth. Other researchers have looked at the role of networks during the pandemic following work by Christakis and Fowler (2010) and Bailey et al. (2018). Kuchler et al. (2021) show that social networks in New York and Lodi, Italy predict the spread of COVID-19, while Coven et al. (2020) perform a similar analysis for New York, but also consider differences in mobility among demographic groups. In contrast to these papers, we consider how the observed travel network changed in response to the outbreak, and how this affected the spread of the disease. Monte (2020) also shows how the connectedness of counties shrank during the pandemic, but does not explicitly study the effects on exposure or case growth. Several papers have used quantitative urban and trade models to study spatial health and economic outcomes during a pandemic. Fajgelbaum et al. (2020) examine optimal commuting restrictions in an epidemiology and trade model calibrated to several cities. Giannone et al. (2020) use cross-state flow data to understand optimal mobility restrictions. Relative to these papers, we focus on empirical identification of mobility responses, exposure, and disease spread using a geographically richer data set. Lastly, our research connects other work that seeks to inform policies that restrict mobility. For example, Atalay et al. (2020) and Dingel and Neiman (2020) study the ability of workers to work from home in different occupations and industries. By providing insights into spatial dynamics, our work can also help inform current theoretical research that seeks to understand the tradeoff between health and economic welfare, including work by Farboodi et al. (2020), Guerrieri et al. (2020), and Kaplan et al. (2020).

Declining travel at the onset of the pandemic

We briefly introduce the data and describe the key features. More detailed discussion and summary of the data are provided in Appendix A. National Mobility Index, Mobility Restriction, and Case Growth. NOTES: The figures plot the median composite mobility index against: the fraction of counties under government restrictions (A), the log of the national case count (B), and the number of new cases reported nationally in the preceding two weeks (C). In A, “Close NE Business” means a mandated site closure of businesses deemed “nonessential.” Sources: Couture et al. (2021), all panels; healthdata.org (2020), panel A; Johns Hopkins University Coronavirus Resource Center (2020), panels B and C. There are two main data sets used in our analysis. The first is the record of COVID-19 daily case diagnoses by county as reported by Johns Hopkins University.2 We combine this with a listing of state-level activity restrictions including stay-at-home orders and closure of “nonessential” businesses.3 The second data set is an anonymized summary of movement between counties derived from a microdata record of smartphone locations. The measure was constructed and generously made publicly available by Couture et al. (2021) (hereafter, CDGHW) using data provided by vendor PlaceIQ. The individual device locations are collected when an application requests GPS location data. These “pings” are aggregated at the county level. The data set consists of a time-consistent list of 2018 counties in the U.S covering 97 percent of the U.S. population. In our analysis, we use data from January 20, 2020 to May 25, 2020.4 Specifically, the data report: (i) the number of devices registering in a county each day, and (ii) the fraction of those devices that registered in each county (of the 2018) sometime in the preceding 14 days. The product of (i) and (ii) is a measure of the number of trips between two counties.5 Trips are best viewed as indicators of connectedness. There is no definitive notion of origin or destination, and the reported statistic is the probability of a binary event, not a transition from a starting place to ending place. In correspondence with the timing in the data construction procedure, we refer to the current location as “focal” county and the previous location as the “visit” county. The data construction also induces a moving average quality that we will account for in the analyses that follow.6 These data depict travel between counties as opposed to within counties.7 By studying this form of mobility, our focus is on trips that are more likely to create contact between regions, rather than those that create contacts between neighbors (such as visits to a store or restaurant). We will refer to this out-of-county travel from here on as “travel activity” or “mobility.” To set the stage for analysis, we first show the dynamics of travel activity in the early phase of the pandemic. To construct a consolidated measure of mobility () for a county on date , we summarize the out-of-county trips as the product of active devices in the county, , and trip probabilities between and other counties (within the lag window of two weeks) as reported on day , :We then index the series as , where is the mean of the county’s index in the pre-pandemic period (January 20, to February 23). Additional details about the components of the index can be found in Appendix B. Figure 1 overlays the median county daily mobility index with government orders, cumulative cases, and new cases. Panel A plots the index versus the share of counties under the mobility restrictions of bans on mass gatherings, closures of “nonessential” businesses, and stay-at-home orders. Panel B compares the index to log national cumulative cases, and Panel C compare it to new cases added. A vertical line marks the national emergency declaration on March 13.

Fig. 1

National Mobility Index, Mobility Restriction, and Case Growth. NOTES: The figures plot the median composite mobility index against: the fraction of counties under government restrictions (A), the log of the national case count (B), and the number of new cases reported nationally in the preceding two weeks (C). In A, “Close NE Business” means a mandated site closure of businesses deemed “nonessential.” Sources: Couture et al. (2021), all panels; healthdata.org (2020), panel A; Johns Hopkins University Coronavirus Resource Center (2020), panels B and C.

One notable feature is that the first drop in mobility occurred immediately following the initial run-up in cases–and before mobility restrictions were enacted–as it became clear that the U.S. was experiencing community spread and not just isolated cases due to foreign travel. From March 1, to March 14, though no county was yet under stay-at-home order, mobility dropped by 20 percent as cases rose 500%. Travel activity continued its downward trend from that point into April as case counts continued an exponential rise and stay-at-home orders and other mobility restrictions were more widely enacted. Mobility reached a bottom in mid-April at 56 percentage points below its pre-virus average but recovered as the level of new cases tapered in May. These patterns suggest that households may have been responding to information about virus prevalence as well as formal emergency declarations and restrictions.8

The changing geography of travel activity

Did travel activity drop in a uniform way, or were some locations affected more than others? We next exploit the full geographic structure of the data to see which sets of visits changed to produce the decline in mobility. To study the geography of the change in activity, we use a gravity regression of travel flows on local case counts. Specifically, we regress recorded visits between county pairs on the case counts on each side of a trip. The model iswhere is the number of visits (number of active devices observed in focal county times the probability of a visit to county in the lag window before ), and , are new cases reported9 in the focal and visited counties, respectively, in the preceding two weeks (the travel window).10 and represent mobility restrictions (stay-at-home orders) in the focal and visit counties, respectively. This model recovers, via parameters and , the observed relationship between visits and cases in the locations on each side of a trip (the focal and visited place). This is to test whether the pullback in overall mobility shown in the last section is associated with the locations’ severity of outbreak. The specifications include fixed effects for each dimension of the panel: time () and directed county pair ().11 Therefore, the identifying variation is within a given trip route over time, relative to the national average change in trips. The effect measured is how the visits on a route change with case counts compared with the baseline period, pre-pandemic. Because of the moving-average nature of the visit rate definition, the daily data have a mechanical degree of serial correlation. To reduce this but still account for the fast-moving dynamics of the outbreaks, our main specifications use one observation per week (Wednesdays). Standard errors are clustered by county pair and time. Table 1 reports the results of the gravity regressions from Eq. (2). Column 1 shows the coefficients on the two-week new case count in the focal and visit counties. Cases in the focal county reduce trips outside the county (). A doubling of new cases in the focal county (an increase of about 69 log points) reduces recorded trips by 3.7 percent (). New cases in the visit county also limit the visit probability (). That is, conditional on making a trip, devices are less likely to visit counties with relatively higher infection rates. A doubling of new cases in the visit county reduces trips by 3.5 percent ().

Table 1

Changes in Mobility - Gravity Regressions.

	1	2	3	4	5	6
Cases in Focal County	−0.0540	−0.0535	−0.0541	−0.0544	−0.0540	−0.0504
	(0.0054)	(0.0053)	(0.0054)	(0.0054)	(0.0053)	(0.007)
Cases in Visited County	−0.0498	−0.0491	−0.0486	−0.0478	−0.0471	−0.0430
	(0.0056)	(0.0055)	(0.0055)	(0.0055)	(0.0053)	(0.0074)
Stay at Home in Focal County		−0.0153			−0.0152	−0.0186
		(0.0064)			(0.0064)	(0.0084)
Stay at Home in Visited County		−0.0308			−0.0310	−0.0320
		(0.0062)			(0.0063)	(0.0086)
Cases in Visited X Baseline Visit Rate			−0.2700	−0.5420	−0.5426	−0.5139
			(0.0308)	(0.0411)	(0.0414)	(0.051)
Cases in Visited X Neighbors				0.1266	0.1271	0.1158
				(0.0092)	(0.0093)	(0.011)
Constant	1.5693	1.5814	1.5691	1.5690	1.5811	1.6014
	(0.0161)	(0.0179)	(0.016)	(0.016)	(0.0178)	(0.0263)
R2	0.875	0.875	0.875	0.875	0.875	0.871
NT	41,253,269	41,253,269	41,253,269	41,253,269	20,344,813	20,308,351
Pairs	3,564,207	3,564,207	3,564,207	3,564,207	3,564,207	3,188,031
Weeks	18	18	18	18	18	9

NOTES: The table reports results from a gravity regression of log visits in the two weeks preceding observation date on new cases and stay-at-home orders; see Eq. (2). The observation level is a weekly observation of a directed county pair (i.e., ). All specifications include directed county pair and week of year fixed effects. Standard errors are clustered by directed county pair and time of observation. Source: Authors’ calculations using data retrieved as described in Section 2.

Column 2 adds controls for shut-down orders on either side of a trip. The estimates show that stay-at-home orders reduced travel, but conditioning on case counts, the magnitude of these effects was relatively small. Stay-at-home orders in the focal county reduced trips by 1.5 percent, and orders in the visit county reduced trips by 3 percent. Notably, the inclusion of the shut-down orders does not change the marginal effects of new cases. The distribution of visit frequency is highly skewed and distance-dependent, and perhaps not all trips were affected by cases in the same way. In column 3 we add an interaction of cases in the visit county with pre-pandemic visit probability to allow the case elasticity to depend on the base rate. The negative coefficient indicates that visits declined more (in proportional terms) to places that were visited regularly (as opposed to episodically) prior to cases arising. We add in column 4 an indicator for whether the counties are neighbors, allowing the nearest places to be affected at different rates. The coefficient on neighbors is positive, and the coefficient on the baseline visit rate interaction increases. Together, these results show that the trips declining the most were those to regularly visited counties, but with some persistence in the most proximate places. Column 5 includes all controls together and the results are consistent with previous specifications. Finally, in column 6, we test the robustness of the model to using biweekly observations instead of weekly to correspond with the two-week lookback in the visit construction. We find the results very much consistent with the weekly data, even in the standard errors. A natural question is whether these estimated effects can be interpreted causally. Two potential threats to causal interpretation are omitted variables and reverse causality. On omitted variables, a two-way fixed effects design accounts for a host of potential problems. In this design, we are comparing trip frequency within a focal-visit county pair relative to the national average change for all pairs in a given week. Results are driven not by cross sectional differences in mobility rates but by visits decreasing in proportionally greater amounts along routes with relatively more cases on either side of the trip. Any remaining threat would have to be a local, time-varying omitted factor driving cases and mobility in opposite ways. Likely more relevant, given the evidence in Section 2, is the potential for reverse causality. The extant evidence is that more mobility leads to more cases, while here we find that more cases lead to less mobility, suggesting our coefficients are if anything biased downwards.12 In summary, the results indicate households were not only traveling less, they were avoiding places with more severe outbreaks. This suggests that households were less exposed to virus cases than if they had continued travel activity as in the days before the pandemic, a topic we treat in more detail in the next section.

Case avoidance and the effect on exposure

Travel between counties likely results in people coming in contact with outbreaks outside their local area. Are these encounters consequential for case growth? To examine this question, we begin by defining nonlocal case exposure and then consider how the pattern of case avoidance shown above affected exposure and altered the trajectory of virus spread. To summarize the case contacts a county is incurring via out-of-county travel, we construct a measure of nonlocal case exposure aswhere represents new cases in the visit county at time , and is a pairwise mobility measure as in Eq. (1). The index is a summary of contacts with cases encountered outside the focal county: a case-weighted sum of the travel flows. We refer to this index simply as “exposure.” The exposure index could be high for a given county because of some combination of (i) high frequency of travel and (ii) travel to high caseload areas. In Appendix E.1, we decompose the sources of exposure. The general pattern is that more exposed counties have greater contact with high caseload areas and not necessarily higher levels of overall mobility. That is, the severity of the outbreak within the geography of a county’s network is far more consequential for case exposure than the level of trips. For example, early in the U.S. outbreak, places connected to the New York metro area exhibited high levels of exposure, irrespective of their overall mobility. Following from the results in Section 3, we examine the importance of case avoidance for exposure, comparing realized exposure to counterfactual exposure measures that assume travel activity did not change despite the increase in cases. Specifically, we calculate the exposure measure in Eq. (3), letting the number of cases evolve as in the data, but holding mobility constant at pre-pandemic averages (as if ). Table 2 shows the ratio of counterfactual exposures to actual exposures at month-end checkpoints. Column 1 shows the total effect declining mobility had on exposure. Had travel activity continued as usual, the median county would have had exposure to 54 percent more cases at the end of March, 109 percent more cases at the end of April, and 40 percent more cases at the end of May. Thus, at the springtime height of the pandemic, the median county would have been exposed to twice as many cases had mobility not adjusted.

Table 2

Decomposition of Actual Exposure Relative to “Business As Usual,” By Mobility Component.

		Partial Effect Of:
Time	Combined	Device Count	Visit Rate	Visit Geo. Network
	1	2	3	4
Last Week of March	1.54	1.17	1.16	1.13
Last Week of April	2.09	1.24	1.34	1.22
Last Week of May	1.40	1.14	1.02	1.19

NOTES: The table reports the median ratio of counterfactual exposure, projected using pre-pandemic period mobility rates, relative to actual exposure for each listed point in time. Nonlocal case exposure is defined in Eq. (3). Column 1 is the combined exposure index, and columns 2 through 4 are its components. Column 2 holds fixed total active devices, column 3 holds fixed out-of-county pings per device, and column 4 holds fixed the visit county share in the focal county’s travel network. Source: Authors’ calculations using data retrieved as described in Section 2.

Simulated Viral Spread Across Locations. NOTES: The figures report the time path of the variables in the dynamic system represented by Eqs. (5a), (5b), and (5c). Each line refers to a separate scenario using different assumptions about the reaction of mobility to local and nonlocal cases. Source: Authors’ calculations using estimates from Tables 1 and 3.

Table 3

Nonlocal Case Exposure and Local New Cases.

	1	2	3	4	5	6
Model	OLS	OLS	OLS	OLS FE	IV	IV FE
Case Exposure	0.111	0.127		0.162	0.213	0.417
	(0.028)	(0.030)		(0.051)	(0.035)	(0.148)
Case Expo., Neighbors			0.036
			(0.006)
Case Expo., Non-Neighbors			0.070
			(0.025)
Lagged Local Case Growth	0.744	0.736	0.730	0.630	0.731	0.620
	(0.029)	(0.029)	(0.021)	(0.071)	(0.031)	(0.074)
Within-County Device Expo.	0.119	0.159	0.125	0.185	0.098	0.048
	(0.047)	(0.041)	(0.043)	(0.111)	(0.052)	(0.142)
Mobility Index		−0.003
		(0.000)
Population	0.157	0.148	0.178		0.068
	(0.035)	(0.036)	(0.035)		(0.042)
Pop. Density	0.046	0.043	0.036		0.035
	(0.010)	(0.010)	(0.011)		(0.010)
Implied Marginal Effect to Case Growth Rate
90-50 Expo Gap	0.246	0.286	0.139	0.380	0.527	1.110
	(0.070)	(0.077)	(0.025)	(0.140)	(0.107)	(0.505)
Non-neighbor			1.119
			(0.045)
Fixed Effects
Level(s)	Week	Week	Week	County;	Week	County;
				Week		Week
Number	12	12	12	2018; 12	12	2018; 12
Instruments:
Projected Exposure					y	y
R2	0.8609	0.8612	0.8627	0.8672	0.8018	0.873
NT	24,038	24,038	24,023	24,038	24,038	24,038

NOTES: The table reports regression results of the model represented by Eq. (4); “Expo” is shorthand for out-of-county case exposure. The outcome variable is the natural log of one plus the number of new cases in the county. The observation level is county by week. Standard errors are double clustered by county and week. Source: Authors’ calculations using data retrieved as described in Section 2.

Changes in Mobility - Gravity Regressions. NOTES: The table reports results from a gravity regression of log visits in the two weeks preceding observation date on new cases and stay-at-home orders; see Eq. (2). The observation level is a weekly observation of a directed county pair (i.e., ). All specifications include directed county pair and week of year fixed effects. Standard errors are clustered by directed county pair and time of observation. Source: Authors’ calculations using data retrieved as described in Section 2. Decomposition of Actual Exposure Relative to “Business As Usual,” By Mobility Component. NOTES: The table reports the median ratio of counterfactual exposure, projected using pre-pandemic period mobility rates, relative to actual exposure for each listed point in time. Nonlocal case exposure is defined in Eq. (3). Column 1 is the combined exposure index, and columns 2 through 4 are its components. Column 2 holds fixed total active devices, column 3 holds fixed out-of-county pings per device, and column 4 holds fixed the visit county share in the focal county’s travel network. Source: Authors’ calculations using data retrieved as described in Section 2. Columns 2 through 4 show decompositions of the effects of the components of mobility on exposure.13 From Eqs. (1) and (3), there are three components to mobility and therefore three ways the contact intensity could change. First, the number of devices registering as active could change.14 Second, the total frequency of out-of-county visits could change. Third, for a fixed amount of mobility, the network of visited places could change.15 We find that each of the three components of the exposure measure contributed to the decrease in exposure. For example, in April, had active devices counts continued as usual (column 2), case exposure would have been 24 percent higher. Had total visit frequency continued as usual (column 3), case exposure would have been 34 percent higher. Had the network of visited counties remained as usual (column 4), case exposure would have been 22 percent higher. Nonlocal Case Exposure and Local New Cases. NOTES: The table reports regression results of the model represented by Eq. (4); “Expo” is shorthand for out-of-county case exposure. The outcome variable is the natural log of one plus the number of new cases in the county. The observation level is county by week. Standard errors are double clustered by county and week. Source: Authors’ calculations using data retrieved as described in Section 2. The last column is especially interesting because it shows a substantial amount of the change in exposure resulted not just from staying home, but from avoiding places with higher levels of cases when traveling. Notably, even as the level of total mobility edged higher in May, a reduction in exposure resulted from people avoiding counties with high caseloads.16

The effect of exposure on new case growth

The remaining question is whether out-of-county exposure causes increases in new cases. To test this, we regress new cases in a county on our index of exposure to out-of-county cases, controlling for lagged cases and other county attributes. The baseline model iswhere county at time is the unit of analysis, denotes new cases, is the out-of-county exposure from Eq. (3), and the ’s are county-level controls. In this specification, time is measured in weeks. The s are parameters of interest, and principally, the exposure parameter . is a set of controls for time-varying county characteristics–mainly, a within-county device exposure index, and in some specifications, the mobility index from Eq. (1). The within-county device exposure index (also provided by CDHGW) is a measure of the number of other devices a typical device encounters at points of interest (e.g., stores) within the focal county.17 This is distinct from the out-of-county travel activity in focus in our study, but it is similar to other measures of device activity in the literature.18 is a set of controls for fixed county characteristics, such as population size and density, or fixed effects to capture attributes nonparametrically. Specifications include time fixed effects, . The is the error term. The outcome variable is the natural logarithm of one plus the number of new cases reported in the last week. The observation level is county by week beginning the first week of March, when community-spread cases began to emerge in the U.S. The exposure index is lagged one week (representing activity one to three weeks prior to the observation date) so as not to overlap with the new case period in the outcome variable, and lagged new cases are measured over the same window as exposure. Because the model includes time fixed effects, estimates are identified off of spatial-temporal variation relative to the national average. Standard errors are clustered by week and state.19 Column 1 of Table 3 presents results using ordinary least squares (OLS) regression. The regression shows two features of viral spread. First, and unsurprisingly, lagged cases in the county create new cases. A one-percent rise in past cases is associated with a 0.74 percent rise in new case growth. Second, and more novel, exposure to out-of-county cases increases local new cases. A one percent rise in outside exposure is associated with a 0.11 percent increase in new case growth. Moving from the median to 90th percentile county in terms of network exposure (roughly, from Ohio to New Jersey) would mean a 24 percent increase in new cases added in a given week. The control variables indicate that larger and denser counties, and places with more within-county device exposure (i.e., fewer people staying at home), have higher case growth. Next, we consider some alternative explanations to the causal effect of exposure. One hypothesis is that the exposure measure is picking up something about overall mobility that is predictive of new cases.20 Column 2 adds the county’s mobility index directly, and its coefficient is marginally negative.21 In light of the results of Sections 2 and 3, we attribute this to reverse causality–the pullback in mobility during the periods of higher case growth. The results suggest that any effect of mobility on new cases is operable via exposure to outside cases. There was regional heterogeneity in the severity of the outbreak and a predictable geographic component to the observed travel network, and thus another alternative explanation to exposure is spatial correlation in travel and case outcomes. As one way to address the possibility,22 in column 3 we split exposure by nearby (neighboring county) and farther-away (non-neighboring county) exposure. (Together, these sum to the county’s total exposure.) If all the exposure effect were coming from nearby counties, the exposure result may actually be spurious and due to spatial correlation. Instead, we find significant effects for each source of exposure independently. Another potential concern is that unobserved local attributes were driving both exposure and local virus spread. In column 4 we add county fixed effects in order to sweep out time-invariant features and focus on exposure variance within a county over time. The coefficient estimate rises relative to column 1, showing that even within a county, periods of greater exposure are followed by periods of greater increase in cases. These results indicate case exposure through travel creates new cases within a focal county. However, the preceding sections showed that mobility dropped, and especially to and from counties with higher levels of new cases, which reduced the amount of exposure a county would experience. Hence, there is potential for reverse causality that may downward bias the estimated effects. With this concern, we seek an instrument correlated with exposure but not itself generating new cases. Our strategy is to build a predicted exposure measure based on pre-determined features of a county. Using a gravity regression of trips on a flexible county-pair distance function (detailed in Appendix A), we recover a predicted county-pair visit rate, , based on proximity of counties. The predicted mobility then enters an expected exposure index, , which is used an instrument for actual exposure. The exclusion restriction is that the distance to other county’s cases affect the focal county’s case rate only through potential travel-related exposure.23 Column 5 reports the results of the IV regression. The coefficient on exposure rises to 0.21, indicating attenuation from reverse causality is indeed present in the OLS specification. Column 6 uses the IV with county fixed effects. Because the predicted visit weight used in constructing the instrument is distance-based and hence invariant for each county pair, the instrument loses power when adding fixed effects. The point estimate with fixed effects rises to 0.41 but is less precise.24 While these are broadly consistent, the IV model without fixed effects is our preferred specification because it mostly relies on pre-determined variation coming from the way a county’s point in space would affect its travel network. In summary, we find consistent evidence that out-of-county exposure via the travel network affects new case diagnoses. Appendix F provides a number of additional robustness checks.

Geographic connectedness and virus spread

We have marshaled evidence for three important facts: (i) Travel activity dropped significantly as case counts rose, with a particular avoidance of areas with relatively larger outbreaks; (ii) Such a drop in activity limited exposure to out-of-county virus cases; (iii) Out-of-county exposure affects the rate of new cases added. Together, these facts suggest cases would have been higher had travel activity not dropped in response to cases. Our last exercise is to combine these insights into a single model in order to evaluate conjectures about spread of the virus in alternative travel scenarios. We construct the following spatial vector autoregressive model of mobility, case exposure, and case growth. The primary outcome of interest is new cases added, in Eq. (5a), which is affected by own-county and out-of-county exposure. For the rate of transmission from local and nonlocal case exposure, we take point estimates from our preferred model in Table 3, column 5. Nonlocal case exposure, in (5b), is a function of outside cases and mobility, which is itself affected by the path of cases locally and nonlocally (Eq. (5c)). To calibrate the responsiveness of mobility to cases, we take point estimates of Eq. (2) from Table 1, column 3.25 Appendix G shows sensitivity of the model to alternative calibrations. We emphasize that this is an autoregressive process and not an epidemiological model. There are no notions of recovery, death, or immunity among the population. (Indeed, our unit of analysis is a spatial area, not a person.) We will note the values the model produces for the sake of exposition, but we intend this exercise to be more illustrative than empirical.26 Accordingly, to keep the model simple, we illustrate a three location system. Two locations are calibrated with symmetric mobility rates to represent two closely connected counties and another more distant county. We set the baseline visit rate to 7.5 percent for the closely connected locations and 0.55 percent for the distant one.27 The model is used for the following thought experiment: if an outbreak of new cases exogenously appears in one of the two connected locations, what happens to the spread of the disease locally and throughout the system? To illustrate the importance of endogenous travel for the rate of disease spread, we simulate the model in three scenarios: (i) a default without mobility (i.e., a purely autoregressive process, (5a) alone), (ii) with mobility but without the feedback effect of cases on travel ((5a) and (5b)), and (iii) with mobility and endogenous feedback ((5a), (5b), and (5c)). Plots the impulse responses for an experiment of 10 new cases dropped into the “treated” location Fig. 2.

Fig. 2

The path of new cases added is depicted in the first row of figures. The rate of own-location spread is below one, so that if there were no mobility (and consequently no exposure), the virus would asymptotically die out in the treated location, as illustrated by the “isolated/no mobility” lines. In scenarios with mobility and exposure, the outbreak jumps locations, which themselves grow through local spread. Exposure then leads to the subsequent re-infection of other places in the system, keeping the disease alive. The impact of exposure then depends on the degree of mobility. In the treated location, the path of cases shows an initially oscillating pattern, as own-location case contribution slows but exposure to outside cases rises. New case rates then rise to a steady state. In the initially virus-free connected location, nonlocal exposure seeded the local outbreak, and it eventually reaches the same steady-state level as the treated location. The distant location experiences its own outbreak, although its lower connectivity translates into a lower long run average rate of exposure, so its steady state is lower than the two closely integrated counties. Among the two steady states with mobility, the level of new cases is about 60 percent higher in the scenario without response to cases. To see why, the second row of figures shows the mobility rate. The “unresponsive” scenario is fixed to have no endogenous change in mobility and mechanically results in the flat lines. In the endogenous mobility scenario, we see travel fall as the outbreak occurs. In the two connected counties, mobility falls by 42 percent. There is consequently a reduction in exposure, shown in the third row of figures, which is only 45 percent as high in the responsive scenario as the unresponsive, because of a combination of less travel and a lower level of cases. The difference in exposure alters the total rate of disease transmission, creating the gap in new cases among scenarios shown in the first row of figures. Thus, when mobility does not decline in response to the outbreak, the rate of new cases added is faster and steady-state level is higher. In summary, the model shows why spatial connectedness matters for both the spread and the perpetuation of the virus. Most directly, nonlocal exposure allows the virus to jump from one area to another. Perhaps less obvious, however, is how travel also affects the rate of growth of cases and the steady state level. Connectedness generates higher caseloads as travel compounds local transmission through reinfection across areas.

Conclusion

This paper has used county level location data from smartphones to document the change in travel activity during the early phase of the COVID-19 pandemic in the U.S. We find that mobility across counties dropped substantially as case counts rose. Relatively larger case counts decreased spatial activity on both sides of a trip: Mobility decreased more in counties with more cases, and the activity that did occur tended to avoid areas with higher caseloads. Understanding the nature of the change in activity is important because mobility across county lines produces contact with nonlocal cases. Such case exposure contributes to local case growth which in turn has a feedback effect on nonlocal case growth, creating exposure for other localities in a continuing loop. Our findings have several implications for policy and practice. First, public information about the spread of the virus is important. We find people responding to such information by restricting their activity in rational ways–both in level and in direction. In a sense, a “healthy fear” of the virus appears to provide motivation for social distancing and similar behavioral interventions, perhaps even more so than government mandates. Second, because spatial activity never entirely disappears, localities could benefit from coordinated responses and shared information. Connectedness means there are spatial externalities. A policy that suits one area may inadvertently produce a threat to a connected area. Fragmented policy across regions could inhibit society’s ability to control the spread of COVID-19.

Credit Author Statement

All authors contributed to all aspects of the research.

Table A1

Summary of Case Prevalence.

Time	Mean	SD	25th	50th	75th	90th	99th
Cases per 1k Residents
Last Week of March	0.150	0.459	0.022	0.059	0.135	0.278	1.413
Last Week of April	1.582	2.991	0.351	0.716	1.548	3.280	16.499
Last Week of May	3.000	5.087	0.685	1.414	3.252	6.757	26.790
New Cases in Preceding 2 Weeks
Last Week of March	65.99	578.89	0.71	3.00	13.00	60.00	843.00
Last Week of April	192.08	893.21	4.29	15.93	70.14	267.43	4,318.71
Last Week of May	150.99	644.21	4.71	17.79	78.43	278.57	2,350.00

NOTES: The table reports summary statistics of COVID-19 cases per capita and new cases for selected months in the spring of 2020, as reported by Johns Hopkins University.

Table A2

Summary Statistics of Device Ping Rates by Geography.

	Pairs	NT	Mean	SD	P10	P50	p90
Same County	na	70,630	89.08	4.78	82.69	90.01	94.19
Other Counties:
Neighbor	9366	327,653	23.14	17.33	5.44	18.32	49.42
Within Commuting Zone	8562	299,670	19.77	19.02	1.95	12.81	49.98
Within State	128,351	4,429,513	3.08	7.82	0.09	0.62	6.97
Within Division	573,239	17,131,215	1.00	4.29	0.02	0.12	1.43
Within Region	1,388,653	35,309,674	0.55	3.06	0.03	0.06	0.65
Any	3,903,314	84,612,867	0.27	2.01	0.01	0.04	0.30
Share to Top 10 Connections	na	70,630	51.18	11.03	36.28	52.07	64.71

NOTES: The table reports summary statistics of device ping activity occurring over a 14 day window in the pre-pandemic period (January 20, to February 23, 2020). Statistics are taken over the visit rate and count only observations with nonzero ping rates; The last row reports a share of total visits. There are 2018 counties included in the dataset. The column reports the number of pair-day observations. Source: Couture et al. (2021).

Table A3

Gravity Model of Visit Rates Between County Pairs, Pre-Pandemic Period.

Day of Week (Sunday excluded)
Monday	0.036
	(0.000)
Tuesday	0.065
	(0.000)
Wednesday	0.068
	(0.000)
Thursday	0.060
	(0.000)
Friday	0.025
	(0.000)
Saturday	−0.010
	(0.000)
Distance Between Counties
Log Miles Between Centroids	−0.807
	(0.000)
Indicators for County Groupings
Neighbors	1.883
	(0.002)
Same CBSA	1.036
	(0.002)
Same State	1.243
	(0.000)
Same Division	0.211
	(0.000)
Same Region	−0.008
	(0.000)
Constant	−2.769
	(0.001)
R2	0.368

NOTES: The table reports coefficient estimates of Eq. (6), a gravity model of daily visit rates using pre-pandemic period travel data (January 20, to February 23, 2020). Source: Couture et al. (2021).

Table A4

Summary Statistics of Device Exposure Index.

County Type	Counties	NT	Mean	SD	P10	P50	p90
Large CZ: Central Counties	383	13,405	181.2	132.8	73.6	147.9	310.5
Large CZ: Outlying Counties	150	5250	127.0	77.6	52.7	110.9	216.5
Small CZ	1485	51,975	105.1	73.1	42.3	87.5	184.5

NOTES: The table reports summary statistics of the local device exposure index for counties by commuting zone (CZ) size; large and small CZs are respectively above and below a population of one million residents. The device exposure index is the average over active devices of the number of other devices present in a point of interest. Source: Couture et al. (2021).

Table C1

Mobility Index: The Effect of State Restrictions and Observed Cases.

	1	2	3	4	5	6
Case Activity (2 wk lag)
Log Case Growth, County	−8.550	−3.473	−3.340
	(2.213)	(0.454)	(0.438)
Log Cases Per Capita, County					−6.721	−5.721
					(1.675)	(1.244)
Log Cases Per Capita, State						−3.455
						(1.229)
Log Cases Per Capita, Division						−0.783
						(0.967)
Closure Orders in County:
Nonessential Services			−1.995	−2.993	−2.609	−2.167
			(0.901)	(1.014)	(1.024)	(0.963)
Stay Home			−2.430	−3.587	−3.505	−3.093
			(0.975)	(0.922)	(0.935)	(0.975)
Constant	93.908	88.622	89.751	86.890	87.555	88.031
	(5.355)	(0.495)	(0.670)	(0.293)	(0.343)	(0.332)
Time Effects		y	y	y	y	y
R2	0.397	0.829	0.831	0.806	0.8098	0.8107
NT	18,153	18,153	18,153	18,153	18153	18153

NOTES: The outcome variable is the county-level index of mobility as defined in Eq. (1), and indexed by the pre-pandemic average for each county. Units are percentage points. Standard errors are clustered by county and time of observation. Each regression contains 2018 counties and 9 weeks for a total of 18,153 observations. Source: Authors’ calculations using data retrieved as described in Section 2.

Table E1

Summary of Case Exposure.

Time	Mean	SD	25th	50th	75th	90th	99th
Last Week of March	296.2	553.1	122.8	191.5	301.7	494.1	2,375.5
Last Week of April	798.3	937.2	321.1	506.8	897.2	1,554.2	5,316.3
Last Week of May	1,035.1	942.1	483.9	751.2	1,258.7	2,030.4	4,811.1

NOTES: The table reports summary statistics of exposure to nonlocal cases as defined in Eq. (3) for each listed point in time. Source: Authors’ calculations using data retrieved as described in Section 2.

Table E2

Sources of Case Exposure.

Panel A: Highly and Lesser Exposed Counties

	1	2	3	4	5	6	7	8	9	10	11
				Destinations in Top 2 Pct. Of Cases				Top 50 Destinations
Counties, N	Average	Total	Cases Per	Visits	Visit	Exposure	Exposure	Visits	Visit	Exposure	Exposure
	Exposure	Visits	1k Residents	To	Share	In	Share	To	Share	In	Share

All, 2018	464.0	3.16	0.58	0.14	0.05	216.6	0.53	2.30	0.73	293.6	0.67
Lesser Exposed, 1968	451.6	3.17	0.56	0.14	0.04	209.7	0.52	2.31	0.73	285.9	0.67
Top 50 Most Exposed, 50	4,448.5	2.87	7.54	1.60	0.58	4,289.2	0.96	2.26	0.81	4,357.7	0.98
Panel B: Selected Cities

	1	2	3	4	5	6	7	8	9	10	11
				Destinations in Top 2 Pct. Of Cases				New York Metro
Location, N Counties	Average	Total	Cases Per	Visits	Visit	Exposure	Exposure	Visits	Visit	Exposure	Exposure
	Exposure	Visits	1k Residents	To	Share	In	Share	To	Share	In	Share

Philadelphia, 11	2,456.2	3.04	3.48	1.21	0.41	1,901.1	0.77	0.26	0.09	518.6	0.20
Pittsburgh, 7	453.8	2.61	0.88	0.12	0.05	218.0	0.48	0.03	0.01	92.9	0.20
Chicago, 13	4,052.6	3.19	1.85	0.63	0.22	3,512.2	0.82	0.02	0.01	43.2	0.01
Miami, 3	959.4	1.42	2.50	0.70	0.49	917.3	0.95	0.05	0.04	25.5	0.03
Houston, 10	1,761.5	2.84	0.76	0.72	0.27	1,562.2	0.83	0.01	0.00	13.6	0.01
Los Angeles, 2	1,406.3	1.10	1.10	0.49	0.42	1,263.4	0.69	0.01	0.01	8.1	0.02
San Francisco, 5	410.0	2.19	0.97	0.15	0.07	104.6	0.26	0.02	0.01	13.5	0.03

NOTES: The table reports summary statistics of the visit rates, cases per capita, and exposure to nonlocal cases as defined in Eq. (3) for each listed point in time. Source: Authors’ calculations using data retrieved as described in Section 2.

Table F1

Case Exposure and New Cases: Robustness to Spatial Definitions .

	1	2	3	4	5	6	7
Model	OLS	OLS FE	OLS FE	OLS FE	OLS FE	OLS	OLS
Case Expo.: Visits Out		0.099	0.107	0.088	0.042		0.083
		(0.024)	(0.027)	(0.025)	(0.018)		(0.045)
Case Expo.: Visits In						0.098	0.026
						(0.023)	(0.022)
Case Expo: Same State	0.075
	(0.018)
Case Expo.: Other States	0.041
	(0.011)
Lagged Local Case Growth	0.740	0.746	0.744	0.714	0.677	0.745	0.744
	(0.024)	(0.028)	(0.029)	(0.032)	(0.038)	(0.029)	(0.029)
Within-County Device Expo.	0.123	0.143	0.142	0.190	0.203	0.118	0.118
	(0.050)	(0.058)	(0.062)	(0.056)	(0.048)	(0.048)	(0.047)
Population	0.172	0.188	0.190	0.246	0.321	0.173	0.159
	(0.043)	(0.035)	(0.034)	(0.036)	(0.045)	(0.039)	(0.035)
Pop. Density	0.040	0.033	0.029	0.025	0.021	0.048	0.047
	(0.011)	(0.011)	(0.012)	(0.013)	(0.013)	(0.011)	(0.011)
Constant	−1.674	−1.677	−1.802	−1.779	−1.473	−1.565	−1.645
	(0.289)	(0.361)	(0.357)	(0.337)	(0.292)	(0.358)	(0.401)
Level(s)	Week	Region	Division	State X	CZ X	Week	Week
Number	12	48	108	600	4427	12	12
R2	0.861	0.863	0.865	0.863	0.865	0.861	0.861
NT	24,038	24,038	24,038	24,038	24,038	24,038	24,038

NOTES: The table reports regression results of the model represented by Eq. (4). The outcome variable is the natural log of one plus the number of new cases in the county. The observation level is county by week. Standard errors are double clustered by state and week. Source: Authors’ calculations using data retrieved as described in Section 2.

Table F2

Case Exposure and New Case: IV Specifications for Mobility .

	1	2	3	4	5
Model	IV	IV	IV	IV	IV
Case Exposure	0.142		0.286	0.382	0.165
	(0.032)		(0.040)	(0.040)	(0.076)
Mobility Index		−0.008	−0.023	−0.028	0.011
		(0.005)	(0.006)	(0.005)	(0.015)
Lagged Local Case Growth	0.740	0.720	0.648	0.624	0.759
	(0.030)	(0.027)	(0.036)	(0.036)	(0.040)
Within-County Device Expo.	0.112	0.289	0.465	0.524	−0.043
	(0.048)	(0.095)	(0.106)	(0.106)	(0.194)
Population	0.129	0.297	0.108	0.044	0.094
	(0.036)	(0.059)	(0.051)	(0.053)	(0.069)
Pop. Density	0.043	0.073	0.022	0.005	0.045
	(0.009)	(0.011)	(0.021)	(0.025)	(0.011)
Fixed Effects
Level(s)	Week	Week	Week	Week	Week
Number	12	12	12	12	12
Instrumented:
Exposure	y			y	y
Mobility		y	y	y	y
Instruments:
Projected Exposure (Pre-period)	y
Projected Exposure (Model)				y	y
Weather Conditions		y	y	y
Shutdown Orders					y
R2	0.805	0.806	0.802	0.794	0.793
NT	24,038	18,205	18,205	18,205	24,038

NOTES: The table reports regression results of the model represented by Eq. (4). The outcome variable is the log number of new cases in the county. The observation level is county by week. Standard errors are double clustered by state and week. Source: Authors’ calculations using data retrieved as described in Section 2.

Table F3

Case Exposure and New Cases: Robustness to Functional Form .

	1	2	3	4
α2=	12	1	32	2
α1=12
[mean]	[8.331]	[10.86]	[14.07]	[17.63]
Coef.	0.125	0.145	0.124	0.101
(se)	(0.032)	(0.032)	(0.023)	(0.018)
R2	0.859	0.860	0.860	0.860
α1=1
[mean]	[10.11]	[12.48]	[15.50]	[18.93]
Coef.	0.103	0.111	0.097	0.083
(se)	(0.033)	(0.028)	(0.021)	(0.017)
R2	0.860	0.861	0.861	0.861
α1=32
[mean]	[12.76]	[14.87]	[17.54]	[20.68]
Coef.	0.059	0.068	0.066	0.060
(se)	(0.019)	(0.019)	(0.016)	(0.013)
R2	0.860	0.861	0.861	0.861
α1=2
[mean]	[15.84]	[17.75]	[20.12]	[22.91]
Coef.	0.037	0.044	0.046	0.044
(se)	(0.012)	(0.012)	(0.012)	(0.010)
R2	0.859	0.860	0.860	0.861

NOTES: The table reports robustness results for the model represented by Eq. (4). The outcome variable is the log number of new cases in the county. The observation level is county by week. The table runs through different calibrations for the exponents in the exposure measure, Eq. (3), as indicated by column and row of the table. The means of the exposure metric are reported in each specification block in brackets. The boxed specification is the preferred model in Table 3. Source: Authors’ calculations using data retrieved as described in Section 2.

11 in total

1. Social Connectedness: Measurement, Determinants, and Effects.

Authors: Michael Bailey; Rachel Cao; Theresa Kuchler; Johannes Stroebel; Arlene Wong
Journal: J Econ Perspect Date: 2018

2. Strong Social Distancing Measures In The United States Reduced The COVID-19 Growth Rate.

Authors: Charles Courtemanche; Joseph Garuccio; Anh Le; Joshua Pinkston; Aaron Yelowitz
Journal: Health Aff (Millwood) Date: 2020-05-14 Impact factor: 6.301

3. Social network sensors for early detection of contagious outbreaks.

Authors: Nicholas A Christakis; James H Fowler
Journal: PLoS One Date: 2010-09-15 Impact factor: 3.240

4. Human mobility restrictions and the spread of the Novel Coronavirus (2019-nCoV) in China.

Authors: Hanming Fang; Long Wang; Yang Yang
Journal: J Public Econ Date: 2020-09-08

5. JUE Insight: The geographic spread of COVID-19 correlates with the structure of social networks as measured by Facebook.

Authors: Theresa Kuchler; Dominic Russel; Johannes Stroebel
Journal: J Urban Econ Date: 2021-01-09

6. JUE Insight: Measuring movement and social contact with smartphone data: a real-time application to COVID-19.

Authors: Victor Couture; Jonathan I Dingel; Allison Green; Jessie Handbury; Kevin R Williams
Journal: J Urban Econ Date: 2021-02-12

7. Urban Flight Seeded the COVID-19 Pandemic Across the United States.

Authors: Joshua Coven; Arpit Gupta; Iris Yao
Journal: J Urban Econ Date: 2022-08-02

8. The effect of human mobility and control measures on the COVID-19 epidemic in China.

Authors: Moritz U G Kraemer; Chia-Hung Yang; Bernardo Gutierrez; Chieh-Hsi Wu; Brennan Klein; David M Pigott; Louis du Plessis; Nuno R Faria; Ruoran Li; William P Hanage; John S Brownstein; Maylis Layan; Alessandro Vespignani; Huaiyu Tian; Christopher Dye; Oliver G Pybus; Samuel V Scarpino
Journal: Science Date: 2020-03-25 Impact factor: 47.728

9. JUE Insight: How much does COVID-19 increase with mobility? Evidence from New York and four other U.S. cities.

Authors: Edward L Glaeser; Caitlin Gorback; Stephen J Redding
Journal: J Urban Econ Date: 2020-10-21

5 in total

1. Regional and sectorial impacts of the Covid-19 crisis: Evidence from electronic payments.

Authors: Bruno P Carvalho; Susana Peralta; João Pereira Dos Santos
Journal: J Reg Sci Date: 2021-12-05

2. Key links in network interactions: Assessing route-specific travel restrictions in China during the Covid-19 pandemic.

Authors: Xi Chen; Yun Qiu; Wei Shi; Pei Yu
Journal: China Econ Rev Date: 2022-04-20