Literature DB >> 35965603

High-Speed railways and the spread of Covid-19.

Jindong Pang1, Youle He2, Shulin Shen3.   

Abstract

High-speed railways (HSRs) greatly decrease transportation costs and facilitate the movement of goods, services, and passengers across cities. In the context of the Covid-19 pandemic, however, HSRs may contribute to the cross-regional spread of the new coronavirus. This paper evaluates the role of HSRs in spreading Covid-19 from Wuhan to other Chinese cities. We use train frequencies in 1971 and 1990 as instrumental variables. Empirical results from gravity models demonstrate that one more HSR train originating from Wuhan each day before the Wuhan lockdown increases the cumulative number of Covid-19 cases in a city by about 10 percent. The empirical analysis suggests that other transportation modes, including normal-speed trains and airline flights, also contribute to the spread of Covid-19, but their effects are smaller than the effect of HSRs. This paper's findings indicate that transportation infrastructures, especially HSR trains originating from a city where a pandemic broke out, can be important factors promoting the spread of an infectious disease.
© 2022 Hong Kong Society for Transportation Studies. Published by Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Covid-19; High-speed rails; Pandemic; Transportation infrastructure

Year:  2022        PMID: 35965603      PMCID: PMC9359484          DOI: 10.1016/j.tbs.2022.08.001

Source DB:  PubMed          Journal:  Travel Behav Soc        ISSN: 2214-367X


Introduction

More than two-thirds of the high-speed railways (HSRs) in the world are built in China. According to the report of China’s Ministry of Transport, HSRs accomplished a ridership of 3.66 billion passengers in 2019, accounting for about 70 percent of the total railway passenger traffic. The passenger volume of HSRs was about four times that of civil aviation in 2019. With a much higher speed, HSRs greatly decrease transportation costs and facilitate the movement of goods, services, and passengers across cities (Chen et al., 2016, Heuermann and Schmieder, 2019, Lawrence et al., 2019). In the context of the Covid-19 pandemic, however, HSRs may contribute to the cross-regional spread of the new coronavirus. The Wuhan lockdown on January 23, 2020, two days before the Spring Festival, marked the outbreak of the Covid-19 pandemic in China. All trains and flights from and to Wuhan were canceled as the Wuhan lockdown began. The railway and airline services in Wuhan were not resumed until April 8, 2020. Epidemiological investigations show that most confirmed cases in other cities had traveled to Wuhan or had close contact with individuals returning from Wuhan at the early stage of the pandemic (Han et al., 2020). Thus, HSR trains can play an important role in spreading Covid-19 from Wuhan to other cities before the Wuhan lockdown. This is especially true because of the Spring Festival travel season (Chunyun), which is the largest annual human movement in the world and usually starts 15 days before the Spring Festival. The purpose of this paper is to evaluate the effect of HSRs and other transportation infrastructures on spreading Covid-19 from Wuhan to other Chinese cities at the early stage of the pandemic. Specifically, we rely on a gravity equation to estimate the elasticity of confirmed Covid-19 cases in a city with respect to the number of HSR trains from Wuhan. To solve potential endogeneity issues, we use historical train connections to Wuhan as instrumental variables (IVs). HSRs can contribute to the spread of Covid-19 in two ways. First, HSRs can directly transport passengers carrying the new coronavirus from Wuhan to other cities (Zheng et al., 2020). The second channel is that healthy passengers may get infected if they ride on the same HSR train with Covid-19 patients (Morawska et al., 2020, Hu et al., 2021). This paper studies the aggregate effect of the above two mechanisms. The estimates in this paper reflect the HSRs’ effect on spreading Covid-19 before the Wuhan lockdown when most social-distancing measures were not enforced. HSRs’ effect can be smaller if passengers follow the current social-distancing measures, including keeping distance from each other and wearing a mask. Thus, this paper’s estimates should be interpreted with care. This paper has the following contributions. First, this paper proposes using historical train connections to Wuhan as IVs for the HSR connections with Wuhan in 2020. This identification strategy complements existing studies that usually use OLS models to study the effect of HSRs on spreading Covid-19. Second, this paper complements the literature on HSRs by demonstrating that in addition to affecting all kinds of economic outcomes, HSRs also contribute to the spread of infectious diseases. Third, this paper’s results suggest the importance of studying the number of HSR train frequencies (the “intensive margin”). Most current studies on HSRs focus on whether a city had an HSR station (the “extensive margin”) and few studies consider the intensity of HSR connections among cities. A more detailed discussion of this paper’s relationship to existing studies is presented in Section 2. The rest of the paper consists of four sections. Section 2 reviews the current literature that is closely related to this paper. Section 3 introduces data and empirical strategies. Empirical results are shown in Section 4 and Section 5 concludes the paper.

Literature review

This paper is mainly related to two strands of literature. First, this paper contributes to a rising literature investigating factors and policies that accelerate or slow the transmission of Covid-19. The current studies based on China and other countries have found that weather conditions and air pollution affect the spread of Covid-19 and the fatality rate of the coronavirus (Conticini et al., 2020, Sajadi et al., 2020, Qi et al., 2020, Shi et al., 2020, Tosepu et al., 2020, Wu et al., 2020, Cartenì et al., 2021a). Specifically, temperature is found to be negatively associated with the infection risk of Covid-19 while the results on humidity are mixed. Please see McClymont and Hu (2021) for a more detailed review. In addition to natural factors, human activities, including social activities, college student travel, inter-regional migration, in-person elections, and the concentration of population in public facilities, can also increase the spread of the novel coronavirus (Kuchler et al., 2020, Mangrum and Niekamp, 2020, Li and Ma, 2021, Mu et al., 2021, Loo et al., 2021, Palguta et al., 2021). These studies generally conclude that activities that encourage the concentration of population increase the infection risk of Covid-19. More relevant to this paper, some studies investigate the probability of Covid-19 infection within transportation facilities. Due to the closed nature of transport vehicles, current research shows that transportation and the use of transportation facilities also promote the transmission of Covid-19. The usage of public transit, including buses, subways, and school buses, is found to contribute to the spread of Covid-19 in Italy and the US (Cartenì et al., 2021b, Ramirez et al., 2021, Thomas et al., 2022). Related studies on China are few. Better access to railroads increases the number of Covid-19 cases in Italy (Cartenì et al., 2021c). Air travel can also increase the infection risk of the novel coronavirus worldwide (Nakamura and Managi, 2020, Pombal et al., 2020). In addition to public transit, railroads, and airlines, several studies have evaluated the effect of HSRs on spreading Covid-19 (Zhang et al., 2020, Li et al., 2021, Zhu and Guo, 2021). These studies focus on China and investigate the transmission of Covid-19 from Wuhan to other Chinese cities. Studies on the same topic in other countries are limited. The existing studies rely on OLS models and do not account for the potential endogeneity of HSR connections to Wuhan. In this paper, we complement the OLS estimates by using historical train connections to Wuhan as IVs. The estimates from using IVs are found to be larger than the OLS estimates, suggesting potential endogeneity issues may bias the OLS estimates towards zero. Another extension beyond existing studies is that this paper highlights the effect of HSR trains originating from Wuhan. As HSR trains originating from Wuhan carry much more Wuhan passengers than HSR trains originating from other cities and having a stop at Wuhan, HSR trains originating from Wuhan can contribute more to the spread of Covid-19 from Wuhan to other cities. To contain the spread of Covid-19, all kinds of quarantine measures and social-distancing policies are implemented. Lockdown policies, mainly adopted by Chinese cities, have been found to effectively reduce or completely eliminate Covid-19 cases (Chinazzi et al., 2020, Fang et al., 2020, Lau et al., 2020, Qiu et al., 2020, Tian et al., 2020). Some more modest mobility-control policies taken by many cities in the US and Europe, including shelter-in-place orders, international travel restrictions, and work-from-home measures, are also able to decrease the infection risk of Covid-19 (Dave et al., 2020, Kraemer et al., 2020, Chiba, 2021). In addition to social-distancing policies, vaccination is another important measure to contain the spread of Covid-19 (Shinde et al., 2021, Thomas et al., 2021, Eyre et al., 2022). Second, this study adds to the literature evaluating the economic and social consequences of HSRs. There has been an emerging literature studying the effect of HSRs on different types of economic outcomes in the past five years. As China experienced the largest wave of HSR construction in the past twenty years, many studies investigated HSRs’ effect by using Chinese datasets. HSRs are found to affect airline traffic, increase employment, affect GDP and income, promote academic cooperation, and encourage innovation by decreasing transportation costs in China (Chen et al., 2016, Lin, 2017, Ke et al., 2017, Dai et al., 2018, Yu et al., 2019, Zhang et al., 2019, Dong et al., 2020, Gao and Zheng, 2020, Hanley et al., 2022). In other countries, existing studies also confirm that HSRs increase the GDP per capita, improve productivity, and affect airline passenger flows (Cascetta et al., 2020, Avogadro et al., 2021, Bhatt and Kato, 2021, Miwa et al., 2022). This paper complements existing studies by demonstrating that HSRs also contribute to the spread of a contagious disease. This paper’s results suggest the importance of studying the number of HSR train frequencies (the “intensive margin”). Most current studies focus on whether a city had an HSR station (the “extensive margin”) and analyze its impact on economic activities (Lin, 2017, Ke et al., 2017, Yu et al., 2019, Cascetta et al., 2020, Dong et al., 2020, Gao and Zheng, 2020, Bhatt and Kato, 2021). Few studies consider the intensity of HSR connections among cities. This paper finds that whether a city has HSRs or not has no significant effect on the number of Covid-19 cases while the number of HSR trains originating from Wuhan does have a significant impact. Given most cities in China have already connected to the national HSR network, this paper’s findings indicate that it is important to evaluate the effect of HSR train frequencies in future studies.

Empirical strategies

Data and summary statistics

We construct a cross-sectional dataset with 275 prefecture cities (including Beijing, Shanghai, Tianjin, and Chongqing) in China. Due to data availability, the cities in four western provinces or autonomous regions (Xinjiang, Tibet, Inner Mongolia, and Qinghai) are excluded from the analysis. The cumulative number of confirmed Covid-19 cases in each city by February 6, 2020, two weeks after the Wuhan lockdown, is obtained from China Data Lab.1 We focus on the number of confirmed Covid-19 cases within two weeks after the Wuhan lockdown because the incubation period of Covid-19 can be up to two weeks. The confirmed cases before February 6, 2020 are more likely to originate from Wuhan. The GDP, the population, the number of medical institutions, and the number of medical staff in each city in 2019 are assembled from local governments’ statistical reports or announcements. Each city’s straight-line distance to Wuhan and the geographic coordinates of city centers are calculated by ArcGIS using a map of prefecture cities. The weather conditions of each city in January 2020 are assembled from weather observatory stations. For each city in the sample, we calculate the average daily number of HSR trains from Wuhan during the third week of January 2020 (from January 13, 2020 to January 19, 2020) from the “12306 China Railway” website.2 We focus on the third week of January 2020 for two reasons. The first reason is that the HSR train schedules in the third week of January 2020 can be different from the schedules in the earlier weeks because of the Spring Festival travel season. The government sometimes opens additional HSR trains in the Spring Festival travel season to accommodate the soaring travel demand. The second reason is that this week is right before the outbreak of the pandemic (Wuhan lockdown) and HSR trains this week may have the largest impact on spreading Covid-19. We use two variables to measure the level of HSR connections to Wuhan: the daily number of HSR trains from Wuhan and the daily number of HSR trains originating from Wuhan. The number of HSR trains from Wuhan includes HSR trains originating from Wuhan and HSR trains starting from other cities but having a stop at Wuhan. As most of the tickets for a train are reserved for passengers in the departure city, the HSR trains originating from Wuhan carry much more passengers from Wuhan and may have a larger effect on spreading the new coronavirus to other cities. For example, for an HSR train from Wuhan to Shanghai, most of the passengers on the train usually get on board in Wuhan. For an HSR train that departs from Chengdu to Shanghai but stops in Wuhan, the number of passengers boarding the train in Wuhan is much fewer. Note that passengers know the departure station of each train when they buy a train ticket, and thus know whether a train departs from Wuhan. We calculate the daily number of normal-speed trains from Wuhan and the daily number of normal-speed trains originating from Wuhan by exploring the train schedules between January 13, 2020 and January 19, 2020, from the “12306 China Railway” website. The average daily number of flights from Wuhan to other cities during the same period is obtained from VariFlight, a professional aviation data provider.3 We use train connections to Wuhan in the past as IVs. Specifically, we assemble the daily number of trains from (and originating from) Wuhan to each city in 1990 and 1971 by exploring the “National Railway Train Timetable”, which publishes the schedules for all trains in China in selected years. Similarly, whether a city had an airport in 1980 is used as an instrument for the number of air flights from Wuhan. We searched government announcements, official documents, and other information on the internet to find out whether a city had an airport in 1980. The summary statistics for the variables mentioned above are shown in Table 1 . On average, each city had 70 cumulative Covid-19 cases by February 6, 2020. About 85 percent of the cities are connected to the HSR network. Each city on average had 8.73 HSR trains from Wuhan and 3.47 HSR trains originating from Wuhan in a day. The distributions for the number of Covid-19 cases and the number of HSR trains from (and originating from) Wuhan are shown in Fig. 1 .
Table 1

Summary statistics.

VariablesMeanStandard DeviationMinMax
Key variables:
The cumulative number of Covid-19 cases by February 6, 20207020512141
The cumulative number of Covid-19 cases by January 30, 202025630573
Has HSR or not (=1 if has HSR)0.850.3601
No. of HSR trains from Wuhan8.7617.11098
No. of HSR trains originating from Wuhan3.487.00040
No. of normal-speed trains from Wuhan4.297.51051
No. of normal-speed trains originating from Wuhan0.571.0706
No. of air flights from Wuhan0.952.69017
Population (in millions)4.683.540.4231.24
GDP (in billions of RMB)332.42469.7113.863815.53
No. of medical institutions2955236911121,058
No. of medical staff (in thousands)33.4733.314.00282.00



The instrumental variables:
No. of trains from Wuhan in 19901.323.48025
No. of trains originating from Wuhan in 19900.561.71017
No. of trains from Wuhan in 19710.601.83013
No. of trains originating from Wuhan in 19710.261.01010
Had an airport in 1980 or not (=1 if had an airport)0.150.3601
Fig. 1

The distribution of the number of Covid-19 cases and the number of HSR trains.

Summary statistics. The distribution of the number of Covid-19 cases and the number of HSR trains. Each city had 4.29 and 0.57 normal-speed trains from and originating from Wuhan in a day, respectively. The number of HSR trains from Wuhan greatly exceeds the number of normal-speed trains from Wuhan. On average, each city had 0.95 flights from Wuhan in a day. Each city had 1.32 trains from Wuhan in 1990 and this number decreased to 0.60 trains in 1971. The daily number of trains originating from Wuhan was 0.56 and 0.26 trains in 1990 and 1971, respectively. The share of cities that had an airport in 1980 is 15 percent.

Empirical specifications

Our empirical specification follows a gravity model, which has been widely used in modeling trade and population flows between countries or regions (Head and Mayer, 2014). The gravity model states that the volume of trade or population flows between two countries or cities is proportional to their economic mass and a measure of their relative trade or flow frictions. One friction often considered in the literature is transportation cost (Mátyás, 1997, Anderson, 2011). Following the setting of a gravity model, the empirical specification is set as follows: is the cumulative number of confirmed Covid-19 cases in city . is a constant term. and represent the characteristics of Wuhan and city , respectively. We use subscript to represent Wuhan. City characteristics include population, GDP, the number of medical institutions, the number of medical staff, the longitude and latitude coordinates of the city center, and weather conditions. The population and GDP are used to account for potential economic or trade links between Wuhan and city . The numbers of medical institutions and medical staff measure the levels of medical services in city . The coordinates of city centers and weather conditions (including the average temperature and precipitation in January 2020) are used to account for the effect of latitude and weather on the spread of Covid-19. is the product of city characteristics for city and Wuhan. and represent the exponents of powers and need to be estimated. is the transportation cost between Wuhan and city . It depends on the straight-line distance () and the number of HSR trains from (originating from) Wuhan (). With a correlation coefficient of 0.99, the straight-line distance is highly correlated with the highway distance between Wuhan and city .4 Thus, the straight-line distance variable can largely capture the effect of highway trips using private cars. For simplicity, we assume the functional form of the transportation cost function is: . and are parameters that need to be estimated. To get semi-elasticity estimates, avoid the effect of extreme values on estimation, and have a linear-regression form, we take equivalent logarithm transformations of Equation (1) and rearrange to get Equation (2): In Equation (2), a constant term is denoted by , which equals . An error term is denoted by . The parameter represents the effect of one more HSR train from (originating from) Wuhan on the number of Covid-19 cases in city . The Covid-19 outbreak in Wuhan is unexpected and we do not have reverse causality problems in estimating Equation (2) by OLS. However, we may suffer from omitted-variable biases, as some uncontrolled factors can simultaneously affect the HSR connections to Wuhan and the number of Covid-19 cases. For example, the government officials of a city may be more passionate about building HSRs and this city thus has more HSR trains from Wuhan. If this city’s government officials are also more competent in implementing social-distancing measures, this city is likely to have fewer confirmed Covid-19 cases. Thus, the government officials’ personalities and competence can lead to more HSR trains from Wuhan and fewer Covid-19 cases. In this case, the omitted-variable problem will bias the OLS estimates towards zero. Following the literature (Baum-Snow, 2007, Duranton and Turner, 2011, Duranton and Turner, 2012, Garcia-López et al., 2015, Baum-Snow et al., 2017, Dong et al., 2020), we choose to use historical variables as instruments to solve this identification problem. As summarized by Redding and Turner (2015), using historical variables as instruments is one of the major identification strategies in evaluating the impact of transportation infrastructures. For example, Baum-Snow et al. (2017) uses the railways in 1964 as an instrument to study the effect of highways, railroads, and ring roads on city growth in China. In this paper, we use the number of trains from (and originating from) Wuhan in 1990 and the number of trains from (and originating from) Wuhan in 1971 as IVs. These instruments are good predictors of the HSR connections in 2020, as cities that had lots of train connections to Wuhan in the past are more likely to have more HSR trains from Wuhan before the Wuhan lockdown. Thus, the relevance requirement of these IVs is satisfied. The validity of these IVs hinges on the assumption that factors determining the train connections to Wuhan in the past are not correlated with potential unobserved variables that affect the transmission of Covid-19 at the early stage of the pandemic. The length of railroads in China started to grow rapidly in the 2000s and China had no HSRs before 2008. About half of Chinese railroads before 1990 were built before the People’s Republic’s establishment in 1949. Much of the railroad investment before 1990 was under Soviet influence and served to connect resource-rich regions of the west with manufacturing centers in the east (Baum-Snow et al., 2017). Therefore, it is plausible that the construction of rail networks before 1990, and thus the number of trains between Wuhan and a city in 1990 or 1971, was determined by factors unrelated to the transmission of Covid-19 in 2020. Another argument for the validity of the instruments is that the growth of cities and the current distribution of economic activities in China are largely shaped by the “reform and opening-up” policy. China did not adopt this policy in 1971 and had just started the “reform and opening-up” process in 1990. Much of the economic and trade links among cities before 1990 were from the planned economy regime. After thirty years of rapid growth, China has established a market-oriented economy. Thus, the transmission of Covid-19 in 2020 was not likely to be determined by the train connections in 1990 and 1971. In evaluating the effect of airline flights, we are unable to find the flight schedules between Wuhan and other cities before 1990. Therefore, we cannot use the number of flights from Wuhan in the past as an IV. Instead, we use whether a city had an airport in 1980 as an instrument for the number of flights from Wuhan in January 2020. The rapid development of China's aviation industry began in the 1990s. The number of airline passengers increased from 17 million in 1990 to 552 million in 2017. Most airports in China in 1980 were dual-use airports, i.e., the airports were used for both military and civilian purposes. Thus, whether a city had an airport in 1980 was mostly determined by factors unrelated to the spread of Covid-19. Whether a city had an airport in 1980 can strongly predict the number of flights from Wuhan. These arguments support the validity and strength of this IV.

Results

The effect of HSR trains on the cumulative number of confirmed Covid-19 cases

The major empirical results are summarized in Table 2 . All city characteristics are included in the regressions but omitted in the text to save space. The cumulative number of confirmed Covid-19 cases in a city is found to significantly decrease with the straight-line distance to Wuhan and increase with this city’s GDP. These effects are consistent with expectations. Cities that are farther away from Wuhan have higher transportation costs to Wuhan and thus have fewer Covid-19 cases. Cities with a higher GDP tend to have more economic and trade connections with Wuhan and receive more passenger flows and Covid-19 cases from Wuhan. The effect of other city characteristics is not statistically significant. Robust standard errors are in parentheses.
Table 2

The effect of HSR trains on the number of Covid-19 cases.

Variables(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
OLS resultsIV-Train connections in 1990IV-Train connections in 1971IV-Both IVs
Has HSR or not0.0778
(0.206)
No. of HSR trains from WH0.0111**0.01140.01810.00948
(0.00455)(0.00704)(0.0113)(0.00644)
No. of HSR trains originating0.0404***0.121***0.103**0.103**
from WH(0.0112)(0.0470)(0.0481)(0.0481)
First-stage F statistics44.4817.5118.8112.6024.2212.60
City CharacteristicsYesYesYesYesYesYesYesYesYes
Observations275275275275275275275275275
R-squared0.6350.6440.6550.6440.5750.6410.6070.6440.607

The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020 (two weeks after the Wuhan lockdown). The number of trains from Wuhan in 1990 is used as an IV in column (4). The number of trains originating from Wuhan in 1990 is used as an IV in column (5). The number of trains from and originating from Wuhan in 1971 are used as instruments in columns (6) and (7), respectively. In columns (8), we use the numbers of trains from Wuhan in 1990 and 1971 as IVs. The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs in column (9). p-values of over-identification tests are 0.26 and 0.23 in columns (8) and (9), respectively. Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1.

The effect of HSR trains on the number of Covid-19 cases. The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020 (two weeks after the Wuhan lockdown). The number of trains from Wuhan in 1990 is used as an IV in column (4). The number of trains originating from Wuhan in 1990 is used as an IV in column (5). The number of trains from and originating from Wuhan in 1971 are used as instruments in columns (6) and (7), respectively. In columns (8), we use the numbers of trains from Wuhan in 1990 and 1971 as IVs. The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs in column (9). p-values of over-identification tests are 0.26 and 0.23 in columns (8) and (9), respectively. Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1. The OLS results following Equation (2) are shown in columns (1)-(3). The results in column (1) suggest whether a city is connected to the HSR network or not has no significant effect on the dependent variable. However, the number of HSR trains from Wuhan does have a significantly positive effect on the cumulative number of confirmed Covid-19 cases. The results in column (2) suggest one more HSR train from Wuhan each day can increase the Covid-19 cases in a city by one percent. In column (3), we find that one more HSR train originating from Wuhan each day significantly increases a city’s Covid-19 cases by four percent. The effect of HSR trains originating from Wuhan is much larger, consistent with the expectation that trains originating from Wuhan carry much more Wuhan passengers and tend to contribute more to the spread of the Covid-19 from Wuhan to other cities. Columns (4)-(9) display the two-stage least-squares (2SLS) regression results. The first-stage regression results are displayed in Table A.1. The daily number of trains from Wuhan in 1990 and 1971 are good predictors for the daily number of HSR trains from Wuhan. In column (4) of Table 2, after instrumenting the number of HSR trains from Wuhan with the number of trains from Wuhan in 1990, the empirical results demonstrate that one more HSR train from Wuhan each day will increase the Covid-19 cases by one percent. This estimate, however, becomes statistically insignificant. In column (5), the number of trains originating from Wuhan in 1990 is used as an instrument and the results suggest one more HSR train originating from Wuhan each day significantly increases the Covid-19 cases by 12 percent. This IV estimate is statistically significant and larger than the OLS estimate in column (3), suggesting potential confounding factors tend to bias the OLS estimates toward zero.
Table A1

First-stage regression results.

(1)(2)(3)(4)(5)(6)
VariablesHSR trains from WHHSR trains originating from WH
Trains from WH in2.430***3.191***
1990(0.364)(0.752)
Trains from WH in3.883***−1.528
1971(0.895)(1.250)
Trains originating from1.138***0.672
WH in 1990(0.272)(0.912)
Trains originating from1.864***0.822
WH in 1971(0.525)(1.598)
City CharacteristicsYesYesYesYesYesYes
Observations276276276276276276
R-squared0.6180.5730.6210.5290.5280.530

The dependent variable is the number of HSR trains from Wuhan or originating from Wuhan. Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH denotes Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1.

In columns (6)-(7), we use the number of trains from Wuhan in 1971 and the number of trains originating from Wuhan in 1971 as instruments. Compared with the 1990 IV, the number of trains from Wuhan in 1971 can be more valid, as the latter has a much longer lag. Nevertheless, the empirical results from the 1971 IVs are qualitatively and quantitively similar to the results when the 1990 IVs are used. The regression results by using both the 1990 and the 1971 IVs are reported in columns (8)-(9). The empirical estimates are still robust. All IV regressions have a strong first stage.5 The null hypothesis of the over-identification tests is not rejected when we include both IVs in columns (8)-(9). This suggests the validity of the 1990 IVs is not statistically different from the validity of the 1971 IVs. The empirical estimates in Table 2 are generally consistent with existing studies that also find HSRs contribute to the spread of Covid-19. However, the IV estimates in this paper are larger than the estimates in other studies that usually rely on OLS models. For example, Zhang et al. (2020) find that one more HSR train from Wuhan increases the number of Covid-19 cases by about 0.004 cases, which is much smaller than the IV estimates in this paper. Zhu and Guo (2021) conclude that cities connected to Wuhan by HSRs have 25 percent more Covid-19 cases than other cities, but they did not estimate the effect of one additional HSR train on the number of Covid-19 cases. Li et al. (2021) focus on provinces instead of cities, though they also find that HSRs promote the spread of Covid-19. Thus, this paper complements existing studies by offering new estimates from a more valid identification strategy.

Comparing the effect of HSRs with other transportation modes

In addition to HSRs, we also evaluate the role of normal-speed trains and air flights in spreading Covid-19 from Wuhan to other cities. The empirical specification still follows Equation (2). Similarly, historical variables are used as IVs to ease concerns about endogeneity problems. The OLS results in column (1) of Table 3 indicate that one more normal-speed train from Wuhan each day can increase the number of Covid-19 cases by 2.2 percent. This estimate increases to 19.5 percent for one more normal-speed train originating from Wuhan each day in column (2). Note that the capacity of a normal-speed train is up to two thousand passengers while the capacity of an HSR train is usually smaller than one thousand passengers. Thus, a normal-speed train can carry much more passengers than an HSR train. This may explain why one more normal-speed train originating from Wuhan has a larger effect than an HSR train originating from Wuhan. The 2SLS results are qualitatively similar in columns (3) and (4) where we use the number of trains from Wuhan in 1990 and 1971 as instruments. The estimated effect of normal-speed trains originating from Wuhan in column (4), however, is only statistically significant at the 10 % significance level.
Table 3

The effects of normal-speed trains and airline flights.


(1)
(2)
(3)
(4)
(5)
(6)
VariablesNormal-speed trains
Air flights
OLSIVOLSIV
No. of normal-speed trains from WH0.0225**0.0169*
(0.00875)(0.0100)
No. of normal-speed trains originating0.195***0.283*
from WH(0.0563)(0.166)
No. of air flights from WH0.125***0.175***
(0.0225)(0.0634)
First-stage F statistics76.7315.0920.07
City CharacteristicsYesYesYesYesYesYes
Observations275275275275275275
R-squared0.6440.6490.6430.6460.6670.662

The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020. The numbers of trains from Wuhan in 1990 and 1971 are used as IVs in column (3). The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs in column (4). The p-values of over-identification tests are 0.26 and 0.21 in columns (3) and (4). Whether a city has an airport in 1980 is used as an IV in column (6). Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1.

The effects of normal-speed trains and airline flights. The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020. The numbers of trains from Wuhan in 1990 and 1971 are used as IVs in column (3). The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs in column (4). The p-values of over-identification tests are 0.26 and 0.21 in columns (3) and (4). Whether a city has an airport in 1980 is used as an IV in column (6). Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1. Air flights are also found to have a significant positive effect on the cumulative number of Covid-19 cases in columns (5) and (6). The OLS results suggest one more daily flight from Wuhan before the outbreak of the pandemic increases the number of Covid-19 cases in the destination city by 12.5 percent. This effect increases to 17.5 percent when we use whether a city has an airport in 1980 as an instrument. Note that the infection risk in a plane cabin is larger than the infection risk in a train car (Hu et al., 2020). This might explain why a plane usually carries fewer passengers than a train but the effect of one more flight from Wuhan is similar to the effect of one more normal-speed train originating from Wuhan. To compare the effect of different transportation modes, we further convert the semi-elasticity estimates in Table 2 and Table 3 to elasticity estimates. We multiply the estimated effect of one more train or flight from Wuhan by the average level of train or flight connections to Wuhan to get the elasticity estimate.6 This conversion accounts for the fact that the number of HSR trains, normal-speed trains, and airline flights from Wuhan is different across cities. The elasticity estimates in Table 4 suggest the HSR trains originating from Wuhan have the largest impact on spreading Covid-19. A ten percent increase in the daily number of HSR trains originating from Wuhan can increase the number of Covid-19 cases in the destination city by 3.5 percent. A ten percent increase in the daily number of flights from Wuhan and normal-speed trains originating from Wuhan can increase the number of Covid-19 cases in the destination city by 1.7 and 1.6 percent, respectively.7 The elasticity estimates for HSR trains from Wuhan and normal-speed trains from Wuhan are all below 0.1.
Table 4

Elasticity estimates.

The estimated effect of one more train or flight from WHThe average number of trains or flights from WHConverted elasticities
HSR trains from WH0.00958.760.083
HSR trains originating from WH0.103.480.35
Normal-speed trains from WH0.0174.290.073
Normal-speed trains originating from WH0.280.570.16
Air flights from WH0.180.950.17

The estimated effect of one more train or flight from Wuhan is from the IV estimates in Table 2 and Table 3.

Elasticity estimates. The estimated effect of one more train or flight from Wuhan is from the IV estimates in Table 2 and Table 3.

Robustness checks

We conduct a series of robustness checks for the major result in Table 2 that HSRs significantly improve the spread of Covid-19 from Wuhan to other cities. The first exercise is conducting a placebo test where we evaluate the effect of HSR trains from an alternative city on the number of Covid-19 cases. Specifically, we choose four regional center cities that are geographically close to Wuhan: Zhengzhou, Hefei, Changsha, and Nanchang. These four provincial capital cities are similar to Wuhan in GDP, population, and geographical locations. The regression results in columns (1)-(4) in Table 5 suggest the number of HSR trains from these four alternative cities have small and insignificant effects on the number of Covid-19 cases. This result is consistent with the expectation that the Covid-19 spreads from Wuhan to other cities and the number of HSR trains from these four alternative cities should have a zero effect on the transmission of Covid-19.
Table 5

Robustness checks.


(1)
(2)
(3)
(4)
ZhengzhouHefeiChangshaNanchang
No. of HSR trains from−0.00510−0.005300.001010.00246
alternative cities(0.00380)(0.00477)(0.00322)(0.00419)
City CharacteristicsYesYesYesYes
Observations274274274274
R-squared0.6340.6340.6310.632



(5)(6)(7)(8)
Exclude cities in the Hubei provinceUse the number of Covid-19 cases before January 30th, 2020
No. of HSR trains from WH0.0120***0.0107*
(0.00459)(0.00627)
No. of HSR trains originating0.0675***0.106**
from WH(0.0245)(0.0430)
First-stage F statistics42.9626.5342.8213.41
City CharacteristicsYesYesYesYes
Observations260260275275
R-squared0.6130.5770.5970.554

The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020, for columns (1)-(6). The dependent variable is the logarithm form of the cumulative number of Covid-19 cases in a city by January 30, 2020, in columns (7)-(8). Columns (1)-(4) show OLS regression results. The numbers of trains from Wuhan in 1990 and 1971 are used as IVs in columns (5) and (7). The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs in columns (6) and (8). The over-identification tests cannot reject the null that both IVs are valid in columns (5)-(8). Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1.

Robustness checks. The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020, for columns (1)-(6). The dependent variable is the logarithm form of the cumulative number of Covid-19 cases in a city by January 30, 2020, in columns (7)-(8). Columns (1)-(4) show OLS regression results. The numbers of trains from Wuhan in 1990 and 1971 are used as IVs in columns (5) and (7). The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs in columns (6) and (8). The over-identification tests cannot reject the null that both IVs are valid in columns (5)-(8). Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1. The second robustness check is excluding the cities in Hubei province. The 2SLS results in columns (5)-(6) of Table 5 indicate the effect of HSR trains is robust after excluding the cities in Hubei province. In the third exercise, we test the robustness of the empirical results by focusing on the number of cumulative Covid-19 cases within one week after the Wuhan lockdown (before January 30, 2020). The results in columns (7)-(8) are qualitatively and quantitatively similar to those in Table 2.8 In the fourth robustness check, we test the sensitivity of the empirical results in Table 2 by including other modes of transportation in the regression. Columns (1), (2) of Table A2 display the OLS results when all three types of transportation modes are included in the regression. After controlling for the frequencies of normal-speed trains and the number of air flights from Wuhan, the number of HSR trains from Wuhan has an insignificant positive effect on the dependent variable in column (1). The effect of HSR trains originating from Wuhan, however, remains statistically significant in column (2). The magnitude of the estimate in column (2) is close to that of the estimate in column (3) of Table 2. The corresponding 2SLS results shown in columns (3) and (4) of Table A2 display similar patterns.
Table A2

More robustness checks.

Variables(1)
(2)
(3)
(4)
OLSInstrumenting HSRs from Wuhan by both IVs
No. of HSR trains from WH0.00751−0.00792
(0.00661)(0.00952)
No. of HSR trains originating0.0327***0.0941**
from WH(0.0121)(0.0374)
No. of normal-speed trains0.01130.0285**
from WH(0.0121)(0.0116)
No. of normal-speed trains0.114*0.0991
originating from WH(0.0654)(0.0642)
No. of air flights from WH0.123***0.113***0.120***0.119***
(0.0218)(0.0228)(0.0223)(0.0230)



First-stage F statistics25.318.77
City CharacteristicsYesYesYesYes
Observations275275275275
R-squared0.6780.6890.6760.687

The dependent variable is the logarithm form of the cumulative number of confirmed Covid-19 cases in a city by February 6, 2020. The numbers of trains from Wuhan in 1990 and 1971 are used as IVs for the number of HSR trains from Wuhan in column (3). The numbers of trains originating from Wuhan in 1990 and 1971 are used as IVs for the number of HSR trains originating from Wuhan in column (4). The over-identification tests cannot reject the null that both IVs are valid in columns (3)-(4). Controlled but not listed city covariates include population, GDP, the straight-line distance to Wuhan, the number of medical institutions, the number of medical staff, the geographic coordinates of city centers, the average temperature in January 2020, and the average precipitation in January 2020. WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1.

Potential heterogeneities

Several potential heterogeneities are explored, and the empirical results are displayed in Table 6 . This subsection focuses on the potential heterogeneous effects of HSR trains originating from Wuhan. We interact the number of HSR trains originating from Wuhan with a dummy variable to identify possible heterogeneous effects. For example, to study the heterogeneity over GDP, we define a dummy variable that equals one for cities with GDP more than 400 billion RMB in 2019, and equals zero for other cities.9 All regressions are estimated by 2SLS and the interactions of the historical variables with the heterogeneity dummy variable are used as instruments for the interactions of the HSR variables with the heterogeneity dummy variable.
Table 6

Potential heterogeneities.

Variables(1)(2)(3)(4)(5)
No. of HSR trains originating0.111**0.117**0.109**−0.003820.0731*
from WH (HSRWH)(0.0517)(0.0472)(0.0465)(0.0537)(0.0406)
HSRWH×HighGDP−0.00741
(0.0532)
HSRWH×MorePopulation−0.0353
(0.0387)
HSRWH×LongDistance−0.0249
(0.0731)
HSRWH×MoreWHMigrants0.0959***
(0.0362)
HSRWH×Lockdown0.0646***
(0.0246)
City CharacteristicsYesYesYesYesYes
Observations275275275275275
R-squared0.6000.6060.5950.6430.578

The dependent variable is the logarithm form of the cumulative number of Covid-19 cases in a city by February 6, 2020. All results are from 2SLS regressions by using the numbers of trains originating from Wuhan in 1990 and 1971 as IVs. is a dummy variable that equals one for about 25 percent of cities whose GDP is larger than 400 billion RMB. is a dummy that equals one for 25 percent of cities whose population is larger than five million. is a dummy that equals one for 25 percent of cities whose distance to Wuhan is longer than one thousand kilometers. is a dummy that equals one for the top 25 % of cities that received the most travelers from Wuhan between January 1, 2020, and January 7, 2020 (measured by the Baidu migration index). is a dummy variable that equals one for cities (31 percent) that adopt lockdown policies after the Covid-19 outbreak (He et al., 2020). WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1.

Potential heterogeneities. The dependent variable is the logarithm form of the cumulative number of Covid-19 cases in a city by February 6, 2020. All results are from 2SLS regressions by using the numbers of trains originating from Wuhan in 1990 and 1971 as IVs. is a dummy variable that equals one for about 25 percent of cities whose GDP is larger than 400 billion RMB. is a dummy that equals one for 25 percent of cities whose population is larger than five million. is a dummy that equals one for 25 percent of cities whose distance to Wuhan is longer than one thousand kilometers. is a dummy that equals one for the top 25 % of cities that received the most travelers from Wuhan between January 1, 2020, and January 7, 2020 (measured by the Baidu migration index). is a dummy variable that equals one for cities (31 percent) that adopt lockdown policies after the Covid-19 outbreak (He et al., 2020). WH represents Wuhan. Robust standard errors are in parenthesis. *** p < 0.01, ** p < 0.05, * p < 0.1. The results in column (1) of Table 6 indicate that there are no significant heterogeneous effect across GDP, i.e., the effect of HSR trains originating from Wuhan is homogeneous for cities with different levels of GDP. As shown in columns (2) and (3), similar homogeneous results are found for the city population and the straight-line distance to Wuhan. In column (4), the Baidu migration index is used to measure the number of Wuhan travelers to a city.10 The empirical results suggest that HSRs’ effect in cities that received more Wuhan passengers between January 1, 2020 and January 7, 2020 is significantly larger than the effect in cities that have fewer visitors from Wuhan. This result is consistent with the expectation that HSR trains promote the spread of Covid-19 by facilitating the inter-city flow of passengers. The estimates in column (5) demonstrate that HSR trains originating from Wuhan have a larger effect in cities that adopt lockdown policies after the outbreak of the Covid-19 pandemic.11 This result suggests that cities that have more HSR trains originating from Wuhan tend to have more Covid-19 cases at the early stage of the pandemic and the local governments of these cities are more likely to adopt lockdown policies to slow the spread of the coronavirus.

Conclusions

This paper investigates the role of HSR trains in spreading Covid-19 from Wuhan to other Chinese cities. The historical number of train frequencies from Wuhan are used as instrumental variables. The empirical results demonstrate one more HSR train from Wuhan to a city each day increases the cumulative number of confirmed Covid-19 cases in this city by about one percent. One more HSR train originating from Wuhan each day, however, can increase the Covid-19 cases by about 10 percent. The empirical analysis suggests other transportation modes, including normal-speed trains and air flights, also contribute to the spread of Covid-19, but their effects (elasticity estimates) tend to be smaller. This paper’s findings suggest transportation infrastructures, especially HSR trains originating from a city where a pandemic broke out, can be important factors promoting the spread of an infectious disease. The estimates in this paper should be considered in making social distancing and reopening policies. In addition to HSR trains, normal-speed trains, and air flights, public transportation can also greatly affect the transmission of Covid-19, especially within cities. This topic merits future research.

CRediT authorship contribution statement

Jindong Pang: Conceptualization, Methodology, Project administration, Writing – original draft. Youle He: Data curation, Software, Formal analysis, Visualization. Shulin Shen: Investigation, Resources, Validation, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  36 in total

1.  The Role of High-speed Rail and Air Travel in the Spread of COVID-19 in China.

Authors:  Pengyu Zhu; Yuqing Guo
Journal:  Travel Med Infect Dis       Date:  2021-05-31       Impact factor: 6.211

2.  How can airborne transmission of COVID-19 indoors be minimised?

Authors:  Lidia Morawska; Julian W Tang; William Bahnfleth; Philomena M Bluyssen; Atze Boerstra; Giorgio Buonanno; Junji Cao; Stephanie Dancer; Andres Floto; Francesco Franchimon; Charles Haworth; Jaap Hogeling; Christina Isaxon; Jose L Jimenez; Jarek Kurnitski; Yuguo Li; Marcel Loomans; Guy Marks; Linsey C Marr; Livio Mazzarella; Arsen Krikor Melikov; Shelly Miller; Donald K Milton; William Nazaroff; Peter V Nielsen; Catherine Noakes; Jordan Peccia; Xavier Querol; Chandra Sekhar; Olli Seppänen; Shin-Ichi Tanabe; Raymond Tellier; Kwok Wai Tham; Pawel Wargocki; Aneta Wierzbicka; Maosheng Yao
Journal:  Environ Int       Date:  2020-05-27       Impact factor: 9.621

3.  Human mobility restrictions and the spread of the Novel Coronavirus (2019-nCoV) in China.

Authors:  Hanming Fang; Long Wang; Yang Yang
Journal:  J Public Econ       Date:  2020-09-08

4.  JUE Insight: College student travel contributed to local COVID-19 spread.

Authors:  Daniel Mangrum; Paul Niekamp
Journal:  J Urban Econ       Date:  2020-12-04

5.  Assessing regional risk of COVID-19 infection from Wuhan via high-speed rail.

Authors:  Tao Li; Lili Rong; Anming Zhang
Journal:  Transp Policy (Oxf)       Date:  2021-04-14

6.  JUE insight: Migration, transportation infrastructure, and the spatial transmission of COVID-19 in China.

Authors:  Bingjing Li; Lin Ma
Journal:  J Urban Econ       Date:  2021-04-15

7.  Spatial transmission of COVID-19 via public and private transportation in China.

Authors:  Ruizhi Zheng; Yu Xu; Weiqing Wang; Guang Ning; Yufang Bi
Journal:  Travel Med Infect Dis       Date:  2020-03-14       Impact factor: 6.211

8.  Impacts of social and economic factors on the transmission of coronavirus disease 2019 (COVID-19) in China.

Authors:  Yun Qiu; Xi Chen; Wei Shi
Journal:  J Popul Econ       Date:  2020-05-09

9.  The effect of human mobility and control measures on the COVID-19 epidemic in China.

Authors:  Moritz U G Kraemer; Chia-Hung Yang; Bernardo Gutierrez; Chieh-Hsi Wu; Brennan Klein; David M Pigott; Louis du Plessis; Nuno R Faria; Ruoran Li; William P Hanage; John S Brownstein; Maylis Layan; Alessandro Vespignani; Huaiyu Tian; Christopher Dye; Oliver G Pybus; Samuel V Scarpino
Journal:  Science       Date:  2020-03-25       Impact factor: 47.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.