| Literature DB >> 32817693 |
Xuwen Gao1, Xinjie Shi1, Hongdong Guo1, Yehong Liu1.
Abstract
Drawing on a recent online survey combined with city-level data, this paper examines the impact of the COVID-19 on consumers' online food purchase behavior in the short term. To address the potential endogeneity issues, we adopt an instrumental variable (IV) strategy, using the distance from the surveyed city to Wuhan as the instrumental variable. We show that our IV method is effective in minimizing potential bias. It is found that the share of confirmed COVID-19 cases increases the possibility of consumers purchasing food online. This is more likely to be the case for young people having a lower perceived risk of online purchases and living in large cities. Despite some limitations, this paper has policy implications for China and other countries that have been influenced by the COVID-19 epidemic. Specifically, government support and regulation should focus on (i) ensuring the safety of food sold on the internet, (ii) protecting the carrier from becoming infected, and (iii) providing financial support to the poor since they may have difficulties in obtaining access to food living in small cities. Moreover, how to help those who are unable to purchase food online because of their technical skills (e.g., the elderly who are not familiar with smart phones or the internet) also deserves more attention for the government and the public.Entities:
Mesh:
Year: 2020 PMID: 32817693 PMCID: PMC7440641 DOI: 10.1371/journal.pone.0237900
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The geographical locations of the respondents.
Descriptive statistics.
| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| N | mean | sd | min | max | |
| Male | 770 | 0.497 | 0.500 | 0 | 1 |
| Head | 770 | 0.549 | 0.498 | 0 | 1 |
| Household size | 770 | 3.914 | 1.342 | 1 | 10 |
| Number of old (>60) | 770 | 0.648 | 0.855 | 0 | 4 |
| Number of children (<18) | 770 | 0.712 | 0.763 | 0 | 4 |
| Dummy for college graduates | 770 | 0.578 | 0.494 | 0 | 1 |
| Dummy for college | 770 | 0.323 | 0.468 | 0 | 1 |
| Dummy for under college | 770 | 0.0987 | 0.298 | 0 | 1 |
| Age: Under 35 years old | 770 | 0.457 | 0.498 | 0 | 1 |
| Age: 36 to 45 years old | 770 | 0.319 | 0.467 | 0 | 1 |
| Age: Above 46 years old | 770 | 0.223 | 0.417 | 0 | 1 |
| HH annual income: under 100,000 RMB (Yuan) | 770 | 0.291 | 0.454 | 0 | 1 |
| HH annual income: 100,000 to 200,000 RMB (Yuan) | 770 | 0.290 | 0.454 | 0 | 1 |
| HH annual income: Above 200,000 RMB (Yuan) | 770 | 0.419 | 0.494 | 0 | 1 |
| Food expenditure from e-commerce/Food expenditure > 0.5 | 770 | 0.400 | 0.490 | 0 | 1 |
| Population (Unit: 10,000) | 150 | 508.2 | 391.0 | 14.80 | 3,391 |
| Number of COVID-19 cases | 150 | 107 | 295 | 0 | 2,791 |
| Share of coronavirus cases (every 10,000) | 150 | 0.206 | 0.525 | 0 | 3.751 |
| Distance to Wuhan city (km) | 150 | 960.892 | 716.980 | 56.564 | 3,602.756 |
Effect of COVID-19 on e-commerce.
| Online shopping after the pandemic | ||||||
|---|---|---|---|---|---|---|
| (1) | (2) | (3) | (4) | (5) | (6) | |
| OLS | OLS | OLS | 2SLS | 2SLS | 2SLS | |
| Share of COVID-19 cases | -0.0784 | -0.0722 | -0.222 | 0.0535 | 0.134 | 0.704 |
| (0.0597) | (0.0604) | (0.149) | (0.214) | (0.302) | (0.709) | |
| [0.220] | [0.255] | [0.285] | [0.816] | [0.681] | [0.260] | |
| Control variables | Yes | Yes | Yes | Yes | Yes | Yes |
| Regional fixed effects | No | Yes | No | No | Yes | No |
| Provincial fixed effects | No | No | Yes | No | No | Yes |
| First-stage F-stat | - | - | - | 100.3 | 62.91 | 67.91 |
| Observations | 770 | 770 | 770 | 770 | 770 | 770 |
The dependent variable is a dummy for online shopping after the pandemic. The share of COVID-19 cases is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The instrumental variable for the share of COVID-19 cases is the distance between the city and Wuhan, which is transformed using the log function. Control variables include gender, age, education levels, income, dummy for household head, household size, share of children and share of the elderly. The region refers to the east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications. *** significant at the 1% level; significant at the 5% level; * significant at the 10% level.
Effect of distance to Wuhan on cities’ share of COVID-19 cases.
| Share of COVID-19 cases | ||||
|---|---|---|---|---|
| (1) | (2) | (3) | (4) | |
| Distance | -0.389 | -0.506 | -0.520 | -0.582** |
| (0.127) | (0.149) | (0.167) | (0.230) | |
| [0.0000] | [0.0000] | [0.0000] | [0.0000] | |
| Control variables | No | Yes | Yes | Yes |
| Regional fixed effects | No | No | Yes | No |
| Provincial fixed effects | No | No | No | Yes |
| Observations | 150 | 150 | 150 | 150 |
The dependent variable is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The distance is the distance between the city and Wuhan, which is transformed using the log function. Control variables include log(population) and city administrative level. Region refers to east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications.
*** significant at the 1% level; significant at the 5% level; * significant at the 10% level.
Effect of COVID-19 on online-food purchase: By type.
| (1) | (2) | (3) | (4) | (5) | |
|---|---|---|---|---|---|
| Fruit | Vegetable | Meat | Seafood | Grain | |
| OLS | OLS | OLS | 2SLS | 2SLS | |
| Share of COVID-19 cases | 0.0109 | 0.390 | 0.409 | 0.253 | -0.0605 |
| (0.270) | (0.315) | (0.243) | (0.186) | (0.169) | |
| [0.970] | [0.199] | [0.056] | [0.139] | [0.726] | |
| Control variables | Yes | Yes | Yes | Yes | Yes |
| Regional fixed effects | Yes | Yes | Yes | Yes | Yes |
| First-stage F-stat | 62.91 | 62.91 | 62.91 | 62.91 | 62.91 |
| Observations | 770 | 770 | 770 | 770 | 770 |
The dependent variable is a dummy for online shopping by food items after the pandemic. The share of COVID-19 cases is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The instrumental variable for the share of COVID-19 cases is the distance between the city and Wuhan, which is transformed using the log function. Control variables include gender, age, education levels, income, a dummy for household head, household size, the share of children and share of the elderly. The region refers to the east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications. *** significant at the 1% level; significant at the 5% level
* significant at the 10% level.
Heterogeneous effect of COVID-19 on e-commerce: By city administrative level.
| Online shopping after the pandemic | |||
|---|---|---|---|
| (1) | (2) | (3) | |
| 2SLS | 2SLS | 2SLS | |
| Above prefecture-level city * share | 1.062 | 1.224 | 1.612** |
| (0.308) | (0.352) | (0.741) | |
| 0.00100 | 0.00100 | 0.0260 | |
| Prefecture-level city * share | 0.425 | 0.650 | 1.127 |
| (0.344) | (0.436) | (0.730) | |
| 0.182 | 0.0880 | 0.0800 | |
| Below prefecture-level city * share | -0.297 | -0.172 | 0.352 |
| (0.247) | (0.294) | (0.669) | |
| 0.148 | 0.578 | 0.563 | |
| Control variables | Yes | Yes | Yes |
| Regional fixed effects | No | Yes | No |
| Provincial fixed effects | No | No | Yes |
| First-stage F-stat | 28.91 | 17.52 | 20.38 |
| Observations | 770 | 770 | 770 |
The dependent variable is a dummy for online shopping after the pandemic. The above prefecture-level cities include sub-provincial and provincial cities. Below prefecture-level cities includes counties and below. The share of COVID-19 cases is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The instrumental variable for the share of COVID-19 cases is the distance between the city and Wuhan, which is transformed using the log function. Control variables include gender, age, education levels, income, dummy for the household head, household size, share of children and share of elderly. The region refers to the east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications.
*** significant at the 1% level; significant at the 5% level
* significant at the 10% level.
Reason for the heterogeneous effect of COVID-19 on e-commerce: By city level.
| Online shopping after the pandemic | |||
|---|---|---|---|
| (1) | (2) | (3) | |
| 2SLS | 2SLS | 2SLS | |
| Above prefecture-level city * share | 3.880 | 4.300 | 5.733** |
| (0.545) | (0.625) | (1.317) | |
| [0.000] | [0.000] | [0.016] | |
| Prefecture-level city * share | 4.506 | 5.166 | 6.084 |
| (1.019) | (1.147) | (1.499) | |
| [0.000] | [0.000] | [0.006] | |
| Below prefecture-level city * share | 3.456 | 3.804 | 5.102 |
| (0.734) | (0.796) | (1.250) | |
| [0.000] | [0.000] | [0.001] | |
| Above prefecture-level city * share * unable to deliver | -4.429 | -4.553 | -4.509 |
| (0.326) | (0.355) | (0.407) | |
| [0.000] | [0.000] | [0.000] | |
| Prefecture-level city * share * unable to deliver | -5.207 | -5.389 | -5.423 |
| (0.957) | (1.014) | (0.913) | |
| [0.000] | [0.000] | [0.000] | |
| Below prefecture-level city * share * unable to deliver | -3.954 | -3.975 | -4.058 |
| (0.656) | (0.673) | (0.678) | |
| [0.000] | [0.000] | [0.000] | |
| Control variables | Yes | Yes | Yes |
| Regional fixed effects | No | Yes | No |
| Provincial fixed effects | No | No | Yes |
| Observations | 770 | 770 | 770 |
The dependent variable is a dummy for online shopping after the pandemic. The above prefecture-level cities include sub-provincial and provincial cities. Below prefecture-level cities includes counties and below. The share of COVID-19 cases is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The unable to deliver variable is a dummy for living out of delivery range. The instrumental variable for the share of COVID-19 cases is the distance between the city and Wuhan, which is transformed using the log function. Control variables include gender, age, education levels, income, dummy for household head, household size, share of children and share of elderly. The region refers to the east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications.
*** significant at the 1% level; significant at the 5% level; * significant at the 10% level.
Heterogeneous effects of COVID-19 on e-commerce: By perceived risk.
| Online shopping after the pandemic | |||
|---|---|---|---|
| (1) | (2) | (3) | |
| 2SLS | 2SLS | 2SLS | |
| High risk* share | -1.303 | -1.197** | -0.593 |
| (0.429) | (0.538) | (0.932) | |
| [0.00800] | [0.0210] | [0.445] | |
| Moderate risk * share | -0.359 | -0.261 | 0.360 |
| (0.258) | (0.316) | (0.749) | |
| [0.121] | [0.345] | [0.582] | |
| Low risk * share | 0.606** | 0.702** | 1.258* |
| (0.297) | (0.386) | (0.825) | |
| [0.0170] | [0.0300] | [0.0870] | |
| Control variables | Yes | Yes | Yes |
| Regional fixed effects | No | Yes | No |
| Provincial fixed effects | No | No | Yes |
| First-stage F-stat | 33.93 | 20.83 | 22.21 |
| Observations | 770 | 770 | 770 |
The dependent variable is a dummy for online shopping after the pandemic. The perceived risk level is operationalized by asking, what do you think about the risk of becoming infected through online shopping? The share of COVID-19 cases is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The instrumental variable for the share of COVID-19 cases is the distance between the city and Wuhan, which is transformed using the log function. Control variables include gender, age, education levels, income, dummy for household head, household size, share of children and share of elderly. The region refers to the east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications.
*** significant at the 1% level; significant at the 5% level; * significant at the 10% level.
Heterogeneous effects of COVID-19 on e-commerce: By age of the household head.
| Online shopping after the pandemic | |||
|---|---|---|---|
| (1) | (2) | (3) | |
| 2SLS | 2SLS | 2SLS | |
| Share of coronavirus cases | 0.010 | 0.097 | 0.824 |
| (0.221) | (0.301) | (0.772) | |
| 0.969 | 0.748 | 0.227 | |
| Share of coronavirus cases | 0.547 | 0.469 | 0.512 |
| (0.211) | (0.206) | (0.223) | |
| [0.008] | [0.021] | [0.018] | |
| Control variables | Yes | Yes | Yes |
| Regional fixed effects | No | Yes | No |
| Provincial fixed effects | No | No | Yes |
| First-stage F-stat | 50.84 | 32.82 | 33.46 |
| Observations | 770 | 770 | 770 |
| Test Share of coronavirus cases + Share of coronavirus cases | 0.557* | 0.566* | 1.336* |
| [0.051] | [0.076] | [0.077] | |
The dependent variable is a dummy for online shopping after the pandemic. Household head age below 35 is a dummy for household head age less than 35. Above prefecture-level cities include sub-provincial and provincial cities. Below prefecture-level cities include counties and below. The share of COVID-19 cases is calculated as the number of confirmed COVID-19 cases on the survey day/city population. The instrumental variable for the share of COVID-19 cases is the distance between the city and Wuhan, which is transformed using the log function. Control variables include gender, age, education levels, income, household size, share of children and share of the elderly. The region refers to the east, center and west. Robust standard errors clustered at the city level are reported in parentheses. P-values from wild bootstrap clustering are reported in brackets. We use Rademacher weights and 1000 replications.
*** significant at the 1% level
** significant at the 5% level
* significant at the 10% level.