Literature DB >> 32289090

Investigating the cases of novel coronavirus disease (COVID-19) in China using dynamic statistical techniques.

Samuel Asumadu Sarkodie1, Phebe Asantewaa Owusu1.   

Abstract

The initial investigation by local hospital attributed the outbreak of the novel coronavirus disease (COVID-19) to pneumonia with unknown cause that appeared like the 2003 severe acute respiratory syndrome (SARS). The World Health Organization declared COVID-19 as public health emergency after it spread outside China to several countries. Thus, an assessment of the novel coronavirus disease (COVID-19) with novel estimation approaches is essential to the global debate. This study is the first to develop both time series and panel data models to construct conceptual tools that examine the nexus between death from COVID-19 and confirmed cases. We collected daily data on four health indicators namely deaths, confirmed cases, suspected cases, and recovered cases across 31 Provinces/States in China. Due to the complexities of the COVID-19, we investigated the unobserved factors including environmental exposures accounting for the spread of the disease through human-to-human transmission. We used estimation methods capable of controlling for cross-sectional dependence, endogeneity, and unobserved heterogeneity. We predicted the impulse-response between confirmed cases of COVID-19 and COVID-19-attributable deaths. Our study revealed that the effect of confirmed cases on the novel coronavirus attributable deaths is heterogeneous across Provinces/States in China. We found a linear relationship between COVID-19 attributable deaths and confirmed cases whereas a nonlinear relationship was confirmed for the nexus between recovery cases and confirmed cases. The empirical evidence revealed that an increase in confirmed cases by 1% increases coronavirus attributable deaths by ~0.10%-~1.71% (95% CI). Our empirical results confirmed the presence of unobserved heterogeneity and common factors that facilitates the novel coronavirus attributable deaths caused by increased levels of confirmed cases. Yet, the role of such a medium that facilitates the transmission of COVID-19 remains unclear. We highlight safety precaution and preventive measures to circumvent the human-to-human transmission.
© 2020 The Author(s).

Entities:  

Keywords:  COVID-19; Cases of novel coronavirus; China; Econometrics; Economics; Environmental economics; Environmental science; Health economics; Modelling COVID-19; Novel coronavirus disease; Public health

Year:  2020        PMID: 32289090      PMCID: PMC7128585          DOI: 10.1016/j.heliyon.2020.e03747

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

On 31 December 2020, the World Health Organization (WHO) received information on an outbreak with unknown aetiology detected in a seafood market located in the city of Wuhan, Hubei Province, China. The 2019 novel coronavirus was detected in 44 case-patients with pneumonia with unknown cause between 31 December 2019 to 3 January 2020 by the Chinese authorities [1]. On 11 February 2020, WHO named the novel coronavirus disease as COVID-19 and declared the infectious disease as a public health emergency, after spreading from China to other 24 countries [2]. As of 20 February 2020 (04:00 GMT), 76,498 cases had been reported globally including from China (75,245), “Diamond Princess” cruise ship and others (634), South Korea (104), Japan (94), Singapore (84), Hong Kong (67), Thailand (35), Taiwan (24), Malaysia (22), Germany (16), Vietnam (16), Australia (15), the US (15), France (12), Macau (10), United Arab Emirates (9), UK (9), Canada (8), Italy (3), Philippines (3), India (3), Iran (2), Russia (2), Spain (2), Nepal (1), Cambodia (1), Belgium (1), Finland (1), Sweden (1), Egypt (1), and Sri Lanka (1). Following the emergence of COVID-19, several studies have examined the transmission dynamics of the infectious disease [3]. While clinical, epidemiological, laboratory, and radiological features of COVID-19 [4] have been reported, phenomenological models using statistical methods have been used to examine epidemiological data [5, 6]. The COVID-19 is reported to have spread through human-to-human transmission [3]. However, it might be possible that other unobserved environmental exposures may have facilitated the rate the disease spreads through human-to-human transmission. Earlier studies based on phenomenological models fail to capture unobserved factors and heterogeneity, which are useful in understanding cases with limited epidemiological data. The complexities of the unobserved factors accounting for COVID-19 underpin this study. Using publicly available data for 31 Provinces/States across China, this study is the first to develop both time series and panel data models to examine the nexus between the novel coronavirus attributable deaths and confirmed cases of COVID-19. We use novel estimation methods capable of accounting for Provinces/States-specific fixed-effects and unobserved heterogeneity of the human-to-human transmission.

Materials & method

Data description

Data were collated on 20 February 2020 from the Center for Systems Science and Engineering at John Hopkins University1. The data spans from 21 January 2020 to 20 February 2020 and were preprocessed from wide to long, a replica of panel data and time series setting. The data consist of four health indicators such as deaths, confirmed cases, suspected cases, and recovered cases across 31 Provinces/States in China namely Anhui, Beijing, Chongqing, Fujian, Gansu, Guangdong, Guangxi, Guizhou, Hainan, Hebei, Heilongjiang, Henan, Hubei, Hunan, Inner Mongolia, Jiangsu, Jiangxi, Liaoning, Jilin, Ningxia, Qinghai, Shaanxi, Shandong, Shanxi, Shanghai, Tianjin, Tibet, Sichuan, Zhejiang, Yunnan and Xinjiang. Our intial observation of data available and presented in Figure 1 shows a widespread of case-patients in Hubei Province compared to other locations (Figure 2). This validates the exact location, the city of Wuhan, where the outbreak was first reported. We observe a daily average of about 1000 confirmed cases, 60 deaths and 161 recovered cases.
Figure 1

Descriptive statistics of COVID-19 across Provinces/States in China.

Figure 2

Provinces/States distribution of COVID-19 across China (a) deaths (b) Confirmed cases (c) Recovery cases (d) Suspected cases.

Descriptive statistics of COVID-19 across Provinces/States in China. Provinces/States distribution of COVID-19 across China (a) deaths (b) Confirmed cases (c) Recovery cases (d) Suspected cases. To use appropriate estimation methods, we examined the characteristics of the data series. We assessed whether the relationship between the novel coronavirus attributable deaths, recovery cases and confirmed cases of COVID-19 was linear or nonlinear. The plot presented in Figure 3 shows that the nexus between deaths and confirmed cases is perfectly linear, with a predictive power (R-squared) of almost 100% whereas the relationship between recovery cases and confirmed cases is nonlinear, with an R-squared of ~97%.
Figure 3

Relationship between (a) death and confirmed cases (b) recovery cases and confirmed cases.

Relationship between (a) death and confirmed cases (b) recovery cases and confirmed cases.

Model estimation

We developed 7 models comprising of 5 panel data setting and 2 time series. The selection of estimation methods was based on real-time reporting of COVID-19 used as a priori expectation. By confirming a perfectly linear relationship between deaths and confirmed cases, our models were constructed on such tangent. Model 1 was developed using the fixed-effects linear model with first-order autoregressive [AR(1)] disturbances to accommodate for the unevenly spaced data across China, rendering the panel setting unbalanced. Model 2 was estimated based on a fixed-effects model with Driscoll-Kraay standard errors to account for possible heteroskedasticity, autocorrelation and cross-sectional dependence amid missing data and unbalanced panel setting [7]. Model 3 was estimated using a fixed-effects model with modified Wald (MWALD) statistic to examine heteroskedasticity in the residuals. Our model of interest with fixed-effects can be expressed as [8]:Where denotes logarithmic transformation to give the variable a constant variance, denotes the novel coronavirus attributable deaths, represents confirmed cases, and are the constant and coefficient to be estimated, is the Provinces/States-specific fixed-effects and is the independent and identically distributed error term across individual Provinces/States in time . Models 4 and 5 were estimated to account for heterogeneous slopes, after the parameters of Model 3 violated the normality assumption, hence, confirming the presence of heteroskedasticity. The common correlated effects mean group estimation can be specified as [9]:Where and . denotes Provinces/States-specific slopes on confirmed cases and has unobservables and error term , denotes the standard group fixed-effects that account for time-invariant heterogeneity across Provinces/States. represents the unobserved common factor, , and are the white noise. For brevity, the time series models follow a standard equation expressed as:The specification of Eqn. (3) follows the dynamic simulations of Autoregressive Distributed Lag model expounded in Ref. [[11], [12]].

Results and discussion

The parameter estimation of the relationship between novel coronavirus attributable deaths and confirmed cases of COVID-19 is presented in Table 1. The estimated models are statistically significant at 5% level (95% CI) and a corresponding predictive power (R-squared) between 68%-100%. The modified wald statistic (MWALD) of Model 3 rejects the null hypothesis of homoskedasticity. Meaning that the effect of confirmed cases on the novel coronavirus attributable deaths is heterogeneous across Provinces/States in China. In both panel and time series models presented, the lagged-dependent variable (LDV) of coronavirus attributable deaths (lnDeathst-1) is positive and statistically significant at 1% level except Model 5 which shows a significant (99% CI) negative coefficient. LDV was introduced in the models to control for omitted variable bias and account for the inertia effects of the reported coronavirus attributable deaths. The positive coefficient of lnDeathst-1 in almost all the models reveals that the historical factors of coronavirus attributable deaths are persistent and likely to affect future reported deaths. On the contrary, when unobserved common factors affecting coronavirus attributable deaths are controlled in Model 5, the coefficient on LDV turns negative. Meaning that the inertia effect of historical deaths is curtailed, hence, reducing the impact of confirmed cases.
Table 1

Parameter estimation of the nexus between novel coronavirus attributable deaths and confirmed cases of COVID-19.

VariableModel 1aModel 2aModel 3aModel 4aModel 5aModel 6bModel 7b
lnDeathst-10.8487∗∗∗ [0.0274]0.8617∗∗∗ [0.0381]0.8617∗∗∗ [0.0230]0.8054∗∗∗ [0.2906]-0.3121∗∗∗ [0.0703]0.8080∗∗∗ [0.0271]
lnConfirmedCases0.1091∗∗∗ [0.0273]0.0961∗∗∗ [0.0346]0.0961∗∗∗ [0.0209]1.7075∗∗ [0.6739]1.0252∗∗∗ [0.3378]0.9149∗∗∗ [0.0384]0.1329∗∗∗ [0.0166]
constant-0.4061∗∗∗ [0.1260]-0.3425∗∗ [0.1616]-0.3425∗∗ [0.1054]-6.1673 [4.8809]-2.820∗∗∗ [0.3843]
Prob > F0.0000∗∗∗0.0000∗∗∗0.0000∗∗∗0.0113∗∗0.0000∗∗∗0.0000∗∗∗0.0000∗∗∗
RMSE0.16990.16000.05460.0877
R-squared0.98770.92970.98650.68000.80910.9998
Obs3193403403613402928
No of groups2121212121
F-test0.0032∗∗∗0.0007∗∗∗
MWALD0.0000∗∗∗
CD test0.7075

Notes: Where [.] is the standard error; a denotes model estimation based on panel data setting; b represents modelling based on time series techniques; ∗∗∗,∗∗ represent statistical significance at 1% and 5% level. lnDeathst-1 is the lagged dependent variable, RMSE is the Root Mean Square Error, R-squared explains the predictive power of the estimated model, Obs represents observations. MWALD is the modified wald statistic and CD test examines the independence of the residuals.

Parameter estimation of the nexus between novel coronavirus attributable deaths and confirmed cases of COVID-19. Notes: Where [.] is the standard error; a denotes model estimation based on panel data setting; b represents modelling based on time series techniques; ∗∗∗,∗∗ represent statistical significance at 1% and 5% level. lnDeathst-1 is the lagged dependent variable, RMSE is the Root Mean Square Error, R-squared explains the predictive power of the estimated model, Obs represents observations. MWALD is the modified wald statistic and CD test examines the independence of the residuals. The coefficient on the estimated confirmed cases in Table 1 is positive and statistically significant (95% CI) in both estimated panel and time series models. The empirical evidence reveals that an increase in confirmed cases by 1% increases coronavirus attributable deaths by ~0.10%~1.71% (95% CI). Using the dynamic ARDL simulations estimation technique [[11], [12]], we predicted the counterfactual change in COVID-19 attributable deaths in case of positive or negative shocks in confirmed cases. The plot presented in Figure 4 reveals that a positive shock (1%) in confirmed COVID-19-case-patients will increase attributable deaths from 0.2% to around 0.8% over the horizon. On the contrary, a 1% negative shock in confirmed cases of COVID-19 will decline death rates from 0.1% to 0.6%.
Figure 4

Impulse-Response of confirmed cases of COVID-19 attributable deaths. Note: The light blue spikes represent the 95% confidence interval.

Impulse-Response of confirmed cases of COVID-19 attributable deaths. Note: The light blue spikes represent the 95% confidence interval. Several novel protocols for clinical and epidemiologic investigations have been outlined to ascertain the clinical features, the pattern of transmission, severity and risk factors of the novel coronavirus disease [10]. Our estimated results confirm the presence of unobserved heterogeneity and common factors that facilitates the novel coronavirus attributable deaths caused by increased levels of confirmed cases. However, the role of the unobserved heterogeneity and common factors that facilitate the transmission of COVID-19 remains unclear. This corroborates the findings of the Situation Report – 33 released by WHO. According to the report [10], the role of environmental risk factors in the COVID-19 transmission process is uncertain. However, confirms the human-to-human transmission through community spread, household, health facilities and environmental surfaces [3, 10]. In such a transmission process, our study reveals a perfectly linear relationship between confirmed cases and novel coronavirus attributable deaths, as such, safety precaution and preventive measures are required to circumvent human-to-human transmission.

Conclusions

Our study presented is based on phenomenological models but not a clinical procedure, hence, care should be taken in the interpretation of the outcome. We demonstrated that the effect of confirmed cases on COVID-19 attributable-deaths is perfectly linear whereas the impact of confirmed cases on recovery cases follows a nonlinear path. Our study suffers from the limitation of early case investigation and historical data, hence, our estimation results may change at the latter stage of the novel coronavirus disease (COVID-19). In view of this, we utilized a battery of estimation approach to increase the sensitivity and robustness of the models.

Declarations

Author contribution statement

S.A. Sarkodie: Conceived and designed the experiments; Analyzed and interpreted the data; Wrote the paper. P.A. Owusu: Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Competing interest statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.
  1 in total

1.  Environmental sustainability assessment using dynamic Autoregressive-Distributed Lag simulations-Nexus between greenhouse gas emissions, biomass energy, food and economic growth.

Authors:  Samuel Asumadu Sarkodie; Vladimir Strezov; Haftom Weldekidan; Ernest Frimpong Asamoah; Phebe Asantewaa Owusu; Israel Nutifafa Yawo Doyi
Journal:  Sci Total Environ       Date:  2019-03-01       Impact factor: 7.963

  1 in total
  31 in total

1.  Impact of COVID-19 pandemic on waste management.

Authors:  Samuel Asumadu Sarkodie; Phebe Asantewaa Owusu
Journal:  Environ Dev Sustain       Date:  2020-08-26       Impact factor: 3.219

2.  Estimating the Prevalence and Mortality of Coronavirus Disease 2019 (COVID-19) in the USA, the UK, Russia, and India.

Authors:  Yongbin Wang; Chunjie Xu; Sanqiao Yao; Yingzheng Zhao; Yuchun Li; Lei Wang; Xiangmei Zhao
Journal:  Infect Drug Resist       Date:  2020-09-29       Impact factor: 4.003

3.  Environmental determinants of COVID-19 transmission across a wide climatic gradient in Chile.

Authors:  Francisco Correa-Araneda; Alfredo Ulloa-Yáñez; Daniela Núñez; Luz Boyero; Alan M Tonin; Aydeé Cornejo; Mauricio A Urbina; María Elisa Díaz; Guillermo Figueroa-Muñoz; Carlos Esse
Journal:  Sci Rep       Date:  2021-05-10       Impact factor: 4.379

4.  Modeling and Forecasting the COVID-19 Temporal Spread in Greece: An Exploratory Approach based on Complex Network Defined Splines.

Authors:  Konstantinos Demertzis; Dimitrios Tsiotas; Lykourgos Magafas
Journal:  Int J Environ Res Public Health       Date:  2020-06-30       Impact factor: 3.390

5.  Studying the trend of the novel coronavirus series in Mauritius and its implications.

Authors:  Naushad Mamode Khan; Ashwinee Devi Soobhug; Maleika Heenaye-Mamode Khan
Journal:  PLoS One       Date:  2020-07-10       Impact factor: 3.240

6.  Prediction model for the spread of the COVID-19 outbreak in the global environment.

Authors:  Ron S Hirschprung; Chen Hajaj
Journal:  Heliyon       Date:  2021-06-29

7.  Weather indicators and improving air quality in association with COVID-19 pandemic in India.

Authors:  Rabin Chakrabortty; Subodh Chandra Pal; Manoranjan Ghosh; Alireza Arabameri; Asish Saha; Paramita Roy; Biswajeet Pradhan; Ayan Mondal; Phuong Thao Thi Ngo; Indrajit Chowdhuri; Ali P Yunus; Mehebub Sahana; Sadhan Malik; Biswajit Das
Journal:  Soft comput       Date:  2021-07-13       Impact factor: 3.732

8.  How COVID-19 pandemic may hamper sustainable economic development.

Authors:  Maruf Yakubu Ahmed; Samuel Asumadu Sarkodie
Journal:  J Public Aff       Date:  2021-03-25

9.  Estimation of time-varying reproduction numbers underlying epidemiological processes: A new statistical tool for the COVID-19 pandemic.

Authors:  Hyokyoung G Hong; Yi Li
Journal:  PLoS One       Date:  2020-07-21       Impact factor: 3.240

Review 10.  Microstructure, pathophysiology, and potential therapeutics of COVID-19: A comprehensive review.

Authors:  Satarudra Prakash Singh; Manisha Pritam; Brijesh Pandey; Thakur Prasad Yadav
Journal:  J Med Virol       Date:  2020-07-15       Impact factor: 20.693

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.