Literature DB >> 32305271

COVID-19 virus outbreak forecasting of registered and recovered cases after sixty day lockdown in Italy: A data driven model approach.

Nalini Chintalapudi1, Gopi Battineni2, Francesco Amenta3.   

Abstract

BACKGROUND: Till 31 March 2020, 105,792 COVID-19 cases were confirmed in Italy including 15,726 deaths which explains how worst the epidemic has affected the country. After the announcement of lockdown in Italy on 9 March 2020, situation was becoming stable since last days of March. In view of this, it is important to forecast the COVID-19 evaluation of Italy condition and the possible effects, if this lock down could continue for another 60 days.
METHODS: COVID-19 infected patient data has extracted from the Italian Health Ministry website includes registered and recovered cases from mid February to end March. Adoption of seasonal ARIMA forecasting package with R statistical model was done.
RESULTS: Predictions were done with 93.75% of accuracy for registered case models and 84.4% of accuracy for recovered case models. The forecasting of infected patients could be reach the value of 182,757, and recovered cases could be registered value of 81,635 at end of May.
CONCLUSIONS: This study highlights the importance of country lockdown and self isolation in control the disease transmissibility among Italian population through data driven model analysis. Our findings suggest that nearly 35% decrement of registered cases and 66% growth of recovered cases will be possible.
Copyright © 2020. Published by Elsevier B.V.

Entities:  

Keywords:  ARIMA; COVID-19 outbreak; Forecasting; Italian population; Lock down

Mesh:

Year:  2020        PMID: 32305271      PMCID: PMC7152918          DOI: 10.1016/j.jmii.2020.04.004

Source DB:  PubMed          Journal:  J Microbiol Immunol Infect        ISSN: 1684-1182            Impact factor:   4.399


Introduction

In the last weeks of 2019, when the world was ready to welcome 2020, many local hospitals in Wuhan, China, were reported unusual number of patients who comes with severe pneumonia without knowing cause and not responds to any kind of vaccine or medicine. Besides, these cases were further increased because of human to human transmission, and doctors confirmed that this unknown disease had similar epidemic of Severe Acute Respiratory Syndrome (SARS) in 2002 and the agent causing this disease was recognized as a corona virus. Sooner or later World Health Organization (WHO) named this virus as novel corona virus(nCOV-19) or COVID-19. By early January 2020, about 59 suspected cases were identified in province of Wuhan. At the beginning, the disease started as a local epidemic of China, but subsequently it quickly escalated all over the world, being transmitted by international travelers. At the present, there is no any scientific evidence for where it has originated. Currently it is confirmed as a global pandemic and dozens of western countries are alarmed by this severe outbreak of corona virus. Today (31 March 2020), 854,307 COVID-19 confirmed cases are including of 42,016 deaths were reported worldwide. More than 190 countries had been affected, with major outbreaks in the United States (US), Italy, Spain, China, Iran, France, and others. We can imagine the gravity of this pandemic situation by looking into these facts. In Italy, death toll from corona virus jumped over 15,000 deaths since end of February and is still ongoing, whereas the number of infected cases from USA surpass about more than half Million population. Due to easy spreading of COVID-19, most national governments including Italy announced lock down and people are not allowed to come out from their homes. As of this, nearly 3.5 Billion global population went into self isolation. The World Health Organization (WHO) confirms that the incubation period (i.e., time elapse between exposure of pathogenic organism to symptom first appearance) of COVID-19 outbreak is 14 days. The basic reproduction number “R naught” or R0 is a contagiousness indicator or infectious transmissibility of parasite agents. In epidemic sciences and health literature R0 is highly encountered to understand a slow outbreak of disease. For instance, if R0 is equal to one, this means that average person who got disease could transmit over single individual. According to WHO, R0 for COVID-19 is confirmed around 2.0–2.5. Recent modeling of R0 from Italy confirmed by Lombardy researchers at early outbreak in between 2.76 to 3.25. Lombardy region is considered as epicenter of corona virus outbreak in Italy. , Most people were died here than anywhere else in the world and later virus spread all over country with more than 98,000 confirmed cases. On 9 March 2020, the Italian prime minister Mr. G Conte had given announcement of imposed national quarantine, restricting the people movement unless for health emergency or unavoidable work needs. Statistics become consciously optimistic and daily number of new registered are becoming constantly stable since last week of March. However, because of both human-to-human and asymptomatic transmission of COVID-19, it is important to understand virus reproduction cases after this Italy lock down. Therefore, we developed a data driven model to forecast COVID-19 outbreak daily registered cases and recovered cases, also estimated the chance of low infected patient cases for next 60 days of Italy quarantine.

Methods

Data

Patient data were obtained from the official website of the Italian Health Ministry (http://www.salute.gov.it/nuovocoronavirus) that reports latest information of COVID-19 infection in Italy. The data model development was done based on the update of 31 March 2020. Patient data consisted of three groups, namely registered cases, recovered cases and death cases. In this study, we excluded the death cases information and forecasted possible number of register and recovered cases in next two months. Rather than observing entire data, we only considered observation from 15 February 2020 because after the first two cases registered on 31 January 2020, no more epidemic was reported in Italy till mid of February. Fig. 1 is the plot of total number of registered and recovered cases trend varied on daily basis.
Figure 1

Total registered cases progression (left) and Total recovered case progression (right) of COVID-19 in Italy (from mid of February to end of March).

Total registered cases progression (left) and Total recovered case progression (right) of COVID-19 in Italy (from mid of February to end of March).

ARIMA model development in R

R is one of the tools that has relevant importance for epidemiologists, and had quick search function can enable users to get many R libraries devoted to outbreak management and analysis. Auto-regressive integrated moving average (ARIMA) and specified by three orderly parameters: (p, d, q); where ‘p’ is an auto aggressive referred to use of ancient values in model,‘d’ is the difference degree of integrated I(d) component, and ‘q’ is model error which is combination of last error terms et. By summing above parameters with non-seasonal ARIMA model can be written as linear equation mentioned in equation (1). The model equation above mentioned as assumed to be a non-seasonal series. In this study, model specified by two sets of parameter order: (p, d, q) and (P, D, Q) m (i.e., describes the seasonal component of m time intervals). The mathematical equations of ARIMA model were explained in appendix section.

Model evolution and statistical analysis

To calculate the COVID-19 re-production cases among Italy patients, we imported ‘AUTOARIMA’ packages in R. After model exported, simple time series analysis was conducted to understand trends of corona epidemic in Italy. The data available from Italian health ministry website is obtained as day-to-day statistics. Past 45-days patient data were recorded on excel sheet. The command read_excel (“data”) was used to read the excel sheet. When working with time series in R, the data were converted in a time series (ts) for the number of registered cases per day from 15 February 2020 to 31 March 2020 mentioned as: install. Packages("forecast") library(forecast) library(readxl) worldcovid19 <- read_excel("Italycovid19.xlsx") View(worldcovid19) tsworldcovid19 <-ts (Italycovid19$`daily registered Cases’, frequency = 1,start = c(15/02/2020,1)) tsworldcovid19 <-ts (Italycovid19$`daily recovered Cases’, frequency = 1,start = c(15/02/2020,1)) plot(tsworldcovid19) 45 days patient data from 15 February 2020 (i.e., where serious outbreak was about to originated) to 31 March 2020 with one day frequency was considered (Fig. 2 ).
Figure 2

45-day plot diagram of COVID-19 daily registered cases in Italy (ts = 1).

45-day plot diagram of COVID-19 daily registered cases in Italy (ts = 1). The plots revealed that the trend in case registered at Italian hospitals was going upwards and peak number of corona cases was registered in the last two weeks of March (Fig. 3 ). This might be caused because of most people are traveled to home lands through public transports before lockdown was officially announced. Through this migration of people, virus could spreads through and expose the symptoms on or after incubation period. In view of this, we conducted simple forecasting of COVID cases if the same trend has been continued for two months. We applied ‘AUTOARIMA’ package in R to evaluate the values of (p, d, q) and forecaste the reproduction of infected cases. Two ARIMA models of COVID-19 daily registered and recovered cases were designed. The possible residuals for these two models to understand the case variance were plotted and statistical analysis was performed using ‘R’ version 1.2.5.
Figure 3

Weekly box plot diagram of infected Italians of COVID-19.

Weekly box plot diagram of infected Italians of COVID-19.

Results

For data fitting in ARIMA model to develop a model for COVID-19 for both registered and recovered cases, we performed the commands mentioned below. The 60-days COVID-19 forecasting graphs of register along recovery cases (Fig. 4 ), and normalized QQ plots were computed (Fig. 5 ). Table 1 presents the model outcomes and accuracy parameters.
Figure 4

Predictive and confidence intervals (CI) of registered case model (graph A), and recovered case model (graph B) (Black line: actual data, Blue line:60-day forecast, Gray zone: 80% of CI, White zone: 95% of CI).

Figure 5

Probability plots of registered cases (left), and recovered cases (right).

Table 1

ARIMA model comparison.

Modelar1ar2ar3AICAICcBICMERMSEMAEMPEMAPEMASEACF1
ARIMA(1,2,0)10 Registered cases0.3694680.41680.7683.9817.84514.74324.464.266.250.14030.0113
s.e.0.1575
ARIMA(3,2,0)10 Recovered cases−1.14−0.74−0.48597.18598.21604.3280.80186.85112.2710.5715.600.3293−0.081
s.e.0.12960.18260.1357

∗ar1,ar2,….arn are model coefficients; s.e.: standard errors; AIC: Akaike information criteria; AICc: Second order Akaike information criteria; BIC: Bayesian Information criterion; ME: Margin of error; RMSE: Root mean square error of fitted model; MAE: Mean absolute error; MPE: Mean posterior estimate; MAPE: Median absolute prediction error; MASE: Mean absolute scaled error; ACF: Aberrant crypt foci.

Predictive and confidence intervals (CI) of registered case model (graph A), and recovered case model (graph B) (Black line: actual data, Blue line:60-day forecast, Gray zone: 80% of CI, White zone: 95% of CI). Probability plots of registered cases (left), and recovered cases (right). ARIMA model comparison. ∗ar1,ar2,….arn are model coefficients; s.e.: standard errors; AIC: Akaike information criteria; AICc: Second order Akaike information criteria; BIC: Bayesian Information criterion; ME: Margin of error; RMSE: Root mean square error of fitted model; MAE: Mean absolute error; MPE: Mean posterior estimate; MAPE: Median absolute prediction error; MASE: Mean absolute scaled error; ACF: Aberrant crypt foci. The probability of new positive cases and recovered cases in Italy for next two months based on available data were computed. It is evident from Fig. 4, the 60-day forecasting of infected cases might rise in between the range of 105,732–182,757, and recovered cases could increased in between the range of 16,742–81,635 with CI of 80–95%. The regressive distribution of patient cases while two plots had observed to estimate the fitting accuracy. The model validation was assessed by prediction errors. Based on the ARIMA model accuracy evolution of COVID-19 Italian epidemic data on mentioned time period, we considered mean absolute prediction error (MAPE) parameter. The accuracy (Acc) is defined in equation (2); The models of ARIMA(1,2,0) registered, and ARIMA(3,2,0) recovered cases are validated with an accuracy of 93.75%, 84.4% respectivly.

Discussions

We used existed COVID-19 epidemic data of Italian patients to evaluate the probability of infected and recovered pateint number after having 60-day country lockdown. Simple automatic forecasting package (AUTOARIMA) of ‘R’ was applied to conduct predictive modelling. Our data driven model analysis highlights the necessity of country lockdown and self isolation to control disease transmissibility among Italian population at the moment. At the present, Italy is becoming the worst epidemic corona outbreak center. On 3 March 2020, 11 towns in North Italy announced quarantine after result of 17 deaths and 650 positive cases. Unfortunately, in consequence of many Italian citizens continued their daily life routine irrespective of outbreak results epidemic spread all over the country. After about one week, the Italian government announced more than 9000 positive cases with 97 deaths. On 9 March 2020 the Italian prime minister announced country lock down and strictly passed regulations to close malls, educational institutions, and sport events in order to stop infection among the other citizens. As mentioned, due to extreme characteristic of COVID-19 is not expose the immediate symptoms while in the incubation time. After Italy's lockdown, government officials make sure that people were at home. All national administration websites encourages companies to offer free online services. Educational institutions and universities involved e-learning methods, any data or publications on COVID-19 made available for free to general public. COVID-19 response team also conducting screening tests for domicile or long stay in high hit areas like north Italy provinces. Hospitals and medical centers are successfully handling patient flow to local hospitals and addresses individual issues about bed facilities, overcrowding in emergency departments, and patient transfer to other specialized facilities. All these critical circumstances were considered to understand what exactly happened in between the period of lockdown (9 March 2020) announcement and incubation period (possibly 23 March 2020). It can be observed in the Fig. 6 , the residual plot of positive COVID-19 cases during the given period. From the plot, it is clear that the first two weeks trend seems normal and after 3 March 2020, a huge spike in case variance can be observed (i.e., 24 to 26th days after quarantine had begun).
Figure 6

Residual plot of positive registered cases.

Residual plot of positive registered cases. One positive sign of this COVID-19 epidemic in Italy is after having established isolation, there is a significant growth of recovered case number, particular with last weeks of March (Fig. 7 ). This could be because of the increased availability of medical devices, medications and health professionals in the most affected areas that might affect lowering of pandemic rates.
Figure 7

Residual plot of recovered cases.

Residual plot of recovered cases. At present, the Italian citizens are also taking more preventive measures and maintaining social distancing to control speed of infection. As a result, disease transmission is expected to be reduced in the near future. Preliminary results of this study suggest that if Italian government and citizens could continue to be quarantined for another two months there could be chance of low tendency rate in infective cases. Predictions mentioned that another 78,701 infected cases might be the registered in 60 days which is lower than last 45 days. ARIMA models can forecast the simple up and downs and more predictive than regressive models without change in the overall trend. It is because ARIMA can only look back the data of dependent variables (i.e, registered and recovered cases). This represents a primary limitation of this study. Secondly, due to unwillingness to join in hospital, some confirmed cases are not ready to inform the medical authorities. This measure could affect the natural transmission of disease to family members which will also affect the study outcomes. Finally, used data was retrieved from official Italian Health Ministry websites, if any delay or mismatch of data reporting could results incorrect forecasting. COVID-19 is a severe pandemic that all countries are facing. This results about half of the global population went into lockdown. At the present, Italy is facing serious epidemic of positive and mortality rates. We estimated an increase in the size of registered cases and recovered case number population if the present lockdown could continue for another two months. Results of this study indicate that nearly 35% of decay in positive cases and 66% of growth in recovered cases could be possible. In addition, present government taking some serious contaminant measures such as suspending training sessions of sports persons, professionals, and non professionals. All emergency issues remained same including to prohibit natural persons to move with public and private means of transport. Advertising of prevention measures such as hand washing, mask wearing and disinfection was done continuously through national media which is largely influences the reproductive number of corona virus cases. The future of COVID-19 diffusion in Italy will largely depend on government regulations and motivation to carry self isolation of individual citizens.

Author contributions

NC: Data analysis, methods, results and study design; GB: Manuscript preparation and statistical analysis; FA: Final revision and study approval.

Declaration of Competing Interest

No author does not have any conflicts of interest.
  5 in total

1.  Critical Care Utilization for the COVID-19 Outbreak in Lombardy, Italy: Early Experience and Forecast During an Emergency Response.

Authors:  Giacomo Grasselli; Antonio Pesenti; Maurizio Cecconi
Journal:  JAMA       Date:  2020-04-28       Impact factor: 56.272

2.  The response of Milan's Emergency Medical System to the COVID-19 outbreak in Italy.

Authors:  Stefano Spina; Francesco Marrazzo; Maurizio Migliari; Riccardo Stucchi; Alessandra Sforza; Roberto Fumagalli
Journal:  Lancet       Date:  2020-02-28       Impact factor: 79.321

3.  Complexity of the Basic Reproduction Number (R0).

Authors:  Paul L Delamater; Erica J Street; Timothy F Leslie; Y Tony Yang; Kathryn H Jacobsen
Journal:  Emerg Infect Dis       Date:  2019-01       Impact factor: 6.883

4.  Estimation of COVID-19 outbreak size in Italy.

Authors:  Ashleigh R Tuite; Victoria Ng; Erin Rees; David Fisman
Journal:  Lancet Infect Dis       Date:  2020-03-19       Impact factor: 25.071

5.  Distribution of the COVID-19 epidemic and correlation with population emigration from Wuhan, China.

Authors:  Ze-Liang Chen; Qi Zhang; Yi Lu; Zhong-Min Guo; Xi Zhang; Wen-Jun Zhang; Cheng Guo; Cong-Hui Liao; Qian-Lin Li; Xiao-Hu Han; Jia-Hai Lu
Journal:  Chin Med J (Engl)       Date:  2020-05-05       Impact factor: 2.628

  5 in total
  57 in total

1.  Role of Precautionary Measures in Containing the Natural Course of Novel Coronavirus Disease.

Authors:  Arwa Alumran
Journal:  J Multidiscip Healthc       Date:  2020-07-16

2.  ARIMA models for predicting the end of COVID-19 pandemic and the risk of second rebound.

Authors:  Zohair Malki; El-Sayed Atlam; Ashraf Ewis; Guesh Dagnew; Ahmad Reda Alzighaibi; Ghada ELmarhomy; Mostafa A Elhosseini; Aboul Ella Hassanien; Ibrahim Gad
Journal:  Neural Comput Appl       Date:  2020-10-23       Impact factor: 5.606

3.  Integrating County-Level Socioeconomic Data for COVID-19 Forecasting in the United States.

Authors:  MichaelC Lucic; Hakim Ghazzai; Carlo Lipizzi; Yehia Massoud
Journal:  IEEE Open J Eng Med Biol       Date:  2021-07-09

4.  Random-Forest-Bagging Broad Learning System With Applications for COVID-19 Pandemic.

Authors:  Choujun Zhan; Yufan Zheng; Haijun Zhang; Quansi Wen
Journal:  IEEE Internet Things J       Date:  2021-03-17       Impact factor: 10.238

5.  Trend Analysis and Forecasting the Spread of COVID-19 Pandemic in Ethiopia Using Box-Jenkins Modeling Procedure.

Authors:  Yemane Asmelash Gebretensae; Daniel Asmelash
Journal:  Int J Gen Med       Date:  2021-04-21

Review 6.  A Survey on Mathematical, Machine Learning and Deep Learning Models for COVID-19 Transmission and Diagnosis.

Authors:  Christopher Clement John; VijayaKumar Ponnusamy; Sriharipriya Krishnan Chandrasekaran; Nandakumar R
Journal:  IEEE Rev Biomed Eng       Date:  2022-01-20

7.  Continent-Wide Analysis of COVID 19: Total Cases, Deaths, Tests, Socio-Economic, and Morbidity Factors Associated to the Mortality Rate, and Forecasting Analysis in 2020-2021.

Authors:  Muhammad Nauman Zahid; Simone Perna
Journal:  Int J Environ Res Public Health       Date:  2021-05-18       Impact factor: 3.390

Review 8.  A Review of Prolonged Post-COVID-19 Symptoms and Their Implications on Dental Management.

Authors:  Trishnika Chakraborty; Rizwana Fathima Jamal; Gopi Battineni; Kavalipurapu Venkata Teja; Carlos Miguel Marto; Gianrico Spagnuolo
Journal:  Int J Environ Res Public Health       Date:  2021-05-12       Impact factor: 3.390

9.  A Citywide 'Virus Testing': Chinese Government's Response to Preventing and Controlling the Second Outbreak of SARS-CoV-2.

Authors:  Liting Zhou; Hans Nibshan Seesaghur; Nadeem Akhtar; Jason Boolakee; Cornelius B Pratt
Journal:  Front Public Health       Date:  2021-06-16

10.  Forecasting the COVID-19 transmission in Italy based on the minimum spanning tree of dynamic region network.

Authors:  Min Dong; Xuhang Zhang; Kun Yang; Rui Liu; Pei Chen
Journal:  PeerJ       Date:  2021-06-29       Impact factor: 2.984

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.