Literature DB >> 32334157

Prediction for the spread of COVID-19 in India and effectiveness of preventive measures.

Anuradha Tomar1, Neeraj Gupta2.   

Abstract

The spread of COVID-19 in the whole world has put the humanity at risk. The resources of some of the largest economies are stressed out due to the large infectivity and transmissibility of this disease. Due to the growing magnitude of number of cases and its subsequent stress on the administration and health professionals, some prediction methods would be required to predict the number of cases in future. In this paper, we have used data-driven estimation methods like long short-term memory (LSTM) and curve fitting for prediction of the number of COVID-19 cases in India 30 days ahead and effect of preventive measures like social isolation and lockdown on the spread of COVID-19. The prediction of various parameters (number of positive cases, number of recovered cases, etc.) obtained by the proposed method is accurate within a certain range and will be a beneficial tool for administrators and health officials.
Copyright © 2020 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  COVID-19; Curve fitting; LSTM; Prediction; Recurrent neural network

Mesh:

Year:  2020        PMID: 32334157      PMCID: PMC7169890          DOI: 10.1016/j.scitotenv.2020.138762

Source DB:  PubMed          Journal:  Sci Total Environ        ISSN: 0048-9697            Impact factor:   7.963


Introduction

World is moving through a very distressing stage by the spread of novel coronavirus (SARS-CoV-2). It is a highly contagious disease and the World Health Organization (WHO) has declared it as a global public health emergency (L.-s. Wang et al., 2020). It is originated in Wuhan, Hubei Province, People's Republic of China (PRC) in late December 2019, when a case of unidentified pneumonia was reported (Huang et al., 2020). PRC Centers for Disease Control (CDC) experts declared that pneumonia as novel coronavirus pneumonia (NCP) as caused by a novel coronavirus and WHO officially named the disease COVID-19 (Huang et al., 2020). However, the International Committee on Taxonomy of Viruses (ICTV) named the virus as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). This is a class of β-coronavirus and has many potential natural hosts, intermediate hosts and final hosts as shown in Fig. 1 . Due to these characteristics, there is a great challenge for prevention and treatment of the virus infection (Vellingiri et al., 2020). Despite of the large number of cases worldwide (as shown in Fig. 2 (Statista, 2020)) and low mortality rate (Liu et al., 2020) compared to SARS and the middle east respiratory syndrome (MERS) (as shown in Fig. 3 (Statista, 2020)), this virus has high infectivity and transmissibility. Preventive measures for COVID-19 include maintaining social distancing, washing hands frequently, avoiding touching the mouth, nose, and face (WHO, 2020).
Fig. 1

Transmission of COVID-19.

Fig. 2

No of cases of COVID-19 worldwide (Statista, 2020).

Fig. 3

Fatality rate of major virus outbreaks (Statista, 2020).

Transmission of COVID-19. No of cases of COVID-19 worldwide (Statista, 2020). Fatality rate of major virus outbreaks (Statista, 2020). The first case of COVID-19 was reported in India on 30th January 2020 with origin from China (PIB, 2020). It spreads to the maximum of districts of the country. As on 9th April 2020 the total cases reported in India are 5734 with 472 recoveries and 166 deaths (Covid-19.in, 2020). However, the rate of infection is lower as compared to other countries. There is a lot of stress on the part of administration and health officials for accommodating patients with possible symptoms of COVID-19. So, for that some prediction tools must be used to know about the number of cases in coming days for making preparations at the administrative level (Tobías, 2020; L. Wang et al., 2020; L.-s. Wang et al., 2020). In this paper, we propose the data-driven LSTM method and the classical curve fitting method for the prediction of number of patients to be accommodated in the subsequent days based to the data available. The proposed model can approximately predict the number of new COVID-19 cases so, the administration can make preparations accordingly to accommodate them. This paper has been organized as follows. In Section 2, LSTM technique for the prediction of COVID-19 has been explained in detail. In 3, 4, the results and conclusions of the work have been presented, respectively.

LSTM based technique for prediction of COVID-19

Deep learning methods like recurrent neural networks (RNN) proved to be effective for prediction (Jiang and Schotten, 2020) due to automatically extracting relevant features from the training samples, feeding the activation from the previous time step as input for the current time step and networks self-connections. RNN is good at processing data and exhibiting great potential in time-series prediction (Connor et al., 1994) through storing large historical information in its internal state. However, it has drawback of vanishing and gradient exploding problems which leads to the large training time or training does not work at all. To remove that shortcomings, Hochreiter and Schmidhuber designed long short-term memory RNN structure in 1997 (Hochreiter and Schmidhuber, 1997) to deal with a long-term dependency with the multiplicative gates that regulate the information flow and memory cells in the recurrent hidden layer. The structure of LSTM consists of four gates i.e. input gate, forget gate, control gate, and output gate which is shown in Fig. 4 (Sun et al., n.d.).
Fig. 4

Basic structure of LSTM.

Basic structure of LSTM. The input gate is defined as It decides which information can be transferred to the cell. The information from the input of previous memory which is to be neglected is decided by the forget gate and is defined as: The update of the cell is controlled by the control gate and is given by the following equations: The hidden layer (h ) is updated by output layer which is also responsible for updating the output as is given by: In the above equations, tanh is used to scale the values into range −1 to 1, σ is the activation function which is taken as sigmoid and W are the corresponding weight matrices.

Results

In this section, we study the spread of COVID-19 in India as there are hundreds of cases reported each day. For validation and analysis of the proposed model, data pertaining to India from (Covid-19.in, 2020) has been used with the Matlab environment.

Data-driven methods to predict COVID-19

The data has been used from 30th January 2020 (when the first case of COVID-19 was reported in India) to 4th April 2020 with 80% data is used for training and rest 20% for forecasting and validation purposes. The resulting plot showing the total number of confirmed cases is shown in Fig. 5(a). In this figure, the observed data (blue color) is the data used for training purposes (80% of the total data), official data (green line) indicates the official data available (Covid-19.in, 2020) and forecasted data (red line) indicates the forecast of a total number of confirmed cases. From this graph, it is observed that the forecasted number of total confirmed positive cases closely matches with the available official data.
Fig. 5

(a) Total number of confirmed cases prediction using LSTM. (b) Daily total number of positive cases prediction using LSTM. (c) Total number of recovered cases prediction using LSTM. (d) Total number of deceased cases prediction using LSTM. (For interpretation of the references to color in this figure, the reader is referred to the web version of this article.)

(a) Total number of confirmed cases prediction using LSTM. (b) Daily total number of positive cases prediction using LSTM. (c) Total number of recovered cases prediction using LSTM. (d) Total number of deceased cases prediction using LSTM. (For interpretation of the references to color in this figure, the reader is referred to the web version of this article.) Similar observations have also been made for daily reported positive cases, total recovered cases and total deceased cases, as shown in Fig. 5(b), (c) and (d) respectively. The classical curve fitting technique also has been applied, considering two objectives: First, is to verify the above method and second is to analyze the impact of lockdown and social distance considering the various spread ratios. From this method it is observed that the data follows a power law (f(x) = ax ) and resulting estimation for total confirmed cases, daily positive cases, total recovered cases and total deceased cases are shown in Fig. 6(a), (b), (c) and (d) respectively. In these figures, official data is the data available for prediction (data of 66 days), estimated is the estimation/prediction curve and pred bnds represents the confidence of prediction, which is resulted as ±5% in the proposed work.
Fig. 6

(a) Total number of confirmed cases prediction by curve fitting. (b) Daily number of positive cases prediction by curve fitting. (c) Total number of recovered cases prediction by curve fitting. (d) Total number of deceased cases prediction by curve fitting.

(a) Total number of confirmed cases prediction by curve fitting. (b) Daily number of positive cases prediction by curve fitting. (c) Total number of recovered cases prediction by curve fitting. (d) Total number of deceased cases prediction by curve fitting. The recovery rate of confirmed cases is also high in case of COVID-19, however, the time taken for the patients to recover is also large. With a large number of patients, the stress on the medical resource increases, so estimation/prediction of time taken for recovery is also required for proper arrangement and utilization of available resources. Towards this goal estimation of the number of patients recovered has also been made, which is shown in Fig. 7(a).
Fig. 7

(a) Estimation of number of days required for recovery (b) Effect of the transmission rate r on number of cases (c) Effect of r on number of cases with 6th April as initial point.

(a) Estimation of number of days required for recovery (b) Effect of the transmission rate r on number of cases (c) Effect of r on number of cases with 6th April as initial point. From this figure, it is observed that for the total number of confirmed cases up to 90 days from the starting point of the first case, 120 days are required for total recovery. For these data-driven estimations, the data has been taken up to 4th of April. The comparison has also been made for the total positive reported cases and daily reported cases with estimated cases (by data-driven model) from 5th to 9th April 2020 as shown in Table. 1 .
Table 1

Comparison of reported and estimated cases.

DayDateOfficial dataEstimationError percentage (%)
Comparison for total positive confirmed cases
67th5th April 202042894012−6.44
68th6th April 2020477846762.12
69th7th April 2020535154381.64
70th8th April 2020591663116.6
71st9th April 2020672573088



Comparison for daily reported positive cases
67th5th April 2020605455−24
68th6th April 20204895226.7
69th7th April 20205735984.4
70th8th April 202056568320.8
71st9th April 2020809779−3.7
Comparison of reported and estimated cases. From the above tables and Fig. 7(a), it is observed that number of cases under different heads can be estimated by using the above two techniques with the available data. However, the limitation is only that limited data is available and initial data is nearly a flat curve and much cannot be deciphered from that.

Effect of measures adopted to prevent the spread of COVID-19

With the outbreak of the COVID-19 pandemic, various measures are has been adopted by the Govt. of India to prevent its spread (Covid-19.in, 2020). One of the measures is social isolation and lockdown. Social isolation is the complete lack of contact between an individual and society, while, lockdown is an emergency protocol that usually prevents people from leaving an area. These two measures prevent the spread of COVID-19 from effected person to healthy individual to a great extent. Analysis of the adopted preventive measures has been done with different transmission rate (r) (Zhou et al., 2020) i.e. from r = 0.001 to r = 2.3. The transmission rate is the quantitative measure of spread of the virus from an effected individual to healthy individuals and low value of transmission rate can be obtained by strict social isolation and lockdown measures (Worldometers, 2020). Let us consider a scenario that in case of India, the value of r before lockdown was 2.3 (i.e. an infected individual can infect the 2.3 persons) and after lockdown it reduced to 0.15. The estimated total positive cases have been compared with actual cases using different values of transmission rate and are shown in Fig. 7(b). From this figure, it is observed that preventive measures (social isolation and lockdown) have worked well in containing this contagious virus in India. Now, let us consider a scenario with 6th April 2020 having 4289 number of positive cases as the initial start and strict lockdown and social isolation measured adopted rigorously, the number of positive cases with variation in transmission rate is shown in Fig. 7(c). From Fig. 7(c), it is observed that preventive measures of social isolation and lockdown will work perfectly in reducing the number of cases and making the curve flat. So preventive measure must be strictly adopted after 6th April.

Conclusion

In this paper, a data-driven forecasting/estimation method has been used to estimate the possible number of positive cases of COVID-19 in India for the next 30 days. The number of recovered cases, daily positive cases, deceased cases has also been estimated by using LSTM and curve fitting. The effect of preventing measures like social isolation and lockdown has also been observed which shows that by these preventive measures, spread of the virus can be reduced significantly.

CRediT authorship contribution statement

Anuradha Tomar: Conceptualization, Investigation, Methodology, Software, Validation, Writing - review & editing. Neeraj Gupta: Conceptualization, Data curation, Writing - original draft, Visualization, Writing - review & editing.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  9 in total

1.  Recurrent neural networks and robust time series prediction.

Authors:  J T Connor; R D Martin; L E Atlas
Journal:  IEEE Trans Neural Netw       Date:  1994

2.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

Review 3.  The reproductive number of COVID-19 is higher compared to SARS coronavirus.

Authors:  Ying Liu; Albert A Gayle; Annelies Wilder-Smith; Joacim Rocklöv
Journal:  J Travel Med       Date:  2020-03-13       Impact factor: 8.490

4.  Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China.

Authors:  Chaolin Huang; Yeming Wang; Xingwang Li; Lili Ren; Jianping Zhao; Yi Hu; Li Zhang; Guohui Fan; Jiuyang Xu; Xiaoying Gu; Zhenshun Cheng; Ting Yu; Jiaan Xia; Yuan Wei; Wenjuan Wu; Xuelei Xie; Wen Yin; Hui Li; Min Liu; Yan Xiao; Hong Gao; Li Guo; Jungang Xie; Guangfa Wang; Rongmeng Jiang; Zhancheng Gao; Qi Jin; Jianwei Wang; Bin Cao
Journal:  Lancet       Date:  2020-01-24       Impact factor: 79.321

5.  Evaluation of the lockdowns for the SARS-CoV-2 epidemic in Italy and Spain after one month follow up.

Authors:  Aurelio Tobías
Journal:  Sci Total Environ       Date:  2020-04-06       Impact factor: 7.963

Review 6.  COVID-19: A promising cure for the global panic.

Authors:  Balachandar Vellingiri; Kaavya Jayaramayya; Mahalaxmi Iyer; Arul Narayanasamy; Vivekanandhan Govindasamy; Bupesh Giridharan; Singaravelu Ganesan; Anila Venugopal; Dhivya Venkatesan; Harsha Ganesan; Kamarajan Rajagopalan; Pattanathu K S M Rahman; Ssang-Goo Cho; Nachimuthu Senthil Kumar; Mohana Devi Subramaniam
Journal:  Sci Total Environ       Date:  2020-04-04       Impact factor: 7.963

7.  Preliminary prediction of the basic reproduction number of the Wuhan novel coronavirus 2019-nCoV.

Authors:  Tao Zhou; Quanhui Liu; Zimo Yang; Jingyi Liao; Kexin Yang; Wei Bai; Xin Lu; Wei Zhang
Journal:  J Evid Based Med       Date:  2020-02-12

8.  Real-time estimation and prediction of mortality caused by COVID-19 with patient information based algorithm.

Authors:  Lishi Wang; Jing Li; Sumin Guo; Ning Xie; Lan Yao; Yanhong Cao; Sara W Day; Scott C Howard; J Carolyn Graff; Tianshu Gu; Jiafu Ji; Weikuan Gu; Dianjun Sun
Journal:  Sci Total Environ       Date:  2020-04-08       Impact factor: 7.963

  9 in total
  80 in total

1.  ξboost: An AI-Based Data Analytics Scheme for COVID-19 Prediction and Economy Boosting.

Authors:  Darshan Vekaria; Aparna Kumari; Sudeep Tanwar; Neeraj Kumar
Journal:  IEEE Internet Things J       Date:  2020-12-25       Impact factor: 9.471

Review 2.  Applications of artificial intelligence in battling against covid-19: A literature review.

Authors:  Mohammad-H Tayarani N
Journal:  Chaos Solitons Fractals       Date:  2020-10-03       Impact factor: 5.944

3.  Data Analysis and Forecasting of the COVID-19 Spread: A Comparison of Recurrent Neural Networks and Time Series Models.

Authors:  Daniela A Gomez-Cravioto; Ramon E Diaz-Ramos; Francisco J Cantu-Ortiz; Hector G Ceballos
Journal:  Cognit Comput       Date:  2021-06-03       Impact factor: 5.418

4.  A Deep Learning-Based Method for Forecasting Gold Price with Respect to Pandemics.

Authors:  Mahtab Mohtasham Khani; Sahand Vahidnia; Alireza Abbasi
Journal:  SN Comput Sci       Date:  2021-06-12

5.  Analysis and forecasts for trends of COVID-19 in Pakistan using Bayesian models.

Authors:  Navid Feroze; Kamran Abbas; Farzana Noor; Amjad Ali
Journal:  PeerJ       Date:  2021-07-07       Impact factor: 2.984

6.  Transmission trend of the COVID-19 pandemic predicted by dendritic neural regression.

Authors:  Minhui Dong; Cheng Tang; Junkai Ji; Qiuzhen Lin; Ka-Chun Wong
Journal:  Appl Soft Comput       Date:  2021-07-07       Impact factor: 6.725

7.  Deming least square regressed feature selection and Gaussian neuro-fuzzy multi-layered data classifier for early COVID prediction.

Authors:  Rathnamma V Mydukuri; Suresh Kallam; Rizwan Patan; Fadi Al-Turjman; Manikandan Ramachandran
Journal:  Expert Syst       Date:  2021-03-26       Impact factor: 2.812

8.  Role of Threat and Coping Appraisal in Protection Motivation for Adoption of Preventive Behavior During COVID-19 Pandemic.

Authors:  Arista Lahiri; Sweety Suman Jha; Arup Chakraborty; Madhumita Dobe; Abhijit Dey
Journal:  Front Public Health       Date:  2021-07-05

9.  Time series prediction of COVID-19 transmission in America using LSTM and XGBoost algorithms.

Authors:  Junling Luo; Zhongliang Zhang; Yao Fu; Feng Rao
Journal:  Results Phys       Date:  2021-06-22       Impact factor: 4.476

10.  Prediction of global spread of COVID-19 pandemic: a review and research challenges.

Authors:  Saloni Shah; Aos Mulahuwaish; Kayhan Zrar Ghafoor; Halgurd S Maghdid
Journal:  Artif Intell Rev       Date:  2021-07-16       Impact factor: 9.588

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.