Literature DB >> 33250530

Novel spatiotemporal feature extraction parallel deep neural network for forecasting confirmed cases of coronavirus disease 2019.

Chiou-Jye Huang1, Yamin Shen1, Ping-Huan Kuo2, Yung-Hsiang Chen3.   

Abstract

The coronavirus disease 2019 pandemic continues as of March 26 and spread to Europe on approximately February 24. A report from April 29 revealed 1.26 million confirmed cases and 125 928 deaths in Europe. To refer government and enterprise to arrange countermeasures. The paper proposes a novel deep neural network framework to forecast the COVID-19 outbreak. The COVID-19Net framework combined 1D convolutional neural network, 2D convolutional neural network, and bidirectional gated recurrent units. COVID-19Net can well integrate the characteristics of time, space, and influencing factors of the COVID-19 accumulative cases. Three European countries with severe outbreaks were studied-Germany, Italy, and Spain-to extract spatiotemporal features and predict the number of confirmed cases. The prediction results acquired from COVID-19Net are compared to those obtained using a CNN, GRU, and CNN-GRU. The mean absolute error, mean absolute percentage error, and root mean square error, which is commonly used model assessment indices, were used to compare the accuracy of the models. The results verified that COVID-19Net was notably more accurate than the other models. The mean absolute percentage error generated by COVID-19Net was 1.447 for Germany, 1.801 for Italy, and 2.828 for Spain, which was considerably better than those of the other models. This indicated that the proposed framework could accurately predict the accumulated number of confirmed cases in the three countries and serve as an essential reference for devising public health strategies. And also indicated that COVID-19 has high spatiotemporal relations, it suggests us to keep a social distance and avoid unnecessary trips.
© 2020 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  COVID-19Net; Confirmed cases forecasting; Parallel deep neural network

Year:  2020        PMID: 33250530      PMCID: PMC7687416          DOI: 10.1016/j.seps.2020.100976

Source DB:  PubMed          Journal:  Socioecon Plann Sci        ISSN: 0038-0121            Impact factor:   4.923


Introduction

CORONAVIRUS disease 2019 (COVID-19) is caused by a highly contagious novel virus and was first discovered in Wuhan City in Hubei, China, where the pandemic initially began. The pandemic quickly spread across China and then the entire world. Because of the high contagion rate of COVID-19 and the likelihood of an outbreak, a lockdown was imposed on Wuhan City, which has a population of nearly 11 million people. Meanwhile, the Chinese government implemented a series of strict measures to control the pandemic. Fig. 1 illustrates the timeline of the COVID-19 pandemic.
Fig. 1

Timeline of the COVID-19 pandemic.

Timeline of the COVID-19 pandemic. In December 2019, a pneumonia case of unknown cause was discovered. Shortly afterward, patients with similar symptoms appeared across China and substantially burdened the healthcare system. On January 12, 2020, the World Health Organization (WHO) named the pneumonia 2019-nCoV. The disease was then confirmed to be contagious between people by Zhong Nanshan, an academician of the Chinese Academy of Engineering, on January 20, 2020. Ten days later (January 30), the WHO designated 2019-nCoV a public health emergency of international concern. On February 11, 2020, WHO Director-General Tedros Adhanom Ghebreyesus officially renamed the disease COVID-19 during an international meeting in Geneva. On March 11, 2020, Tedros announced a global outbreak of COVID-19. Mahase [1] reported in the BMJ that UK Prime Minister Boris Johnson warned citizens to avoid contact with people. Imperial College London indicated that COVID-19 is highly contagious to older adults aged over 70 years and pregnant women. As of 11:00 a.m. Beijing Time on March 16, the accumulated number of confirmed cases in Italy has reached 24 938, with 20 794 active cases, 1518 severe cases, 2335 fully recovered cases, and 1809 deaths. Italy (60.43 million people) has only a 20% larger total population than does South Korea (51.63 million people). However, the number of confirmed cases in Italy is three times of that South Korea. The number of deaths presents the greatest difference; that of Italy (1809) is 24 times that of Korea (75). Overall, the COVID-19 mortality rate in Italy exceeds 7%, which is substantially higher than the global average mortality rate (3.7%) and is the highest rate worldwide. Italy implemented measures against the pandemic earlier than did other countries. The day after COVID-19 was designated a public health emergency of international concern by the WHO, the Italian government became the first among European Union countries to announce a national health emergency and ceased all flights to and from China. On February 22, 2020, Italy imposed lockdowns on multiple cities, and the military was deployed to assist with pandemic prevention 3 days later. Recently, Italy has deployed healthcare personnel across the country to contain the pandemic. Italy's severe situation and its worldwide leading mortality rate may be attributable to the fast spread of the virus and numerous citizens' unwillingness to reduce social activities despite warnings from the government. Meanwhile, an increasing number of countries have implemented strict pandemic prevention measures. After Italy, Spain imposed multiple strict measures in which the central government announced a state of emergency for 14 days, during which all people were requested to remain indoors unless commuting to work or purchasing daily necessities. In addition to schools, which were closed previously, all shops and commercial areas unrelated to sustaining daily necessities were required to cease operation. On March 14, 2020, a lockdown was imposed on Madrid. Madrid, Spain on Saturday, March 14th became the second European country to impose a nationwide lockdown, ordering its 47 million people to mostly stay in their homes as coronavirus cases surge and the government announced that the wife of the Spanish prime minister had tested positive for the virus [2]. Staring from the next day, the police began to use unmanned drones for patrols and ask crowds in parks to leave and remain at home. However, the pandemic in Spain has not subsided. The accumulated number of confirmed cases in Spain has reached 74 386, and this number is higher than that of the five most affected Chinese provinces excluding Hubei combined (i.e., Guangdong, Henan, Zhejiang, Hunan, and Anhui), despite the total population of Spain being 12% of the combined population of these five provinces. The incremental rate of confirmed cases in Spain is also greatly concerning. The accumulated number of confirmed cases was 3146 on March 13, 5232 on March 14, and 7753 on March 15, which presents a daily incremental rate of 50%. In addition, the number deaths in Spain reached 288 and a 3.7% mortality rate, which is the highest after that of Italy. On February 9, 2020, Fernando Simón, the Director of the Center for Coordination of Health Alerts and Emergencies in Spain, stated that the country would only have a few confirmed cases. However, a report by the center on March 6 warned that the number of deaths in nearby Italy was increasing rapidly and that the pandemic may spread to Spain. At that time, the number of confirmed cases in Spain was roughly 280, with four deaths. Two weeks later, the number of deaths in Spain exceeded 7000, three times that disclosed by Iran. The statistics reported by the Spanish government on March 30 indicated that 813 people died because of COVID-19 in the past 24 h, which caused the number of deaths in Spain to reach 7340, during which the number of confirmed cases increased by 6398. Consequently, the number of accumulated confirmed cases reached 85 195 in Spain, the third highest number worldwide. The pandemic in Spain appeared to have subsided in the past day. However, a sudden increase in the number of confirmed cases and deaths forced the Spanish government to impose further lockdown measures: Since March 30, all people have been required to remain at home for at least 2 weeks, except for those engaging in essential occupations. On March 11, 2020, the Chancellor of Germany, Angela Merkel, hosted a press conference jointly with Jens Spahn, the Federal Minister of Health, and Lothar Wieler, the President of the Robert Koch Institute. For the first time, Merkel publicly addressed her opinion of COVID-19 and stated that the public is currently not immune to this disease. Merkel warned that because no vaccine or medication has been developed for COVID-19, 60%–70% of the German population is expected to be infected at the current rate according to expert analysis. A few hours after the press conference, the WHO characterized COVID-19 as a pandemic, when the accumulated number of confirmed cases in Germany reached 2000, with four cases of death. This marks the zero-death record held by Germany while the pandemic spreads across Europe. In the past week, the number of confirmed cases in Germany has increased considerably, with daily increases of over 1000 cases. To contain the pandemic, the German government closed its borders with France, Switzerland, and Austria at 08:00 on March 16. The German railway also progressively reduced the number of domestic trains in service. Social activities in each German state are being gradually reduced. Events with more than 50 participants must be canceled; schools, bars, and nightclubs have been closed; some companies and governmental units allow their employees to work from home; and restrictions were imposed on foreign travel. In response to COVID-19, the German government has conducted research in the Robert Koch Institute since January 6, and the scale of the research continues to increase. In addition to its comprehensive laboratory system and highly trained personnel, Germany began conducting screening tests early to identify patients in advance. After the contagion rate and number of patients are reduced, the national healthcare system can allocate more capacity to provide treatment for all patients instead of passively conducting screening tests after patients experience severe symptoms and require hospitalization. Germany currently has the lowest COVID-19 mortality rate among European countries with populations of more than 10 million people. According to 2017 Organization for Economic Cooperation and Development statistics, the number of ward beds per person in Germany was the highest among European countries, and an average of 8 beds are available for every 1000 people. A crucial factor in containing pandemics is the density of intensive care units (ICUs). Patients with severe COVID-19 symptoms must receive treatment in ICU wards and generally require artificial ventilation. According to the German Federal Ministry of Health, Germany currently has 28 000 intensive care beds, 25 000 of which are equipped with artificial ventilation systems. Assume Germany's healthcare system possesses enough facilities to contain the pandemic for the total population. However, many of these beds are already occupied. Consequently, this assumption is valid only if the national healthcare system does not collapse. A critical factor in preventing this pandemic is reducing the viral contagion rate and not overwhelming the healthcare system, which can be extremely difficult. The number of intensive care beds in Italy is roughly 5000. If the pandemic continues or becomes more severe, Germany's large number of available beds will be a notable advantage in containing the epidemic. As of 16:00 on March 20, the accumulated number of confirmed cases in Germany is 19 356, with 52 deaths and a mortality rate of 0.26%. However, the situation in Germany is not optimistic. Political and academic figures, including Minister Spahn, warned that the pandemic has not peaked. Before the COVID-19 outbreak, the German healthcare system already faced labor shortages. Experts warned that when the real peak of the pandemic arrives, Germany may lack nurses despite having sufficient ventilators. Local health departments lack epidemiology surveyors; hence, during the pandemic peak, whether Germany can continue to effectively track the contact history of cases and stop chains of infection remain questionable. Concerns have emerged regarding European countries with close interactions with Germany, including the United Kingdom and Switzerland. If these countries fail to contain the pandemic, Germany may face continuous streams of imported cases because the strict border controls currently imposed in Europe cannot continue indefinitely. In response to COVID-19, which is considered a “never before threat” to Germany, the German government clearly stated that additional restrictions may be imposed on the daily life of citizens. Merkel also emphasized that the government is ready to take any necessary measures to help the country overcome the pandemic. As the country with the largest population, economic activity, and number of European Parliament seats in the European Union, Germany is a critical reference for the global community to assess the progression of COVID-19. Currently, the pandemic in China appears to have subsided considerably, while outbreaks continue in other countries, particularly European countries such as Italy, Spain, and Germany. On March 19, 2020, the number of newly confirmed cases in Italy surpassed 4000, and that in Spain and Germany each surpassed 2000. The COVID-19 outbreak substantially threatens the global health system and considerably affects the global economy. Decreases in European and US stock markets have led to notable economic losses. Extracting information from the spatiotemporal data of the pandemic is increasingly crucial to helping countries determine next-stage pandemic prevention measures and planning resource allocation and aid among cities and countries. The remaining sections of this paper are structured as follows. Section 2 presents a literature review on articles recently published by relevant scholars. Section 3 details the model proposed in this study and the principles behind models. Section 4 discusses the analysis results, and Section 5 presents its conclusions.

Related works

COVID-19 had a severe impact on economic, social, and production in the world. Nicola et al. [3] gave a summarization of the socio-economic effects of COVID-19 on individual aspects of the world economy, and it indicated the necessity of a broad socio-economic development plan. COVID-19 disease prompts the government to impose drastic lockdown measures of the economy and society, to let the economy recover while preventing and controlling the epidemic. Ocampo et al. [4] applied the decision-making trial and evaluation laboratory (DEMATEL) method with intuitionistic fuzzy (IF) sets to the domain of public health and the emerging COVID-19 pandemic. People warned to avoid all non-essential travel, COVID-19 has caused considerable losses to tourism in many countries. COVID-19 has also hit the stock market. In Spain's stock market [5] (IBEX 35), the most significant one-day drop in its history appears, which experienced a decline of up to 14% at the closing of shares. Some researchers analyzed the opportunities and challenges in their research areas during the epidemic. Javaid et al. [6] found that Industry 4.0 not only could fulfill the requirements for custom face masks, gloves but also could collect information from the healthcare system to control and treat COVID-19 patients properly. Pu et al. [7] discussed the impact of COVID-19 on agricultural production in China. The research indicated that unreasonable restrictions would block the outflow channels of farm products, hinder necessary production inputs, destroy production cycles, and finally undermine production capacity. Singh et al. [8] researched the Internet of Things (IoT), and the research indicated that the IoT healthcare system was useful for proper monitoring of COVID-19 patients. The use of interconnected networks can help improve patient satisfaction and reduces readmission rates in the hospital. Above all, whatever the stock market or others, total affect by COVID-19. There are opportunities as well as challenges, and this is bound to have some impact on the economy. A large number of researchers have found that the prediction of COVID-19 cases has become more and more critical. Nikolopoulos et al. [9] found that due to travel restrictions and other measures in various countries, short-term real-time prediction of the epidemic has become an essential management and decision-making task. Accurately predicting the development of new cases can more effectively manage the resulting excess demand in the entire supply chain. Zeroual et al. [10] found that effective management of infected patients has become a challenging problem facing hospitals. Accurate short-term prediction of the number of new infections and recovered cases is essential to optimize available resources and prevent or slow the development of such diseases. Shastri et al. [11] believe that predicting COVID-19 cases can help reduce morbidity and mortality. The proposed COVID-19Net can forecast next day total confirmed cases and provide a reference for enterprises to develop production plans and response plans during the epidemic. It helps enterprises seize opportunities and meet challenges. Commonly used COVID-19 prediction models are divided into two types: those using mathematics or artificial intelligence (AI) black box algorithms to construct models and analyze the progression and contagion of the pandemic. Li et al. [12] used a mathematical approach to construct a function model and describe the daily number of newly confirmed cases and deaths in Hubei. They used nonlinear regression to obtain essential parameter values, summarize the pandemic trend in Hubei, and predict that the pandemic will end in Hubei on March 10. Nishiura et al. [13] estimated the serial intervals of COVID-19 patients. Using a truncated dataset with days as the unit, they employed a double-interval Bayes likelihood function to estimate serial intervals. Under a 95% confidence interval, their results revealed a mean of 3.9 d between successive cases in a chain of transmission. Tang et al. [14] used a temporal dynamic model to simulate COVID-19 transmission in China. Markov chain Monte Carlo methods were used to fit the pandemic data of the country, and Geweke's diagnostic was used to assess model convergence. Their experiment suggested that 58%–76% of contagion sources must be isolated to contain the pandemic. Fan et al. [15] used data on the population distribution and flow of Wuhan City to predict COVID-19 cases imported from nearby cities and provinces. Their results revealed that the population of other cities or provinces was highly correlated with the daily number of confirmed cases in Wuhan. Roosa et al. [16] predicted pandemic progression by using three mathematical models: a generalized logistic growth model, the Richards growth model, and a sub-epidemic wave model. Their results verified that the sub-epidemic model most accurately predicted the pandemic situation in Hubei on February 15. Zhong et al. [17] used the susceptible–infected–removed model and epidemic data combined with parameter estimation to estimate the accumulated number of confirmed cases in China. They predicted that the COVID-19 outbreak in China will gradually subside between early May and late June. The Diamond Princess cruise is considered a major epidemic zone of COVID-19 and has received considerable attention from the global community. Nishiura [18] used the infection timeline of the Diamond Princess to back-calculate the incidence rate under a 95% confidence interval and revealed that implementing isolation measures on February 5 reduced the number of confirmed cases to 776. Currently, Internet and big data platforms are gradually improving to incorporate comprehensive functions. This indicates the feasibly of using AI to resolve disease-related problems [19]. For example, in response to the increasing need for resources during the severe acute respiratory syndrome (SARS) outbreak from China in 2002, Yu et al. [20] used a multitarget and multicycle mixed-integer programming model to determine the optimal allocation and provision of temporary facilities and resources. They used Language for Interactive General Optimizer to optimize facility site selection and suggested that selecting the sites of temporary medical facilities built in late February substantially affects subsequent pandemic prevention. McAleer [21] employed the Global Health Security Index to assess 195 countries worldwide and compared COVID-19 with SARS and Middle East respiratory syndrome. The results indicated that the global community could better contain COVID-19 than the other two diseases, with the index score increasing to 51.9. For deep learning, Metsky et al. [22] employed the CRISPR-Cas13 system to compare COVID-19 virus nucleic acid sequences with those of other representative viruses. They also tested various COVID-19 screening methods and identified ADAPT as the most accurate. Guo et al. [23] used a bi-path convolutional neural network (CNN) to construct a virus host prediction model. They used the basic local alignment search tool to compare the gene sequences of two virus hosts and determined that the area under curve approach most accurately described human hosts. Yang et al. [24] combined a long short-term memory (LSTM) model and the Susceptible-Exposed-Infected-Resistant (SEIR) model and used the population flow data obtained near January 23, 2020 and COVID-19 epidemic parameters to predict the pandemic peak in China. Comparing the prediction curves derived using these two models and that plotted using actual data indicated that the number of infected patients would reach a peak of 4000 between February 4 and February 7. Anastassopoulou et al. [25] used the susceptible–infectious–recovered–dead model to examine the medical data of Hubei acquired between January 11 and February 10, 2020 and predict mortality and recovery rates through 95% confidence intervals [26]. Their results showed that the accumulated number of confirmed cases could reach 45 000–180 000 on February 29, and the number of deaths could exceed 2700. Kucharski et al. [27] used a random transmission model to predict the number of confirmed cases in Wuhan and imported cases from Wuhan. Their results indicated that viral transmission could subside in Wuhan in late January 2020; however, new outbreaks could occur in regions with transmission potential similar to that of Wuhan. Huang et al. [28] proposed to use the deep neural network CNN architecture to predict the cumulative number of confirmed cases in seven regions of Hubei, Shenzhen and Wenzhou. By comparing with the training effects of MLP, LSTM and GRU models, it is found that the overall effect of CNN in this experiment is better than other models. Few scholars have used deep learning to predict pandemic trends because this approach requires much data. To fully utilize available pandemic data, the time-variant characteristics of pandemic progression and the strong spatial correlation between neighboring countries and cities with high transmission potential must be considered. Accordingly, a parallel mixed deep learning network was employed in this study to extract the spatiotemporal features of COVID-19 transmission and predict subsequent pandemic trends.

Methodology

Data were acquired from the daily situation reports disclosed by the WHO [29] and information in the GitHub website [30]. To ensure the integrity and rigorousness of research procedures, we used data from January 22 to April 28, during which 98 pieces of data were collected daily and divided into 93 samples. The first 83 samples were used as the training set, and the remaining 10 samples were used as the testing set. Data accumulated over a 5-d period were used to predict the accumulated number of confirmed cases of the subsequent day. The data were from three European countries with severe situations: Germany, Italy, and Spain. The data were divided into two parts, as shown in Fig. 2 .
Fig. 2

Overview of the COVID-19Net model experiment process.

The first part consisted of six characteristic factors from each country: the daily number of newly confirmed cases, deaths, and recovered cases and the accumulated number of confirmed cases, deaths, and recovered cases. The second part consisted of three characteristic factors—the accumulated number of confirmed cases from the three countries. Overview of the COVID-19Net model experiment process. Based on the different types of these specific factors, a new hybrid spatiotemporal architecture called COVID-19Net employ predict the total number of confirmed cases on the next day, as shown in step 2 of Fig. 2. The proposed architecture is mainly composed of CNN and bidirectional gated recurrent unit (BiGRU). The experimental COVID-19Net flow chart also shown in advance below. The detailed explanations of this method will give in sections 3, 4. Fig. 3 illustrates the geographical locations of the three countries. Many studies did not consider the spatial characteristics of the epidemic. For the first time, COVID-19Net has extracted and integrated the temporal and spatial (Germany, Italy, Spain) characteristics and influencing factors of cumulative COVID-19 cases, and performed well as shown in Fig. 2. COVID-19Net makes full use of the feature extraction capabilities of CNN to extract spatiotemporal features of COVID-19 cases and extract relevant features from each feature factor.
Fig. 3

Geographical locations of Germany, Italy, and Spain [31].

Geographical locations of Germany, Italy, and Spain [31].

Convolutional neural network

CNN's are a typical class of deep neural networks. They are used wildly in many fields, such as image classification [32], semantic segmentation [33], time-series forecasting [34] et al. CNN is divided into one-dimensional CNN (1D-CNN) and two-dimensional CNN (2D-CNN) according to different application data dimensions. The movement direction of the 1D-CNN filter is unidirectional, but while the movement direction of the 2D-CNN filter is two-directional. Both 1D-CNN and 2D-CNN have a strong feature-learning ability. A CNN can be divided into layers to extract features from a dataset. Fig. 4 presents the structural diagram of a CNN.
Fig. 4

CNN structure.

CNN structure. A CNN mainly comprises convolutional, pooling, and fully connected layers. Convolutional layers are the core structure where most computations occur. When data pass through the kernel, features are extracted and sent to pooling layers, which ensure that features remain invariant. The resulting feature maps are then compressed to reduce computation loads. Finally, fully connected layers transform all features into one-dimensional (1D) vectors and send them to the dense layer for outputs.

Bidirectional long short-term memory neural network & bidirectional gated recurrent unit

A bidirectional LSTM (BiLSTM) model [35] presents an improved framework compared with LSTM [36]. The neuron structure of BiLSTM is similar to that of LSTM, as illustrated in Fig. 5 (a) and (b). LSTM comprises an input gate, output gate, and forget gate.
Fig. 5

(a) Neuron structure of an LSTM model; (b) neuron structure of a BiLSTM model.

(a) Neuron structure of an LSTM model; (b) neuron structure of a BiLSTM model. The input gate is expressed using (1), in which current input and previous information and memory (i.e., and ) are selectively retained according to corresponding weights and are sent to the forget gate. The corresponding weight matrices are , , and , and denotes the bias matrix of the input gate. The forget gate is expressed using (2), (3). In (2), current input and previous information and memory (i.e., and ) are selectively forgotten according to weight matrices , , and , and is the bias matrix of the forget gate. In (3), information retained in the input gate is applied to the tanh function and added to previously forgotten information to derive the target retained information , where and are the weight matrices of the forget gate, and is the bias matrix of the forget gate. The output gate is acquired by passing current input , previous information , and currently retained memory through the sigmoid layer. Then, is standardized using tanh and multiplied to the output of the sigmoid layer to derive output information . In (4), (5), , , and, are the weight matrices of the output gate, and is the bias matrix of the output gate. A BiLSTM model is an LSTM model with backpropagation learning, as presented in Fig. 5(b). A gated recurrent unit (GRU) [37] resolves the vanishing gradient problem in a recurrent neural network and uses an update gate and reset gate, as displayed in Fig. 6 (a). These two gates control output information, retain previous information, and require fewer parameters than does an LSTM model. Accordingly, a GRU-based neural network is more efficient than an LSTM model.
Fig. 6

(a) GRU neuron structure; (b) BiGRU neuron structure.

(a) GRU neuron structure; (b) BiGRU neuron structure. The update gate controls information sent previously. In (6), the update gate can selectively retain previous information , and and are the weight matrices of the update gate. The reset gate selectively forgets previous information . The computation of (7) is identical to that of (6), and and are the weight matrices of the reset gate. Reset gate output is subjected to matrix multiplication with , as expressed in (8). A BiGRU [38] is similar to a BiLSTM [39] model in which backpropagation learning is introduced on the basis of GRU, as shown in Fig. 6(b).

Proposed method

Fig. 7 illustrates the flow chart of the proposed algorithm COVID-19Net.
Fig. 7

Flow chart of COVID-19Net.

Step 1: Required data are selected from the aforementioned data sources to perform correlation analysis and identify highly correlated data. Step 2: All data are divided into testing and training sets according to the data collection time. Step 3: Data are refactored into a matrix with six factors and five time steps named Input 1. Data are refactored into a matrix with three regions and five time steps named Input 2. Step 4: Because Input 1 is mainly used to extract temporal features, a 1D-CNN and BiGRU-based parallel deep learning network is employed. Because Input 2 is mainly used to extract spatiotemporal features and each country has a specific geographical location and order, a two-dimensional (2D) CNN is employed. The training set refines the models. Step 5: The testing set is used to test the models and calculate the mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean square error (RMSE). Flow chart of COVID-19Net. The COVID-19Net algorithm proposed in this study is parallelly connected using a 1D-CNN [40], a 2D-CNN [41], and BiGRUs to form a mixed deep learning network. Fig. 8 illustrates this framework.
Fig. 8

Framework of COVID-19Net.

Framework of COVID-19Net. COVID-19Net was used to process spatiotemporal and temporal data separately. Because Input 1 contained data from each country regarding the daily number of newly confirmed cases, deaths, and recovered cases and the accumulated number of these cases in the past 5 days, temporal features related to the accumulated number of confirmed cases were extracted from these factors. Given the outstanding performance of BiGRUs in learning from time series data, 1D-CNN is combined with BiGRU to extract temporal features from Input 1, as shown in the left branch in Fig. 7. The highly contagious nature of COVID-19 and frequent population flows among European countries may result in a strong spatial correlation between their pandemic trends. Therefore, extracting spatial features considerably increased the accuracy of the prediction model. This indicated that features related to changes in the accumulated number of confirmed cases in each country are highly crucial. Based on the parallelly combined 1D-CNN and BiGRUs used for extracting temporal features from Input 1, a 2D-CNN was constructed to extract the spatiotemporal features of the three countries from Input 2, as shown in the right branch in Fig. 7. The 1D-CNN had 16, 32, 32, 64 convolutional kernels, respectively, each with a kernel size of 3. The size of the corresponding maximum pooling layer was 2. In the 2D-CNN model, the two convolutional layers had 32 and 64 convolution kernels, respectively, and the sizes of the convolution kernels were 3 × 3 and 4 × 1, respectively. Because the COVID-19 data employed in this study were insufficient, a dropout layer was used to prevent overfitting. The COVID-19Net algorithm proposed in this study parallelly combines CNNs and BiGRUs to concurrently process spatiotemporal and spatial features. The proposed algorithm can extract temporal and spatiotemporal features, which provides an alternative for researching data with both temporal and spatiotemporal correlation. In this experimental, COVID-19Net were coded using Tensorflow backend and trained using 16 GB memory with an Intel i7-9700K CPU and single NVIDIA RTX2080 GPU. The experimental environment is the ‘keras’ module in Python 3.7 (Table 1 ). There are about 65 433 parameters in the training step, 100 epochs, and each time takes about 100 s. The hyperparameters are selected by grid-search for each model as shown in appendix Table 5.
Table 1

Basic configuration parameters of the experimental platform list.

CPUIntel i7-9700K
GPU
NVIDIA RTX2080
Hard Disk Drive512 GB
Memory16 GB
OSWindows 10
EnvironmentPython 3.7
Basic configuration parameters of the experimental platform list.

Experimental and discussion

The MAE, MAPE, and RMSE equations used in this study are expressed as follows: The recovered cases confirmed cases and deaths indirectly reflect the local medical load, prevention, and control efforts, and they will have a more significant impact on the total confirmed cases. Hence, experimental data were collected from Germany, Italy, and Spain regarding six factors: the daily number of newly confirmed cases, deaths, and recovered cases and the accumulated number of confirmed cases, deaths, and recovered cases. Fig. 9 presents the correlation heatmaps of these factors. The results revealed that the accumulated number of confirmed cases was strongly correlated with other factors in Italy, with all correlation coefficients greater than 0.62 (Fig. 9(b)). Fig. 9(a) and (c) displays the correlation heatmaps of Germany and Spain, respectively. The pandemic situations in these countries were less severe than that of Italy. Therefore, the correlation between accumulated confirmed cases and daily recovered cases was low, with correlation coefficients of 0.78 and 0.65. However, the correlation between the remaining factors remained high, with correlation coefficients exceeding 0.54. The results verify the necessity of using these factors as one input of the COVID-19Net parallel network. It also suggests that people can analyze these factors to evaluate whether the government has done enough to prevent and control the disease.
Fig. 9

Correlation heatmaps of (a) Germany, (b) Italy, and (c) Spain.

Correlation heatmaps of (a) Germany, (b) Italy, and (c) Spain. A separate correlation heatmap was created for the accumulated number of confirmed cases in the three countries (Fig. 10 ). The results indicated correlation coefficients greater than 0.99. The correlation is striking, and this shows that it is necessary to extract the spatial characteristics of the epidemic, this is a reason to use it as other input to the COVID-19Net parallel network. The geographical proximity of these countries and their frequent interaction lead to faster disease transmission. Hence, if the epidemic is to be contained, some travel restrictions should be applied, and social distance should be kept.
Fig. 10

Correlation heatmaps of the accumulated number of confirmed cases in Germany, Italy, and Spain.

Correlation heatmaps of the accumulated number of confirmed cases in Germany, Italy, and Spain. This study used a CNN, a GRU, a CNN-GRU, and COVID-19Net to predict the accumulated number of confirmed cases in Germany, Italy, and Spain (Fig. 11 ). The results showed that the predictions produced by CNN-GRU were the least accurate and could not reflect any features or patterns. GRU showed an unstable growth trend; therefore, it could not be used as a reliable reference. The prediction results produced by COVID-19Net show better than CNN-GRU and GRU. This was because the three countries have obvious spatial characteristics in geographical location, and it was unreasonable to consider only the time factor. Overall, the prediction result produced by the proposed COVID-19Net model was more accurate than that of the other models, and more precise predictions realized. It can provide a reference for the government to arrange resources such as the number of hospital beds, masks, etc., and for enterprises to make production plans and response plans during the epidemic.
Fig. 11

Results produced using the CNN, GRU, CNN-GRU, and COVID-19Net to predict the accumulated number of confirmed cases in (a) Germany, (b) Italy, and (c) Spain.

Results produced using the CNN, GRU, CNN-GRU, and COVID-19Net to predict the accumulated number of confirmed cases in (a) Germany, (b) Italy, and (c) Spain. Table 2, Table 3, Table 4 list the MAE, MAPE, and RMSE values of the three countries obtained from the various models. Comparing these values verified that the proposed model was considerably more accurate than the other models and exhibited an MAPE value greater than 3 for the three countries. This indicated that COVID-19Net can accurately predict the accumulated number of confirmed cases in each country. It also highlights the importance of spatial characteristics in prediction.
Table 2

MAEs of the three countries.

CountryCOVID-19NetCNNGRUCNN-GRU
Germany2241.1002561.0008301.20010425.100
Italy3376.5009033.40011689.80012962.600
Spain6247.20016509.30019228.40022626.300
Avg.3954.9339367.90013073.13315338.000
Table 3

MAPEs of the three countries.

CountryCOVID-19NetCNNGRUCNN-GRU
Germany1.4471.7065.3646.697
Italy1.8014.6916.0806.693
Spain2.8287.3858.76110.276
Avg.2.0254.5946.7357.889
Table 4

RMSEs of the three countries.

CountryCOVID-19NetCNNGRUCNN-GRU
Germany2549.0303179.5808772.43011941.700
Italy4051.9409381.49011996.60014035.600
Spain7183.00021118.80020824.90024770.300
Avg.4594.65711226.62313864.64316915.867
MAEs of the three countries. MAPEs of the three countries. RMSEs of the three countries. Fig. 12 presents the box-plots of test set forecasting errors based on (a) Germany, (b) Italy, (c) Spain's COVID-19 prediction model. Judging from the level of the box plot, the average error of COVID-19Net is small. At the same time, the boundary range formed by the upper quartile and the lower quartile indicates that the error range of COVID-19Net is the smallest, that is, the COVID-19Net model has a short fluctuation range and the model is more stable. It can see from the median line Q2 that the performance of the GRU and CNN-GRU models in this experiment is similar. Still, after comparing the boundaries of the box diagram, the GRU is more accurate than the CNN-GRU hybrid architecture. It can see that the proposed network architecture is sufficient to meet related computing requirements, and the performance is better than the deeper network architecture such as CNN-GRU. The residual test also is applied to test COVID-19Net; the results of the three countries in Fig. 12 are 0.378, 0.582, and 0.326, respectively. These indicate that COVID-19Net is qualified to predict the accumulative number of confirmed cases.
Fig. 12

The box plot of absolute error based on (a) Germany, (b) Italy, (c) Spain's COVID-19 prediction model.

The box plot of absolute error based on (a) Germany, (b) Italy, (c) Spain's COVID-19 prediction model. Fig. 13 presents the bar chars of the MAE, MAPE, and RMSE values generated from COVID-19Net, CNN, GRU, and CNN-GRU for Germany, Italy, and Spain. Fig. 13(a), (b), and 13(c) respectively correspond to Table 2, Table 3, Table 4. The results indicated that the CNN-GRU model was the least accurate, and the GRU and CNN exhibited less favorable performance than did COVID-19Net. The models in descending order of prediction accuracy were COVID-19Net, CNN, GRU, and CNN-GRU, which verified that the proposed algorithm was the most accurate.
Fig. 13

Bar charts of the (a) MAE, (b) MAPE, and (c) RMSE values derived from the prediction results of the three countries by using COVID-19Net, CNN, GRU, and CNN-GRU.

Bar charts of the (a) MAE, (b) MAPE, and (c) RMSE values derived from the prediction results of the three countries by using COVID-19Net, CNN, GRU, and CNN-GRU. Above all, it is necessary to analyze six factors to evaluate whether the government has done enough to prevent and control the disease. If the epidemic is to be contained, some travel restrictions should be applied, and social distance should be kept. And It is necessary to extract the spatial characteristics of the epidemic to improve the accuracy of prediction. COVID-19 poses considerable challenges to healthcare systems worldwide. By using the proposed algorithm, predicted that the demand for ward beds and ICU beds in Germany, Italy, and Spain would increase substantially, particularly before the pandemic peaks. If problems from lacking healthcare resources and social distancing cannot be resolved, demand will increase. COVID-19 may overwhelm the capacity of hospitals, particularly ICU nurses. The predicted values produced in this study can help countries develop and implement disease prevention measures and reduce gaps between the strategies employed by countries, including reducing services unrelated to COVID-19 prevention and temporarily increasing the capacity of the healthcare system. Based on the estimation results of Zhang et al. [26], It can be concluded that COVID-19 will stabilize after the beginning of June (about four weeks), during which healthcare resources will be in heavy demand. However, this demand will also depend on social distancing measures implemented and other measures already imposed by each country. During this pandemic, relevant disease prevention measures must be maintained, and the importance of these measures must be highlighted to reduce the deaths of civilians and healthcare personnel. COVID-19 also have a severe impact on economic, social, and production in the world. COVID-19Net can provide a reference for enterprises to develop production plans and response plans during the epidemic. It helps enterprises seize opportunities and face challenges.

Conclusions

The likelihood of flattening the epidemic curve, as discussed in Western media, is overly optimistic because this entails no increase in COVID-19 cases. Currently, only China claims to have achieved this after implementing 2 months of lockdowns and strict measures. In this study, COVID-19Net is proposed to predict the accumulated number of confirmed cases in Germany, Italy, and Spain, which are heavily affected by the pandemic. The accumulated numbers of confirmed cases, deaths, and recovered cases and the daily numbers of newly confirmed cases, deaths, and recovered cases in the past 5 days were used to predict accumulated confirmed cases the next day. By comparing the prediction results and evaluation indicators of COVID-19Net, CNN, GRU and CNN-GRU, it could be verified that the accuracy of COVID-19Net was the best one. The MAPE values of the three countries were 1.447, 1.801 and 2.828, respectively. This means that if you only considered time series factors, you could not extract features from the data of these countries. The effect of the hybrid model CNN-GRU was also very poor. Although the accuracy of CNN was slightly higher than that of GRU and CNN-GRU, it was still impossible to predict the cumulative number of confirmed cases in these three countries. COVID-19Net was verified to be more accurate than the other three models because COVID-19 trend data contain spatiotemporal features that can be extracted using the deep neural network of COVID-19Net. The results of this study can not only serve as an essential reference for devising public health strategies against COVID-19. The proposed model provides to improve the allocation of hospital resources but also a reference for enterprises to develop production plans and response plans during the epidemic. It helps enterprises seize opportunities and meet challenges. The experimental results indicate that it is necessary to analyze six factors to evaluate whether the government has done enough to prevent and control the disease. If people want to control the epidemic effectively, people should implement stay home isolation, travel restrictions and keep their social distance. Some researchers consider that the temperature will affect the spread of COVID-19 [42]. The coronavirus also has some mutations, along with the development of the outbreak. The prevention and control efforts will continue to strengthen. The explosion of circulation from the space will reduce relatively, so the space correlation will also have a certain reduce these will affect the learned pattern. In future work, the research will carry out on these aspects.

CRediT authorship contribution statement

Chiou-Jye Huang: Conceptualization, Data curation, Funding acquisition, Methodology, Resources, Software, Supervision, Validation, Visualization, Writing - original draft, Writing - review & editing. Yamin Shen: Data curation, Formal analysis, Software, Validation, Writing - original draft, Writing - review & editing. Ping-Huan Kuo: Methodology, Project administration, Resources, Software, Writing - original draft, Writing - review & editing. Yung-Hsiang Chen: Investigation, Validation, Writing - original draft, Writing - review & editing.
Table 5

Basic parameter table of each model.

ModelCOVID-19NetCNNGRUCNN-GRU
Connection TypeParallel (spatio & temporal)SerialSerialSerial
InputThree countries (5,3) & Six factors (5,6)Six factors (5,6)Six factors (5,6)Six factors (5,6)
No. of layers5 & 7636
Other hyper-parameters(2-D Convolution filters: 32, 64) & (1-D Convolutional filters: 16, 32, 32, 64)GRU cells: 60 & 0Convolution filters: 16, 32, 32, 64GRU cells: 16, 16Convolution filters: 16, 32, 32, 64GRU cells: 16
  25 in total

1.  Early Prediction of the 2019 Novel Coronavirus Outbreak in the Mainland China Based on Simple Mathematical Model.

Authors:  Linhao Zhong; Lin Mu; Jing Li; Jiaying Wang; Zhe Yin; Darong Liu
Journal:  IEEE Access       Date:  2020-03-09       Impact factor: 3.367

2.  Backcalculating the Incidence of Infection with COVID-19 on the Diamond Princess.

Authors:  Hiroshi Nishiura
Journal:  J Clin Med       Date:  2020-02-29       Impact factor: 4.241

3.  On the Coronavirus (COVID-19) Outbreak and the Smart City Network: Universal Data Sharing Standards Coupled with Artificial Intelligence (AI) to Benefit Urban Health Monitoring and Management.

Authors:  Zaheer Allam; David S Jones
Journal:  Healthcare (Basel)       Date:  2020-02-27

Review 4.  Internet of things (IoT) applications to fight against COVID-19 pandemic.

Authors:  Ravi Pratap Singh; Mohd Javaid; Abid Haleem; Rajiv Suman
Journal:  Diabetes Metab Syndr       Date:  2020-05-05

5.  Serial interval of novel coronavirus (COVID-19) infections.

Authors:  Hiroshi Nishiura; Natalie M Linton; Andrei R Akhmetzhanov
Journal:  Int J Infect Dis       Date:  2020-03-04       Impact factor: 3.623

6.  Reverse Logistics Network Design for Effective Management of Medical Waste in Epidemic Outbreaks: Insights from the Coronavirus Disease 2019 (COVID-19) Outbreak in Wuhan (China).

Authors:  Hao Yu; Xu Sun; Wei Deng Solvang; Xu Zhao
Journal:  Int J Environ Res Public Health       Date:  2020-03-09       Impact factor: 3.390

7.  Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020.

Authors:  K Roosa; Y Lee; R Luo; A Kirpich; R Rothenberg; J M Hyman; P Yan; G Chowell
Journal:  Infect Dis Model       Date:  2020-02-14
View more
  6 in total

1.  Random-Forest-Bagging Broad Learning System With Applications for COVID-19 Pandemic.

Authors:  Choujun Zhan; Yufan Zheng; Haijun Zhang; Quansi Wen
Journal:  IEEE Internet Things J       Date:  2021-03-17       Impact factor: 10.238

2.  Deep Spatiotemporal Model for COVID-19 Forecasting.

Authors:  Mario Muñoz-Organero; Paula Queipo-Álvarez
Journal:  Sensors (Basel)       Date:  2022-05-05       Impact factor: 3.847

3.  Spatio-temporal characteristics and control strategies in the early period of COVID-19 spread: a case study of the mainland China.

Authors:  Jiachen Ning; Yuhan Chu; Xixi Liu; Daojun Zhang; Jinting Zhang; Wangjun Li; Hui Zhang
Journal:  Environ Sci Pollut Res Int       Date:  2021-04-27       Impact factor: 4.223

4.  COVID-19 lockdowns and air quality: Evidence from grey spatiotemporal forecasts.

Authors:  Mingyun Gao; Honglin Yang; Qinzi Xiao; Mark Goh
Journal:  Socioecon Plann Sci       Date:  2022-01-11       Impact factor: 4.641

5.  Editorial for the special issue on "Transforming public health through artificial intelligence, machine learning and internet of things".

Authors:  Sang-Bing Tsai; Tao Wu; Chia-Huei Wu; Xiaohang Yue
Journal:  Socioecon Plann Sci       Date:  2022-02-15       Impact factor: 4.923

6.  How heterogeneous is the dengue transmission profile in Brazil? A study in six Brazilian states.

Authors:  Iasmim Ferreira de Almeida; Raquel Martins Lana; Cláudia Torres Codeço
Journal:  PLoS Negl Trop Dis       Date:  2022-09-12
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.