Literature DB >> 34794082

Forecasting COVID-19 recovered cases with Artificial Neural Networks to enable designing an effective blood supply chain.

Ertugrul Ayyildiz1, Melike Erdogan2, Alev Taskin3.   

Abstract

This study introduces a forecasting model to help design an effective blood supply chain mechanism for tackling the COVID-19 pandemic. In doing so, first, the number of people recovered from COVID-19 is forecasted using the Artificial Neural Networks (ANNs) to determine potential donors for convalescent (immune) plasma (CIP) treatment of COVID-19. This is performed explicitly to show the applicability of ANNs in forecasting the daily number of patients recovered from COVID-19. Second, the ANNs-based approach is further applied to the data from Italy to confirm its robustness in other geographical contexts. Finally, to evaluate its forecasting accuracy, the proposed Multi-Layer Perceptron (MLP) approach is compared with other traditional models, including Autoregressive Integrated Moving Average (ARIMA), Long Short-term Memory (LSTM), and Nonlinear Autoregressive Network with Exogenous Inputs (NARX). Compared to the ARIMA, LSTM, and NARX, the MLP-based model is found to perform better in forecasting the number of people recovered from COVID-19. Overall, the findings suggest that the proposed model is robust and can be widely applied in other parts of the world in forecasting the patients recovered from COVID-19.
Copyright © 2021 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Artificial neural networks; Blood supply chain; CIP Therapy; COVID-19; Forecasting

Mesh:

Year:  2021        PMID: 34794082      PMCID: PMC8590479          DOI: 10.1016/j.compbiomed.2021.105029

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


Introduction

After being seen for the first time in December 2019 in Wuhan, China, COVID-19 became a pandemic within a matter of months [1,2]. Reportedly for China, 81% of the patients showed mild symptoms during the recovery period and the mortality rate was 2.3%. Moreover, approximately 5% of the patients suffered more serious symptoms, including respiratory failure, septic shock, and multi-organ failure; in this group, the mortality rate reached 5% [3]. It is widely found that the spread of COVID-19 varies among men and women, different age groups, and those with underlying medical conditions [4]. In response to the pandemic, governments all over the world including Turkey have quickly taken a series of drastic measures to slow down its spread. A challenging task in developing effective infection prevention and control strategies to stop the spread of the virus is being able to forecast new cases, deaths, and recoveries. Many studies have already proven the effectiveness of various forecasting approaches in estimating daily new infections and deaths. However, the efforts dedicated to forecasting the recovery cases are relatively limited. Some patients can be treated using the plasma received from recovered patients who are reportedly able to donate plasma after 14 days [[5], [6], [7]]. To facilitate the process of promptly transferring the plasma from recovered patients to new COVID-19 patients, developing an effective supply chain system is crucial. Therefore, accurately forecasting the number of recovered people as potential donors is key in developing such a system. As such, this study introduces a forecasting model to estimate the number of recovery cases for effective plasma donation in CIP therapy that is a treatment of COVID-19. The proposed model is based on the Artificial Neural Networks (ANN) and hypothesized to achieve a high prediction rate. Research to develop vaccines and drugs for COVID-19 is ongoing and no effective therapy or antiviral drug has been found yet. At this point, a passive antibody therapy CIP that is currently applied to COVID-19 patients is an option for COVID-19 therapy. In producing CIP, the liquid component (plasma) of whole blood is taken from recovered COVID-19 patients [8]. Better planning and execution of supply chain processes with plasma donation to patients of CIP therapy is crucial for effective management of the fight against COVID-19. It is aimed to develop a forecasting model in this study that will play a key role in designing an effective blood supply chain mechanism in the struggle with the treatment of this disease. Therefore, we focus on the problem of forecasting the number of people diagnosed with COVID-19 who have completed successful treatment (who have a negative COVID-19 test result) and now have the potential to donate plasma. Blood, plasma, or stem cell donations were studied in many dimensions before the pandemic. This issue is addressed from many different aspects, such as the need for immediate use, supply and storage of blood and other tissues, as well as the psychological aspects of people's reluctance to donate blood, misdirection, and anxiety about getting sick. The severe progress of COVID-19 in Turkey, as in all other countries, accelerates the studies focusing on the elimination of this disease as soon as possible; and at this point, CIP therapy stands out as one of the most effective treatments to tackle the disease [[8], [9], [10], [11]]. Recovered patients can donate plasma through the Red Crescent and pandemic hospitals in Turkey. At this point, the number of recovered patients must be already determined. The donation process of blood and other tissues is a vital problem even in pre-pandemic periods. In this context, it clearly shows that proper planning of the plasma donation process under pandemic conditions will contribute significantly to the literature.

Related works

The impacts of the COVID-19 pandemic on the status quo have been attracting researchers' attention. The researchers focus specifically on forecasting cases of COVID-19 for death and infections. In this section, a systematic literature review is conducted, which adopts the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) to gather literature by reviewing the studies focusing on forecasting of COVID-19 cases. PRISMA is a systematic approach developed by David Moher to review literature [12]. The PRISMA approach has five steps in literature search: defining criteria, identifying sources, selecting literature, collecting data, and selecting data items [13]. We adopt the PRISMA approach as Systematic Literature Review Method (SLR) to minimize bias in the review process and make it more systematic [14]. The literature search was performed from January 9, 2020 to August 1, 2021, with the keywords used shown in Table 1 .
Table 1

Studies found in literature search.

DatabaseDetails of the SearchNumber of studies
SCOPUS(TITLE-ABS-KEY ( covid-19) AND TITLE-ABS-KEY (prediction) AND TITLE-ABS-KEY ("artificial neural network") )145
(TITLE-ABS-KEY ( covid-19) AND TITLE-ABS-KEY (estimation) AND TITLE-ABS-KEY ("artificial neural network") )21
(TITLE-ABS-KEY ( covid-19) AND TITLE-ABS-KEY (forecasting) AND TITLE-ABS-KEY ("artificial neural network") )82
Studies found in literature search. A total of 248 papers is found as a result of the search in the Scopus database with the keywords shown in Table 1. Later, papers directly related to this study are determined by specifying both inclusion and exclusion criteria for the studies found. Thus, it is planned to reveal similar studies that can form a background for the approach adopted in the study and compare the inferences. Table 2 shows the inclusion and exclusion criteria for the papers.
Table 2

Inclusion and exclusion criteria for the papers in the literature search.

Inclusion CriteriaExclusion Criteria
The studies include forecasting implementation for COVID-19 casesStudies whose full text could not be reached
The studies include forecasting implementation for COVID-19 deathsStudies that do not explicitly mention the method used and the results
The studies include forecasting implementation for COVID-19 recoveriesStudies are adopting different artificial intelligence approaches in the prediction of COVID-19 cases
Studies using up-to-date forecast data on COVID-19 and taking advantage of the ANN methodStudies written in languages other than English
Inclusion and exclusion criteria for the papers in the literature search. When the studies in the literature search are filtered with the criteria in Table 2, it is seen that 26 studies are similar to the methodology adopted in this paper. To reveal the differences between these studies with our paper and to determine their contributions, a summary table has been presented in Table 3 .
Table 3

The literature that adopted the ANN approach for COVID-19 forecasts.

#AuthorsAimInput variablesContributionsLimits
1Ahmad and Asad [15]Predicting of coronavirus COVID-19 casesActual confirmed casesActual DeathsActual RecoveriesA new ANN design is proposed to estimate the number of deaths, recovered and confirmed COVID-19 casesInsufficient number of input parameters collected from real life
2Bodapati et al. [16]Forecasting of Daily Cases, Deaths Caused, and Recovered CasesNumber of casesNumber of DeathsThe total number of people recovered from the virusAbility to integrate with an application that streams live data from government sites to view real-time graphs of dataExposure of the model to limited data
3Saba and Elsheikh [17]Predicting the different number of COVID-19 cases at the end of the epidemicConfirmed casesApplying ARIMA and NARANN methods to forecast COVID-19 casesNot making comparisons using different time series methods
4Istaiteh et al. [18]Predicting the COVID-19 cases in each country all over the worldConfirmed casesUsing spatio-temporal forecasting for 189 countries worldwide to predict COVID-19 casesEncountering high error rates with the methods adopted for some countries
5Al-qaness et al. [19]Forecasting the number of confirmed cases of COVID-19 in Brazil and RussiaConfirmed casesUsing an enhanced version of marine predators algorithm (MPA), called chaotic MPA to improve ANFIS performance and avoid the shortcomings of traditional ANFISHaving longer computation time of the proposed method than the compared methods
6Abbasimehr and Paki [20]Predicting COVID-19 confirmed casesConfirmed casesUtilizing Bayesian Optimization in determining estimation parameters and adopting a multi-output modeling approachExposure of the model to limited data
7Elsheikh et al. [21]Forecasting the number of total confirmed cases, total recovered cases, and total deaths in Saudi ArabiaTotal number of confirmed casesTotal recoveriesTotal deathsUsing seven different statistical evaluation criteria in the estimation accuracy of the model (RMSE, R-squared, MAE, efficiency coefficiency, overall index, coefficient of variation, and coefficiency of residual mass)Using a limited number of data
8Hamadneh et al. [22]Forecasting the number of confirmed and recovered cases of COVID-19The requested dateDetermining the parameters of the MLPNN using the prey-predator algorithm (PPA)There is no comparative analysis to compare the results
9Moftakhar et al. [23]Predicting the number of patientsThe observed number of newly infected casesApplying ANN and ARIMA models for predictionModels cannot be trained well due to few observations and the Inability to evaluate any risk factor for this disease due to insufficient data on demographic information and social networks of patients.
10Ünlü and Namlı [24]Predicting COVID-19 confirmed cases and deaths in seven countriesConfirmed casesDeathsApplying e Support Vector Machines (SVM), Holt-Winters, Facebook's Prophet, and Long-Short Term Memory (LSTM) for forecastingUsing only RMSE in the interpretation of results
11Niazkar and Niazkar [25]Estimating the confirmed cases of COVID-19 in China, Japan, Singapore, Iran, Italy, South Africa, and the United States of America.Chronological data of confirmed and death casesApplying fourteen ANN-based models to predict the COVID-19 outbreakUsing a limited number of data
12Hamadneh et al. [26]Forecasting the number of total confirmed cases, total recovered cases, and total deaths in Brazil and MexicoThe requested dataUsing ANN to estimate the number of cases of COVID-19 with prey predator algorithm (PPA)There is no comparative analysis to compare the results
13Toga et al. [27]Predicting the infected cases, the number of deaths, and the recovered cases with ARIMA and ANN in TurkeySusceptible casesDaysCurfewsLaboratory testsDetermining the susceptible case number for each day after the first recovered case
14Kumari and Toshniwal [28]Forecasting the COVID-19 outbreak in India with ANNCumulative confirmed, new, and cumulative deceased cases recorded dailyUtilizing a mathematical curve fitting model to understand the performance of the proposed modelNot conducting a comparative analysis
15Conde-Gutiérrez et al. [29]Estimating the cumulative number of deaths from COVID-19 in MéxicoCumulative number of deathsComparing ANN with Gompertz model in estimating the number of deaths
16Safi and Sanusi [30]Predicting the total confirmed cases and the recovery or death rate worldwideTotal casesDeath ratesApplying the ARIMA, ETS, ANN models, and the hybrid combination of the three models.Using data around the world and not separating regions or countries by region (??)
17Wieczorek et al. [31]Forecasting number of cases each day worldwideInformation from the last 12 days plus geolocation coordinates of latitude and longitude have an impact on the cases correlations between neighboring countriesProposing a model, which can work as a part of an online system as a real-time predictor to help in the estimation of COVID-19 spread worldwideUsing a limited number of data
18de Barros Braga et al. [32]Estimating the daily and cumulative cases and deaths caused by COVID-19 and demand for hospital bedsCumulative casesCumulative deathsMunicipal demography Occurrence dateName of State's municipalities Names of the health regionsTraining ANN with data from 6 different moments for providing the ability to evaluate the forecasting qualityUsing a limited number of data
19Shetty and Pai [33]Forecasting the number of infected casesDaily reported casesApplying a fast training algorithm that is Extreme Learning machine to reduce the training time and using cuckoo search (CS) algorithm to select the parametersNot conducting a comparative analysis
20Tamang et al. [34]Estimating the number of rising cases and deaths in India, the USA, France, and the UKThe number of daysPresenting intelligent based optimum curve fitting and forecasting for different non-linear modelsNot conducting a comparative analysis
21Melin et al. [35]Predicting confirmed and COVID-19 deaths for 26 countriesThe confirmed and deathsAdopting the firefly algorithm for ensemble neural network optimization to COVID-19 time series prediction with type-2 fuzzy logic in a weighted average integration methodNot conducting a comparison with different fuzzy extensions to measure the performance of the adopted model
22Ardabili et al. [36]Estimating the cumulative number of cases for five countriesCumulative number of casesDemonstrating a comparative analysis of machine learning and soft computing models in forecasting the COVID-19 outbreakNot applying comparative studies on various machine learning models for individual countries.
23Ahmar and Boj [37]Predicting infection fatality rate of COVID-19 in Brazil using NNAR and ARIMATotal data of confirmed casesTotal data of death of COVID-19Presenting Neural Network Time Series (NNAR) and ARIMA to Forecast Infection Fatality Rate (IFR)Using a limited number of dataUsing only MSE as a performance measureNot specifying which approach is used in parameter selection
24Ardabili et al. [38]Estimating the cumulative number of cases for five countries total and daily cases worldwideTotal casesDaily casesProposing artificial neural network-integrated grey wolf optimizer for COVID-19 outbreak estimationsNot conducting a comparative analysis
25Alsmadi et al. [39]Forecasting the cumulative cases of COVID-19 for the four Canadian provincesThe cumulative number of infected casesApplying three models, smooth transition autoregressive (STAR) models, neural network (NN) models, and susceptible-infected-removed (SIR) models for forecasting the casesData reliability problem
26de Oliveira et al. [40]Estimating the number of confirmed cases and deaths of COVID-19 for Brazil, Portugal, and the United StatesThe daily cumulative number of cases and deathsComparing different training functionsNot conducting a comparative analysis
The literature that adopted the ANN approach for COVID-19 forecasts. In addition, the input data used in these studies and the limitations of these studies, as well as their contributions are summarized and a detailed perspective that can be used in a glance at the relevant literature is presented. When we examine the studies in Table 3, it has been observed that no analysis has been conducted to validate the results in many forecasting studies. Generally, many of them use much less data than the number of data in our study. In addition, no study has carried out as much detailed input research as in this study, and the problem has not been examined in detail from a multi-faceted perspective. In most of the studies, forecasting has been applied using only time-series data. However, in this study, many input candidates that can effectively forecast the number of recovered patients are identified and the inputs to be used in the proposed model are selected through analytical methods. The dataset in different countries is used to determine the effectiveness of the proposed method and it is concluded that the results of the method are reliable and useful for other countries. Besides, it is investigated whether the proposed model gives the best results by applying to ARIMA, LSTM, and NARX artificial intelligence tools for comparative analysis. As a result of the analysis carried out, it has been seen that the model proposed by adopting multi-layer perceptron (MLP) gives the best results according to the select key performance indicators (KPIs). Apart from the reviewed studies, our model using MLP forecasts the number of recovered patients to ensure the efficient plasma donation used for treating COVID-19 patients. Unlike existing studies using MLP, a large number of potential input variables are investigated. MLP is a valuable network structure to model complex nonlinear problems [41]. MLPs can handle large datasets for inputs. They also work well even with small datasets. MLP provides faster decision-making after network training [42]. Unlike other statistical models, MLP does not need probabilistic information about the dataset or assumptions regarding probability density functions. MLP also can approximate unknown functions. MLP is implemented in many studies to solve complex real-world problems due to its high self-adapting and self-learning ability [42]. In this paper, the forecasting process using the MLP model has been conducted in Turkey for COVID-19 patients surviving the disease. By estimating patients who recover daily, the number of people who can become donors in the treatment of plasma can be determined for people who have the disease. At this point, when the patients recovering are determined, important information such as the number of patients to donate for CIP treatment, which is currently used as an effective method in the treatment of the disease, will be obtained, and thus effective planning can be realized. This study is thought to contribute to the literature with the subjects can be listed below: The daily number of recovered patients is forecasted, and a new perspective that will contribute to the treatment process is presented for COVID-19. For the first time, apart from the number of COVID-19 patients or deaths, the number of recoveries, which can be used in planning the treatment process, is estimated. Forecasting of the number of recoveries from COVID-19 is modeled using an MLP structure with multiple inputs rather than a time series problem. The most influential factors to forecast the number of recoveries are determined using correlation analysis and causal relationships. The rest of the paper is organized as follows: The proposed methodology is presented in Section 3. Section 4 presents the data cleaning process and real case application of the proposed methodology for Turkey. The comparative analysis is presented in Section 5. Finally, the conclusions and future recommendations are given in the last section.

Proposed methodology

This study proposes a novel strategy to develop more accurate forecast models. The proposed strategy can be divided into three steps: (i) candidate input variables are determined, (ii) predictors are selected by correlation analysis and causal analysis, (iii) different hyperparameters are tested to find the best model. First, the relevant literature is reviewed, and various variables are defined as candidate input variables to determine the daily number of recovered COVID-19 cases. Then, the Pearson correlation coefficients for the candidate variables are calculated, and some candidate variables are eliminated. Thus, the predictors of the number of people who have recovered from COVID-19 are determined. After the predictors are determined, the values of the predictors are compiled, and a dataset is created. Finally, the most appropriate artificial neural network structure is determined using these data, and the daily number of patients recovering from COVID-19 is forecasted. The steps of the proposed method are given in Fig. 1 .
Fig. 1

The proposed methodology.

The proposed methodology.

Determining the predictors

Some predictors have little or no effect on forecast performance or may cause overfitting. Therefore, a predictor selection/reduction process is required for effective forecasting. Correlation analysis is one of the data pre-processing techniques that provides model simplicity and prevents overfitting [43]. Correlation analysis is a statistical method used to determine whether there is a linear relationship between two numerical measurements; the direction and severity of this relationship, if any [43]. In short, it shows whether the changes in variables affect each other (x and y). The correlation coefficient (r) is a value used to establish the relationship between variables, and this coefficient (Pearson's correlation) takes a value between −1 and 1 [44]. Pearson correlation coefficient can determine the level of the linear relationship between the variables [45]. The Pearson correlation coefficient (r) for two independent variables, x and y, is calculated in Equation (1).where represents the number of observations. If the coefficient is close to ±1, it can be said that there is a perfect correlation, which means as one variable increases, the other variable tends to increase (if positive) or decrease (if negative). If the coefficient value lies between ±0.50 and ± 1, it is said to be a strong correlation. When the value lies below + 0.29, then it is said to be a weak correlation. When the value is zero, it can be said that there is no linear correlation between the variables [46]. The predictor selection step can be applied to improve the accuracy of machine learning models such as support vector machine (SVM), NARX. It provides a more relevant, consistent, and robust input dataset to forecast target variable(s).

Artificial Neural Networks (ANNs)

ANNs are computer systems designed to automatically perform new capabilities such as producing and discovering new information through learning [47]. The ANNs can be trained with examples as human intelligence [48]. The more examples are shown in the network training, the higher the success of solving the problem [49]. The training of the network occurs with the acceptance of certain error values. Learning in a network can be considered as a change in the weight matrix [50]. ANNs are used to estimate output values considering input values for estimation. In this context, ANNs can be used in tasks such as forecasting, classification, data association, data interpretation, data filtering [51]. ANNs can simulate the performance of especially complex systems when available experimental data is limited [52]. ANN are nonlinear information and data processing systems. They are developed from processing units called neurons and the connections between neurons [53]. ANN can be trained by changing the weights of these connections to deal with nonlinear problems that cannot be handled with traditional analysis methods [54]. The three main components of the ANNs structure are the neurons, connections established between these neurons, and functions [55]. ANNs consist of layers; the input layer is the layer where data is entered into the network, and the hidden layer(s) is the layer that processes the data received by the network in the input layer. The number of these layers can be one or more. The output layer is the layer where the result is expressed as output after the data is processed in hidden layers.

Multi-layer perceptron

ANN can be either feed-back (recurrent) networks or feed-forward networks. Recurrent neural networks contain a self-connecting hidden layer, and the connections between nodes form a directed graph along a temporal sequence [56]. Recurrent neural networks contain feed-forward loops and feedback loops, which can store information. Increasing the number of weights increases the complexity of the structure for recurrent neural networks [57]. Perceptrons are arranged in layers in the feed-forward networks so that the first layer receives inputs, and the last layer produces outputs. In contrast, the middle layers have no connection with the outside, therefore, they are called the hidden layers [58]. The feed-forward network adopted in this study is MLP. MLP can solve complex nonlinear problems effectively and is generally applied for classification and forecasting [59]. Backpropagation is a supervised learning technique that is used in training MLP [60]. Backpropagation is a version of the Widrow–Hoff learning rule with multiple layers and differentiable nonlinear activation functions [61]. The backpropagation algorithm consists of the feed-forward phase, where the output of the network is determined, and the feed-back phase, in which the weights are updated by spreading the error back to reduce the gradient [62]. In this algorithm, the difference between the output and the target, the error, is gradually reduced to the minimum level by propagating it on all weights [63]. In the backpropagation algorithm, training starts with a random set of weights [64]. In the feed-forward phase, the inputs of the training set are presented to the input layer of the network. The input layer contains the neurons that receive these inputs. The neurons in the input layer transmit the input values directly to the hidden layer. Each neuron in the hidden layer calculates the total value by adding the threshold value to the weighted input values, and processes them with an activation function and transmits them to the next layer or directly to the output layer. After the net input of each neuron in the output layer is calculated by adding the threshold value to the weighted input values, this value is again processed with the activation function to determine the output values. The error value is calculated by comparing the output values of the network with the expected output values [54]. MLP has an input layer, one or more hidden layers, and an output layer composed of neurons. In this study, an MLP based methodology, which is one of the most known ANN models, is proposed. MLP is developed to express the nonlinear relationship between independent and dependent variables [65]. A basic MLP architecture contains layers of consecutive neurons. MLP is a feed-forward ANN model. Input data is transferred only from the input layer to the output layer. In MLP models, the backpropagation training method is generally used to minimize the error between the output of the network and the actual results of the training set [66]. The main function of backpropagation is to determine the most appropriate weight values using the gradient descent method to identify the most accurate weights used in ANN [67]. The feed-forward step generates a set of weights, then the weights are updated by the backpropagation of the error in each iteration when training an MLP [68]. Each neuron has incoming signals () into neurons through weighted () input connections. ANNs are trained with these weights. The signals, modified according to the weight changes, are summed in the neuron. The results proceed through the activation or transfer function used to modify the results before the signal becomes the output signal. Generally, there are multiple inputs () that enter the neuron. Input weights () are used to calculate the weighted sum () of inputs as given below [69]: The value becomes the input of the nonlinear activation function, and it is modified according to function. The bias is another input to the activation function. In this way, the bias and weighted sum form the inputs of the activation function [54]. Then, an activation function is applied to produce the output signal (), as follows:where denotes the activation function. Different activation functions can be used in this stage. Sigmoid function is one of the most used activation functions in feed-forward networks [70]. Sigmoid function is represented with [71]: The linear activation function is described as follows [72]: Hyperbolic tangent activation function is given in Equation (6) [73]: To calculate output ) of the MLP structure, the weight value between the hidden neuron and the output neuron () is multiplied by the value of the hidden neuron (). This value is used as in the determined activation function. In this study, the sigmoid function is used as the transfer function. The supervised learning method is one of the most used techniques to train networks in ANN. In supervised training, both input and output are provided to the network. Then the network processes the inputs and compares the outputs with the targets (desired outputs) [74]. A comparison is performed between the calculated output of the network and the target to determine the network error. The errors are propagated back to the network, and the weights controlling the neural network are updated [75]. This happens over and over again as the weights are constantly being changed. These weights are determined carefully to reach a minimum error or tolerance, as explained earlier. Besides, errors can be used to change network parameters to improve network performance. In other words, the inputs are assumed to be at the beginning and the outputs to be at the end of the causal sequence. Supervised learning is the most common form of neural network training [76]. The mean squared error (MSE) function is used as a loss function for the training and testing of the proposed ANN, whereas to improve the performance of ANN, the correlation coefficient (R) is used. In this study, the early stopping method is used as a regulation method to avoid overfitting [[77], [78], [79]]. Early stopping consists of breaking or stopping the network training when the loss function increases. Two basic rules are considered stopping conditions (i.e., loss function value and loss function change) [80]. The loss function, MSE, is evaluated for each iteration in training. When the value of the loss function begins to increase without decreasing for the next iterations, training stops in the next iteration with the lowest loss function value in the dataset [81]. Accordingly, if the generalization capacity of the model decreases, the training stops. The general flow of the training state of ANN can be shown in Fig. 2 .
Fig. 2

The training of ANN.

The training of ANN. While designing the ANN model, the number of hidden layers, the number of neurons in the layers, and the activation functions used can be different. So, determining the number of hidden layers, fixing the number of neurons in each hidden layer, applying a suitable training algorithm and activation functions play a vital role in achieving better network performance. All these values create the neural network. The number of neurons of each hidden layer affects network performance. The learning rate and momentum coefficient are also used to achieve minimum error, and similarly, they can be different to minimize error. In this study, trials with different network structures are created, and the best results are given in the following sections.

Forecast key performance indicators (KPIs)

Different error models are applied to evaluate the performances of the forecasting models. Through calculated metrics, the accuracy of the forecasting models is measured. The smaller the metric value means the higher the forecast accuracy. In the literature, MSE, root means square error (RMSE), mean absolute error (MAE) and mean absolute percentage error (MAPE) are used as forecasting performance evaluation metrics (KPIs) [[82], [83], [84], [85]]. Different KPIs are used in this study to evaluate the accuracy of the proposed methodology from various perspectives [86]. MSE can be adopted to determine the performance of our proposed model. The MSE can be calculated as in the following [87]:where shows the predicted value and y shows the true value where j = 1,2, …,m. RMSE is another KPI that can evaluate forecasting models. The mathematical expression for RMSE is given [88]: MAPE is one of the most common KPIs used to determine forecasting accuracy [89] due to its features such as scale independence and interpretability [90,91]. The MAPE is calculated as [92]: MAE determines the variance between two continuous variables [93]. Equation (10) is used to calculate MAE [94]: The KPIs define the difference between the output and target data [95,96]. Many forecasting networks proposed are based on the minimization of the MSE [[96], [97], [98], [99], [100], [101]]. Using the MSE as KPI, the performance of neural networks is measured using a qualitative approach based on the adjustment ratio of weights and MSE [95]. MAPE is also applied in many forecasting problems to examine the accuracy of the models [90,[102], [103], [104]]. For these reasons, we used MSE, RMSE, MAPE, and MAE as the KPIs for the forecast models in this paper.

A real case application for Turkey

In this study, the daily number of patients recovering from COVID-19 in Turkey is forecasted using ANNs. In this context, the variables predicted to affect the number of patients recovered have been determined by interviews with experts from the health sector and data published by the Ministry of Health in Turkey. The input variables are given in Table 4 .
Table 4

Input variables.

Candidate Input Variables
I-1Total tests
I-2Total cases
I-3Total death
I-4Intensive cares
I-5Intubated patients
I-6Total recovered
I-7Daily tests
I-8Daily cases
I-9Daily deaths
I-10Daily change in the intensive cares
I-11Daily change in the intubated patients
Input variables.

Predictor selection

After the experts determine the variables, the data of the variables are collected from the official sources of the Ministry of Health [105]. On March 10, 2020, the Ministry of Health of Turkey announced that a Turkish man who caught the virus while traveling to Europe was the first coronavirus case in the country. Then, on March 13, the Health Minister of Turkey explained that the coronavirus was detected in the relatives of the first patient, who was taken under observation. Making a statement after the Coronavirus Scientific Committee convened on March 27, 2020, the Minister of Health announced that 42 people recovered, 341 people were in intensive care, 2069 positive cases were detected in the last 24 h, and 17 people died due to COVID-19. The number of recovered patients was first announced on this date. Therefore, relevant data for the next 245 days (between March 27 and November 26) are collected and analyzed [105]. Firstly, the correlation coefficients of candidate input variables with the number of daily recovered patients are calculated using Equation (1) and given in Table 5 .
Table 5

Correlation coefficients with the output variable.

Input VariablesCorrelation Coefficient
I-1Total tests−0.021
I-2Total cases0.178
I-3Total death0.129
I-4Intensive cares0.602
I-5Intubated patients0.402
I-6Total recovered−0.159
I-7Daily tests0.414
I-8Daily cases0.255
I-9Daily deaths0.561
I-10Daily change in the intensive cares−0.513
I-11Daily change in the intubated patients−0.516
Correlation coefficients with the output variable. After determining correlation coefficients between all variables with the number of daily recovered patients, we try to decide which variables would be used in the forecasting process. Using highly correlated and the most influential input variables can give better forecast results [106]. If the correlation coefficient takes a value between −0.5 and 0.5, that means there is a low or negligible correlation between variables [43]. Therefore, variables that have a negligible effect (correlation coefficient between −0.5 and 0.5) on the number of forecasted daily recovered patients are excluded. So “I-4 Intensive cares”, “I-9 Daily deaths”, “I-10 Daily change in the intensive cares”, and “I-11 Daily change in the intubated patients” are determined as potential input variables in forecasting calculations. Then, the correlation coefficients among the remaining four input variables are calculated as given in Table 6 .
Table 6

Correlation coefficients among the determined input variables.

VariableI-4I-9I-10I-11
I-410.85860.10380.4403
I-90.858610.14020.4684
I-100.10380.140210.2333
I-110.44030.46840.23331
Correlation coefficients among the determined input variables. When we examine Table 6, mostly weak relationships appear between the variables. The absolute value of the correlation coefficient between two variables being greater than 0.9 indicates that the values of these two variables increase or decrease together. Hence, it can be reasonable to use one of these as the predictor. The correlation coefficients between individual input variables shown in Table 6 do not take values close to ±1 to eliminate one pair of the highly correlated variables [45]. In this analysis, we do not need to remove any input variables. After all data cleaning processes, we use four different input variables as predictors such as “I-4 Total intensive cares”, “I-9 Daily deaths”, “I-10 Daily change in the intensive cares”, and “I-11 Daily change in the intubated patients”. After correlation calculations, causal analysis is also performed to determine whether input variables had a causal relationship with the target variable and whether the model fits well. For this aim, we apply structural equation modeling (SEM) to analyze the relations. In this SEM, it is checked whether the inputs have a causal relationship with the target variable. For this aim, we construct the following hypothesis: I-4 variable has a causal relationship with the target variable I-9 variable has a causal relationship with the target variable I-10 variable has a causal relationship with the target variable I-11 variable has a causal relationship with the target variable Relationship analysis is conducted with the IBM AMOS program. The diagram obtained from the IBM AMOS program and results are given in Fig. 3 and Table 7 as follows:
Fig. 3

Relationship diagram between variables.

Table 7

SEM results.

Estimate (β1)Estimate (β2)S.E.C.R.P
Target Variable<---I-110.0381
Target Variable<---I-10−0.253−5.6650.927−6.113<0.001
Target Variable<---I-90.94829.7462.15613.795<0.001
Target Variable<---I-4−0.326−0.7210.152−4.748<0.001
Relationship diagram between variables. SEM results. Fig. 3 shows the inputs and outputs, arrows extending from inputs to outputs to test the hypothesis of whether inputs have an effect on output, error variables denoted by "e", which indicates the error of each factor. In addition, the relations of the errors with each other are also shown by arrows as a result of the analysis. In Table 7, β1 shows the standardized path coefficients, β2 shows the path coefficients, S.E. shows the standard estimates, C.R. shows the critical ratio, and p shows the marginal significance level in a hypothesis test representing the probability of a particular event occurring. In the structural model, 4 hypotheses identified as H1, H2, H3, and H4 are examined. The goodness of fit of the SEM model is also evaluated by the chi-square test and the goodness of fit indices, such as Root Mean Square Error of Approximation (RMSEA), goodness-of-fit index (GFI), and comparative fit index (CFI). According to the results obtained in the constructed structural model, it can be interpreted that the model is compatible since all fit indices are within the desired limits (RMSEA<0.08, CFI>0.9, GFI>0.9 [107]). It has also been observed that all inputs are related to the output variable. By checking the p-values in Table 5, it can be interpreted that all the inputs have a significant effect on the target variable.

Forecasting of the daily number of patients recovering from COVID-19 in Turkey

After determining the predictors to be used and the values of these predictors, these values are used in ANN as inputs for forecasting the number of recovered patients in Turkey for COVID-19 cases. In our network structure, the data of the “I-4 Total intensive cares”, “I-9 Daily deaths”, “I-10 Daily change in the intensive cares” and “-11 Daily change in the intubated patients” are adopted as input variables, and the actual number of “the Daily Number of Patients Recovering” is evaluated as the output variable. Fig. 4 shows the values taken by our input variables for the time interval specified for the forecasting process.
Fig. 4

The values of input variables.

The values of input variables. As shown in Fig. 4, the number of patients receiving intensive care varies between 500 and 2500 and shows an increasing trend. The number of daily deaths shows a steady trend between May and August, then increases after August. The daily change in intensive care and daily change in intubated patients do not have a trend and, the numbers take both positive and negative values. There are lots of spikes for these two input variables. The input and output variables are normalized according to Equation (11) before being used in the neural network. The normalized values of all variables take the value in the range [0,1].where and represent values before and after normalization of the variable, respectively. and represent the minimum value and maximum value of the variable, respectively. In our proposed methodology, n-day lag is used to make the n-day forecast. Therefore, the network is constructed using (245-n) samples. n is assumed to be 7, 10, and 14 to include medically justified recovery times [[108], [109], [110], [111], [112]]. The data of each input variable with an n day delay are used to forecast daily recovered patients. Once the data is compiled, the partition for training and test is applied. Furthermore, the first n-days period is excluded from data due to n-day lags. Subsequently, the remaining data is divided into two parts as training and test data. To achieve the best performance, one and two layers are tried as the number of hidden layers. The number of hidden neurons is also tested between 4 and 12 as the most commonly used values. The learning rate and momentum coefficient are tested starting from 0.1 increased by 0.05 up to 1. In this network structure, TRAINLM (Levenberg-Marquardt backpropagation) is used as a training function. The tangent sigmoid function and logarithmic sigmoid function are tried as transfer functions at the hidden layer(s) and pure linear function in the output layer. All combinations are made for 1000 epochs and run 100 times to test different starting weights. Finally, the most appropriate ANN network structure consists of two layers with eight and five hidden neurons in the first and second layers. The sigmoid function is used as a transfer function for each layer, and the pure linear function is used to determine the value of the output layer. The learning rate is determined as 0.7, and the momentum coefficient is determined as 0.45. The network structure used in this study is shown in Fig. 5 .
Fig. 5

The neural network structure used in this study.

The neural network structure used in this study. MLP models in this paper have been developed in MATLAB R2020b software environment with different parameter values. The k-fold cross-validation is used to avoid overfitting with the dataset. For this purpose, we randomly divide the training dataset into k-folds. One of these folds is used to test the network, and the remaining folds are used to train the network. This process is repeated k times for each fold. The MSE metric for each fold is calculated, and the cross-validation error is computed by averaging the MSE values. The proposed model with predetermined parameters is validated in this way, and 5-fold cross-validation is also used. To improve the accuracy of the proposed network structure, we performed the re-forecasting process by changing the rate of training data. We compare the forecast performance with respect to MSEs at different training and test percentages to determine the best combination of parameters. While calculating the MSE values, the network was run 100 times for each distribution, and randomness was kept by calculating the average values. The 80/20 training/test split gave the best forecasting performance in our case. In this way, the split of test and training data that gives the lowest error rate were successfully determined. We adopted MSE as a performance measurement technique in different studies based on ANN [[113], [114], [115], [116], [117]]. Then, the dataset is divided into training and test sets. This division is performed considering the first 80% of the initial data for the training step and 20% of the last data for the testing step. After determining the best network structure for the forecasting problem under consideration, we also used different time lags between the predictor data and the target variable. Additionally, we have added the last value of the daily number of patients recovering from COVID-19 as network inputs to better forecast performance. Table 8 summarizes the results of different input variable sets for the determined neural network structure. While calculating the forecast KPIs, the network was run 100 times for each model, and randomness was preserved by calculating average values.
Table 8

The results of MLP models.

InputsMSERMSEMAPEMAE
7 Day Lag271181659.063131.554
7 Day Lag with Last Value258761607.139112.993
10 Day Lag6019524513.245187.785
10 Day Lag with Last Value22526747420.342307.185
14 Day Lag33113157535.681484.2
14 Day Lag with Last Value56736075338.376497.279
The results of MLP models. After the forecasting process was performed, the best model was determined as 7-Day Lag with Last Value with respect to all KPIs. This model uses both 7-day lag and the last value of output to make the forecast. Fig. 6 shows the R-squared (R2) values of the proposed network structure and the predicted values of daily recovered patients; in other words, the ANN model outputs against the desired targets.
Fig. 6

R2 values for the training and tests.

R2 values for the training and tests. R2 is one of the most used statistical measures in forecasting models, which determines the proportion of the variance for a predicted output explained by input variables in a forecasting model. R2 evaluates the fit quality of the proposed model [94]. The formula to calculate R2 is expressed as follows:where shows the predicted value, and yj shows the true value (j = 1,2, …,m). When considering the relationship of all four predictors, the ANN model accurately explained 95.89% of the variation in the training data and 97.50% on the test data. The plots of dispersion of actual values and predicted values of daily recovered people are presented in Fig. 6 for training and test data sets. In Fig. 6, the two plots represent the training and testing data. The dashed lines show the perfect results, namely outputs = targets. The solid blue and red lines show the best fit linear regression line between outputs and targets for training and testing data, respectively. For this study, both the training and testing data indicate good fits. A small quantity number for outliers in the plots means that a perfect matching of the proposed model to the actual data. This model uses predictors and generates more complex equations to interpret them simply. Fig. 7 shows a comparison of the predicted and actual data for the test stage.
Fig. 7

The predicted and actual data for daily recovered patients in Turkey.

The predicted and actual data for daily recovered patients in Turkey. As can be seen from Fig. 7, there is not much difference between forecast and actual data. The biggest error was calculated for November 21, 2020. The forecast value was 3816 on the day November 21, 2020, when 4485 people were recovered. As shown in Fig. 7, there was a sudden increase in the number of recovered patients after November 6, 2020. The proposed network model has forecasted this change well, as shown, with an error rate of about 5%. Eventually, it can be inferred that the recovered patients forecasted for each day will be ready to donate plasma. After that, we forecasted the next 7 days (December 5 to 11) using the past 7 days' data in the proposed MLP structure. For example, to forecast the number of daily recovered people on December 5, we used the value of input variables on November 28. Thus, a 7-day forecast was performed. Fig. 8 demonstrates both the actual data for the last 3 days (from December 2 to December 4) and the forecasted data for the next 7 days from December 5 to December 11, 2020.
Fig. 8

Forecasting of next 7 days.

Forecasting of next 7 days. In this study, apart from forecasting only those who died or became infected by COVID-19, the daily number of recovered patients has been forecasted, and a perspective that would contribute to the treatment process has been presented. Therefore, it has an original and different perspective as compared to other COVID-19 forecasting studies. CIP therapy is currently the only known treatment method that contributes to treating severe COVID-19 cases. As a result of the proposed approach, appropriate strategies for CIP therapy will be developed with the estimated number of recovered patients, and more people can survive until the vaccine is available for COVID-19. With estimating the number of recovered patients, the number of people who can donate plasma will be determined to realize the correct planning and timely implementation of plasma donation. This proposed forecasting mechanism is verified for Turkey. More planned steps can be taken regarding the treatment of the disease in all countries of the world struggling with this disease, and thus an effective blood supply chain system can be developed.

Comparison and testing procedure for the proposed algorithm

For testing and comparing the proposed network structure, the number of deaths from COVID-19 across the country was forecasted. For this aim, the adopted methodology had been run from scratch. Firstly, the correlation coefficients of the candidate input variables with the number of daily death from COVID-19 were calculated using Equation (1) and given in Table 9 .
Table 9

Correlation coefficients.

Input VariablesCorrelation Coefficient
I-1Total tests0.677
I-2Total cases0.558
I-3Total deaths0.590
I-4Intensive cares0.830
I-5Intubated patients0.922
I-6Total recovered0.436
I-7Daily tests0.692
I-8Daily cases0.943
I-9Daily recovered0.489
I-10Daily change in the intensive cares0.249
I-11Daily change in the intubated patients0.605
Correlation coefficients. After the determination of correlation coefficients between all variables with the number of daily death from COVID-19, “I-1 Total tests”, “I-2 Total cases”, “I-3 Total deaths”, “I-4 Intensive cares”, “I-5 Intubated patients”, “I-7 Daily tests”, “I-8 Daily cases” and “I-11 Daily change in the intubated patients” were determined as potential input variables in forecasting calculations. After that, correlation coefficients among the remaining 8 input variables were calculated as given in Table 10 .
Table 10

Correlation coefficients among the determined input variables.

VariableI-1I-2I-3I-4I-5I-7I-8I-11
I-110.97290.97590.79680.86660.97370.51640.6023
I-20.972910.99450.71380.80560.95030.41460.5127
I-30.97590.994510.71740.83300.94380.45060.5311
I-40.79680.71380.717410.84280.81270.71990.4403
I-50.86660.80560.83300.842810.85880.83790.6453
I-70.97370.95030.94380.81270.858810.53600.5724
I-80.51640.41460.45060.71990.83790.536010.5208
I-110.60230.51270.53110.44030.64530.57240.52081
Correlation coefficients among the determined input variables. When we analyze Table 10, it is noticed that there is a solid relationship between the “I-1 Total tests”, “I-2 Total cases”, “I-3 Total deaths” and I-7 Daily tests”. Having such a high correlation coefficient means that the values of these four variables increase or decrease together. Hence, it can be reasonable to use one of them as the predictor. So we used only “I-1 Total tests” as a predictor among these four variables. After all the data cleaning processes, we determined five different input variables as predictors such as “I-1 Total tests”, “I-4 Intensive cares”, “I-5 Intubated patients”, “I-8 Daily cases”, and “I-11 Daily change in the intubated patients”. Once the predictors to be used and the values of these predictors were determined, these values were used in the same ANN structure used to forecast the number of daily recovered patients as inputs in forecasting the number of daily death from COVID-19 in Turkey. When calculating the MSE values, the network structures were run 100 times for each combination, and the average MSE values were calculated to eliminate the chance factor to ensure that the lowest MSE is achieved. Fig. 9 shows the comparison of the forecasted and actual data for the test stage.
Fig. 9

The forecasted and actual data for daily deaths.

The forecasted and actual data for daily deaths. As shown in Fig. 9, there is not much difference between the forecasted and actual data. The biggest error is calculated for November 20. The forecasted value is 185, and 177 people died on November 20. The MAPE is calculated as 5.967 for the test stage. As a result of this analysis, it was seen that our forecasting model could be used with different data, and successful results were achieved by testing it with the number of deaths. Our proposed network structure also gave a very low MAPE value in forecasting the number of deaths from COVID-19.

Robustness analysis

To confirm the robustness of the model presented, we analyzed the COVID-19 data of Italy and forecasted the number of recovered people. For this aim, we used data from February 24, 2020, to January 10, 2021, meaning 322 data points were used in the forecasting process. 77 of this data were used to test the network. The proposed ANN model had made very successful forecasting results with the data determined for Italy. The MAPE was calculated to be only 12.8. MSE and RMSE values were determined to be 1500 and 38.73, respectively. Fig. 10 shows the actual and predicted data for the test stages.
Fig. 10

The forecasted and actual data of recovered people from COVID-19 in Italy.

The forecasted and actual data of recovered people from COVID-19 in Italy. As can be seen from Fig. 10, the proposed forecasting model can be used for different countries. The biggest error was calculated for November 10. The forecast value was 12941, while 17734 people were recovered. As shown in Fig. 10, there were sudden changes, up and down, in the number of patients recovering after the first day of November. The proposed network model had forecasted this change well, as it is shown. As a result of the robustness analysis, the proposed forecasting model has again yielded successful results. It has been proven to be a model that can be used to determine strategies to combat COVID-19 and the flawless operation of the blood supply chain mechanism.

Comparative analysis

In this study, we forecasted daily recovered people from COVID-19 by MLP structure, unlike the current studies in the literature that use time series based forecasting methodologies, such as ARIMA, LSTM, NARX. In this section, the proposed MLP based forecasting methodology is compared with different forecasting methods to demonstrate its robustness. For this purpose, ARIMA (a time series-based method, and LSTM and NARX, which are two machine learning methods, are utilized in this study to forecast daily recovered people from COVID-19 in Turkey. The same dataset, designed after the predictor selection step, is used to compare the results with the proposed MLP application. The results were compared according to their KPIs. This section gives brief information and results about the methods employed in this study.

ARIMA

ARIMA is one of the most used techniques for parametric univariate time series modeling. ARIMA models are applied to non-stationary series but can also be converted to stationary series by difference-taking. ARIMA strongly assumes that future data values are linearly dependent on current and past data values, similar to other linear methods [118]. In this way, ARIMA provides highly accurate results in stationary time series forecasting. The ARIMA method uses autoregressive (AR) and moving average (MA) models. AR includes lagged terms, and MA includes lagged terms on the residuals or noise [119]. ARIMA is used stationary time series data with no missing values. So time series data can be modeled as stationary or can be transformed into stationary by differencing. The Dickey-Fuller statistics are used to test whether the series is stationary [120]. Thus, the letter ‘I’ (Integrated) in ARIMA means that the first-order difference is applied to transform time series to stationary. The general representation of the models is ARIMA (p, d, q). Here, p and q are the degrees of the AR model and MA model, respectively, and d is the degree of difference. The equation representing the ARIMA model for the time sequence is given in Equation (13). is a normal random variable white noise sequence with zero mean and variance σ2 and B is the backshift operator whose effect on the can be represented as . In this study, the Box-Jenkins approach is utilized in ARIMA models to find the most suitable time series model for the past values of the time series. The preliminary analysis of the daily recovered people data is performed using time plots of both the original time series and 1-differencing, as shown by Fig. 11 .
Fig. 11

Original series and 1-differencing.

Original series and 1-differencing. The time series used in the study was non-stationary according to the result of the Dickey-Fuller test with a test result of a p-value of 0.256, at the significance level of 0.05. The data reached stationarity with one order of differencing, as can be seen in Fig. 11. To construct the ARIMA model, the non-stationary data was transformed stationary by the 1-differencing method. Before further analysis using the Box-Jenkins approach, the data were transformed to achieve stationary time series data. After applying the 1-differencing method, the time series data became stationary. So d was determined as “1” in this study. The next step in the Box Jenkins approach was to identify whether the model needs any AR terms or not. A partial Autocorrelation plot can be helpful to determine the required number of AR terms, namely the value of p. Any autocorrelation in a stationary time series can be revised by adding enough p. So, the order of p was taken to be equal to many lags that cross the significance limit in the partial autocorrelation plot. Fig. 12 shows a partial autocorrelation plot of stationary time series.
Fig. 12

Partial autocorrelation plot.

Partial autocorrelation plot. Lag 1 crossed the significance limit (blue region), as can be seen in Fig. 12. So lag 1 was quite significant. Therefore, p could be fixed 1 in this time series. An autocorrelation plot can be used to determine the number of MA terms, namely q. Fig. 13 shows the autocorrelation plot of stationary time series.
Fig. 13

Autocorrelation plot.

Autocorrelation plot. Again, lag 1 crossed the significance limit (blue region), so q could be fixed as 1. Therefore, according to the Box Jenkins approach, the most suitable model was determined as ARIMA (1, 1, 1). The results of an ARIMA (1,1,1) model were estimated by maximum likelihood estimation and given in Table 11 .
Table 11

Parameter estimates of ARIMA (1, 1, 1) model.

VariableCoefficientStandard Errorz-statisticp-value
AR(1)0.46130.8840.5220.602
MA(1)−0.42950.899−0.4780.633
Constant15.714920.6790.7600.447

Akaike's Information Criterion (AIC) and Bayesian Information Criterion (BIC) values are determined as 3492 and 3506, respectively.

Parameter estimates of ARIMA (1, 1, 1) model. Akaike's Information Criterion (AIC) and Bayesian Information Criterion (BIC) values are determined as 3492 and 3506, respectively. To achieve the best performance, different values of “p” and “q” are tried. In this study, the ARIMA models are implemented in Python ARIMA. The results of the ARIMA for forecasting the daily recovered patients are presented in Table 12 . The best model with respect to MAPE is determined as (1,1,1) model. To make a fair comparison of the different ARIMA models, MSE, RMSE, MAPE, and MAE are calculated.
Table 12

The results of ARIMA models.

(p,d,q)MSERMSEMAPEMAE(p,d,q)MSERMSEMAPEMAE
(0,1,0)9284830512.579185.736(2,1,0)9265430412.501185.595
(0,1,1)9279530512.595185.790(2,1,1)9264330412.267185.449
(0,1,2)9263130412.572185.772(2,1,2)8586729314.051187.560
(0,1,3)9248830412.556185.317(2,1,3)8911429912.696185.313
(1,1,0)9279130512.578185.787(3,1,0)9254830412.445185.159
(1,1,1)9272430512.223185.498(3,1,1)9225930412.633185.857
(1,1,2)9261230412.292185.644(3,1,2)8899529812.764185.432
(1,1,3)9220030412.731186.211(3,1,3)8874729812.762184.982
The results of ARIMA models.

LSTM

Long short-term memory networks, often referred to as LSTM, are a special type of RNN that can learn long-term dependencies [121]. It is aimed to store and transfer the state information of the ANN while processing data in RNNs. However, as a result of continuous processing and transfer of state information, it is unlikely to be transferred without breaking long-term dependencies [122]. In other words, while short-term dependencies are transferred very successfully, sometimes long-term dependencies are not transferred effectively. LSTMs are designed to address long-term dependency issues. LSTM provides faster convergence for the training data and can identify long-term dependencies in the input data [123]. The LSTM module consists of 3 separate gates: forget gate, input gate, and output gate. Forget gate determines how much of the information is forgotten and how much is passed on to the next stage. In the next step, the input gate controls which information should be stored. Finally, the output gate determines which information to read and output. The equations for the forget gate, input gate, and output gate are given in Equations (14), (15), (16), respectively [124]. Where be the sigmoid function, are relevant weights in forget, input and output gate, denotes the previous output at time , is the input vector at time , and are bias neurons at the respective gate associated with each LSTM block. In this study, the sigmoid function is used as the transfer function. Then, Equation (17) is given for cell state: and represent the new and previous cell states. means the element-wise multiplication of the vectors. The equations for candidate cell state are given: In this study, tanh is used as an activation function. The LSTM is implemented in Python by using the Keras library with Tensorflow backend. We have the data of daily recovered patients in Turkey from March 28, 2020, to November 27, 2020. The first 80% of the time-series data is used to train the network, and the remaining data is used to test the network. Namely, 196 data is used for training, and 49 data is used for testing. Fig. 14 shows the training and test data used in this study.
Fig. 14

Input data of LSTM network.

Input data of LSTM network. Several parameters were considered in this study. 5, 10, 15, and 20 hidden neurons were tested in the hidden layer, and the time step of the input sequence (lag) was analyzed between 1 and 14 to achieve better performance, that means the number of daily recovered people in the next day was forecasted using past data of n time step. The “Adam” optimizer was used, and batch size was set to 4. The number of epochs was determined as 100 in the LSTM structure. The loss function was MSE in the LSTM application. Then forecast KPIs of each trial were calculated. The best performance was determined as 11 lags with respect to all performance measures. The results of LSTM implementation are presented in Table 13 .
Table 13

The results of LSTM models.

NeuronlagMSERMSEMAPEMAENeuronlagMSERMSEMAPEMAE
51565722387.971177.638151557542367.894175.740
52517442277.349168.283152482392207.286166.117
53827292889.619220.673153590892437.987181.683
5419746144417.096377.443154785602809.289215.793
55496442237.235164.46215523881148917.360401.676
56566412387.667179.532156520722287.207165.488
57644992548.208193.347157910283029.800234.754
58454942137.423156.76415815003638712.193299.410
5910012031610.768232.96915927067652019.087410.643
51035898259914.415374.843151024875449916.302403.890
51116862141114.747339.102151136692760619.189470.035
512657272568.143200.108151223327848313.260345.633
51337177661019.705502.980151329243454116.486422.905
51412158134910.931276.037151436973760816.818449.302
101454612138.856161.969201570702398.020178.793
102468162168.952167.084202503162247.289167.332
10312265535014.59279.596203705142669.224204.683
104483232208.822167.67204835122899.943227.505
105578322409.505177.153205817652869.722218.089
106628342519.351177.218206991303159.958221.806
107416452048.136154.6592071086653309.484224.204
108497482238.626162.76420812584835513.068289.266
109479212198.776170.75320915111038913.340314.699
10106355225210.018197.9632010847752919.522218.877
1011425192067.807149.97201112894335910.230262.864
1012558552369.031180.6472012802772839.397205.864
1013533832319.031181.973201316728140912.355317.922
1014693372639.152184.405201430199255018.122456.975
The results of LSTM models.

NARX

NARX networks are a kind of dynamic neural network model that gives successful results in nonlinear system modeling and time-series forecasting applications. Compared to traditional neural networks, NARX networks converge faster and demonstrate more effective learning. The values of the output signals depend on both the input signals and the historical values of output signals in a dynamic system [125]. Thus, NARX can provide more effective results than traditional neural networks [126]. NARX model for the time sequence is given in Equation (20). Let be the inputs, f be the nonlinear ambiguity function, and lastly d be the feedback. The NARX was implemented in MATLAB R2020b using the “narx net” structure in this study. Daily recovered patients were used as the target variable, “I-4 Intensive cares”, “I-9 Daily deaths”, “I-10 Daily change in the intensive cares”, and “I-11 Daily change in the intubated patients” were used as the input variables. The tangent sigmoid transfer function and linear transfer functions were used for the hidden layer and the output neuron. The number of epochs was determined as 1000 in the NARX structure. The dataset was divided into training and test sets. This division was performed considering 80% of the initial data for the training step and the remaining 20% of the data for the testing step. Additionally, the delay was analyzed between 1 and 7 with different numbers of neurons to achieve better performance. The best performance is achieved for 10 neurons with 5 delays. The results of NARX are presented in Table 14 .
Table 14

The results of NARX models.

NeuronDelayMSERMSEMAPEMAENeuronDelayMSERMSEMAPEMAE
316794226113.334164.800718616929427.684193.713
328285128817.843188.9107211521233933.117244.976
338487029114.715187.677735879524214.722167.026
345492923413.890147.664746324625114.444162.860
356629925714.155173.820757666727711.512152.969
367977928217.251200.7947611933934521.878240.404
3724698149723.515313.4837710846732920.154243.278
417602527618.575192.396817162126815.176207.234
427060626613.824175.397826649025814.280161.866
437964928214.052178.730836052124613.271166.725
447553927515.828189.546848073228419.862206.159
456601925716.383186.433854632521514.685140.594
467621827616.056198.327869002130021.696230.516
478272828814.551197.595874232220610.293139.212
517527227414.607192.396917426727320.446175.982
529752531226.906204.123926456925414.494150.948
535958924413.540163.058936121224713.261160.839
547760127918.307194.0399411499633919.474239.338
556523025515.994174.9699511955134625.606260.760
566176424914.511162.3769617882642325.254323.668
578671429413.433191.343976426825410.874157.449
618037928419.120178.4161016252225011.805175.982
627504927418.837184.1131026883026216.206180.345
6312009134732.092264.1691036858126215.155170.314
645989724514.039160.5841047378227215.579178.332
6511132333426.986257.0461055183422810.986146.103
667078826616.536187.9461066808726114.778170.311
6716100440121.611286.87810713481636720.565258.958
The results of NARX models. Table 15 summarizes the investigated parameters for each method. 16 different ARIMA models, 56 different LSTM structures, and 56 different NARX models were utilized to compare the proposed methodology results.
Table 15

Investigated parameters for each method.

MethodParameterValues
ARIMAp0, 1, 2, 3
d1
q0, 1, 2, 3
LSTM# of neurons5, 10, 15, 20
lag1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14
NARX# of neurons3, 4, 5, 6, 7, 8, 9, 10
lag1, 2, 3, 4, 5, 6, 7
MLP# of layers1, 2
# of neurons4, 5, 6, 7, 8, 9, 10, 11, 12
learning ratestarting from 0.1 increased by 0.05 up to 1
momentum coefficientstarting from 0.1 increased by 0.05 up to 1
transfer functiontangent sigmoid, logarithmic sigmoid
Investigated parameters for each method. After four different models were applied to forecast daily recovered patients, their best performance results were compared and evaluated according to forecasting KPIs as given in Table 16 . For the LSTM and ARIMA applications, three different models could be determined as the best models in terms of different KPIs. In the NARX application, one model was determined as the best structure with respect to each KPI.
Table 16

The comparison of forecasting methods.

MethodStructureMSERMSEMAPEMAE
MLPTwo layers: 8 neurons, 5 neurons258761607.139112.993
LSTM10 neurons, 11 lags425192067.807149.970
LSTM10 neurons, 7 lags416452048.136154.659
LSTM15 neurons, 6 lags520722287.207165.488
NARX8 neurons, 7 delays4232220610.293139.212
ARIMA(1,1,1)9272430512.223186.651
ARIMA(2,1,2)8586729314.051187.560
ARIMA(3,1,3)8874729812.762184.982
The comparison of forecasting methods. As shown in Table 16, the proposed MLP model provides the best performance with respect to all KPIs. LSTM model with 15 hidden neurons and uses 7-day lag is determined as the second-best performance with the MAPE value of 7.207. However, ARIMA shows the worst performance with a 12.223 MAPE value as expected and all other KPIs. It can be said that the proposed forecasting method is very effective for forecasting daily recovered patients.

Conclusion

Nowadays, where COVID-19 still seriously affects the whole world, making forecasts about the number of cases and deaths is one of the most critical steps to take the necessary precautions and implement the right strategies to fight against the disease. However, besides these forecasts, it is crucial to estimate the number of patients recovering for CIP treatment, which is applied by taking plasma from recovered patients and transferring it to infected people. It is currently considered an effective treatment method in the fight against the disease. In this way, accurate planning can be done to accomplish blood supply between the infected people and donors, and it can create a smooth supply chain system for a perishable product like blood. The daily number of recovered people, who play a key role in planning plasma supply and distribution required for the implementation of CIP therapy is forecasted. This study introduces a dynamic forecasting model to the literature, which can be used daily, especially during emergencies. The proposed forecasting model is used with different datasets. With using this model, the number of people, who have been diagnosed with COVID-19, received treatment, and tested negative for COVID-19, is forecasted on a daily basis. For this aim, ANN approach is used to forecast the daily number of people recovered from COVID-19 to determine the number of donors in the treatment of the disease. The variables used in the forecasting are determined as a result of expert opinions and a literature review. Subsequently, the variables are analyzed and eliminated to be used as predictors by correlation analysis and causal analysis. Thus, a novel strategy is proposed to increase forecast accuracy by systematically determining the predictors in forecasting models. After the predictors are determined, their data is compiled, and the best ANN network structure is determined. The most suitable structure is found by trying many different structures. Besides, the robustness of the proposed model is evaluated by analyzing data from Italy for the number of patients recovered from COVID-19 and the number of deaths. This helped to validate and test the proposed model. As a result of the study, it is shown that ANN is an effective approach to forecast the daily number of people recovered from COVID-19. This study has a different point of view than other forecasting studies on COVID-19, and no similar study has been found in the literature to the best of the author's knowledge. In this regard, this study is thought to have a high contribution to the literature and practice, and it is believed that it would be a leading paper for the researchers and practitioners who want to study in this area. This paper can be a guide to establish a smooth and secure collection/distribution infrastructure on supplying plasma for giving a more effective response to the pandemic. It can also serve as a reference point for the uninterrupted flow of COVID-19 treatment and solutions that enable matching the highest number of donors with patients. In the future, the approach adopted in this paper can be used with updated data or different forecasting methods can be applied to compare the current results. Other forecasting methods may be applied to find even better forecasting results, or easier to use methodologies would be developed apart from ANN. A comparison of different ANN-based forecasting models would also be interesting. Support vector regression may also improve the accuracy of the proposed methodology as used in many studies [[127], [128], [129], [130], [131]]. With the increase in available data, time series-based forecasting models can be developed. The proposed forecasting approach could eventually help the decision-makers or managers in planning new pandemic strategies. Studying with a network trained on historical data of other pandemics can be helpful in the long term. To improve the developed predictor selection methodology, data collection and data selection methods based on big data using deep-learning technologies can be applied. This study can be improved with model selection for each input setting for time lags. Additionally, the training stage can be enhanced with a calculation of entropy in the input dataset. The forecasting methodology can be extended to other countries because of the same pandemic process and the treatment mechanism they need, including guidelines for predictor selection.

Funding

No funds, grants, or other support was received.

Ethical approval

Ethics committee approval is not required.

Availability of data and material

Not applicable.

Code availability

Not applicable.

Consent to participate

Not applicable.

Consent to publish

The authors confirm that the final version of the manuscript has been reviewed, approved, and consented to for publication by all authors.

Authors' contributions

Ertugrul Ayyildiz and Melike Erdogan developed the theoretical formulation, performed the analytic calculations, and performed the numerical analysis. Ertugrul Ayyildiz and Alev Taskin contributed to the sensitivity analysis and final version of the manuscript. Ertugrul Ayyildiz and Melike Erdogan drafted the manuscript. Alev Taskin supervised the work.

Declaration of competing interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  2 in total

1.  Comparative study of artificial neural network versus parametric method in COVID-19 data analysis.

Authors:  Anum Shafiq; Andaç Batur Çolak; Tabassum Naz Sindhu; Showkat Ahmad Lone; Abdelaziz Alsubie; Fahd Jarad
Journal:  Results Phys       Date:  2022-05-16       Impact factor: 4.565

2.  Study of Multidimensional and High-Precision Height Model of Youth Based on Multilayer Perceptron.

Authors:  Lijian Chen; Xinben Fan; Keji Mao; Amr Tolba; Fayez Alqahtani; Ahmedin M Ahmed
Journal:  Comput Intell Neurosci       Date:  2022-06-18
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.