Literature DB >> 36089985

An innovative ensemble model based on multiple neural networks and a novel heuristic optimization algorithm for COVID-19 forecasting.

Zongxi Qu^1,2, Yutong Li^1,2, Xia Jiang³, Chunhua Niu^1,2.

Abstract

During the global fight against the novel coronavirus pneumonia (COVID-19) epidemic, accurate outbreak trend forecasting has become vital for outbreak prevention and control. Effective COVID-19 outbreak trend prediction remains a complex and challenging issue owing to the significant fluctuations in the COVID-19 data series. Most previous studies have limitations only using individual forecasting methods for outbreak modeling, ignoring the combination of the advantages of different prediction methods, which may lead to insufficient results. Therefore, this paper develops a novel ensemble paradigm based on multiple neural networks and a novel heuristic optimization algorithm. First, a new hybrid sine cosine algorithm-whale optimization algorithm (SCWOA) is exercised on 15 benchmark tests. Second, four neural networks are used as predictors for the COVID-19 outbreak forecasting. Each predictor is given a weight, and the proposed SCWOA is used to optimize the best matching weights of the ensemble model. The daily COVID-19 series collected from three of the most-affected countries were taken as the test cases. The experimental results demonstrate that different neural network models have different performances in various complex epidemic prediction scenarios. The SCWOA-based ensemble model can outperform all comparable models with its high accuracy and robustness.

Entities: Chemical

Keywords: COVID-19 forecasting; Ensemble forecasting; Optimization algorithm; Neural network

Year: 2022 PMID： 36089985 PMCID： PMC9444161 DOI： 10.1016/j.eswa.2022.118746

Source DB: PubMed Journal: Expert Syst Appl ISSN： 0957-4174 Impact factor: 8.665

Introduction

The global outbreak of COVID-19 caused by novel coronavirus pneumonia began in December 2019. By October 2021, the epidemic had spread to more than 200 countries worldwide, with nearly 230 million confirmed cases and 4.8 million deaths. Its strong infectivity, rapid spread, broad epidemic scope, and unpredictability pose a serious threat to global human health, social stability, and public security (Chakraborty & Maity, 2020). To slow the spread of COVID-19, governments around the world have implemented different degrees of prevention and control measures. However, strict measures such as lockdowns and quarantines may cause severe socio-economic consequences, and the variation of the epidemic is highly uncertain, making it difficult for governments to make the best decisions (Haug et al., 2020). When the number of new cases increases by thousands every day, even the health systems of developed countries have become overwhelmed and unable to deal with this large number of patients in such a short time. Effective prediction and indoctrination of prediction models can help governments estimate healthcare requirements and provide advice and information to the public (Dehning et al., 2020). For example, from the identification of exclusion zones and the organization of economic activities to the management of medical resources and the planning of emergency hospitals, effective forecasting is of strategic importance to decision-makers, which helps governments decide whether to impose or relax a restriction, thus minimizing the economic and political effects of the pandemic (Petropoulos, Makridakis, & Stylianou, 2020). Hence, forecasting the outcome of outbreaks as accurately as possible is crucial for decision-making and policy implementations. Recently, many significant studies have been devoted to forecasting the upcoming number of cases and the spread of COVID-19 in the near future. Traditional epidemiological models have been widely adopted in predicting COVID-19 cases. The time-dependent susceptible, infectious, and/or recovered (SIR) model is frequently used to model the growth of COVID-19 and to predict the future condition of infection and recovery rates (Alenezi et al., 2021, Cumsille et al., 2022, Masuhara and Hosoya, 2021). In addition, many studies also used the susceptible, exposed, infectious, and/or recovered (SEIR) model for COVID-19 epidemic prediction (Annas et al., 2020, Das et al., 2021, Paul et al., 2021, Piovella, 2020). The epidemiological approach attempts to model disease states, considering biological and disease processes, which requires preliminary assumptions, thus making the calculation process more complex. Another method of epidemic prediction is the statistical forecasting model. Ceylan (Ceylan, 2020) have applied an autoregressive integrated moving average (ARIMA) model to forecast the epidemiological trend in Italy, Spain, and France. Ghosal et al. (Ghosal, Sengupta, Majumder, & Sinha, 2020) used linear and multiple linear regression methods to predict the number of deaths in India over a short period of 6 weeks. Moftakhar & Seif (Moftakhar & Seif, 2020) used the ARIMA model to forecast the patients of COVID-19 in Iran in the next 30 days. A further noteworthy forecasting method is to use machine learning models such as artificial neural networks (NNs) and support vector algorithms, which have recently become more prevalent in predicting infectious diseases. For example, Ly (Ly, 2020) employed an adaptive neuro-fuzzy inference system (ANFIS) to predict COVID-19 cases in the UK. The result shows that data from Spain and Italy can increase the forecasting ability of COVID-19 cases in the UK. Parbat & Chakraborty (Parbat & Chakraborty, 2020) used support vector regression (SVR) for a 60-day forecast of 2019 coronavirus cases in India based on time-series data reported from March 1, 2020, to April 30, 2020. Tomar & Gupta (Tomar & Gupta, 2020) assessed the number of confirmed cases of COVID-19 in India in the next 30 days and tested the effectiveness of quarantine measures using a long short-term memory (LSTM) model and curve fitting. Chimmula & Zhang (Chimmula & Zhang, 2020) used the LSTM model to predict the time series of COVID-19 transmission in Canada and compared the transmission rates in Canada, Italy, and the United States. A LSTM model based on a recurrent neural network (RNN) was also used to predict the future mutation rate of SARS-COV-2. Behnood et al. (Behnood, Mohammadi Golafshani, & Hosseini, 2020) used ANFIS and LSTM models to predict new infection cases in Bangladesh and compared the results of the two experiments, believing that LSTM results were more satisfactory. Most previous studies are based on NNs for forecasting outbreaks in a data-driven manner. Nevertheless, different forecasting techniques have their strengths and weaknesses, implying that there is no single best model that can be applied all the various complex forecasting scenarios. To deal with sampling and modeling uncertainties, NNs are typically used as an ensemble of several network models. The ensemble combines the results of the different models that compose them to improve the accuracy and robustness of the predictions (Kourentzes, Barrow, & Crone, 2014). Although the utilization of ensembles is accepted nowadays as the standard for NN prediction (Crone, Hibon, & Nikolopoulos, 2011), their performance is a function of combining the individual forecasts (Stock & Watson, 2004). Improving the operation mode of ensemble prediction has a direct effect on prediction accuracy and the decision-making of prediction support, including some prediction applications in different fields, such as economic modeling and policy-making (McAdam and McNelis, 2005, Stock and Watson, 2004), temperature and weather (Langella et al., 2010, Roebber et al., 2007), wind energy forecasting (Xiao, Wang, Dong, & Wu, 2015; W. Zhang et al., 2017), fault detection (X. Zhang et al., 2021), impulsive reaction–diffusion NNs (Wei, Li, & Stojanovic, 2021), and climate modeling (Fildes & Kourentzes, 2011). The ensemble of NNs is the basis for achieving precise forecasts for these various applications; hence, it is essential to enhance the construction of the ensembles. Currently, a commonly accepted way to combine individual models is to assign a weighting coefficient to each individual model, i.e., the weighting-based combination method (Xiao et al., 2015). These methods assign a weighting coefficient to each component model based on its performance in the determination procedure (by checking the measured and predicted values). Compared with ensemble averaging, the weighted ensemble can combine the strengths of various models to obtain more stable forecasting results. However, the determination of optimal weights is vital to ensemble prediction. Several previous studies have attempted to utilize heuristic algorithms for weight optimization. For example, Yang et al.(Yang, Chen, Wang, Li, & Li, 2016) used a differential evolution (DE) heuristic algorithm to construct ensemble models to determine the optimal weights for electricity demand ensemble forecasting. Xiao et al. (Xiao et al.,2015) used the cuckoo search optimization (CSO) algorithm to optimize the weight coefficients of the combined model. The search ability of the heuristic algorithm affects the prediction performance of the weight-based ensemble model. The whale optimization algorithm (WOA), proposed by Mirjalili & Lewis (Mirjalili & Lewis, 2016b), is a novel heuristic algorithm that imitates the hunting mechanism of humpback whales. Despite its reasonable convergence rate, it is not suitable for highly complicated functions and may still encounter problems such as being trapped in an optimal local solution and slow convergence (Deepa and Venkataraman, 2021, Vijaya Lakshmi and Mohanaiah, 2021). To overcome these weaknesses and improve its search capability, we propose a new hybrid heuristic algorithm, which is a combination of WOA and the sine cosine algorithm (SCA). SCA is based on sine and cosine functions and can be applied for exploitation and exploration phases in global optimization functions. Therefore, we use SCA to search for the initial random position of WOA to improve the number of iterations and the convergence rate. The proposed hybrid variant is called SCWOA. Through this analysis, a novel ensemble paradigm based on multiple NNs and a new type of heuristic optimization algorithm (SCWOA) is introduced in this paper. First, four individual NN methods, backpropagation neural network (BPNN), Elman neural network (ENN), ANFIS, and LSTM, are selected to predict the COVID-19 outbreak. Then, an ensemble forecasting method, which integrates the characteristics of each forecasting method, is developed to improve the fitting accuracy and forecasting capability of the model. Finally, the best weight coefficients for each prediction member are automatically obtained by a new optimization algorithm SCWOA. To verify and validate the proposed model, the daily COVID-19 series collected from three of the most-affected countries were taken as test cases to conduct the empirical study. The main contributions in this paper are as follows. (1) An innovative ensemble prediction framework is proposed for COVID-19 epidemic prediction. In this architecture, four NN models are combined, and the weights of the ensemble model are optimized by SCWOA to achieve better performance than obtained with a single prediction model (2) A new type of heuristic optimization algorithm (SCWOA) is introduced. To improve both the exploration and exploitation capacities, the hybrid SCWOA is proposed, and to evaluate the improved algorithm, 15 benchmark functions are used. (3) The proposed ensemble model can effectively integrate the advantages of multiple NN models to achieve stable and accurate prediction results. The remainder of this paper is organized as follows. Section 2 introduces four NN methods and the proposed heuristic optimization algorithm SCWOA. Section 3 describes the main procedure of the proposed ensemble forecasting model. Section 4 describes data collection and evaluation criteria. Section 5 discusses the forecasting results of the proposed model and provides a comparison of results with other models. Finally, Section 6 concludes with the essential results of this paper.

Methodology

BPNN

BPNN is a multilayer feedforward NN with a wide variety of applications. The diagram of BPNN includes an input layer, one or more hidden layers, and an output layer, as shown in Fig. 1 .

Fig. 1

The structure diagram of BPNN.

The structure diagram of BPNN. BPNN has two learning processes, forward and reverse propagation. Assuming an -layer, n-node NN, each layer of neurons only accepts the output information of the previous layer and then propagates to the next layer of neurons. Suppose the network outputs y, N samples (x, y) (k = 1, 2…, N), and the output of node i is O. The network input is x, the output is y, then the output of node i is O. For node i in the first layer, when the sample k is input, the input of node i is.where represents the output of the jth node in the previous layer of L layers. Here is the network weight of; when the kth sample is input, the output of the ith node is. The error function and total error are.where is the actual output of the NN and is the ideal output. Assume, then, If the ith node is the output unit, then. If the ith node is not the output unit, then.where is the input of the next layer of L layers. To calculate, we calculate it from the next layer of L layers:where m represents the mth unit in the next layer of the L layers. Finally, from the above two formulas, we can obtain.

ENN

Elman first proposed the ENN in 1990 (Elman, n.d.). ENN is a common example of a dynamic recurrent network, and its structure consists of an input layer with a specific context node, a hidden layer, and an output layer. The major benefit of ENN is that the context nodes may be utilized to remember previous hidden node activations, which makes it suitable in the fields of dynamic system identification and predictive control. The mathematics of the ENN can be described as follows. Let the external input of the network be u. The output is y, and the output of the hidden layer is x. Then we have. where ,and denote the connection weighting matrix from the context layer to the implicit layer the input layer to the hidden layer, and the hidden layer to the output layer, respectively. Here and are the transfer functions of the implicit layer and the output layer. From Eqs. (8)-(10),Thenwhere relies on, at different moments, so is a dynamic recursive process. Accordingly, the backpropagation algorithm used for Elman regression NN training is the dynamic backpropagation learning algorithm.

ANFIS

ANFIS, proposed by Jang (Jang, 1993), combines the advantages of fuzzy systems and NNs. It uses a NN learning mechanism to automatically extract input and sample data rules, thereby forming an adaptive neural fuzzy controller. The ANFIS network includes five layers. Layer 1: All the nodes are adaptive nodes here. The outputs of layer 1 are the fuzzy membership grade of the inputs, which are given by. where, can adopt any fuzzy membership function. Layer 2: The nodes are fixed nodes. They are labeled with M, indicating that they perform as a simple multiplier. The outputs of this layer can be represented as. Layer 3: The nodes are also fixed nodes. They are labeled with N, indicating that they play a normalization role to the firing strengths from the previous layer. The outputs of this layer can be represented as. Layer 4: The nodes are adaptive nodes. The output of each node in this layer is simply the product of the normalized firing strength and a first-order polynomial (for a first-order Sugeno model). Thus, the outputs of this layer are given by. Layer 5: There is only one single fixed node labeled with S. This node performs the summation of all incoming signals. Hence, the overall output of the model is given by. The final output of ANFIS is.

LSTM

LSTM was introduced by Hochreiter & Schmidhuber (Hochreiter & Schmidhuber, 1997) and refined and popularized by many people in the following work. LSTM was designed explicitly to avoid the long-term dependency problem. Like RNNs, LSTMs also have a chain-like structure, but the repeating module has a different structure. The first step of LSTM is to decide what information should be discarded from the cell state. This decision is made by a sigmoid layer called the “forget gate layer.” Inputs are and, and the output is a number between 0 and 1 for each number in the cell state. A “1” means “keep this completely,” whereas a “0” means “forget this completely”: The next step is to decide what information to store in the cell state, including two parts. First, the “input gate layer” determines which values will be updated, and then the tanh layer creates a vector of new candidate values,. Next, these two layers are combined to create an update to the state: The next step is to multiply the old state by, forgetting the things we decided to earlier. Then, add: The final step is to run a sigmoid layer which decides what parts of the cell state to output. Then, put the cell state through tanh (to push the values to be between − 1 and 1) and multiply it by the output of the sigmoid gate:

A novel heuristic optimization algorithm: SCWOA

To improve the performance of the ensemble model, an improved heuristic algorithm SCWOA is proposed to determine the optimal weight coefficient of the ensemble model. This section introduces the basic WOA, SCA, and the novel SCWOA.

Overview of WOA

Mirjalili and Lewis proposed WOA in 2016 to simulate the feeding mechanism of humpback whales (Mirjalili & Lewis, 2016a). Humpback whales hunt near the surface by creating distinctive bubbles along a “9-shaped” or circular path as they circle their prey. First, the humpback whale dives about 10–15 m, then releases many bubbles creating a spiral encirclement of the prey, and then swims toward the prey at the surface. The humpback whale's fins flash and use the light from the fins to surround and immobilize the prey and prevent it from escaping. The mathematical models for encircling prey, spiral bubble net foraging maneuvers, and searching for prey are now described. encircling prey The humpback whale circles its prey, increasing the number of iterations to update its position to the best search agent. This process can be expressed mathematically as. where indicates the position vector of the best solution achieved so far, t is the current iteration, denotes the position vector, and and denote the coefficient vectors. Bubble-net attacking method To mathematically model the humpback whale's bubble net behavior, we designed the following two methods. Shrinking encircling mechanism: By setting a random value in [–1, 1], the new position of the search agent can be defined as any position between the position of the current best agent and the initial position of the search agent. Spiral update position: The helical equation between the humpback whale and prey positions, mimicking the spiral motion of a humpback whale, is as follows: The probability p is a random number in [0,1] and is assumed to be chosen between the spiral-shaped path and the shrinking encircling during the optimization process. Search for prey In the exploration phase, the change of vectors can be searched for prey. Thus, can use random values greater than 1 or less than –1 to move away from the reference whale. At this stage, the mathematical model is as follows:where denotes a vector of random positions (random whales) chosen from the current population.

Overview of SCA

Mirjalili (Mirjalili, 2016) proposed SCA based on sine and cosine functions to explore different regions of the search space. In SCA, the search space dimension is determined by the number of parameters required for optimization. The SCA creates different initial random agent solutions and requires them to use mathematical models based on sine and cosine functions to swing outward or towards the best solution: where is the current position at the tth iteration in the ith dimension, is the targeted optimal global solution, and are random numbers. Equation (32) uses conditions for exploitation and exploration:

SCWOA

As mentioned earlier, the WOA was proposed recently and has been applied as an optimization tool in numerous domains. Although the WOA has a reasonable convergence rate, it is not suitable for highly complicated functions and may still face the difficulty of getting stuck in local optimization. In the native WOA algorithm, and if the random position chosen is far from the target position, it will increase the number of search iterations and, thus, affect the algorithm's convergence rate. To overcome these weaknesses and improve its search capability, we propose a new hybrid WOA based on the SCA to solve practical problems. The hybrid variant is called SCWOA. In this variant, the random initial position of WOA and the position of each iteration are searched by SCA. SCA was based on the sine and cosine functions to swing away or toward the best solution, which can provide rapid convergence at a very initial stage from exploration to exploitation. Specifically, SCA's rapid search capability first obtained a better solution. Then further exploration and exploitation were conducted by WOA. When the SCA finds an optimal solution, the acceptance of these new solutions determined by the WOA obtain a better solution near the optimum. The purpose is to overcome the blindness of the random initialization, enrich the searching behavior, and accelerate the local convergence of the WOA. The pseudo-code of the proposed SCWOA algorithm is outlined as follows. Algorithm. Pseudocodes of SCWOA.

Framework of the ensemble forecasting model

This section describes the details of the ensemble forecasting model framework, which is shown in Fig. 2 . The framework includes the ensemble forecasting theory and determining the combination weights by SCWOA.

Fig. 2

The main procedure of the proposed system.

Theory of the ensemble forecasting theory

It is increasingly recognized that the combination of models has advantages over the choice of an individual model, not only in terms of accuracy and error variability but also in terms of simplifying model building and selection. How to combine existing forecasting technologies to achieve perfect prediction results is a widely discussed topic. The ensemble prediction theory suggests that if there exist n prediction techniques for addressing a certain prediction problem with appropriately assigned weighting coefficients, the prediction results of several techniques can be summed. A weighted ensemble is a methodology that allows multiple models to contribute to a forecast in proportion to their predictive or estimated performance. Then the predicted value of the ensemble method can be formulated as. where is the actual time series data, is the weight coefficient for the ith predictor, and is the forecasting value of ith predictor at time t. The prediction error of the ensemble method is expressed as follows: The ensemble method used in this paper combines four commonly used predictors: BPNN, ENN, ANFIS, and LSTM. Then the predicted value in Eq. (36) can be written as.where,,,, and are the predicted value by the ensemble method BPNN ENN ANFIS, and LSTM at time t, respectively, is the weight coefficient assigned to BPNN, ENN, ANFIS, and LSTM, with –1 ≤ ≤ 1.

Determining the weights of the ensemble method by SCWOA

To forecast a certain problem, determining the weight coefficients of each forecasting method is a vital issue for the ensemble method. A modeling averaging ensemble is one of the most commonly used combination strategies, which combines the prediction from each model equally and often results in better performance on average than a given single model. Clemen (Clemen, 1989) demonstrated that the simple averaging ensemble (i.e., a combination of equal weights) performs almost the best among the combination methods. de Menezes et al.(de Menezes, Bunn, & Taylor, 2000) reviewed several research works and concluded that the simple averaging ensemble performed best when the performance of individual predictions was comparable. Nevertheless, there are some well-performing prediction methods that we expect to contribute more to an ensemble prediction, and perhaps some less-skilled methods that may be useful but should have a smaller contribution to an ensemble prediction. To obtain the weight coefficients of each predictor automatically, the proposed optimization algorithm SCWOA in this paper is adopted to search for optimal matching ensemble model weights. Moreover, the mean absolute percentage error function (MAPE) between the predicted and actual values of the models is chosen as the fitness function of SCWOA. Finding the optimal weights is the process of the heuristic search problem that minimizes MAPE by iteratively finding the optimal global solution. The fitness function is as follows:where and denote actual and predicted values and N is the total sample size. Finally, the weights computed by SCWOA are multiplied with each single prediction result separately, and the results of all ensemble members are added to obtain the final prediction value.

Data description and evaluation criteria

Data description

In this paper, the model's validity is verified by using the new crown outbreak prediction as a case study. The accuracy of the prediction mainly depends on the quality of the data and requires sufficient historical data. The experimental data were collected from the “COVID-19 data repository”, which updates the global daily epidemic data by the Center of Systems Science and Engineering (CSSE) of Johns Hopkins University at Github (Dong, Du, & Gardner, 2020). We select three countries significantly affected by COVID-19: the USA, India, and Brazil. To validate the model's performance, this paper collects daily data of total cases, new cases, total deaths, and new death cases in the three countries from January 1, 2021 to August 1, 2021. The obtained data are divided into two parts, with 80 % as the training set and the remaining 20 % as the test set.

Evaluation criteria

This paper considers five evaluation criteria to effectively evaluate the model's performance, as shown in Table 1 . Specifically, the root-mean-squared error (RMSE), MAPE, R 2, IA, and TIC are chosen as error criteria to reflect the prediction performance of the forecasting models.

Table 1

Five evaluation rules.

Metric	Equation	Definition
RMSE	RMSE=1N∑n=1Nyn-yn∧21/2	The root-mean-square forecast error
MAPE	MAPE=1N∑n=1Nyn-yn∧yn×100%	Mean absolute percentage error
R²	R2= 1 -∑n=1N(yn-y^n)∑n=1N(yn-y¯)	Coefficient of determination
IA	IA= 1 -1N∑n=1N(yn-y^n)2∑n=1Ny^n-y¯+yn-y¯2	The index of agreement of forecasting results
TIC	TIC =1N∑n=1N(yn-y^n)21N∑n=1Nyn2+1N∑n=1Ny^n2	Theil's inequality coefficient

Here and present the actual and predicted values at time n, respectively. N denotes the sample size.

Five evaluation rules. Here and present the actual and predicted values at time n, respectively. N denotes the sample size.

Experimental results and analysis

In this section, we have established two experiments: Experiment I provides the evaluation of the SCWOA by benchmark functions; Experiment II investigates the performance of the ensemble model for forecasting the COVID-19 epidemic. Details are presented in the following sections.

Experiment I: The evaluation of the SCWOA by benchmark functions.

To measure and evaluate the performance of the novel optimization algorithm, the algorithms need to process some well-defined test functions. In this experiment, various optimization benchmark problems are implemented to validate the performance of the proposed SCWOA. In this paper, 15 classical benchmark functions have been used to test the performance of SCWOA and other metaheuristic algorithms. These functions are classified into three categories, i.e., unimodal, multimodal, and fixed dimension multimodal, as listed in Table 2 . Table 2 describes the characteristics of the three types of benchmark functions, where Dim denotes the dimensionality of the function, and Range denotes the search range boundary of the function. The unimodal benchmark function has a single optimal value suitable for benchmark testing. The multimodal benchmark function has more than one optimal value and is more challenging than the unimodal function. The graphs of each test function are demonstrated in Fig. 3 , which shows that the benchmark functions usually have one global optimum, and the rest are local optima. Moreover, the robust metaheuristic algorithm can avoid the local optimum and determine the global optimum solution.

Table 2

Unimodal benchmark functions.

Function type	Function	Dim	Domain	Optimum value
Unimodal	F1(x)=∑i=1nXi2	30	[-100,100]	0
	F2(x)=∑i=1nXi+∏i=1nxi	30	[-10,10]	0
	F3(x)=∑i=1n∑j-1nXj2	30	[-100,100]	0
	F4(x)=maxixi,1⩽i⩽n	30	[-100,100]	0
	F5(x)=∑i=1n-1100(xi+1-xi2)2+(xi-1)2	30	[-30,30]	0

Multimodal	F6(x)=∑i=1nxi2-10cos2πxi+10	30	[-5.12,5.12]	0
	F7(x)=-20exp(-0.21n∑i=1nxi2)-exp(1n∑i=1ncos(2πxi))+20+e	30	[–32,32]	0
	F8(x)=14000∑i=1nxi2-∏i=1ncos(xii)+1	30	[-600,600]	0
	F9(x)=πn10xin(πyi)+∑i=1n-1(yi-1)21+10sin2(πyi+1)+(yn-1)2+∑i=1nu(xi,10,100,4)yi=1+xi+14u(xi,a,k,m)=k(xi-a)mxi>a0-a<xi<ak(-xi-a)mxi<-a	30	[-50,50]	0
	F10(x)=0.1sin2(3πxi)+∑i=1nxi-121+sin2(3πxi+1)+(xn-1)21+sin2(2πxn)+∑i=1nu(xi,5,100,4)	30	[-50,50]	0

Fixed-dimension multimodal	F11(x)=1500+∑j=1251j+∑i=12(xi-aij)6-1	2	[-65,65]	1
	F12(x)=∑i=111ai-x1(bi2+bix2)bi2+bixi+x42	4	[-5,5]	0.00030
	F13(x)=(x2-5.14π2x12+5πx1-6)2+10(1-18π)cosx1+10	2	[-5,5]	0.398
	F14(x)=1+x1+x2+12(19-14x1+3x12-14x2+6x1x2+3x22×30+2x1-3x22×(18-32x1+12x12+48x2-36x1x2+27x22	2	[-2,2]	3
	F15(x)=-∑i=14ciexp-∑j=13aijxj-pij2	3	[1,3]	−3.86

Fig. 3

The graphs of each benchmark functions.

Unimodal benchmark functions. The graphs of each benchmark functions.

Experimental settings

The simulations are implemented in MATLAB R2020a with an 11th Gen Intel® Core™ i7-11800H, 2.30 GHz CPU and 32 GB of RAM. The particle swarm optimization (PSO), grey wolf optimizer (GWO), SCA, and WOA algorithms are utilized as benchmark algorithms. Set the number of search agents for all algorithms to 30 and the maximum number of iterations to 500. In the experiments, each algorithm runs 50 times independently on all benchmark functions and stops when the maximum number of iterations is reached.

The computational results of the proposed algorithm

To statistically evaluate the proposed hybrid variant in comparison with other algorithms, the average and standard deviation of optimal values are calculated. The results of the proposed SCWOA and PSO, GWO, SCA, and WOA are reported in Table 3 . In addition, the convergence curves for all benchmark functions through the proposed SCWOA and the standard PSO, GWO, SCA, and WOA are as shown in Fig. 4, Fig. 5, Fig. 6 .

Table 3

Comparison results on unimodal, multimodal, and fixed-dimension multimodal benchmark functions.

Benchmark function	PSO		SCA		GWO		WOA		SCWOA
Benchmark function	Ave	Std	Ave	Std	Ave	Std	Ave	Std	Ave	Std
F1	1.95E-04	3.64E-04	2.50E + 00	2.32E + 00	1.15E-27	1.44E-27	1.95E-71	6.65E-71	3.19E-161	1.54E-160
F2	2.26E-02	1.92E-02	2.61E-02	3.88E-02	9.05E-17	5.70E-17	9.67E-51	3.64E-50	1.49E-147	7.96E-147
F3	9.07E + 01	3.34E + 01	8.49E + 03	5.13E + 03	6.38E-05	2.08E-04	4.08E + 04	1.47E + 04	3.68E-109	1.51E-108
F4	1.17E + 00	2.56E-01	3.83E + 01	1.36E + 01	6.94E-07	8.25E-07	4.55E + 01	2.52E + 01	1.05E-27	5.11E-27
F5	8.70E + 01	6.03E + 01	4.12E + 04	6.61E + 04	2.70E + 01	7.59E-01	2.80E + 01	4.38E-01	2.62E + 01	2.69E-01
F6	6.17E + 01	1.91E + 01	3.20E + 01	3.03E + 01	3.67E + 00	5.13E + 00	2.27E-15	1.11E-14	0.00E + 00	0.00E + 00
F7	2.13E-01	4.56E-01	1.42E + 01	8.25E + 00	9.89E-14	1.88E-14	3.73E-15	2.25E-15	3.59E-15	1.52E-15
F8	8.87E-03	1.18E-02	8.53E-01	3.71E-01	3.77E-03	6.30E-03	0.00E + 00	0.00E + 00	0.00E + 00	0.00E + 00
F9	4.15E-03	2.03E-02	1.28E + 04	4.82E + 04	4.77E-02	2.85E-02	2.42E-02	2.21E-02	9.45E-03	5.30E-03
F10	4.36E-01	2.79E-01	1.97E + 05	7.84E + 05	5.89E-01	1.87E-01	4.76E-01	2.63E-01	3.57E-03	5.14E-03
F11	3.48E + 00	2.33E + 00	1.64E + 00	9.23E-01	5.30E + 00	4.67E + 00	2.57E + 00	3.10E + 00	1.08E + 00	3.89E-01
F12	9.14E-04	9.78E-05	1.05E-03	4.04E-04	2.04E-03	5.41E-03	6.19E-04	2.26E-04	3.08E-04	6.36E-07
F13	3.98E-01	0.00E + 00	4.00E-01	1.41E-03	3.98E-01	8.94E-05	3.98E-01	4.74E-05	3.98E-01	6.27E-07
F14	3.00E + 00	1.54E-15	3.00E + 00	6.89E-05	3.00E + 00	6.99E-05	3.00E + 00	8.56E-05	3.00E + 00	1.10E-09
F15	−3.86E + 00	2.12E-15	−3.85E + 00	2.56E-03	−3.86E + 00	1.87E-03	−3.85E + 00	1.50E-02	−3.86E + 00	3.37E-03

Fig. 4

Convergence Curve of PSO, SCA, WOA, GWO and SCWOA variants on F1-F5 function.

Fig. 5

Convergence Curve of PSO, SCA, WOA, GWO and SCWOA variants on F6-F10 function.

Fig. 6

Convergence Curve of PSO, SCA, WOA, GWO and SCWOA variants on F11-F15 function.

Comparison results on unimodal, multimodal, and fixed-dimension multimodal benchmark functions. Convergence Curve of PSO, SCA, WOA, GWO and SCWOA variants on F1-F5 function. Convergence Curve of PSO, SCA, WOA, GWO and SCWOA variants on F6-F10 function. Convergence Curve of PSO, SCA, WOA, GWO and SCWOA variants on F11-F15 function. For the unimodal benchmark functions, the experimental results prove that the new proposed algorithm outperforms the other algorithms in all cases tested. The average optimal values of F1–F5 reach 3.19E-161, 1.49E-147, 3.68E-109, 1.05E-27, and 2.62E + 01, respectively, and the best results are written in bold font. The convergence performance of the algorithm is shown in Fig. 4, which similarly reveals that the SCWOA converges faster and shows a strong optimization capability. For the multimodal benchmark functions, the experimental results also prove that the new algorithm has the best performance in all test cases. The average optimal values of F6–F10 reach 3.19E-161, 1.49E-147, 3.68E-109, 1.05E-27, and 2.62E + 01, respectively, and the best results are written in bold font. The convergence performance of all algorithms has been compared using Fig. 5. Fixed-dimensional multimodal problems have multiple local optima, and their number grows exponentially with increasing dimensionality, making them a benchmark for measuring the ability of a technology to explore. According to the results of F11–F15 in Table 3, the new hybrid algorithm finds superior results on these problems, performing much better than PSO, GWO, SCA, and WOA and indicating that SCWOA has a robust exploratory capability. The convergence performance of the algorithm in multimodal problems with fixed dimensions is shown in Fig. 6. It can be seen that SCWOA decreases rapidly and can effectively avoid getting into local minima.

Experiment Ⅱ: The performance of the ensemble model for forecasting the COVID-19 epidemic

As individual NNs cannot be applied to various complex epidemic prediction scenarios, it is necessary to consider combining multiple NNs for ensemble prediction to overcome the one-sidedness of the individual model prediction. Therefore, this paper selects four commonly used NNs, BPNN, ENN, ANFIS, and LSTM (see Section 2 for model details), and the prediction performance of the four methods is compared through four epidemic data sets of three countries. However, the limitations of single models in prediction mean that each method can only provide corresponding information from a particular perspective, making it difficult to fully characterize the COVID-19 epidemic trend. Therefore, to eliminate the weaknesses of every individual model, suitable combinations of the four models are needed to constitute the ensemble prediction. The newly proposed SCWOA is applied to optimize the ensemble prediction model in this paper. The SCWOA searches for the optimal combination weights of the four single models. The MAPE error indicator is used as the objective function of SCWOA to obtain the optimal solution of the weight coefficients by iterative search. The optimal weights and MAPE results are obtained as listed in Table 4 .

Table 4

The optimal weights and MAPE results of four neural networks.

Countries	Methods	Total cases				New death cases
		Weights	MAPE(%)	Weights	MAPE(%)	Weights	MAPE(%)	Weights	MAPE(%)
USA	BPNN	−0.6120	0.3320	−0.0976	262.4478	0.0113	0.1122	1.0000	45.2748
	ENN	0.7603	0.1169	0.1443	75.8460	0.0086	0.1345	−0.9093	46.5709
	ANFIS	1.0000	0.1374	1.0000	74.6520	0.7012	0.0742	0.3244	48.8013
	LSTM	−0.1517	0.3508	0.0115	85.4976	0.2783	0.0268	0.1516	204.4707

India	BPNN	0.0107	0.3327	−0.6712	30.1301	0.0000	0.8763	0.6398	59.8135
	ENN	1.0000	0.0230	0.1037	23.5619	0.1267	0.8201	0	64.6156
	ANFIS	−0.0110	0.1445	1.0000	17.8443	−0.1359	1.4032	0	75.1196
	LSTM	0.0003	0.5561	0.3926	21.2914	1.0000	0.6363	0.0752	100.5203

Brazil	BPNN	0.0015	0.5151	0.0014	28.5270	0.0747	0.4007	0.2055	17.5701
	ENN	0.2245	0.6607	0.2260	28.9823	−0.0081	0.1422	0.2403	17.3553
	ANFIS	0.0000	0.4839	0.1412	40.9626	0.9317	0.1425	0.1561	20.1853
	LSTM	0.7740	0.2397	0.4581	31.3894	0.0000	0.7265	0.2899	22.9903

The optimal weights and MAPE results of four neural networks. As indicated in Table 4, the four methods of BPNN, ENN, ANFIS, and LSTM have their advantages and disadvantages in predicting different epidemic COVID-19 data in the three countries. Among them, BPNN obtains the smallest MAPE value in the total death cases prediction of the USA and India. ENN obtains the smallest MAPE value in the total cases prediction of the USA and India and Brazil's total death cases and total death cases prediction. ANFIS obtained the best results in the new cases prediction of the USA and India. LSTM obtained the minimum MAPE in the total death cases prediction of the USA and India and the total cases prediction of Brazil. The values marked in bold are used to indicate the best values of the model, and the optimal prediction model is selected accordingly. In summation, the situation of the COVID-19 epidemic is complex and variable, each NN model has its advantages and applicability, and it is difficult to determine the only NN model suitable for all epidemic situations. Further, ensemble weights obtained from the SCWOA search were weighted into each of the four models to obtain the ensemble prediction results, and the effectiveness of the model was verified by four different epidemic data sets in three countries. To fully assess the predictive performance of the proposed model, BPNN, ENN, ANFIS, LSTM, ARIMA, least squares support vector machine (LSSVM), and Averaging-Ensemble are selected as benchmark models to compare with the proposed model. In addition, five evaluation criteria, such as RMSE, MAPE, R 2, IA, and TIC, are used to reflect the prediction performance of the models, and the results are listed in Table 5, Table 6, Table 7, Table 8 . Fig. 7, Fig. 8, Fig. 9 shows the predicted values and observed values between the proposed model and other models. The further discussion of the experimental results is as follows.

Table 5

The comparative forecasting error of different models for COVID-19 total cases.

Countries	Methods	RMSE	MAPE	R²	IA	TIC
USA	ARIMA	286215.7956	0.5443	0.5376	0.7710	0.0042
	LSSVM	115039.0835	0.2375	0.9253	0.9776	0.0017
	BPNN	172033.0221	0.3320	0.8329	0.9390	0.0025
	ENN	52578.6263	0.1169	0.9844	0.9956	0.0008
	ANFIS	70062.9332	0.1374	0.9723	0.9923	0.0010
	LSTM	163120.2984	0.3508	0.8498	0.9520	0.0024
	Averaging-Ensemble	109180.0711	0.2091	0.9327	0.9792	0.0016
	SCWOA-Ensemble	34113.6566	0.0815	0.9881	0.9973	0.0005

India	ARIMA	610954.9038	1.4814	−0.5265	0.8222	0.0098
	LSSVM	5473172.1091	18.0088	−121.5077	0.0339	0.0951
	BPNN	115347.4440	0.3327	0.9456	0.9847	0.0019
	ENN	10048.1000	0.0230	0.9996	0.9999	0.0002
	ANFIS	53861.3340	0.1445	0.9881	0.9969	0.0009
	LSTM	194170.7690	0.5561	0.8458	0.9552	0.0032
	Averaging-Ensemble	55846.8377	0.1368	0.9872	0.9965	0.0009
	SCWOA-Ensemble	9025.2129	0.0185	0.9997	0.9999	0.0001

Brazil	ARIMA	453183.3331	1.7983	0.3322	0.9013	0.0118
	LSSVM	92408.7638	0.4386	0.9722	0.9933	0.0024
	BPNN	113125.7660	0.5151	0.9584	0.9895	0.0030
	ENN	145190.2240	0.6607	0.9315	0.9812	0.0038
	ANFIS	102804.3420	0.4839	0.9656	0.9920	0.0027
	LSTM	51934.1284	0.2397	0.9912	0.9979	0.0014
	Averaging-Ensemble	41681.8038	0.1889	0.9944	0.9986	0.0011
	SCWOA-Ensemble	29145.0877	0.1141	0.9972	0.9993	0.0008

Table 6

The comparative forecasting error of different models for COVID-19 new cases.

Countries	Methods	RMSE	MAPE	R²	IA	TIC
USA	ARIMA	39625.9112	53.8695	−0.1961	0.4116	0.6022
	LSSVM	50495.5089	59.1437	−0.9422	0.5093	0.4420
	BPNN	40631.3412	262.4478	−0.2575	0.5226	0.4341
	ENN	36895.8196	75.8460	−0.0369	0.5564	0.3983
	ANFIS	57308.4210	74.6520	−1.5017	0.5437	0.4893
	LSTM	31768.2175	85.4976	0.2313	0.6985	0.3363
	Averaging-Ensemble	33129.8781	77.3119	0.1639	0.6768	0.3576
	SCWOA-Ensemble	10882.2612	30.3627	0.8450	0.9607	0.1250

India	ARIMA	7915.9498	11.7499	−0.0429	0.4069	0.0891
	LSSVM	13613.2906	28.6263	−0.8239	0.2878	0.1571
	BPNN	13501.7673	30.1301	−0.7942	0.3572	0.1633
	ENN	13049.4068	23.5619	−0.6759	0.2745	0.1520
	ANFIS	12346.3386	17.8443	−0.5002	0.3964	0.1361
	Averaging-Ensemble	12437.0265	21.2914	−0.5223	0.2854	0.1482
	SCWOA-Ensemble	10256.7920	16.1490	−0.7509	0.3714	0.1193
	SCWOA*	10128.8276	13.7291	−0.0097	0.4310	0.1196

Brazil	ARIMA	30803.8363	73.1414	−0.8711	0.6110	0.2389
	LSSVM	21460.9495	29.8514	0.0918	0.6097	0.1908
	BPNN	20824.7056	28.5270	0.1448	0.6228	0.1877
	ENN	22265.4856	28.9823	0.0224	0.6653	0.1975
	ANFIS	25101.7439	40.9626	−0.2425	0.6201	0.2226
	LSTM	22744.8628	31.3894	−0.0201	0.6887	0.1970
	Averaging-Ensemble	20149.1588	27.0733	0.1994	0.6900	0.1798
	SCWOA-Ensemble	17425.6981	25.7089	0.4012	0.7113	0.1694

Table 7

The comparative forecasting error of different models for COVID-19 total death cases.

Countries	Methods	RMSE	MAPE	R²	IA	TIC
USA	ARIMA	2585.9083	0.3327	0.3285	0.7658	0.0021
	LSSVM	238.3001	0.0332	0.9943	0.9986	0.0002
	BPNN	869.4171	0.1122	0.9241	0.9752	0.0007
	ENN	860.8173	0.1345	0.9256	0.9818	0.0007
	ANFIS	537.0568	0.0742	0.9710	0.9922	0.0004
	LSTM	209.3502	0.0268	0.9956	0.9989	0.0002
	Averaging-Ensemble	344.8095	0.0457	0.9881	0.9967	0.0003
	SCWOA-Ensemble	197.5213	0.0259	0.9960	0.9990	0.0002

India	ARIMA	18396.8831	3.4737	−2.0467	0.7240	0.0221
	LSSVM	18826.4698	3.9881	−2.1906	0.4678	0.0235
	BPNN	3880.1558	0.8763	0.8645	0.9629	0.0048
	ENN	3664.1177	0.8201	0.8791	0.9669	0.0045
	ANFIS	6497.4214	1.4032	0.6200	0.8848	0.0080
	LSTM	2768.9240	0.6363	0.9310	0.9823	0.0034
	Averaging-Ensemble	1860.6057	0.3636	0.9688	0.9910	0.0023
	SCWOA-Ensemble	935.3028	0.1955	0.9921	0.9980	0.0011

Brazil	ARIMA	14900.6884	2.0865	0.1096	0.8754	0.0138
	LSSVM	3069.7632	0.5192	0.9622	0.9913	0.0029
	BPNN	2525.0654	0.4007	0.9744	0.9935	0.0024
	ENN	879.7529	0.1422	0.9969	0.9992	0.0008
	ANFIS	857.7813	0.1425	0.9970	0.9993	0.0008
	LSTM	4519.5498	0.7265	0.9181	0.9823	0.0042
	Averaging-Ensemble	1817.6157	0.3041	0.9868	0.9968	0.0017
	SCWOA-Ensemble	360.9762	0.0569	0.9995	0.9999	0.0003

Table 8

The comparative forecasting error of different models for COVID-19 new death cases.

Countries	Methods	RMSE	MAPE	R²	IA	TIC
USA	ARIMA	167.9180	57.4800	−0.0067	0.3893	0.3092
	LSSVM	218.5093	54.3070	−0.7119	0.3511	0.3197
	BPNN	206.8672	45.2748	−0.5344	0.2856	0.3096
	ENN	234.9546	46.5709	−0.9793	0.3990	0.3141
	ANFIS	207.3595	48.8013	−0.5417	0.4520	0.2979
	LSTM	233.2734	204.4707	−0.9511	0.4271	0.5115
	Averaging-Ensemble	163.5745	41.3553	0.0406	0.4540	0.2577
	SCWOA-Ensemble	146.1758	41.6379	0.2339	0.5764	0.2418

India	ARIMA	767.0861	96.6555	−0.6908	0.3845	0.3104
	LSSVM	768.9617	49.4638	−0.6167	0.3158	0.3439
	BPNN	759.4036	59.8135	−0.5767	0.3591	0.3482
	ENN	794.9469	64.6156	−0.7278	0.3304	0.3645
	ANFIS	928.6265	75.1196	−1.3577	0.1519	0.4208
	LSTM	899.2495	100.5203	−1.2109	0.3202	0.3850
	Averaging-Ensemble	741.4895	52.0475	−0.5798	0.3400	0.3375
	SCWOA-Ensemble	684.0992	142.5934	−0.2795	0.3909	0.3657

Brazil	ARIMA	847.1567	68.0459	−1.6752	0.6252	0.2370
	LSSVM	309.4662	18.3241	0.6430	0.8831	0.1073
	BPNN	282.0152	17.5701	0.7035	0.9175	0.0967
	ENN	291.6718	17.3553	0.6829	0.9128	0.0989
	ANFIS	397.2349	20.1853	0.4118	0.8591	0.1334
	LSTM	454.5381	22.9903	0.2299	0.8401	0.1471
	Averaging-Ensemble	256.7650	15.9107	0.7543	0.9333	0.0866
	SCWOA-Ensemble	197.3885	14.6789	0.8548	0.9553	0.0704

Fig. 7

Comparison of predicted and actual values of different models in USA.

Fig. 8

Comparison of predicted and actual values of different models in Indi.

Fig. 9

Comparison of predicted and actual values of different models in Brazil.

The comparative forecasting error of different models for COVID-19 total cases. The comparative forecasting error of different models for COVID-19 new cases. The comparative forecasting error of different models for COVID-19 total death cases. The comparative forecasting error of different models for COVID-19 new death cases. Comparison of predicted and actual values of different models in USA. Comparison of predicted and actual values of different models in Indi. Comparison of predicted and actual values of different models in Brazil. Table 5, Table 6, Table 7, Table 8 indicate that the proposed SCWOA-based ensemble model (SCWOA-Ensemble) shows more robust predictive performance compared with ARIMA, LSSVM, BPNN, ENN, ANFIS, and LSTM, and Averaging-Ensemble has the best performance among all COVID-19 data sets of three countries. For the total cases, compared with ARIMA, LSSVM, BPNN, ENN, ANFIS, and LSTM, SCWOA-Ensemble leads to 93.39 %, 79.55 %, 82.19 %, 41.74 %, 68.73 %, and 72.77 % average reductions in RMSE, 92.48 %, 79.86 %, 82.58 %, 44.14 %, 68.09 %, and 75.28 % average reductions in MAPE, 93.36 %, 79.50 %, 82.15 %, 41.69 %, 68.67 %, and 72.75 % average reductions in TIC of three countries. For the new cases, compared with ARIMA, LSSVM, BPNN, ENN, ANFIS, and LSTM, SCWOA-Ensemble leads to 52.33 %, 40.95 %, 38.17 %, 38.21 %, 43.18 %, and 35.90 % average reductions in RMSE, 58.77 %, 38.19 %, 50.91 %, 37.66 %, 39.88 %, and 39.37 % average reductions in MAPE, 49.19 %, 35.60 %, 35.90 %, 34.72 %, 36.81 %, and 32.03 % average reductions in TIC of three countries. For the total death cases, compared with ARIMA, LSSVM, BPNN, ENN, ANFIS, and LSTM, SCWOA-Ensemble leads to 94.95 %, 66.80 %, 79.63 %, 70.17 %, 68.91 %, and 54.63 % average reductions in RMSE, 94.62 %, 68.71 %, 80.13 %, 72.29 %, 70.40 %, and 54.86 % average reductions in MAPE, 94.91 %, 66.82 %, 79.65 %, 70.13 %, 68.95 %, and 54.59 % average reductions in TIC of three countries. For the new death cases, compared with ARIMA, LSSVM, BPNN, ENN, ANFIS, and LSTM, SCWOA-Ensemble leads to 33.49 %, 26.79 %, 23.09 %, 28.02 %, 35.38 %, and 39.28 % average reductions in RMSE, 44.39 %, 0.31 %, 2.28 %, 5.70 %, 16.09 %, and 48.59 % average reductions in MAPE, 24.33 %, 17.07 %, 14.30 %, 16.79 %, 26.06 %, and 36.28 % average reductions in TIC of three countries. In addition, compared with Averaging-Ensemble, SCWOA-Ensemble can obtain better prediction results than the simple averaging ensemble approach, indicating that SCWOA can search to obtain the optimal ensemble model weight coefficients, thus effectively combining the advantages of the four single models. To visually compare each model's time series prediction ability, the line graphs of the prediction results of different models compared with the actual values are shown in Fig. 7, Fig. 8, Fig. 9. From Fig. 7, Fig. 8, Fig. 9, it can be seen that the four single models have different prediction performances in total cases, new cases, total death cases, and new death cases. In contrast, by combining the advantages of each single model, the ensemble model can simulate the overall trend of epidemic changes well and utilize the sudden change information of epidemic prediction and obtain better prediction performance in all kinds of situations. The pooled model can obtain better prediction performance in all scenarios. Each single NN model has its advantages in predicting different epidemic situations, and no model can obtain the best results in various situations. The performance of each single NNs in different epidemic prediction scenarios varies, and no single model can obtain optimal results in all situations. Moreover, the proposed ensemble model based on the SCWOA has strong prediction ability and can effectively address the complexity and nonlinearity of total cases, new cases, total death cases, new death cases. The proposed optimization method SCWOA plays an essential role in improving the prediction accuracy of the ensemble model.

Discussion

From the analysis, it can be found that each NN model has its advantages in different epidemic prediction scenarios. In addition, the ensemble prediction model optimized by SCWOA achieves the best prediction performance in all prediction scenarios, which shows that the swarm intelligence algorithm SCWOA can effectively obtain the optimal weight coefficient so that the ensemble model can synthesize the advantages of every single method to achieve better prediction performance. Specifically, the prediction methods with good performance should make a more significant contribution to the ensemble model, whereas the methods with poor performance should contribute less. For a given set of data sets, if a prediction method has a better prediction performance, it makes a more significant contribution to the ensemble model and has a greater weight coefficient. Thus, in various complicated forecast scenarios, through the SCWOA to intelligently search for the optimal solution and assign the optimal weight to the set model, we can successfully combine the advantages of every single model and obtain more accurate prediction results. We have also analyzed the computational complexity and burden of our method. The computation experience of the proposed method is the two stages of parameter estimation. The ensemble model contains a large number of parameters and the ideal parameters form a complex hypersurface, so model optimization via SCWOA results in over-computing loads. The optimal method fixes some of them in the pre-trained model while the single model is constructed in the first stages, and then the remaining parameters, such as the weights of single models, are estimated by SCWOA. As the proposed method integrates each separately trained model, its complexity is about the sum of the complexity of five single models, which means that our ensemble method will not cause more computational burden than other single models. The code is run on a machine with an 11th Gen Intel® Core™ i7-11800H, 2.30 GHz CPU and 32 GB of RAM. The model time cost is listed in Table 9 . The results of Table 9 indicate that the SCWOA optimization process does not take the longest running time compared with the training of every single model, so it will not cause a computational burden. Although the time cost of the final ensemble method is about 1 min, the ensemble model significantly improves the accuracy of prediction, and this time cost is acceptable in public health practice.

Table 9

Comparison of model running time.

Methods	Time costs (s)
BPNN	0.4097
ANFIS	26.0567
ENN	1.5440
LSTM	23.7924
SCWOA optimization	16.0050
Ensemble model	67.8078

Comparison of model running time.

Conclusions and future prospects

Reliable and precise COVID-19 epidemic forecasting is vital for outbreak prevention and control. It is increasingly recognized that the combination of models has advantages over the choice of an individual model. How to combine existing forecasting technologies to achieve perfect prediction results is a widely discussed topic. This paper has proposed a novel ensemble forecasting paradigm based on multiple NNs and a new heuristic intelligence algorithm SCWOA. First, four individual NNs were selected to predict the COVID-19 outbreak. To eliminate the weaknesses of each model, an ensemble forecasting method, which integrates the characteristics of each forecasting method, has been developed. The proposed SCWOA algorithm was applied to optimize the ensemble prediction model. The SCWOA algorithm was used to search for the best matching weights of the ensemble model. The performance of the SCWOA has been evaluated by 15 classical benchmark functions, and the results show that the SCWOA performs much better than PSO, GWO, SCA, and WOA, which indicates that SCWOA has a robust exploratory capability and can effectively avoid getting into local minima. The daily COVID-19 series collected from three of the most-affected countries were taken as the test cases to conduct the empirical study to verify the proposed ensemble model. The comparison results obtained in this study demonstrate that different NN models have different prediction performances in various complex epidemic prediction scenarios. The SCWOA-based ensemble prediction model significantly outperforms all other comparable models with its high prediction accuracy, which implies that the proposed model can effectively integrate the advantages of multiple NN models to achieve stable and accurate prediction results. For future work, there are several potential research paths: (a) the selection of the base predictor in the current ensemble model tends to be more subjective; optimizing the model selection strategy is a promising direction in the next step; (b) building a big data-driven prediction model, more epidemic-related variables, such as epidemic prevention policy measures, search indices, and population mobility, need to be incorporated and how these variables affect changes in epidemic trends should be analyzed.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Objective:

Minimize and maximize the objective function fx, Xi=Xi1,Xi2,...,Xid

Parameters:

iter-iteration number.

Maxiter-the maximum number of iteration.

I-a population pop.

p-the switch probability

Generate the populationXii=1,2,3,…,I

Initialize the search agents using the sine and cosine functions to find the best solution

xit+1→=xit→+rand1×sinrand2×rand3×lit-xit→,rand4<0.5xit→+rand1×cosrand2×rand3×lit-xit→,rand4⩾0.5

Calculate the fitness of each search solution

X*=the best search agent

iter=1

While (iter <MaxIter)

for each search solution

Update a,A, C

If (p<0.5)

A<1

then

Update the position of the current agent by D→=C→X∗→(t)-X→(t)

else if |A| ≥ 1 then

Select a random search agent X_rand

Update the position of the current agent by X→t+1=Xrand→-A→·D→

end if

end for

Calculate the fitness of each search agent

Update X* if there is a better solution

iter = iter +1

end while

return