Literature DB >> 35755829

Point and interval prediction of crude oil futures prices based on chaos theory and multiobjective slime mold algorithm.

Weixin Sun1, Heli Chen1, Feng Liu2, Yong Wang1.   

Abstract

Crude oil is the most important energy source in the world, and fluctuations in oil prices can significantly influence investors, companies, and governments. However, crude oil prices have numerous characteristics, including randomness, sudden structural changes, intrinsic nonlinearity, volatility, and chaotic nature. This makes the accurate forecasting of crude oil prices a difficult and challenging task. In this paper, a hybrid prediction model for crude oil futures prices is proposed, the accuracy and robustness of which are demonstrated via controlled experiments and sensitivity analysis. This study uses a new data denoising method for data processing to improve the accuracy and stability of the predictions of crude oil prices. Furthermore, the chaotic time-series prediction method, shallow neural networks, linear model prediction methods, and deep learning methods are adopted as submodels. The results of interval forecasts with narrow widths and high prediction accuracies are derived by introducing a confidence interval adjustment coefficient. The results of the simulation experiments indicate that the proposed hybrid prediction model exhibits higher accuracy and efficiency, as well as better robustness of the forecasting than the control models. In summary, the proposed forecasting framework can derive accurate point and interval forecasts and provide a valuable reference for the price forecasting of crude oil futures.
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022.

Entities:  

Keywords:  Chaotic time-series prediction method; Crude oil futures price; Hybrid prediction model; Interval forecasts; Multiobjective slime mold algorithm

Year:  2022        PMID: 35755829      PMCID: PMC9211054          DOI: 10.1007/s10479-022-04781-6

Source DB:  PubMed          Journal:  Ann Oper Res        ISSN: 0254-5330            Impact factor:   4.820


Introduction

As the most important global energy source, crude oil has become a pivotal strategic resource. Owing to the COVID-19 pandemic (Li et al., 2021), the demand for crude oil saw an unprecedented decline of 9.3% (a decrease of 9.1 million barrels per day) in 2020. Nevertheless, according to the 2021 BP Statistical Review of World Energy (BP China, 2021), crude oil still accounts for the largest share (31.2%) of global primary energy consumption in 2020. The crude oil price is closely related to socioeconomic factors, international politics, and national security (Sun et al., 2020). For example, the three oil crises (in 1973, 1978, and 1990, respectively) caused by surges in crude oil prices led to a reduction in production and a significant slowdown in economic growth in industrialized countries worldwide (Qin, 2020). Therefore, the prediction of crude oil prices (COP) is crucial for investors, companies, and governments. The forecasting of the COP is also a challenging task because the COP can be influenced by multiple factors, including inflation, exchange rates, supply and demand, international politics, wars, and pandemics (Chai et al., 2018; You et al., 2021). COP time series have many characteristics, including intrinsic nonlinearity, randomness, sudden structural changes, volatility, and a chaotic nature (Cerqueti and Fanelli, 2021; Zhao et al., 2021). With the continuous development of machine learning (ML) models and optimization algorithms, the prediction of COP has become increasingly accurate. Because the international crude oil futures prices (COFP) are generally regarded as the reference prices for the international COP (Zhang et al., 2021), an increasing number of researchers have engaged themselves in the prediction of the COFP. Based on a review of existing studies, forecasting methods applied to the COFP can be divided into two categories, i.e., single and hybrid models. The single models used for COFP can be divided into ML and traditional time-series analysis models. Traditional time-series analysis methods such as the vector autoregression model (VAR) (Mirmirani and Li, 2004), generalized autoregressive conditional heteroskedasticity (GARCH) model (Agnolucci, 2009), and autoregressive integrated moving average model (ARIMA) (He et al., 2010) are widely used for COP forecasting. Although these models are effective in capturing the linear information in a time series, they fail to capture its nonlinear information to a certain extent (Abdollahi and Ebrahimi, 2020). Therefore, these models are not very effective when predicting the COFP owing to the nonlinear characteristics of COFP (Sun et al., 2018). Furthermore, these models have many strict assumptions (Li et al., 2019), but these assumptions are often difficult to practically satisfy. To compensate for the shortcomings of traditional time-series analysis models, an increasing number of ML methods have been used to forecast the COFP. ML models are not only free of strict assumptions, but they can also capture nonlinear information in a time series (Abedin, Chi, et al., 2021; Abedin, Moon, et al., 2021). Initially, a multilayer perceptron (MLP) (Guotai et al., 2017a), support vector machine (SVM) (Abedin et al., 2019; Abedin et al., 2019; Guo et al., 2012; Li, Chen, et al., 2020; Li, Wen, et al., 2020), back propagation neural network (BPNN) (Mingming and Jinliang, 2012), and extreme learning machine (ELM) (Wang, Athanasopoulos, et al., 2018) were used to improve the accuracy of COFP forecasts. In recent years, deep learning methods have attracted extensive attention owing to their excellent performance in terms of prediction accuracy and stability (Abedin, Chi, et al., 2021; Abedin, Moon, et al., 2021; Lv et al., 2022; Wang et al., 2022; Wang, Athanasopoulos, et al., 2018; Wang, Du, et al., 2018). Long short-term memory (LSTM) networks (Zhang et al., 2020), deep belief networks (DBN) (Zhang and Ci, 2020), and bidirectional long short-term memory networks (BiLSTM) (Abedin, Moon, et al., 2021) are gradually used in COFP predictions. However, there are still certain drawbacks in using ML models to predict the COFP, such as their poor generalization ability and local optimum problems (Wang et al., 2020). To address these issues, many hybrid models have been proposed for COFP forecasting. A hybrid model is a combination of several single models through an optimization algorithm. It can inherit the advantages of every single model, thereby improving the prediction accuracy and stability (Guotai et al., 2017b; Hu et al., 2021; Jiang et al., 2020; Khalilpourazari and Doulabi, 2021). In existing studies, the most commonly used optimization algorithms are the genetic algorithm (GA) (Yang et al., 2019), particle swarm optimization (PSO) (Ribeiro et al., 2021), ant lion optimization algorithm (ALO) (P. et al., 2018), frog-leaping algorithm (FLA) (He et al., 2021), and whale optimization algorithm (WOA) (Lin and Zhang, 2021). Only one optimization objective can be set when these optimization algorithms are used to build hybrid models. To simultaneously consider more optimization objectives, researchers have proposed many multiobjective optimization algorithms, such as the multiobjective ant lion optimization algorithm (MOALO) (Wang, Du, et al., 2018) and multiobjective whale optimization algorithm (MOWOA) (Wang et al., 2017). Due to the fact hybrid models constructed using multiobjective optimization algorithms can improve forecasting stability and accuracy, an increasing number of researchers are applying such models to COFP forecasting (Abdollahi and Ebrahimi, 2020; Chai et al., 2018; Zhao et al., 2021). The COFP time series has chaotic characteristics (Wang et al., 2020). In the present study, the Lyapunov exponents of the Brent crude oil futures price (BCOFP) and the West Texas Intermediate crude oil futures price (WCOFP) are also calculated, the results of which (i.e., 0.0376 and 0.0036, respectively, both of which are greater than zero) indicate the chaotic characteristics of these time series. However, existing studies cannot sufficiently extract chaotic information from a time series to improve the COFP prediction accuracy. The characteristics and representative studies of the COFP forecasting models are listed in Table 1.
Table 1

Summary of available studies

TypeSubtypeModelReferencesAdvantagesDisadvantages
Single modelLinear prediction modelVARMirmirani and Cheng Li (2004)These models have low complexity, fast computational speed, and effective linear time series prediction abilityThe nonlinear time series prediction ability of these models is poor
ARIMAHe et al. (2010)
GARCHAgnolucci (2009)
Shallow neural networksMLPGuotai et al. (2017a, 2017b)These models have an excellent performance to fit simple functions and fast running speedThe ability of these models to fit complex functions is limited
BPNNMingming and Jinliang (2012)
SVMGuo et al. (2012)
ELMWang, Du, et al. (2018), Wang, Athanasopoulos, et al. (2018)
Deep learning modelsDBNZhang and Ci (2020)Deep learning can approximate complex functions and is highly adaptiveThese models require a lot of computation and a long-running time
LSTMZhang et al. (2020)
BiLSTMLi et al. (2021)
Hybrid modelSingle objectiveGAYang et al. (2019)Fast convergence and global optimization searchOnly one objective can be achieved at a time
PSORibeiro et al. (2021)
ALOReddy et al. (2018)
FLAHe et al. (2021)
WOALin and Zhang (2021)
Multiple objectivesMOALOWang, Du, et al. (2018)); Wang, Athanasopoulos, et al. (2018))Multiple objectives can be achieved simultaneouslyA good balance of convergence and diversity has not yet been achieved
MOWOAWang et al. (2017)
MOSMALi, Chen, et al. (2020)
Summary of available studies In this paper, a hybrid prediction model applying time varying filtering for empirical mode decomposition (TVF_EMD) and a multiobjective slime mold algorithm (MOSMA), called TVF_EMD_MOSMA, is proposed for COFP forecasting. Based on this framework, the results of the point forecast (PF) and interval forecast (IF) are derived. First, we decompose the COFP time series using TVF_EMD and eliminate redundant noise series. Second, we obtain the PF results of COFP based on TVF_EMD_MOSMA. Finally, we apply the maximum likelihood estimate (MLE) to estimate the probability density function of the COFP residuals based on the fitting errors. Thereafter, we apply MOSMA to determine the confidence interval adjustment coefficient (CIAC). Subsequently, we obtain the IF results of COFP through the PF results, CIAC, and optimal distribution. The contributions of this study are as follows: A new hybrid prediction model (TVF_EMD_MOSMA) for COFP is presented. The results of comparative experiments show that TVF_EMD_MOSMA exhibits high accuracy and stability of PF and IF for COFP. The predictive performance of TVF_EMD_MOSMA is efficient and robust. A novel data denoising method (TVF_EMD) is used for COFP data processing. Based on the idea of “decomposition and combination,” the COFP time series is reconstructed after removing the high-frequency noise using TVF_EMD, thus improving the accuracy of the prediction model. The proposed MOSMA is applied for COFP prediction for the first time. MOSMA applies an archive component to store all non-dominated Pareto solutions and implements multiobjective optimization based on non-dominated sorting and the crowding distance mechanism. Experimental results show that MOSMA can effectively enhance the prediction accuracy and stability of the hybrid model for COFP prediction. To obtain an IF with a narrower width and higher prediction accuracy, the CIAC determined using MOSMA is added. The contradiction between the interval prediction accuracy and prediction interval width was balanced, significantly improving the IF performance. The remainder of this paper is organized as follows. Section 2 introduces the proposed hybrid prediction model and its submodels. Section 3 describes the performance evaluation metrics and data used in the study. Section 4 presents the experimental results. Section 5 further discusses TVF_EMD_MOSMA and describes the results of the sensitivity analysis, the accuracy and stability improvement ratio, and forecasting effect analysis. Section 6 discusses the practical applications, limitations, and scope of future research work. Finally, Sect. 7 presents the conclusions of this study.

Research method

In this section, the basic models, interval estimation theory, and proposed hybrid prediction model are introduced.

Time varying filter empirical mode decomposition

Empirical mode decomposition (EMD) can decompose a COFP time series into a finite number of intrinsic mode function (IMF) signals based on the time-scale characteristics of the COFP time series (Wang, Athanasopoulos, et al., 2018; Wang, Du, et al., 2018). Since the decomposed time series facilitates the extraction of time-series features, EMD is extensively used in time-series forecasting (Wang and Wang, 2020). However, EMD is plagued by marginal effects and pattern confusion which leads to reduced accuracy in a time-series decomposition. Recursive empirical mode decomposition (REMD) has been proposed to alleviate the problems of marginal effects and pattern confusion in EMD (Wang and Wang, 2020). Recently, TVF_EMD was proposed to simultaneously solve the modal separation and intermittent operation problems. Simultaneously, the physical meaning of the model parameters in TVF_EMD is clear, which facilitates parameter selection (Wang, Niu, et al., 2021). The detailed calculation process of TVF_EMD is shown below. Step 1: The frequency and instantaneous amplitude of the COFP time series are calculated using the Hilbert transform. In Eqs. (1) and (2), and are obtained by interpolating and . In addition, and are the local minimum and maximum of respectively; and represent the instantaneous mean value and instantaneous envelope; and are obtained by interpolating and . In Eq. (3), represents the local cut-off probability. Arrange to solve the intermission problem: Define a signal as and use the extreme point of as the node. By approximating the COFP time series through a B-spline interpolation, the approximate result is obtained. Step 2: Determine the cut-off condition, . If , is considered an IMF. Otherwise, set and repeat Steps 1 and 2. Step 3: Using these above steps, decompose the COFP time series into multiple IMFs.

Volterra adaptive filter based on phase space reconstruction

To extract the chaotic information in the COFP time series, the Volterra adaptive filter is used. The following introduces the principle of the phase space reconstruction, the calculation of the optimal embedding dimension and delay time, and the principle of the Volterra adaptive filter.

Phase space reconstruction

Takens theorem states that a single variable chaotic time series can be reconstructed into a multidimensional phase space, and this space can contain the chaotic features of the original time series. Thus, the laws and properties of a chaotic time series can be accurately captured (Lin and Zhang, 2021). Suppose that the COFP time series is and that an m-dimensional vector is formed through the delay time:, where is the optimal embedding dimension, is the predicted delay time, is the phase point in m-dimensional phase space, and is the number of phase points,. In addition, describe the evolutionary trajectory of a dynamical system in the phase space, and thus, the chaotic behavior of a dynamical system can be studied in reconstructed m-dimensional phase space.

Determination of delay time

In this study, the mutual information (MI) method is used to determine the delay time of the COFP time series. The probability of occurrence of in the time series is defined as ; the probability of the occurrence of in time series is defined as ; and the joint probability of occurrence of and in the two series is defined as , where and can be solved based on the probability of occurrence in their respective time series. The joint probability can then be obtained by counting the lattice on the plane . The MI function is as follows. We apply the when the MI function takes the first minimal value point as the delay time.

Determination of the optimal embedding dimension

In this study, the false nearest neighbor method (FNN) is used to determine the optimal embedding dimension. The calculation procedure for the FNN is as follows. Step 1: In the embedding space with embedding dimension m, find the Euclidean distance nearest neighbor of all points. The Euclidean distance between and is calculated using Eq. (5). Step 2: When any pair of nearest neighbors satisfies the following criterion, it is an FNN point. In Eq. (6), and represent the squared distance between any pair of nearest neighbors at the optimal embedding dimensions of m + 1 and m, respectively, and represents the threshold value. Step 3: When , the ratio of the FNN points to the total number of phase points is calculated, and m is gradually increased until the ratio is less than 5%. The chaotic attractor geometry is considered to be completely opened and is the optimal embedding dimension at this time.

Volterra adaptive filter prediction model

The Volterra adaptive filter prediction model can predict a chaotic time series using only a small sample of data, and it can automatically track the chaotic motion trajectory. This model has high prediction accuracy for a chaotic time series (Qiao et al., 2020). Assuming that the input variable is and the output variable is , the Volterra adaptive second-order filtering model is as follows: The coefficient and input vectors are expressed through Eqs. (8) and (9), respectively. Based on Eqs. (8) and (9), Eq. (7) can be expressed as

ARIMA model

The ARIMA model is a traditional time-series analysis model that is widely used for time-series forecasting (Ribeiro et al. 2021). The model provides a better prediction for a linearly smoothed time series. Here, is expressed through Eq. (11). In Eq. (11), ; indicates the autoregressive coefficient polynomial in ARIMA, and indicates the moving smoothing coefficient polynomial.

ELM

ELM is a feedforward neural network with a single hidden layer. This model structure comprises an input layer, implicit layer, and output layer, similar to an ANN (Lin and Zhang, 2021). The layers are connected to each other using a characteristic mapping function. Information from the input layer is processed by the implicit layer and passed to the output layer, which then derives the calculated value according to the mapping function. Although the random initialization of the parameters improves the generalization of the ELM, it also requires the ELM to add a large number of nodes to achieve accurate training. For large samples, several nodes consume an excessive number of computational resources and may cause overfitting.

Bidirectional long-short term memory model

The bidirectional long-short term memory model (BiLSTM) is divided into two independent LSTMs, and the input sequences are input into the two LSTMs in forward and inverse order for feature extraction. LSTM is a recurrent neural network, and it is proposed to solve the gradient disappearance and explosion problems. The design concept of BiLSTM is to simultaneously obtain the characteristics of the data with information between the past and future (Wang et al., 2020). BiLSTM outperforms a single LSTM approach in terms of efficiency and performance.

MOSMA

The slime mold algorithm (SMA) is a population-based metaheuristic algorithm proposed by Li, Chen, et al. (2020). The slime mold can establish the best pathway for connecting food in a relatively superior manner through a combination of positive and negative feedback. Therefore, SMA adjusts the search path and obtains the optimal result based on a positive and negative feedback system. SMA simulates three different morphotypes in the hunting process of slime mold: finding food, wrapping food, and approaching the food morphotype. The mathematical model of SMA is as follows:where In Eqs. (12) and (13), indicates the current position of the slime mold, indicates the number of current iterations, and represent the lower and upper bounds of the search range, is the vibration parameter, represents the weight of the slime mold, denotes the optimal fitness, denotes the worst fitness, and denotes the optimal fitness in all iterations. denotes the sequence of sorted fitness values (ascending in the minimum value problem), and and denote a random value within the range of [0,1]. SMA can effectively solve many practical problems (Li, Chen, et al., 2020; Li, Wen, et al., 2020). MOSMA is proposed to effectively achieve multiple goals. It is a multiobjective improvement algorithm based on the SMA algorithm, using an elite non-dominated sorting method to estimate the Pareto optimal solutions. In addition, to ensure the diversity of Pareto optimal solutions, MOSMA added a crowding distance mechanism to increase the coverage of all objectives (Premkumar et al., 2021). The steps for an elitist non-dominated sorting approach are as follows: Step 1: Calculate the non-dominated results of the objective function. Step 2: Sort the non-dominated results using non-dominated sorting. Step 3: Find the non-dominated ranking of all non-dominated results to determine the optimal solution. The crowding distance () is calculated through the following formula: In Eq. (14), and denote the maximum and minimum values of the objective function, respectively. The non-dominated ranking and crowding distance () are used to determine the optimal solution. To optimize the prediction accuracy and robustness of the hybrid prediction model, the multiobjective functions are defined as follows: In Eq. (15), and denote the actual and predicted COPs, respectively. In addition, denotes the mean of .

Interval forecasting method

Although the PF provides an explicit value, the reliability of providing this value is not given by the PF. Therefore, the IF is proposed to fill in this gap. The IF can provide considerable information to users regarding forecast results (Sun et al., 2020). The common distributions in the field of energy price forecasting are the Gumbel, generalized extreme value (GEV), and gamma distribution (Jiang et al., 2021; Wang, Niu, et al., 2021). In this study, the MLE is used to fit the optimal distribution of the prediction error series. However, the interval width and prediction accuracy of the IF are irreconcilable. To obtain an IF with a narrow width and high prediction accuracy, the CIACs ( and ) are added (Jiang et al., 2021), and are determined using MOSMA. The calculation formulas of the day’s IF are as shown below:where and are the lower and upper bounds of the confidence interval at the confidence level (), denotes the day’s PF result, and and are the quantile and quantile of the optimal distribution, respectively. In addition, is the prediction error sequence of TVF_EMD_MOSMA, and and are the CIACs determined through MOSMA. The objective functions for the IF in MOSMA can be defined as follows:where and denote the actual and predicted COFP, respectively; and and are the lower and upper bounds of the confidence interval at the confidence level (). When the objective functions take the minimum value, the position of the slime mold is the values of and .

Framework of proposed TVF_EMD_MOSMA

The proposed TVF_EMD_MOSMA is introduced in this section. The overall framework is shown in Fig. 1. In this study, the point and interval prediction results of WCOFP and BCOFP are derived using TVF_EMD_MOSMA. The detailed steps are as follows:
Fig. 1

Framework of the proposed TVF_EMD_MOSMA

Framework of the proposed TVF_EMD_MOSMA Using the TVF_EMD method, the high-frequency noise in the COFP time series is filtered out. Consequently, a smooth COFP time series is obtained. Thereafter, the processed COFP time series is used in the TVF_EMD_MOSMA prediction model. Step 1: Data processing The processed COFP time series is input into the four submodels (Volterra adaptive filter, ARMIA, ELM, and BiLSTM) as an input variable, and the output is . Here, denotes the predicted result from the submodel. In addition, is combined as the final predicted value by a set of weights determined using MOSMA. Step 2: Point forecast The optimal distribution of the prediction error series is determined using the MLE. The CIACs ( and ) are determined using MOSMA. Finally, the confidence intervals with confidence levels of 90%, 95%, and 99% are calculated by Eqs. (15) and (16). Step 3: Interval forecast

Studied data and performance evaluation metrics

In this section, the studied data (original, training set, and test set data) are presented in detail. The performance evaluation metrics for the PF and IF are also presented.

Studied data

WCOFP and BCOFP have a large international influence and are often regarded as the reference prices for the international oil spot market. Therefore, the study sample includes the daily settlement prices of the WCOFP and BCOFP from 4 January 2010 to 30 September 2021. These data are available on Investing.com (https://cn.investing.com/com-modities/crude-oil-historical-data). The period from 4 January 2010 to 30 September 2020 is selected as the training set for training the prediction model. The remaining data (1 October 2020 to 30 September 2021) are used as the test set to evaluate the model prediction performance. More details of the dataset are presented in Table 2.
Table 2

Detailed description of the studied data (Data from https://cn.investing.com/com-modities/crude-oil-historical-data)

TypeDatasetDateNumberStatistics ($)
MaximumMinimumMeanStdKurtosisSkewness
WTIAll samples2010/01/04–2021/09/303053113.9300 − 37.630068.865822.47511.99310.1061
Training samples2010/01/04–2020/09/302790113.9300 − 37.630069.746923.05551.90980.0329
Testing samples2020/10/01–2021/09/3026376.250035.790059.518111.39022.0058 − 0.5233
BrentAll samples2010/01/04–2021/09/303034126.650019.330076.139926.14191.77980.2194
Training samples2010/01/04–2020/09/302776126.650019.330077.431126.74141.68860.1232
Testing samples2020/10/01–2021/09/3025879.530037.460062.246211.48752.0842 − 0.6089
Detailed description of the studied data (Data from https://cn.investing.com/com-modities/crude-oil-historical-data)

Performance evaluation metrics

In this section, a system of metrics for evaluating the performances of PF and IF is presented (Wang, Niu, et al., 2021; Wang Wang et al., 2021; Wang et al., 2021; Wang et al., 2021). A detailed description of the metrics is presented in Table 3. The abbreviations used in this study are described in "Appendix 1".
Table 3

Description of the metrics used by the model

TypeMetricFull nameCalculation formula
PFMAPEMean absolute percentage error\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{MAPE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {{{\left( {OP_{i} - FP_{i} } \right)} / {OP_{i} }}} \right|} \times 100\%$$\end{document}MAPE=1ni=1nOPi-FPi/OPi×100%
RMSERoot mean square error\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{RMSE}} = \sqrt {\frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {OP_{i} - FP_{i} } \right)}^{2} }$$\end{document}RMSE=1ni=1nOPi-FPi2
MAEMean absolute error\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{MAE}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left| {OP_{i} - FP_{i} } \right|}$$\end{document}MAE=1ni=1nOPi-FPi
MdAPEMedian absolute percentage error\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{MdAPE}} = median\left( {\left| {{{\left( {OP_{i} - FP_{i} } \right)} / {OP_{i} }}} \right|} \right) \times 100\%$$\end{document}MdAPE=medianOPi-FPi/OPi×100%
IFFICPPrediction interval coverage probability\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{PICP}} = \frac{1}{n}\sum\limits_{i = 1}^{n} {B_{i} } ,$$\end{document}PICP=1ni=1nBi,\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{i} = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {if\;OP_{i} \in \left[ {Lb_{i} ,Ub_{i} } \right]} \hfill \\ {0,} \hfill & {else} \hfill \\ \end{array} } \right.$$\end{document}Bi=1,ifOPiLbi,Ubi0,else
AISAverage interval score\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{AIS}} = \sum\limits_{i = 1}^{n} {A_{i}^{(1 - \alpha )} } ,$$\end{document}AIS=i=1nAi(1-α),\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{i}^{(1 - \alpha )} = \left\{ {\begin{array}{*{20}l} { - 2\alpha \cdot IW_{i}^{(1 - \alpha )} - 4\left( {Lb_{i}^{{\left( {1 - \alpha } \right)}} - FP_{i} } \right),} \hfill & {if\;FP_{i} < Lb_{i}^{{\left( {1 - \alpha } \right)}} } \hfill \\ { - 2\alpha \cdot IW_{i}^{(1 - \alpha )} ,} \hfill & {if\;FP_{i} \in \left[ {Lb_{i}^{{\left( {1 - \alpha } \right)}} ,Ub_{i}^{{\left( {1 - \alpha } \right)}} } \right]} \hfill \\ { - 2\alpha \cdot IW_{i}^{(1 - \alpha )} - 4\left( {FP_{i} - Ub_{i}^{{\left( {1 - \alpha } \right)}} } \right),} \hfill & {if\;FP_{i} > Ub_{i}^{{\left( {1 - \alpha } \right)}} } \hfill \\ \end{array} } \right.$$\end{document}Ai(1-α)=-2α·IWi(1-α)-4Lbi1-α-FPi,ifFPi<Lbi1-α-2α·IWi(1-α),ifFPiLbi1-α,Ubi1-α-2α·IWi(1-α)-4FPi-Ubi1-α,ifFPi>Ubi1-α
FINAWPrediction interval normalized average width\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text{PINAW}} = \frac{1}{n}\sum\limits_{u = 1}^{n} {\left| {\frac{{Ub_{i}^{(1 - \alpha )} - Lb_{i}^{{\left( {1 - \alpha } \right)}} }}{{\max \left\{ {OP_{u} ,u = 1,2,...,n} \right\} - \min \left\{ {OP_{u} ,u = 1,2,...,n} \right\}}}} \right|}$$\end{document}PINAW=1nu=1nUbi(1-α)-Lbi1-αmaxOPu,u=1,2,...,n-minOPu,u=1,2,...,n

and denote the actual and predicted COFPs, respectively; ; and and are the lower and upper bounds of the confidence interval at the confidence level (). In addition, MAPE and MdAPE are used to reflect the predictive accuracy of the model, and the smaller the value, the higher the accuracy; RMSE and MAE are used to reflect the predictive robustness of the model, and the smaller the value, the higher the robustness; FICP is used to reflect the accuracy of the IF, and the larger the value, the higher the accuracy; FINAW is used to measure the width of the prediction interval, and the smaller the value, the higher the accuracy; and AIS is a composite indicator that reflects the accuracy and stability of the interval forecast, and the larger the value, the higher the accuracy

Description of the metrics used by the model and denote the actual and predicted COFPs, respectively; ; and and are the lower and upper bounds of the confidence interval at the confidence level (). In addition, MAPE and MdAPE are used to reflect the predictive accuracy of the model, and the smaller the value, the higher the accuracy; RMSE and MAE are used to reflect the predictive robustness of the model, and the smaller the value, the higher the robustness; FICP is used to reflect the accuracy of the IF, and the larger the value, the higher the accuracy; FINAW is used to measure the width of the prediction interval, and the smaller the value, the higher the accuracy; and AIS is a composite indicator that reflects the accuracy and stability of the interval forecast, and the larger the value, the higher the accuracy

Experiments and analysis

In this section, two control experiments to test the performance of the proposed TVF_EMD_MOSMA prediction model are introduced in detail. Experiment I demonstrated the excellent PF performance of the proposed hybrid prediction model by setting up multiple sets of controlled trials. Experiment II determined the optimal distribution of the prediction error series using MLE. Confidence intervals with confidence levels of 90%, 95%, and 99% were calculated using MOSMA. The IF performance of the proposed hybrid prediction model was tested.

Experiment configuration

Experimental environment

All experiments in this study were conducted using MATLAB R2020a. The experimental platform was a laptop computer with a 64-bit 1.80 GHz AMD Ryzen7 4800U CPU with a Radeon graphics card and 16 GB of RAM, running on Windows 10.

Model parameter settings

The proposed hybrid prediction model consists of four submodels (Volterra adapter filter, ARIMA, ELM, and BiLSTM) and MOSMA optimization algorithms. Three types of control models have been developed: (1) Benchmark models: BPNN, LSTM, Volterra adapter filter, ARIMA, ELM, BiLSTM, TVF_EMD_BPNN, TVF_EMD_Voterra, TVF_EMD_ARIMA, TVF_EMD_ELM, and TVF_EMD_BiLSTM. (2) The models using different data denoising methods: EMD_MOSMA and REMD_MOSMA. (3) The models using different optimization algorithms: TVF_EMD_SFL, TVF_EMD_SMA, TVF_EMD_MOALO, and TVF_EMD_MOWOA. The detailed parameter settings for the models are listed in Table 4.
Table 4

Parameter settings of the model and optimization algorithm

MethodParameterSymbolValueReason
TVF_EMDBandwidth threshold\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$w_{b}$$\end{document}wb0.08Common value
Max number of IMFs\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{I}$$\end{document}NI50Preset
B-spline order\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O_{b}$$\end{document}Ob26Common value
BPNNHidden layer nodes number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{h}$$\end{document}Nh10,15,10Trial–error manner
Iteration number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{i}$$\end{document}Ni10,000Trial–error manner
Learning rate\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_{r}$$\end{document}Lr0.1Trial–error manner
Training requirements precision\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P_{tr}$$\end{document}Ptr0.0001Preset
LSTMHidden layer nodes number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{h}$$\end{document}Nh50Trial–error manner
Iteration number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{i}$$\end{document}Ni1000Trial–error manner
Learning rate\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_{r}$$\end{document}Lr0.001Trial–error manner
VolterraTime delay\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$T_{d}$$\end{document}Td21(WTI), 14(Brent)The MI method
Embedding dimension\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D_{e}$$\end{document}De10(WTI), 11(Brent)The FNN method
Order\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$O_{n}$$\end{document}On3Common value
ELMIteration number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{i}$$\end{document}Ni10,000Preset
Hidden layer nodes number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{h}$$\end{document}Nh5Trial–error manner
BiLSTMVariable learning rate\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$L_{r}$$\end{document}Lr0.001(0.5)Trial–error manner
Iteration number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{i}$$\end{document}Ni3000Trial–error manner
Hidden layer nodes number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{h}$$\end{document}Nh100Trial–error manner
FLA, SMAIteration number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{i}$$\end{document}Ni100Trial–error manner
Population size\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{p}$$\end{document}Sp200Trial–error manner
MOALO, MOWOA, MOSMAPopulation size\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{p}$$\end{document}Sp200Trial–error manner
Archive size\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{a}$$\end{document}Sa100Trial–error manner
Iteration number\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{i}$$\end{document}Ni100Trial–error manner
Parameter settings of the model and optimization algorithm

Experiment I: PF result analysis

Three types of control trials were conducted to demonstrate the superiority of the PF performance of the proposed hybrid prediction model. The results of Experiment I are shown in Fig. 2.
Fig. 2

Results of Experiment I

Results of Experiment I

Comparison with benchmark models

Multiple benchmark models were used as control models to demonstrate that the TVF_EMD_MOSMA outperformed the single model. The PF performances of TVF_EMD_MOSMA and multiple benchmark models are presented in Table 5.
Table 5

PF performance of TVF_EMD_MOSMA and benchmark models

ModelMAPEMdAPEMAERMSE
WCOFPBCOFPWCOFPBCOFPWCOFPBCOFPWCOFPBCOFP
Volterra1.92402.04421.78471.71141.32661.49641.55591.8056
BPNN1.61811.50951.30781.10530.94700.91871.24341.2259
LSTM2.23602.63641.65591.88581.22661.44161.62392.1953
BiLSTM1.90951.52751.62791.13421.13720.93361.43241.2803
ARIMA2.14121.53581.65131.20291.22950.92591.59271.2467
ELM1.60581.58231.35581.29010.93920.97561.23651.2750
TVF_EMD_Volterra1.22461.21220.89810.89800.86540.85651.12641.1004
TVF_EMD_BiLSTM1.20331.17050.98630.86190.70630.72250.91600.9825
TVF_EMD_ARIMA1.02590.99160.74600.73420.59510.60390.80460.8117
TVF_EMD_ELM0.93171.04810.72310.90220.54310.64130.71140.8209
TVF_EMD_MOSMA0.75380.82810.51220.71090.53060.58620.72830.7488

Bold is used in Table to highlight the results of the “TVF_EMD_MOSMA” model proposed in this paper

By comparing the model using the TVF_EMD denoising method and the model without denoising, the PF’s accuracy and stability of the model using TVF_EMD are better than those of the model without denoising. For instance, the evaluation metrics of the ELM using TVF_EMD for WCOFP are , , , and . However, the evaluation metrics of the ELM without denoising for WCOFP are ,,, and . The MAPE, MdAPE, MAE, and RMSE of the TVF_EMD_ELM are much smaller than those of the ELM, indicating that the PF performances of ELM are improved by using the TVF_EMD denoising method. The benchmark models show different predictive performances for the different datasets. For WCOFP, TVF_EMD_ELM achieved the best PF performance. However, TVF_EMD_ARIMA was the best-performing benchmark model for BCOFP. Therefore, to obtain the optimal prediction performance for different datasets, the proposed hybrid prediction model combines submodels using the optimal weights determined by MOSMA. For the BCOFP, all evaluation metrics of TVF_EMD_MOSMA are smaller than those of all the benchmark models (,,, and ). For WCOFP, the model comparison results are similar to those of BCOPF, demonstrating that the TVF_EMD_MOSMA prediction model achieves a better PF performance than all benchmark models. PF performance of TVF_EMD_MOSMA and benchmark models Bold is used in Table to highlight the results of the “TVF_EMD_MOSMA” model proposed in this paper

Comparison of different denoising methods

To compare the different denoising methods (TVF_EMD, EMD, and REMD), EMD_MOSMA and REMD_MOSMA were set as the control models. The PF performances of TVF_EMD_MOSMA and the control models with different denoising methods are listed in Table 6.
Table 6

The PF performance of TVF_EMD_MOSMA and the control models

ModelMAPEMdAPEMAERMSE
WCOFPBCOFPWCOFPBCOFPWCOFPBCOFPWCOFPBCOFP
EMD_MOSMA1.08951.46580.78281.18800.77301.02980.99961.2920
REMD_MOSMA1.12221.11120.85650.94610.77010.79850.97891.0035
TVF_EMD_FLA1.04411.47310.92111.09380.73511.04830.91021.5689
TVF_EMD_SMA0.85561.22380.72890.97580.60050.86360.82291.1538
TVF_EMD_MOALO0.84760.84010.75170.67980.59520.59580.74350.7526
TVF_EMD_MOWOA1.08901.46570.78231.18190.77281.02950.99901.2919
TVF_EMD_MOSMA0.75380.82810.51220.71090.53060.58620.72830.7488

Bold is used in Table to highlight the results of the “TVF_EMD_MOSMA” model proposed in this paper

The PF performance of TVF_EMD_MOSMA and the control models Bold is used in Table to highlight the results of the “TVF_EMD_MOSMA” model proposed in this paper By comparing the evaluation indicators of TVF_EMD_MOSMA, EMD_MOSMA, and REMD_MOSMA, the evaluation indicators (MAPE, MAE, MdAPE, and RMSE) of TVF_EMD_MOSMA were smaller than those of EMD_MOSMA and REMD_MOSMA. The results indicate that the TVF_EMD method shows excellent performance. In other words, TVF_EMD is more suitable for data processing of COFP prediction than the other denoising methods (EMD and REMD).

Comparison of different optimization algorithms

In this section, different optimization algorithms are compared. To demonstrate the superiority of MOSMA, multiple control models (TVF_EMD_SFL, TVF_EMD_SMA, TVF_EMD_MOALO, and TVF_EMD_MOWOA) were constructed by keeping the submodels and denoising methods unchanged and solely changing the optimization algorithm. The parameter settings for these models are listed in Table 4. The PF performances of TVF_EMD_MOSMA and the control models with different optimization algorithms are listed in Table 6. Comparing the evaluation metrics of TVF_EMD_MOSMA and the control models using different optimization algorithms, only the MdAPE of TVF_EMD_MOSMA for BCOFP is larger than that of the optimal model (TVF_EMD_MOALO) by 0.04. For other cases, TVF_EMD_MOSMA has the smallest evaluation metric values (MAPE, MAE, MdAPE, and RMSE) for the WCOFP and BCOFP. In other words, MOSMA achieves higher stability and accuracy than SFL, SMA, MOALO, and MOWOA. Experiment I helps conclude that the proposed hybrid prediction model is more effective for the PF of COFP.

Experiment II: IF result analysis

In this section, the optimal distribution of the prediction error series is determined using MLE. The confidence intervals with confidence levels of 90%, 95%, and 99% and the IF performance of the proposed hybrid prediction model are presented. The results of Experiment II are shown in Fig. 3.
Fig. 3

Results of experiment II

Results of experiment II

Selection of the optimal distribution function for COFP prediction error series

The IF can provide more information to users regarding the forecast results. The common distributions in the field of energy price forecasting are the Gumbel, GEV, and gamma distributions (Jiang et al., 2021; Wang, Niu, et al., 2021). Therefore, we chose these distribution functions to fit the distribution characteristics of the COFP prediction error series using the MLE. The and RMSE are used as evaluation metrics to evaluate the fitting effect of the distribution function. And is the correlation coefficient of the fitted distribution (Gumbel, GEV, and gamma) with empirical distribution (observations). The results of fitting the distribution functions of the WCOFP prediction error series (WCOFP_E) and BCOFP prediction error series (BCOFP_E) are listed in Table 7.
Table 7

Results of fitting the distribution functions of WCOFP_E and BCOFP_E

DataMetricGumbelGEVGamma
WCOFP_ERMSE0.29100.72200.2510
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R$$\end{document}R0.97400.92500.9770
BCOFP_ERMSE0.31401.10600.1240
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R$$\end{document}R0.96300.84300.9870
Results of fitting the distribution functions of WCOFP_E and BCOFP_E For WCOFP_E, the of the Gumbel and gamma distributions are close, and the RMSE of the gamma distribution is the smallest. Therefore, the gamma distribution is chosen as the distribution function of WCOFP_E. For the distribution function of WCOFP_E, the shape parameter is 5.2 and the inverse scale parameter is 0.4, as estimated using the MLE. For BCOFP_E, the of the gamma distribution was the largest, and the RMSE of the gamma distribution was the smallest. Therefore, the gamma distribution is considered to be the optimal distribution of BCOFP_E. For the distribution function of BCOFP_E, the shape parameter is 16.63 and the inverse scale parameter is 0.19, as estimated using the MLE.

IF performance of TVF_EMD_MOSMA

After determining the optimal distribution function of COFP_E using the MLE, the CIACs were added to the hybrid prediction model to enable the IF with a narrow width and high prediction accuracy. The confidence intervals with confidence levels of 90%, 95%, and 99% were derived using the PF results, optimal distribution, and CIACs. The IF performance of the proposed TVF_EMD_MOSMA is presented in Table 8.
Table 8

IF performance of the proposed TVF_EMD_MOSMA

DataConfidence level \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\left( {1 - \alpha } \right)$$\end{document}1-α%AISFICPFINAW
BCOFP90 − 0.59430.90080.1423
95 − 0.34410.95420.1676
99 − 0.09440.99240.2706
WCOFP90 − 0.63630.90430.1638
95 − 0.40120.94680.1718
99 − 0.08680.98940.3112
IF performance of the proposed TVF_EMD_MOSMA The AIS is used to measure the accuracy of the IF, and a larger AIS indicates a higher prediction accuracy. The FICP is also used to measure the accuracy of IF, and it is the frequency at which observations fall into the prediction intervals. FINAW is used to measure the width of the confidence interval, with smaller values indicating a better IF performance. As presented in Table 8, the value of the FICP is extremely close to the confidence level, indicating that the IF results in this study are reasonable. The values of FINAW and AIS are quite small when the given confidence level is achieved, which indicates that the IF results show high accuracy and narrow interval width. In other words, the IF performance of the proposed hybrid prediction model is excellent.

Discussion

TVF_EMD_MOSMA is further discussed in this section, including the sensitivity analysis, the accuracy and stability improvement ratio, and the forecasting effect analysis.

Sensitivity analysis

Sensitivity analysis was employed to measure the robustness of the model’s predictive performance; in particular, sensitivity analysis was used to measure the effect of a change in the model parameters on the prediction results. This study considered the effect of the variation of two parameters, population size and iteration number, on the model’s prediction performance. The sensitivity analysis indicators are calculated as follows (Wu et al., 2022).where is the evaluation metric (MAPE, MdAPE, MAE, and RMSE) of experiment, is the mean of , and is the number of experiments. In this study, the population size takes individual values from (100, 150, 200*, and 250), and the number of iterations takes individual values from (50, 100*, 150, and 200). The * indicates the optimal parameter of MOSMA. The pattern indicates that the population size changes and the number of iterations remains unchanged at 100. The pattern indicates that the number of iterations changes and the population size remains unchanged at 200. The results of the sensitivity analysis are listed in Table 9.
Table 9

Results of the sensitivity analysis

DataPattern\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{s}^{MAPE}$$\end{document}AsMAPE\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{s}^{MdAPE}$$\end{document}AsMdAPE\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{s}^{MAE}$$\end{document}AsMAE\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$A_{s}^{RMSE}$$\end{document}AsRMSE
WCOFP\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{p} \overline{\overline{{N_{i} }}}$$\end{document}SpNi¯¯0.00040.00250.00020.0001
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\overline{\overline{{S_{p} }}} N_{i}$$\end{document}Sp¯¯Ni0.00040.00590.00020.0001
BCOFP\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$S_{p} \overline{\overline{{N_{i} }}}$$\end{document}SpNi¯¯0.00020.00220.00010.0004
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\overline{\overline{{S_{p} }}} N_{i}$$\end{document}Sp¯¯Ni0.00010.00080.00010.0000
Results of the sensitivity analysis The values of all sensitivity analysis indicators are extremely small. Take as an example, is 0.0004. However, the smallest MAPE for WCOFP is 0.8281, which is more than two thousand times as large as . Therefore, the effect of the parameter changes on the prediction performance is small. In other words, the prediction performance of the proposed hybrid prediction model is robust.

Accuracy and stability improvement ratio

The improvement of the proposed TVF_EMD_MOSMA was measured in comparison with the control model. and were proposed to measure the improvements in the prediction accuracy and stability, respectively. and were calculated as follows:where and denote the MAPE and MAE of the control model, respectively, and and denote the MAPE and MAE of the proposed TVF_EMD_MOSMA. The results of and are presented in Table 10.
Table 10

Results of and

ModelWCOFPBCOFP
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$IR_{a}$$\end{document}IRa(%)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$IR_{s}$$\end{document}IRs(%)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$IR_{a}$$\end{document}IRa(%)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$IR_{s}$$\end{document}IRs(%)
Volterra60.822660.007059.490360.8264
BPNN53.416143.972645.140836.1938
LSTM66.289256.746368.589759.3378
BiLSTM60.525153.345445.787237.2133
ARIMA64.796756.846946.080236.6911
ELM53.058643.510747.664839.9139
TVF_EMD_Volterra38.447338.689931.686231.5614
TVF_EMD_BiLSTM37.357824.885529.252518.8610
TVF_EMD_ARIMA26.525610.840216.48592.9233
TVF_EMD_ELM19.09942.314020.98958.5942
EMD_MOSMA30.814731.362943.505343.0763
REMD_MOSMA32.830731.104425.477026.5874
TVF_EMD_SFL27.806427.824143.785244.0809
TVF_EMD_SMA11.901111.646232.333732.1214
TVF_EMD_MOALO11.069610.85941.42841.6113
TVF_EMD_MOWOA30.782931.345143.501443.0597
Average39.096533.456337.574932.6658
Results of and Compared with the control model, the proposed TVF_EMD_MOSMA showed a significant improvement in the prediction accuracy and stability. For the WCOFP, TVF_EMD_MOSMA improved the prediction accuracy and stability by an average of 39.0965% and 33.4563%, respectively. For the BCOFP, TVF_EMD_MOSMA improved the prediction accuracy and stability by an average of 37.5749% and 32.6658%, respectively. Compared with LSTM, TVF_EMD_MOSMA achieved the largest improvement in the prediction accuracy ( and ). Compared with Volterra, TVF_EMD_MOSMA has the largest improvement in prediction stability ( and ). Compared with the benchmark model, the model using different denoising methods, and the model using different optimization algorithms, the proposed hybrid prediction model significantly improved prediction accuracy and stability.

Forecasting effect analysis

The first- and second- order effectiveness ( and , respectively) were introduced to measure the forecasting effect of the model (Wang et al., 2021b, c, d; Wang, Niu, et al., 2021).where and denote the actual and predicted COFP, respectively, and is the number of predicted values. In addition, and are larger, indicating a higher predictive efficiency of the model. The evaluation results regarding the forecasting efficiency of the multiple models are listed in Table 11.
Table 11

Evaluation results of multiple models’ forecasting efficiency

ModelWCOFPBCOFP
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TE_{1}$$\end{document}TE1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TE_{2}$$\end{document}TE2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TE_{1}$$\end{document}TE1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TE_{2}$$\end{document}TE2
TVF_EMD_MOSMA0.99250.98530.99210.9856
TVF_EMD_MOALO0.99150.98500.99160.9851
TVF_EMD_SMA0.99140.98320.98780.9768
TVF_EMD_ELM0.99070.98250.99080.9827
TVF_EMD_ARIMA0.98980.98040.99010.9812
TVF_EMD_SFL0.98960.98200.98530.9696
TVF_EMD_MOWOA0.98910.98030.98530.9741
EMD_MOSMA0.98910.98020.98530.9741
REMD_MOSMA0.98880.97990.98890.9805
TVF_EMD_BiLSTM0.98800.97790.98830.9776
TVF_EMD_Volterra0.98780.97760.98790.9782
ELM0.98390.97040.98420.9710
BPNN0.98380.97040.98490.9718
BiLSTM0.98090.96670.98470.9704
Volterra0.98080.96880.97960.9661
ARIMA0.97860.96100.98460.9710
LSTM0.97760.95590.97360.9367
Evaluation results of multiple models’ forecasting efficiency The results in Table 11 demonstrate that the first- and second- order effectiveness ( and , respectively) of the proposed TVF_EMD_MOSMA are the highest. In other words, the proposed hybrid prediction model achieved the highest prediction efficiency in comparison with the other control models. By comprehensively considering the sensitivity analysis, accuracy and stability improvement ratio, and forecasting effect analysis, the proposed TVF_EMD_MOSMA can achieve a higher forecasting accuracy, better forecasting stability, and higher forecasting efficiency compared with other control models. Therefore, the proposed hybrid prediction model is reliable, valid, and significant.

Practical applications and limitations of the model

In this section, the practical applications and limitations of the proposed TVF_EMD_MOSMA, as well as future research in this area, are presented.

Practical applications

COPs are closely related to socioeconomic, international, and national security. However, international COPs are volatile and uncertain, owing to various factors. The proposed TVF_EMD_MOSMA for the prediction of COFP is reliable, valid, and significant. It can provide valuable reference information for investors (Medina–Olivares et al., 2021), companies, and governments. By considering a combination of the spot prices and the forecast prices of COFP, investors can decide on an investment strategy to achieve their profit goals. Companies that use crude oil as a feedstock or produce oil for sale can apply hedging operations based on the forecast results of COFP to control their production costs and sales risks. Governments can decide on the import, export, storage, and use of crude oil according to the forecast results of the COFP to ensure the stability of domestic oil prices. Stable domestic oil prices are crucial for the smooth development of economies and society.

Limitations and future research

The proposed TVF_EMD_MOSMA uses only the COFP time series to predict future prices. COFP can be influenced by multiple factors, including inflation, exchange rates, supply and demand, international politics, wars, and epidemics. Hence, these factors should be considered in future studies. In the future, the proposed TVF_EMD_MOSMA should be extended to the forecasting of other energy prices, such as coal and natural gas prices.

Conclusion

Crude oil is the most important energy source in the world, and fluctuations in crude oil prices can have a significant impact on investors, companies, and governments. Therefore, accurate prediction of COFP is crucial. In this paper, the TVF_EMD_MOSMA prediction model is proposed to improve the accuracy and robustness of the prediction. A new data denoising method, TVF_EMD, is used for COFP data processing. The chaotic time-series prediction method, shallow neural networks, linear model prediction methods, and deep learning methods are adopted as submodels for COFP prediction. The predicted values of submodels are combined with the optimal weight that is determined using MOSMA. The results of IF with a narrower width and higher prediction accuracy were derived by introducing CIACs determined using MOSMA. The conclusions of this study are as follows. The new data denoising method, TVF_EMD, can significantly improve the prediction accuracy of COFP. Comparison experiments helped determine that the prediction accuracy of the model using TVF_EMD was significantly higher than that of other denoising methods (EMD and REMD). The chaotic time-series prediction method, shallow neural network, linear prediction model, and deep learning method were adopted as submodels. Combining their prediction results with MOSMA can obtain accurate and stable prediction results. The MAPE and MAE of WCOFP and BCOFP were ,,, and , respectively. The PF performance of the proposed TVF_EMD_MOSMA is highly robust. The sensitivity analysis demonstrated that the variation in the model parameters had a slight effect on the prediction performance. The maximum value of the sensitivity analysis indicator is 0.0059, which is extremely small. The IF performance of the proposed TVF_EMD_MOSMA is excellent. By introducing the CIAC determined using MOSMA, the contradiction between the prediction accuracy and interval width is balanced.
Table 12

List of abbreviations

AbbreviationThe full name
AISAverage interval score
ALOAnt lion optimization algorithm
ARIMAAutoregressive integrated moving average model
BiLSTMBidirectional long short-term memory networks
BCOFPBrent crude oil futures price
BPNNBack propagation neural network
COFPCrude oil futures price
COFP_ECOFP prediction error series
COPCrude oil price
CIACConfidence interval adjustment coefficient
DBNDeep belief network
EMDEmpirical mode decomposition
ELMExtreme learning machine
FICPPrediction interval coverage probability
FLAFrog leaping algorithm
FINAWPrediction interval normalized average width
FNNThe false nearest neighbor method
GARCHGeneralized autoregressive conditional heteroskedasticity model
GEVGeneralized extreme value
IFInterval forecasts
LSTMLong short-term memory network
MdAPEMedian absolute percentage error
MAEMean absolute error
MLEMaximum likelihood estimate
MAPEMean absolute percentage error
MIThe mutual information method
MLMachine learning model
MLPMulti-Layer Perceptron
MOALOMultiobjective ant lion optimization algorithm
MOSMAMultiobjective slime mold algorithm
MOWOAMultiobjective whale optimization algorithm
PFPoint forecasts
PSOParticle swarm optimization
RMSERoot mean square error
REMDRecursive empirical mode decomposition
GAGenetic algorithm
SMASlime mold algorithm
SVMSupport vector machine
TVF_EMDTime varying filtering for empirical mode decomposition
VARVector autoregression model
WCOFPWest texas intermediate crude oil futures price
WOAWhale optimization algorithm
  3 in total

1.  Designing a hybrid reinforcement learning based algorithm with application in prediction of the COVID-19 pandemic in Quebec.

Authors:  Soheyl Khalilpourazari; Hossein Hashemi Doulabi
Journal:  Ann Oper Res       Date:  2021-01-03       Impact factor: 4.854

2.  Work Resumption Rate and Migrant Workers' Income During the COVID-19 Pandemic.

Authors:  Jiaxiang Li; Baoju Chu; Nana Chai; Bi Wu; Baofeng Shi; Feiya Ou
Journal:  Front Public Health       Date:  2021-05-21
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.