Literature DB >> 35539002

Prediction of electricity energy consumption including COVID-19 precautions using the hybrid MLR-FFANN optimized with the stochastic fractal search with fitness distance balance algorithm.

Adem Dalcali1, Harun Özbay2, Serhat Duman2.   

Abstract

The increase in energy consumption is affected by the developments in technology as well as the global population growth. Increasing energy consumption makes it difficult to ensure electrical energy supply security. Meeting the energy demand can be achieved with the right planning. Proper planning is critical for both economical use of resources and low cost for the end consumer. On the other hand, erroneous estimation of demand may cause waste of resources and energy crisis. Accurate estimation is possible by accurately modeling the factors affecting electricity consumption. Apart from known factors such as seasonal conditions, days of the week and hours, modeling in extreme events such as pandemics that affect all our behaviors increases the success in modeling the future projection. This ensures that the security of electrical energy supply is carried out effectively with limited resources. For this purpose, in this study, a hybrid multiple linear regression-feedforward artificial neural network (MLR-FFANN) based algorithm model was proposed, taking into account the estimated impact of the COVID-19 pandemic on the energy consumption values of Bursa, an industrial city in Turkey. The aim of the hybrid MLR-FFANN approach was to simultaneously optimize the β polynomial for multiple linear regression and the weight and bias coefficients for the forward propagation neural network using the adaptive guided differential evolution, equilibrium optimizer, slime mold algorithm, and stochastic fractal search with fitness distance balance (SFSFDB) optimization algorithms. The success of the model whose parameters were optimized using the optimization algorithms was determined according to mean absolute error, mean absolute percentage error, and root mean square error evaluation criteria and statistical analysis of these results. According to the results of the analysis, the MLR-FFANN approach whose parameters were optimized with the SFSFDB algorithm was more successful in the training of the dataset containing the COVID-19 precautions.
© 2022 John Wiley & Sons, Ltd.

Entities:  

Keywords:  COVID‐19 precautions; MLR‐FFANN algorithm; SFSFDB algorithm; energy consumption; optimization

Year:  2022        PMID: 35539002      PMCID: PMC9074447          DOI: 10.1002/cpe.6947

Source DB:  PubMed          Journal:  Concurr Comput        ISSN: 1532-0626            Impact factor:   1.831


INTRODUCTION

Background, definitions, and motivation

Today, meeting electrical energy demands is a very difficult problem considering the increasing population, industrialization, depletion of fossil resources, and increasing environmental concerns. In addition to fossil fuels, there is a need to meet increasing energy demands with low‐cost and environmentally friendly renewable resources. However, as renewable energy sources are dependent on nature and exhibit seasonal characteristics, such energy production requires planning. Therefore, today, diversifying energy and ensuring supply security play a critical role for countries. Successful planning of resources means efficient use of energy and preventing waste of resources. By using estimation methods, short‐, medium‐, and long‐term electricity consumption amounts can be determined and necessary measures can be taken to ensure a secure energy supply. The consumption of electric energy is affected by human activities and seasonal factors such as air temperature and humidity. However, the presence of unexpected circumstances also affects the success of the plans. The COVID‐19 pandemic has affected the activities of the global community and as a result, has been reflected in the impact on the electricity consumption sector. The demand for electric energy is affected by human activities and mobility. Therefore, the restrictions brought about by countries during pandemic periods will affect their electricity consumption. ,

Related work

Prediction of electricity consumption over the short, medium, and long term can be carried out in any city, country, building, or structure. Consumption estimation can be performed with mathematical modeling such as least squares, regression analysis, artificial intelligence, or machine learning approaches including the genetic algorithm and fuzzy logic. , In the literature, many estimation methods have been presented comparatively for different time periods. Regression methods, Kalman filter‐based forecasting, artificial intelligence techniques, deep recurrent neural networks, and hybrids of these techniques have been used successfully in the forecasting of electricity demand. , , Chen and Lee proposed an adaptive network‐based fuzzy system (ANFIS) method for estimating electricity consumption in public buildings. In their study, a multiple ANFIS approach was presented to predict electricity consumption based on human activities and weather conditions. Temperature, precipitation, sunshine duration, insolation radiation, humidity, working days, school days, and holidays were determined as inputs to the ANFIS model. The performance of the proposed method was compared with the Levenberg–Marquardt back propagation (LM‐BP) neural network, single ANFIS model, and linear and nonlinear regression. Although the method outperformed simple regression models, it underperformed compared to the single ANFIS and LM‐BP. Similar to consumption in buildings, electricity consumption estimates have been carried out within cities, countries, and areas directly affected by human use. The support vector regression (SVR) model was used to estimate the daily electricity consumption of a metro station. In this model, similar to the previous model, meteorological data such as air temperature and relative humidity were used as input data that affected consumption. Five‐fold‐cross‐validation with the genetic algorithm was used to optimize the hyper‐parameters of the SVR to improve its performance. In addition to the aforementioned input parameters, wind speed and direction, day, and time information have also been used in the estimation of electricity consumption. Parametric and nonparametric AutoRegressive (AR, NPAR), Smooth Transition AutoRegressive (STAR), and AutoRegressive Moving Average (ARMA) models were used in a study in which the medium‐term electricity consumption estimation of Pakistan was carried out. It was stated that among the proposed methods, ARMA performed better than the other models. A multivariate gray estimation model based on first‐order linear difference equations was proposed for long‐term electricity consumption estimation for Shanxi City (China). In that study, it was stated that electricity consumption was affected by factors such as seasons, holidays, political conditions, technological development level, and the economy. In addition to daily and annual forecasts, hourly forecasts can be made over a much shorter term. The hourly electricity consumption of a building was estimated using artificial neural network (ANN) and case‐based reasoning (CBR) methods. It was determined that the performance of the ANN was better than that of the CBR. Although a relatively large dataset is needed for ANNs to be successful, it has been stated that CBR can perform estimation with relatively less data. It has also been determined in similar studies that in order to provide acceptable accuracy, an ANN should have a sufficient number of samples. Estimation of electricity consumption was also carried out for different areas such as polymer material production and electric vehicle charging stations. Wang et al. proposed a model called ESN‐DE, which uses an improved echo state network (ESN) to perform the estimation of electrical energy consumption. In the proposed model, the differential evolution (DE) algorithm is used to find the optimal values of three important parameters of the ESN. According to the results obtained, it has been determined that the ESN‐DE algorithm gives better results than the basic ESN and ESN‐GA. Therefore, the ESN‐DE is a potential candidate for the estimation of electrical energy consumption due to its easy implementation and stability. In another study by the authors, the ESN and fruit fly optimization algorithm (FOA) approach were designed to predict medium‐term industrial electricity consumption in China. Numerical results indicate that adaboostsp‐ESN with FOA is successful in predicting future industrial electricity consumption. Zhou et al. proposed a new model for electricity consumption named panel semi‐parametric quantile regression NN (PSQRNN). The PSQRNN estimation results have been reported to perform better when compared to back propagation NN (BPNN), support vector machine (SVM) and quantile regression neural network (QRNN). Artificial intelligence algorithms, such as neural network models based on various neural networks and optimization algorithms, have been applied in a variety of science fields. For example, these methods have been successfully used to calculate the viscosity of natural gas, the MMP of pure and impure CO2 flux, the density of CO2 adsorption on activated carbon, and the estimation of the viscosity of self‐diverting acids. , , , The COVID‐19 pandemic, which swept throughout the globe before the end of 2019, prompted countries around the world to take immediate anti‐pandemic measures in all sectors of life. The epidemic period, which continues until today, is affected every sector both commercially and scientifically. In the literature, the impact of the pandemic on many sectors in different countries has been examined. In scientific studies, Chen et al., used mixed data sampling (MIDAS) and extension models to estimate the variability of China's crude oil futures with high frequency data. It was determined that the MIDAS‐RV‐CJ model was more successful in the short term, while the MIDAS‐RV‐L model gave better results in long‐term predictions. Furthermore, it has been reported that the leverage effect is the strongest predictor of the COVID‐19 pandemic based on the samples collected during the pandemic. In another study, Wu et al. analyzed the US oil markets based on social media information using convolutional NN (CNN) during the COVID‐19 process. The results obtained from the experimental studies showed that although social media information contributes to the estimation of oil price, production and consumption, the oil inventory does not affect the estimation accuracy. Considering the electricity consumption in terms of the energy sector worldwide, it is seen that electricity consumption is affected like other sectors due to the restrictions applied during the COVID‐19 pandemic period. In order to examine the impact of COVID‐19 in Turkey, the effect of the number of daily cases on electricity consumption was examined. In the analysis, it was determined that there was an inverse relationship between the number of cases and electricity consumption. Similar to the example of Turkey, the electricity consumption of Spain, one of the countries most affected by the pandemic, has been examined in detail. In the March–April period, a decrease of approximately 13.49% was determined by taking the average of the previous 5 years as a reference. A similar analysis was carried out in nine European countries and the USA, and the consumption differences between countries were examined. It was determined that the differences were the result of the different restrictions carried out in those countries. From this point of view, the degree of restrictions imposed by countries is an important factor in electricity consumption. Almost no reduction in electricity demand was detected in Norway, which took a much less strict stance in terms of restrictions. Whereas electricity demand decreased by 8% in the USA, decreases by 20%, 25%, and 28%, respectively, were observed in France, Spain, and Italy, which had tight lockdown policies.

Contribution of the article

Accurate realization of demand forecasting is important for economical generation of electric energy. An overestimation of demand may waste resources, whereas underestimation may lead to energy crises. In this context, energy supply security could be ensured through government policies and maintenance‐investment plans. In the literature, parameters that affect electricity consumption include the economy, industry, climate, ambient temperature, solar radiation, humidity, wind speed, working days, and the gross domestic product. , However, apart from these classical studies, no examination of extraordinary situations such as pandemics has been carried out. Moreover, for countries that have followed a partial closure system, regional or city‐based studies are important in obtaining critical results. The main contributions of the study can be listed as follows. This study discusses in a comparative way the effect of the COVID‐19 pandemic, which has impacted the whole world with an unexpected speed, on electricity consumption and the estimation of this effect using flexible calculation methods. One of the most important contributions of the study is to reveal the electricity consumption behavior under the influence of COVID‐19 in Bursa, an industrial city of Turkey, and to create a strategy for future pandemic periods. For the first time, the stochastic fractal search with fitness distance balance (SFSFDB) algorithm was adapted and simulated for the proposed hybrid multiple linear regression‐feedforward artificial neural network (MLR‐FFANN). The design of the optimal structure of the feedforward artificial neural network (FFANN) and the determination of the polynomial coefficients of the (multi‐linear regression) MLR were carried out during the training process, and these can be defined as the most important factors for training the proposed hybrid MLR‐FFANN. Combinations of hyperbolic tangent and sigmoid activation functions were used in the hidden and output layers of the FFANN and the success of these combinations in the training and learning process of the proposed hybrid MLR‐FFANN approach was investigated. The SFSFDB algorithm was broadly compared to and outperformed the adaptive guided differential evolution algorithm (AGDE), equilibrium optimizer (EO), and slime mold algorithm (SMA). The SFSFDB algorithm successfully trained the hybrid MLR‐FFANN in a minor number of iterations and found the optimal value of weight, bias, and polynomial coefficients of both simultaneously. The simulation results were analyzed statistically and according to the results, the SFSFDB algorithm was proven to be more successful in training the MLR‐FFANN algorithm proposed in the prediction dataset compared to the other optimization algorithms.

Organization of the study

The rest of the study article is organized as follows: The progress of the pandemic in Turkey and its effect on energy consumption is discussed in Section 2. In Section 3, MLR‐FFANN, SFS, and SFSFDB algorithms are explained. Section 4 provides detailed information on the application of the SFSFDB optimization algorithm to the hybrid MLR‐FFANN model. Section 5 focuses on the analysis of the experimental results under different operational scenarios. Finally, Section 6 provides the conclusions drawn from this study and ideas for future works.

PROGRESS OF THE PANDEMIC IN TURKEY AND ITS EFFECT ON ENERGY CONSUMPTION

In this section, Turkey's energy outlook and Bursa's electric energy production and consumption are detailed. The course of the pandemic in Turkey and the measures taken by the government against the pandemic are examined.

Situation of the city of Bursa

According to the 2020 data, the population of Bursa is 3,101,832, making it the fourth largest city in Turkey in terms of population. For a clearer understanding of Bursa's position in Turkey, data on industry and electricity consumption are given in Table 1.
TABLE 1

Indicators of Bursa and Turkey

2013201420152016201720182019
Exports (thousand $)Bursa9,970,9439,140,46310,364,72711,066,41411,716,92110,371,1179,076,069
Turkey166,504,862150,982,114149,246,999164,494,619177,168,756171,464,945160,514,811
The number of enterprisesBursa126,554128,169131,407136,322140,633147,618153,435
Turkey3,397,7243,434,9123,498,5863,608,4703,696,0043,845,9513,954,698
Gross domestic product per person ($)Bursa14,28213,82912,53712,27012,13211,45110,382
Turkey12,58212,17811,08510,96410,69697929213
Total electricity consumption per person (kWh)Bursa3427345537333726410642463094
Turkey2583266927602897308231494155
Industrial electricity consumption per person (kWh)Bursa213520872310225125742649
Turkey121612581315135714411435
Indicators of Bursa and Turkey The total electricity consumption value per person in Bursa is above the national average. Similarly, industrial electricity consumption per person for Bursa is high in all of the years mentioned. The number of enterprises, as an important parameter of the industry indicator, realized in Bursa corresponds to approximately 3.88% of the number of enterprises realized in Turkey. In addition, an average of 6.29% of the exports produced in Turkey for the specified years originated from Bursa. Considering all these indicators, Bursa is seen as an important industrial city.

Electricity energy outlook in Turkey

Turkey has a growing population and strong economic growth. This is also reflected in the energy requirements. In the last 10 years, the Turkish economy has grown around 5% on average. This growth also increases energy consumption and makes it difficult to ensure energy supply security. Consequently, in order to ensure energy supply security, installed power has been continuously increasing, especially after 2000. The installed power, which was 27,264.1 MW in 2000 and 49,524.1 MW in 2010, reached 96,709.6 MW as of February 2021. , As of February 2021, the installed power values according to their sources are given in Figure 1.
FIGURE 1

Distribution of installed power by resources

Distribution of installed power by resources Between 2000 and 2009, oil prices increased by 76% and natural gas prices by 114%. According to the projections made for the years 2015–2040, an increase of 186.3% in oil prices and 85.7% in natural gas prices is expected. Considering the price instability in fossil fuels and the depletion of resources, renewable energy sources come to the fore as an alternative. Figure 1 shows that 50.05% of Turkey's installed power consists of renewable resources. Among renewable resources, hydraulic resources take the first place, with 31,177.9 MW. With the licensed power plant investments that were temporarily accepted in 2020, the amount of additional installed power is 5430.31 MW. Only 60.80 MW of this additional power amount is from fossil fuel. The investments made can be evaluated as an indicator of the importance given to renewable energy in Turkey. In addition, with these investments, the energy supply will not be secured with imported resources, but with clean and renewable resources without fuel costs. According to the market development report published by the Energy Market Regulatory Authority, the total licensed electricity generation amount in Turkey in 2019 was 294,251,318.65 MWh, with Bursa contributing at a rate of 2.49%, or 7,322,483.15 MWh. Whereas the total licensed installed power of Turkey was 84,957.72 MW as of 2019, that of Bursa constituted 3.07% of this power, with 2607.03 MW. The unlicensed installed power was 6309.27 MW in Turkey for 2019, whereas in Bursa it was 70.54 MW, making up 1.12%. Unlicensed electricity generation in Turkey amounted to 9,829,447.73 MWh in total for 2019, with Bursa contributing 97,251.79 MWh (0.99%). When the electricity consumption values are examined, the total consumption billed for 2019 was 229,597,913.65 MWh in Turkey and 11,813,549.28 MWh in Bursa. With a share of 5.15%, Bursa was the city with the fourth highest electricity consumption in Turkey. Consumption amounts billed on a consumer basis are given in Table 2.
TABLE 2

Amount of billed consumption by type of consumer

Lighting (MWh)Residential (MWh)Industry (MWh)Agricultural irrigation (MWh)Commercial (MWh)Total (MWh)
Bursa126,061.661,969,089.317,260,805.16125,251.542,332,341.6211,813,549.28
Turkey5,041,682.9656,389,775.2294,462,698.788,553,367.4365,150,389.26229,597,913.65
Amount of billed consumption by type of consumer Considering the consumption amounts for Bursa, the highest was realized in the industrial section, followed by the commercial establishments. Considering its share in total consumption and the amounts by type of consumer, Bursa is seen as a robust industrial city.

Course of the COVID‐19 pandemic

The World Health Organization (WHO) gave the first notification for the coronavirus pandemic on December 31, 2019, by reporting that the disease was seen in China. On March 11, WHO declared the outbreak a pandemic. As of March 13, 2021, according to the data announced by WHO, the number of cases in the world was 118,754,336 and the number of deaths was 2,634,370. These figures forced governments to take strict actions. The measures taken caused difficulties for people and countries both socially and economically. There were significant changes in the work, education, shopping, and entertainment habits during the pandemic period. Societies were forced to carry out many activities remotely or in accordance with social distancing rules. It was important to provide uninterrupted health services during these critical measures. In order for all these services to be carried out safely, electric energy had to be provided without any problems. Gradual restrictions were introduced in Turkey with the occurrence of cases in March 2020. Due to these restrictions, periodic closures or capacity reductions were experienced in the manufacturing sector. These restrictions and slowdowns in activities were also reflected in the electricity consumption. Figure 2 presents the change in consumption for the years 2019 and 2020.
FIGURE 2

Electricity consumption amounts

Electricity consumption amounts Especially in April and May, a significant decrease was observed in consumption amounts, followed by a recovery. In August, as a continuation of this recovery, 2020 consumption increased compared to the same time period of the previous year, and this trend continued until the end of the year. COVID‐19 has caused a decrease in electricity demand in many countries and an increase in the share of renewable energy sources in electricity production. In a study carried out in Germany, energy production from renewable sources increased by 8% in the first quarter of 2020, taking the same period of 2019 as a reference. Similarly, in a study conducted in Italy, the effect of COVID‐19 on electricity consumption was examined by taking into account its environmental dimensions. It was determined that the significantly reduced electricity consumption was supported by wind‐based generation, resulting in a 30% decrease in wholesale energy prices. This has also had a positive impact on reducing carbon dioxide emissions. In addition, the researchers observed that the decrease in consumption affected the amount of energy supplied from traditional (thermal) sources. In order to reduce and control the impact of the pandemic, governments have applied different restrictions. These policies can cause disadvantageous situations in many sectors. The detection of these situations is important, especially in the later stages of the pandemic or similar pandemics that may occur later, in terms of precautions to be taken. The first case in Turkey was seen on March 11, 2020, after which, various measures were taken to control the pandemic. The steps taken during the pandemic process are summarized in Table 3.
TABLE 3

Milestones of COVID‐19 and the steps taken in Turkey

Date
February 3, 2020Turkey suspends all flights from China.
March 11, 2020The first coronavirus case was announced in Turkey.
March 14, 2020Passenger transportation banned with 16 countries, 9 of which were European countries.
March 18, 2020The first death due to coronavirus occurred.
March 20, 2020With the Presidential circular, all scientific, cultural, artistic, and similar meetings and activities were stopped until the end of April.
March 21, 2020Curfew restrictions were imposed on citizens aged 65 and over.
March 22, 2020With the Presidential circular, flexible and remote working was allowed in public institutions and organizations.
March 23, 2020Formal education was interrupted and distance education programs were started.
March 27, 2020International flights were completely suspended. Intercity transportation was subject to the permission of the Governorship. Entrance to places such as picnic areas, forests, and historical sites was prohibited at the weekend.
April 3, 202030 metropolitan areas and 1 province were closed to entry and exit of vehicles with some exceptions. Curfews were imposed on those under 20 years old throughout the whole country.
April 11–12, 2020The Government declared curfew in 30 metropolitan areas and 1 province (2 days)
April 18–19, 2020The Government declared curfew in 30 metropolitan areas and 1 province (2 days)
April 23–26, 2020The Government declared curfew in 30 metropolitan areas and 1 province (4 days)
May 1–3, 2020The Government declared curfew in 30 metropolitan areas and 1 province (3 days)
May 10, 2020The curfew for 65 years and older has been lifted between 12:00 and 18:00.
May 13, 2020The curfew for children aged 0–14 has been lifted between 11:00 and 15:00.
May 15, 2020The curfew for young people between the ages of 15 and 20 has been lifted between 11:00 and 15:00.
May 23–26, 2020A curfew was imposed during the Ramadan Holiday (4 days).
May 24, 2020The curfew for 65 years and older has been lifted between 12:00 and 18:00.
June 1, 2020The intercity travel restriction has been lifted. Flexible and remote working in the public sector has come to an end.
June 3, 2020The curfew imposed on people over the age of 65 has been lifted.
June 20, 2020A curfew was imposed during the university entrance exam (first step).
June 27–28, 2020A curfew was imposed during the university entrance exam (second step).
August 26, 2020Flexible and remote working in public institutions and organizations was allowed.
November 17, 2020A curfew will be imposed on weekends outside of 10.00–20.00. Restaurants will only provide takeaway service, shopping malls and markets will be closed at 20:00. (First starting on 04.12.2020.)
November 30, 2020A curfew will be imposed, starting at 21:00 on Fridays and ending at 05:00 on Mondays.
November 30, 2020Until a new decision, a curfew will be applied between 21.00 and 05.00 on weekdays throughout the country. (First started on 01.12.2020.)
December 31, 2020–January 4, 2021A curfew will be imposed, starting from 21:00 on Thursday, December 31, 2021, and ending at 05:00 on Monday, January 4, 2021.
February 2, 2021Face‐to‐face education will begin gradually in village schools as of February 15, 2021 and for other classes as of March 1, 2021.
February 3, 2021The South African and Brazilian variants were also seen in Turkey.
March 1, 2021As of March 2, 2021, pre‐school education institutions, primary school, secondary school 8th and 12th grades will start education in all cities and will start at other levels, including secondary and high schools, in low and medium risk provinces.
The scope of the curfews has been changed according to the provinces with the new Controlled Normalization Process announced on March 1, 2021. The provinces have been divided into four different risk groups (low, medium, high, and very high) and the level of precaution has been determined according to the risk group.
March 2, 2021

BURSA: In the medium risk group. According to this:

• A curfew will be applied between 21.00 and 05.00 on weekdays.

• A curfew will be applied between 21.00 and 05.00 on weekends.

• For our citizens aged 65 and over and under the age of 20, the curfew will be lifted outside the hours stated above.

• The food and beverage sector, which will operate between 07:00 and 19:00, will serve with a 50% capacity limitation.

RestrictionInformRule bending
Milestones of COVID‐19 and the steps taken in Turkey BURSA: In the medium risk group. According to this: • A curfew will be applied between 21.00 and 05.00 on weekdays. • A curfew will be applied between 21.00 and 05.00 on weekends. • For our citizens aged 65 and over and under the age of 20, the curfew will be lifted outside the hours stated above. • The food and beverage sector, which will operate between 07:00 and 19:00, will serve with a 50% capacity limitation.

METHODS

Multi‐linear regression

Regression analysis is a statistical technique used to investigate and model the relationship between variables in order to obtain an approximate function between the relevant response (output) and independent variables (inputs). If there is more than one independent variable in the regression analysis, it is called MLR. In most MLR problems, it is not known how the independent variables relate to the dependent variables. Therefore, the first step and the main problem are to find an approximation method to relate them. Generally, first‐order and second‐order polynomial models are used, as given in Equations (1) and (2). where β denotes the polynomial coefficients and x the independent variables, y is defined as the dependent variable, and ε identifies other forms of variation, such as errors or lack of numerical convergence. The k index represents the total number of variables. The i and j indices represent a certain variable between 1 and k. If Equations (1) and (2) are written in a matrix form, Equation (3) is obtained. The related matrices and vectors can be elaborated as in Equation (4). To estimate the coefficients of MLR models, the least squares method given for the vector sample β in Equation (5) is generally used. Thus, MLR models can be developed according to the existence of errors.

Feedforward artificial neural networks

ANNs are a distributed computing technology, inspired by the information processing technique of the human brain that can record the experimental information of systems. The ANN topology is divided into two types: feedforward (FF) and back propagation neural networks (BPNN). The FFANN is used more widely because it requires less memory in the implementation phase. The FFANN allows a one‐way signal flow. In addition, feedforward neural networks are organized in multiple layers: the input layer, the hidden layer, and an output layer. The neurons of each layer are interconnected by the weights and bias terms of the other neurons in the previous layers. This system is represented by the mathematical expression given in Equation (6). where n is the number of input nodes and is the connection weight of the jth node in the hidden layer from the ith node in the input layer; θ denotes the jth hidden node bias and x denotes the ith input. Accordingly, Equation (7) is used to calculate the output of each hidden node. The final outputs are defined as given in Equations (8) and (9). where is the connection weight from the jth hidden node to the kth output node and represents the bias value of the kth output node. As can be understood from the equations, the most important parts of these structures are the weight and bias values because they determine the final value of the output. In order to find these values at the optimum value, the training of the neural network is very important. The aim is to obtain the minimum error value at the end of both the training process and the testing process of the neural network. Based on calculating the difference between the actual and predicted values, the mean square error (MSE) is used to evaluate the forward propagation neural network. This mathematical expression is given in Equation (10). where represents the error value of the ith data, and are defined as the actual and estimated value of the ith data, respectively, and n is the total number of data. The FFANN model with the structure of 4‐5‐1 is illustrated in Figure 3.
FIGURE 3

FFANN scheme with the structure of 4‐5‐1

FFANN scheme with the structure of 4‐5‐1

Proposed optimization algorithm

Stochastic fractal search algorithm

The stochastic fractal search (SFS) algorithm was developed because there was no information exchange between fractals and the many parameters that needed to be determined according to the problem to be optimized by the fractal search algorithm. The SFS algorithm is constituted of two basic processes: the propagation and update processes. The update process also consists of two parts: the first and second update processes. The propagation process performs the local search feature in the algorithm, and the update processes perform the global search functions. The two functions given in Equations (11) and (12) are proposed for the propagation process of fractals. In Equation (12), μ BP represents the position of the fractal that gives the best fitness value found up to that generation, BP is the fractal that gives the best fitness value, and P denotes the propagating fractal; the ε expression defines randomly generated numbers in the range of [0, 1]. In addition, μ in Equation (12) represents the position of the spreading fractal. The σ value is the standard deviation value that expresses the step length of the Gaussian walking function. The standard deviation expression is calculated as given in Equation (13). Equation (13) shows that the difference between the fractal with the best fitness value and the spreading fractal is obtained by multiplying the expression log(g)/g. The g in the log(g)/g expression added to Equation (13) represents the generation value and creates a damping effect in the standard deviation function, that is, the value of the standard deviation decreases as the number of generations progresses. Although this causes a more local search, it increases the probability of finding a solution with an even better fitness value in the immediate vicinity of the found solution point. After the propagation process is performed, the first and second update processes are run. For these operations, first of all, a probabilistic value is given to each fractal by taking into account the order of the fitness values, using the expression given in Equation (14). where N is the number of fractals in the generation and the expression rank(P ) expresses the rank of the fractal in the generation according to its fitness value. The first update process starts after the probabilistic value is calculated. Depending on whether the randomly generated ε expression is smaller or larger than the probabilistic value, the relevant dimension of the relevant fractal is updated. This process is as given in Equation (15). The r and t subindices given in Equation (15) represent randomly selected fractals from within the generation. After the first update process, Equation (14) is reordered and the fractals are assigned probabilistic values. For fractals entering the second update process, the position of the fractal is changed according to whether a randomly generated number between 0 and 1 is smaller or larger than 0.5. The second update process is performed with the expression given in Equation (16). The algorithm continues to generate a new generation and generate new solution points until it meets the termination criterion.

Fitness distance balance selection method

The purpose of the fitness distance balance (FDB) method is to effectively guide meta‐heuristic search (MHS) algorithms in the search process. The feature that distinguishes the FDB method from other selection methods is the calculation of the score values of the solution candidates and the selection process according to the score values. In the calculation of the score, the fitness values of the solution candidates and their distance from the best solution candidate in the population are taken into account. This ensures that the solution candidate with a high fitness value is selected. On the other hand, this prevents the selection of a solution candidate that is very close to the best solution in the population. This selection strategy of the FDB method contributes to the solution of the early convergence problem frequently encountered in the MHS process. , , The steps to implement the FDB method are as follows: The score values of the solution candidates in the 𝑃 population should be calculated. If 𝑋𝑏𝑒𝑠𝑡 is assumed to be the best solution in population 𝑃, the scores of the solution candidates in population 𝑃 are expressed by the vector 𝑆𝑝 given in Equation (17). While calculating the score of each solution candidate, the values of fitness and distance are taken into account. In population 𝑃, the distance between the ith solution candidate and 𝑋𝑏𝑒𝑠𝑡 (the best solution in 𝑃) is calculated using the Euclidean metric as given in Equation (18). The distance vector 𝐷𝑝 of the solution candidates in the population 𝑃 can be represented as given in Equation (19). Another parameter used in calculating the scores of the solution candidates is the fitness value. The fitness values of the solution candidates in the population 𝑃 are represented by the vector 𝐹. The 𝐹 and 𝐷𝑝 vectors are normalized so that the two parameters do not dominate each other in the score calculation. The score calculation of the ith solution candidate is given in Equation (20). The ω parameter given in Equation (20) is used to weight the effects of 𝐹 and 𝐷𝑝 parameters. The ω parameter can have an infinite number of values in the range [0, 1]. A larger value of the ω parameter increases the effect of the fitness value on the score. In this case, the effect of the distance value on the score decreases. When ω = 1, the score value of the solution candidate depends only on the fitness value. When ω = 0, the solution candidate's score depends only on the distance value. This ensures that the solution candidate farthest from P is selected in the population. Therefore, the variation in the population increases as the ω parameter is close to zero. This provides discovery capability for MHS algorithms. This parameter gives the FDB the ability to adapt. The ω parameter has the important function of switching and balancing between operating and reconnaissance tasks.

FDB based stochastic fractal search algorithm

A step‐by‐step explanation can be given for improving the search performance of MHS algorithms using the FDB method. Before applying the FDB method to an MHS algorithm, the search process lifecycle of the MHS algorithm should be analyzed and the process in which the algorithm provides diversity should be determined. Therefore, the SFS algorithm needs to be rearranged based on the general steps of the MHS process. The purpose of rearranging the algorithm is to reveal the exploration and exploration processes, and the step in which the FDB method is applied. After the exploration process, the diffusion process is carried out, and the tasks of searching for a neighborhood and providing diversity in the update process are fulfilled. In the next step, reference positions that guide the search process lifecycle in the SFS algorithm are examined. Thus, reference positions and the methods used to select them are determined. In this process, three reference positions are selected, one using the turn‐based method and the other two randomly. According to the information obtained here, the FDB selection method was applied. Thus, the process of ensuring diversity in the population was realized. Equations ((17), (18), (19), (20)) are taken into account in the FDB selection method. The SFSFDB flowchart is given in Figure 4.
FIGURE 4

Flowchart of SFSFDB algorithm

Flowchart of SFSFDB algorithm

SFSFDB ALGORITHM APPLIED TO HYBRID MLR‐FFANN

In the training of ANNs, a large number of uncertainties occur with the relationship between different datasets and the combination of many solutions. This uncertainty and different relationship status make their training process difficult. Moreover, ANNs have a nonlinear structure, which affects the behavior of the network in the learning process. Therefore, the importance is increasing of determining the most appropriate values of the weight and bias coefficients used in the training of the network and the activation functions used in the neurons. Considering this situation, the weight and bias coefficients of FFANN and the β polynomial coefficients of MLR in the proposed MLR‐FFANN model were determined simultaneously by optimization algorithms. In this study, the application of the SFSFDB algorithm described in Section 3.3 to the training of the proposed MLR‐FFANN model is explained step by step as follows. Step 1: In this step, the prepared dataset is read from the relevant file and the values of the dataset are normalized between 0 and 1 using Equation (21). The maximum number of iterations of the SFSFDB algorithm, the size of the population and the number of dimensions are entered into the system by the user. In addition to this step, the initial population is created within the minimum and maximum values determined for the parameters (weight, bias, and β polynomial coefficients) of the proposed MLR‐FFANN model to be optimized. Step 2: The fitness function values of the initial population are calculated and the position value with the best fitness value is determined. Step 3: The algorithm checks whether the current iteration number has reached the determined maximum number of iterations. If not, the natural process of the algorithm continues. If reached, the best value and best position values of the fitness function are displayed. Step 4: Depending on the structure of the algorithm, the maximum diffusion number is determined. Depending on the maximum diffusion number, the steps of the diffusion process in Figure 4 are applied. Step 5: According to Step 4, after the condition in the diffusion process is met and the necessary operations are performed, the algorithm performs the first updating process step in the flow diagram. The algorithm checks whether each ordered P value is less than the randomly generated number. Depending on this condition, the necessary processing is carried out. Step 6: When the condition step in Step 5 is completed, the second updating process is started. In this step, all solution candidates in the population are ranked. Depending on the specified condition, either the selection method of the SFS algorithm is applied or the FDB selection method is applied, depending on Equations ((17), (18), (19), (20)). After the selection steps are performed depending on the specified conditions, the position of the P solution candidate in the population is updated. Step 7: The current iteration number of the algorithm is increased and the algorithm goes to Step 3 and checks whether the condition there is met. Figure 5 shows the application of the SFSFDB algorithm to the training of the proposed MLR‐FFANN model. Here, the input dataset is expressed as environmental conditions, days of the week, COVID‐19 pandemic precautions, the average electrical energy consumption values for the MLR method, and the average electric energy consumption estimated error values for FFANN, whereas the output dataset shows the daily average energy consumption of the city of Bursa. By using the normalization expression shown in Equation (21) for the input and output datasets, the values of the data are transformed into a dataset varying between 0 and 1.
FIGURE 5

Flowchart of SFSFDB algorithm applied to MLR‐FFANN

Flowchart of SFSFDB algorithm applied to MLR‐FFANN

EXPERIMENTAL RESULTS

In this study, a multiple linear regression and feedforward neural network‐based hybrid algorithm was proposed to predict the electric energy consumption of the city of Bursa during the COVID‐19 pandemic, and the design of this algorithm was carried out using AGDE, EO, SMA, and SFSFDB optimization algorithms. Although the optimization algorithms used minimized the mean squared error, they aimed to find the most appropriate weight and bias coefficients of the FFANN model and the β polynomial coefficients of the multiple linear regression algorithm. Furthermore, the success of the following design structures in training of the FFANN model was examined in the optimization process. The case studies of the FFANN structural designs were carried out as follows. Case 1: Using the hyperbolic tangent function in the hidden and output layers for the FFANN model. Case 2: Using the hyperbolic tangent function in the hidden layer and the sigmoid function in the output layer for the FFANN model. Case 3: Using the sigmoid function in the hidden layer and the hyperbolic tangent function in the output layer for the FFANN model. Case 4: Using the sigmoid function in the hidden and the output layers for the FFANN model. Each of the structures for the FFANN model case studies is shown in Table 4. The simulation studies were conducted according to these network structures.
TABLE 4

FFANN structures used in study cases

Case 1Case 2Case 3Case 4
Hidden layerOutput layerHidden layerOutput layerHidden layerOutput layerHidden layerOutput layer
Hyperbolic tangentHyperbolic tangentHyperbolic tangentSigmoidSigmoidHyperbolic tangentSigmoidSigmoid
FFANN structures21 × 5 × 1
21 × 6 × 1
21 × 7 × 1
21 × 8 × 1
21 × 9 × 1
21 × 10 × 1
FFANN structures used in study cases The input and output parameters of the proposed hybrid MLR‐FFANN model are as follows. Input parameters of the hybrid MLR‐FFANN model Days of the week COVID‐19 pandemic process precautions Environmental conditions Average electric energy consumption values for MLR Predicted error values of average electric energy consumption for FFANN Output parameters of the hybrid MLR‐FFANN model Electricity energy consumption values of the Bursa Metropolitan Province During the COVID‐19 pandemic, 80% of the dataset created for the estimation of the electric energy consumption of the Bursa Metropolitan Province was used in the training of the hybrid MLR‐FFANN model and 20% in the testing of the model. The mean absolute error (MAE), mean absolute percentage error (MAPE), and root mean square error (RMSE) criteria were used to evaluate the success of the proposed model in training and testing data. These evaluation criteria are shown in Equations ((22), (23), (24)). The MAE, MAPE, and RMSE evaluation criteria results from the training and testing data of different FFANN structures for all working cases of the AGDE algorithm are given in Table 5. Depending on these results, all working cases and the ranking of the network structures are shown in Table 6 in order to identify which working condition among the network structures used in the different working cases was more successful in the FFANN training process of the AGDE algorithm.
TABLE 5

MAE, MAPE and RMSE values obtained from the different FFANN structures optimized by AGDE algorithm

AlgorithmFFANN structuresData typeEvaluation criteriaCase 1Case 2Case 3Case 4
Hidden layerOutput layerHidden layerOutput layerHidden layerOutput layerHidden layerOutput layer
Hyperbolic tangentHyperbolic tangentHyperbolic tangentSigmoidSigmoidHyperbolic tangentSigmoidSigmoid
AGDE21 × 5 × 1Training dataMAE27.394126.189130.236030.1717
MAPE5.26785.10115.99405.9849
RMSE38.747438.853144.042844.1025
Test dataMAE56.758453.198044.823042.6288
MAPE10.876110.52338.94358.5632
RMSE98.592878.259064.479061.7684
21 × 6 × 1Training dataMAE27.542527.776028.635430.0492
MAPE5.32055.30315.55395.9714
RMSE39.764338.442441.292144.0109
Test dataMAE52.585454.733049.225942.6781
MAPE10.315410.71559.82328.5095
RMSE78.641276.531673.469261.5986
21 × 7 × 1Training dataMAE25.356226.977027.386525.8093
MAPE4.81135.31835.32524.9777
RMSE35.198442.222739.672336.9277
Test dataMAE58.701742.728045.361350.1409
MAPE11.46378.68799.29569.9568
RMSE93.843560.872968.730571.9617
21 × 8 × 1Training dataMAE26.472427.724725.850325.2653
MAPE5.00115.33674.90744.9397
RMSE36.989039.815036.222336.9891
Test dataMAE51.193852.955652.833960.1406
MAPE10.176910.354710.480611.5400
RMSE79.522275.210378.241597.8941
21 × 9 × 1Training dataMAE26.605726.252727.565731.0802
MAPE5.09625.08665.32656.1767
RMSE37.760138.109539.392445.5558
Test dataMAE47.702453.515454.517541.0055
MAPE9.464610.480910.79268.3016
RMSE73.329980.301180.993959.6515
21 × 10 × 1Training dataMAE24.304226.261726.816128.0184
MAPE4.57085.07305.16975.4671
RMSE34.752736.886238.232239.9554
Test dataMAE63.111350.419151.850750.8396
MAPE12.35879.808110.192710.0942
RMSE99.355676.002876.314674.0462
TABLE 6

Ranking of results of simulation studies of the AGDE algorithm for different operational cases

AlgorithmCasesLayersActivation functionsData typeEvaluation criteriaFFANN structures
21 × 5 × 121 × 6 × 121 × 7 × 121 × 8 × 121 × 9 × 121 × 10 × 1
AGDECase 1Hidden layer/output layerHyperbolic tangent/hyperbolic tangentTraining dataMAE562341
MAPE562341
RMSE562341
Test dataMAE435216
MAPE435216
RMSE524316
Case 2Hidden layer/output layerHyperbolic tangent/sigmoidTraining dataMAE164523
MAPE345621
RMSE436521
Test dataMAE461352
MAPE561342
RMSE541263
Case 3Hidden layer/output layerSigmoid/hyperbolic tangentTraining dataMAE653142
MAPE653142
RMSE654132
Test dataMAE132564
MAPE132564
RMSE132564
Case 4Hidden layer/output layerSigmoid/sigmoidTraining dataMAE642153
MAPE542163
RMSE541263
Test dataMAE234615
MAPE324615
RMSE324615
MAE, MAPE and RMSE values obtained from the different FFANN structures optimized by AGDE algorithm Ranking of results of simulation studies of the AGDE algorithm for different operational cases Figure 6 shows that the training of the proposed MLR‐FFANN model was carried out in 250 iterations by all optimization algorithms. During the training of the MLR‐FFANN model in different operational situations, it is clearly seen that the SFSFDB algorithm in Cases 1 and 3, and the AGDE algorithm in Cases 2 and 4 compared to the other algorithms, were more successful in finding the optimal value of the objective function. Moreover, it can be seen from Figure 6B that the SFSFDB algorithm was as successful as the AGDE algorithm in the training method proposed for Case 2.
FIGURE 6

(A–D) Convergence curves of optimization algorithms for different operational cases

(A–D) Convergence curves of optimization algorithms for different operational cases The results of MAE, MAPE, and RMSE evaluation criteria obtained from the training and test data of different FFANN structures for all working cases of EO, SMA, and SFSFDB algorithms are given in Tables 7, 8, 9, respectively. The ranking of the evaluation criteria results obtained for the training and testing data at the end of the optimization process of the FFANN design structures by the EO, SMA, and SFSFDB optimization algorithms in the different study cases is shown in Tables 10, 11, 12.
TABLE 7

MAE, MAPE, and RMSE values obtained from the different FFANN structures optimized by the EO algorithm

AlgorithmFFANN structuresData typeEvaluation criteriaCase 1Case 2Case 3Case 4
Hidden layerOutput layerHidden layerOutput layerHidden layerOutput layerHidden layerOutput layer
Hyperbolic tangentHyperbolic tangentHyperbolic tangentSigmoidSigmoidHyperbolic tangentSigmoidSigmoid
EO21 × 5 × 1Training dataMAE30.466829.887431.381131.8208
MAPE5.97115.87856.27326.3167
RMSE43.832243.338846.740645.9720
Test dataMAE44.764243.851039.074141.4298
MAPE9.00758.87218.08248.4246
RMSE66.329662.030859.162559.7926
21 × 6 × 1Training dataMAE29.326530.332430.650530.9127
MAPE5.71695.97515.95256.1542
RMSE41.696342.678644.353045.5809
Test dataMAE50.837149.814942.164540.7968
MAPE10.18279.95338.42568.2865
RMSE74.714271.320363.278359.3978
21 × 7 × 1Training dataMAE28.214327.748530.038329.5706
MAPE5.47865.43955.95375.8674
RMSE41.095840.258343.476243.6306
Test dataMAE42.565953.879248.836342.6071
MAPE8.557610.45889.75828.6709
RMSE63.306679.309369.861664.3107
21 × 8 × 1Training dataMAE28.619529.023330.567430.9093
MAPE5.45985.69435.96486.1330
RMSE38.572140.656342.804544.4658
Test dataMAE61.613251.917345.926645.8902
MAPE12.182910.32619.14889.2047
RMSE86.706073.320467.527266.7979
21 × 9 × 1Training dataMAE28.682727.576929.971030.0967
MAPE5.54485.35865.87275.8982
RMSE40.093139.147042.829842.6977
Test dataMAE54.327144.222745.562049.3985
MAPE10.59018.98839.17019.6140
RMSE77.518866.242068.761369.9643
21 × 10 × 1Training dataMAE26.330724.945326.880729.1426
MAPE5.06764.77705.17265.7892
RMSE37.293436.146437.846642.4553
Test dataMAE49.080247.410555.073644.3604
MAPE9.72129.379510.74148.9299
RMSE74.532570.933586.737265.7630
TABLE 8

MAE, MAPE, and RMSE values obtained from the different FFANN structures optimized by the SMA

AlgorithmFFANN structuresData typeEvaluation criteriaCase 1Case 2Case 3Case 4
Hidden layerOutput layerHidden layerOutput layerHidden layerOutput layerHidden layerOutput layer
Hyperbolic tangentHyperbolic tangentHyperbolic tangentSigmoidSigmoidHyperbolic tangentSigmoidSigmoid
SMA21 × 5 × 1Training dataMAE33.751532.150934.932235.0882
MAPE6.72716.45736.97167.0292
RMSE49.435146.508250.234250.9672
Test dataMAE38.662746.791741.169037.9265
MAPE7.98849.41618.43937.8526
RMSE57.040265.449958.620155.8728
21 × 6 × 1Training dataMAE36.098534.152835.239636.2255
MAPE7.29396.86097.09437.2331
RMSE53.237850.189751.122751.4780
Test dataMAE41.108440.732742.499843.9866
MAPE8.44128.30308.60268.8966
RMSE56.454858.774259.144261.4382
21 × 7 × 1Training dataMAE33.070832.687332.732132.6362
MAPE6.61206.53536.56246.5491
RMSE48.936247.418148.385148.8187
Test dataMAE39.632741.781039.639037.0926
MAPE8.10208.42358.13057.6234
RMSE58.510859.818158.157654.8335
21 × 8 × 1Training dataMAE33.093634.942033.997134.3282
MAPE6.62716.96436.80006.8296
RMSE48.154349.294549.369849.5032
Test dataMAE42.087444.191340.340038.3903
MAPE8.62239.13828.21487.8880
RMSE61.782463.071058.810356.4119
21 × 9 × 1Training dataMAE32.533032.600034.024833.0650
MAPE6.47076.48836.81256.6219
RMSE46.996647.295949.133748.5220
Test dataMAE44.299742.852239.254938.5194
MAPE8.91438.72908.07047.8535
RMSE62.696860.967957.084756.4597
21 × 10 × 1Training dataMAE31.689832.485535.293633.8145
MAPE6.35666.44317.05466.7522
RMSE46.909845.709250.262548.8474
Test dataMAE36.796647.813840.216442.9139
MAPE7.64979.45738.39518.7180
RMSE55.323369.952459.664160.5452
TABLE 9

MAE, MAPE, and RMSE values obtained from the different FFANN structures optimized by the SFSFDB algorithm

AlgorithmFFANN structuresData typeEvaluation criteriaCase 1Case 2Case 3Case 4
Hidden layerOutput layerHidden layerOutput layerHidden layerOutput layerHidden layerOutput layer
Hyperbolic tangentHyperbolic tangentHyperbolic tangentSigmoidSigmoidHyperbolic tangentSigmoidSigmoid
SFSFDB21 × 5 × 1Training dataMAE29.784826.393625.069325.8692
MAPE5.77965.08504.83395.0302
RMSE41.600437.370535.998238.1307
Test dataMAE51.597149.713654.000955.3014
MAPE10.30109.794710.713710.8689
RMSE77.281569.123688.625985.7601
21 × 6 × 1Training dataMAE26.450727.254627.726929.7217
MAPE5.03565.28105.32985.8510
RMSE37.849539.074139.730742.3777
Test dataMAE59.996853.479253.648448.2257
MAPE11.699710.510710.65779.7180
RMSE88.554178.140680.779869.5811
21 × 7 × 1Training dataMAE25.375826.779427.381128.3706
MAPE4.85275.23265.22935.5572
RMSE35.439338.767138.542040.9998
Test dataMAE60.161248.006351.682947.8107
MAPE11.97269.609610.01169.4816
RMSE89.818970.865184.339870.8205
21 × 8 × 1Training dataMAE24.791326.653927.268127.8048
MAPE4.69645.16375.19865.4324
RMSE35.333637.586437.975140.4902
Test dataMAE66.822252.546055.513845.6329
MAPE13.322710.348611.01769.0612
RMSE106.694775.468981.127269.7324
21 × 9 × 1Training dataMAE23.399723.267922.783623.5089
MAPE4.39954.41334.26574.4659
RMSE33.738833.213832.305233.6666
Test dataMAE63.880262.884156.883254.1425
MAPE12.673912.225011.310510.6307
RMSE105.348495.027986.869884.1150
21 × 10 × 1Training dataMAE22.331021.972622.097521.7047
MAPE4.09064.12944.12564.1347
RMSE32.197931.035431.215932.0118
Test dataMAE69.550561.486455.828656.7811
MAPE13.639511.911711.238211.1619
RMSE103.011592.556785.688782.4051
TABLE 10

Ranking of results of simulation studies of the EO algorithm for different operational cases

AlgorithmCasesLayersActivation functionsData typeEvaluation criteriaFFANN structures
21 × 5 × 121 × 6 × 121 × 7 × 121 × 8 × 121 × 9 × 121 × 10 × 1
EOCase 1Hidden layer/output layerHyperbolic tangent/hyperbolic tangentTraining dataMAE652341
MAPE653241
RMSE654231
Test dataMAE241653
MAPE241653
RMSE241653
Case 2Hidden layer/output layerHyperbolic tangent/sigmoidTraining dataMAE563421
MAPE563421
RMSE653421
Test dataMAE146523
MAPE146523
RMSE146523
Case 3Hidden layer/output layerSigmoid/hyperbolic tangentTraining dataMAE653421
MAPE634521
RMSE645231
Test dataMAE125436
MAPE125346
RMSE125346
Case 4Hidden layer/output layerSigmoid/sigmoidTraining dataMAE652431
MAPE652431
RMSE653421
Test dataMAE213564
MAPE213564
RMSE213564
TABLE 11

Ranking of results of simulation studies of SMA for different operational cases

AlgorithmCasesLayersActivation functionsData typeEvaluation criteriaFFANN structures
21 × 5 × 121 × 6 × 121 × 7 × 121 × 8 × 121 × 9 × 121 × 10 × 1
SMACase 1Hidden layer/output layerHyperbolic tangent/hyperbolic tangentTraining dataMAE563421
MAPE563421
RMSE564321
Test dataMAE243561
MAPE243561
RMSE324561
Case 2Hidden layer/output layerHyperbolic tangent/sigmoidTraining dataMAE154632
MAPE254631
RMSE264531
Test dataMAE512436
MAPE512436
RMSE412436
Case 3Hidden layer/output layerSigmoid/hyperbolic tangentTraining dataMAE451236
MAPE461235
RMSE461325
Test dataMAE562413
MAPE562314
RMSE352416
Case 4Hidden layer/output layerSigmoid/sigmoidTraining dataMAE561423
MAPE561423
RMSE562413
Test dataMAE261345
MAPE261435
RMSE261345
TABLE 12

Ranking of results of simulation studies of the SFSFDB algorithm for different operational cases

AlgorithmCasesLayersActivation functionsData typeEvaluation criteriaFFANN structures
21 × 5 × 121 × 6 × 121 × 7 × 121 × 8 × 121 × 9 × 121 × 10 × 1
SFSFDBCase 1Hidden layer/output layerHyperbolic tangent/hyperbolic tangentTraining dataMAE654321
MAPE654321
RMSE654321
Test dataMAE123546
MAPE123546
RMSE123654
Case 2Hidden layer/output layerHyperbolic tangent/sigmoidTraining dataMAE365421
MAPE365421
RMSE365421
Test dataMAE241365
MAPE241365
RMSE142365
Case 3Hidden layer/output layerSigmoid/hyperbolic tangentTraining dataMAE365421
MAPE365421
RMSE365421
Test dataMAE321465
MAPE321465
RMSE613254
Case 4Hidden layer/output layerSigmoid/sigmoidTraining dataMAE365421
MAPE365421
RMSE365421
Test dataMAE532146
MAPE532146
RMSE613254
MAE, MAPE, and RMSE values obtained from the different FFANN structures optimized by the EO algorithm MAE, MAPE, and RMSE values obtained from the different FFANN structures optimized by the SMA MAE, MAPE, and RMSE values obtained from the different FFANN structures optimized by the SFSFDB algorithm Ranking of results of simulation studies of the EO algorithm for different operational cases Ranking of results of simulation studies of SMA for different operational cases Ranking of results of simulation studies of the SFSFDB algorithm for different operational cases The comparative analysis and ranking of FFANN structures in simulation cases for all optimization algorithms are given in Table 13. These rankings are shown depending on the sum of the ranking results, mean values of these results, and standard deviation values given in Tables 6 and 10, 11, 12. According to these ranking results, the FFANN structure that completed the training and testing process the most successfully was determined among the simulation cases for each optimization algorithm. The identified FFANN structures are denoted in Table 13 in different colors for each optimization algorithm.
TABLE 13

Comparison analysis of the ranking results of the FFANN structures for optimization algorithms

AlgorithmCasesFFANN structuresFinal rankSumMeanStandard deviationAlgorithmCasesFFANN structuresFinal rankSumMeanStandard deviation
AGDECase 121 × 5 × 16284.66670.5164EOCase 121 × 5 × 13244.00002.1909
21 × 6 × 15264.33331.861921 × 6 × 16274.50000.5477
21 × 7 × 13203.33331.505521 × 7 × 12122.00001.2649
21 × 8 × 12162.66670.516421 × 8 × 14254.16672.0412
21 × 9 × 11152.50001.643221 × 9 × 15264.33330.8165
21 × 10 × 14213.50002.738621 × 10 × 11122.00001.0954
Case 221 × 5 × 14223.66671.5055Case 221 × 5 × 13193.16672.4014
21 × 6 × 16294.83331.329221 × 6 × 16294.83330.9832
21 × 7 × 12183.00002.280421 × 7 × 15274.50001.6432
21 × 8 × 15244.00001.549221 × 8 × 14274.50000.5477
21 × 9 × 13213.50001.760721 × 9 × 11122.00000.0000
21 × 10 × 11122.00000.894421 × 10 × 12122.00001.0954
Case 321 × 5 × 14213.50002.7386Case 321 × 5 × 15213.50002.7386
21 × 6 × 15244.00001.095421 × 6 × 12183.00001.2649
21 × 7 × 11162.66670.816521 × 7 × 16274.50000.8367
21 × 8 × 13183.00002.190921 × 8 × 13213.50001.0488
21 × 9 × 16294.83331.329221 × 9 × 11183.00000.8944
21 × 10 × 12183.00001.095421 × 10 × 14213.50002.7386
Case 421 × 5 × 16244.00001.5492Case 421 × 5 × 14244.00002.1909
21 × 6 × 12193.16670.983221 × 6 × 13183.00002.1909
21 × 7 × 11172.83331.329221 × 7 × 12162.66670.5164
21 × 8 × 14223.66672.582021 × 8 × 16274.50000.5477
21 × 9 × 13203.33332.582021 × 9 × 15264.33331.8619
21 × 10 × 15244.00001.095421 × 10 × 11152.50001.6432
Comparison analysis of the ranking results of the FFANN structures for optimization algorithms In Table 14, the average values of the evaluation criteria obtained for the FFANN structures used in all cases are calculated and the sorting studies of the algorithms according to these values are indicated in both the training and test datasets. In this table, the most successful cases of the AGDE algorithm can be expressed as Cases 2 and 3. The most successful cases for EO, SMA, and SFSFDB algorithms are defined as Cases 2, 4, and 3, respectively. The results of the evaluation criteria obtained in both training and test data for each case (Table 14) are shown in detail in the bar graphs in Figure 7.
TABLE 14

Comparison analysis of the ranking results of study cases for optimization algorithms

AGDEEO
CasesMean values of the results for all of casesCasesMean values of the results for all of cases
Training data resultsTest data resultsTraining data resultsTest data results
MAEMAPERMSEMAEMAPERMSEMAEMAPERMSEMAEMAPERMSE
Case 126.2791835.01128337.20198355.00883310.775987.2142Case 128.606755.539840.43048350.53128310.04033373.851283
Case 226.8635335.20313339.05481651.25818310.09506674.529616Case 228.25235.520540.370948.5159339.66301670.52605
Case 327.7483335.3794539.80901649.7687169.92136673.7047Case 329.9148335.86491643.0084546.1061839.22108369.22135
Case 428.3990165.5862541.256947.9055839.49421671.153416Case 430.4087836.0264544.13371644.0804668.855164.337716
FIGURE 7

(A–D) Evaluation criteria for all optimization algorithms

Comparison analysis of the ranking results of study cases for optimization algorithms (A–D) Evaluation criteria for all optimization algorithms The results of the evaluation criteria of the best cases in each optimization algorithm (Figure 7 and Table 14) are shown in bar graph form in Figure 8 for both training and test data so that the results can be better interpreted.
FIGURE 8

Evaluation criteria for the best case of all optimization algorithms

Evaluation criteria for the best case of all optimization algorithms After determining the best cases used in the training process of FFANN for each optimization algorithm (Table 14), the results of the evaluation criteria of the best FFANN structures in the cases, the ranking of these results for both training and test data, and the sum of these ranking results, mean value, and standard deviation values are shown in Table 15. In other words, the final ranking in Table 15 clearly shows that the EO algorithm was the first in the training of the dataset, followed by the SFSFDB and SMA algorithms.
TABLE 15

Determination of the best network structures for all optimization algorithms

The evaluation criteria of the best FFANN structures for the training and test process
AlgorithmsCasesFFANN structuresTraining data resultsTest data results
MAEMAPERMSEMAEMAPERMSE
AGDECase 321 × 7 × 127.38655.325239.672345.36139.295668.7305
EOCase 221 × 9 × 127.57695.358639.147044.22278.988366.2420
SMACase 421 × 7 × 132.63626.549148.818737.09267.623454.8335
SFSFDBCase 321 × 10 × 122.09754.125631.215955.828611.238285.6887
Determination of the best network structures for all optimization algorithms The FFANN structures determined for all optimization algorithms were adapted to the proposed hybrid MLR‐FFANN algorithm. By using MLR‐FFANN algorithms, where the most appropriate weight, bias, and β polynomial coefficients were determined by the optimization algorithms, the electric energy consumption of the Bursa Metropolitan Province was predicted during the COVID‐19 pandemic period until September 2020. As a result of the prediction process, the evaluation criteria, the ranking of these criteria, and the final ranking of the algorithms are shown in Table 16. According to the final ranking of the algorithms, the MLR‐FFANN method trained with the SFSFDB algorithm was more successful than other algorithms in estimating the electric energy consumption values of Bursa during the COVID‐19 pandemic. When Tables 15 and 16 were evaluated together for the final rankings of the optimization algorithms, the MLR‐FFANN, whose training process was completed with the SFSFDB algorithm, was more successful than the MLR‐FFANN trained with the other (EO, SMA, and AGDE) algorithms.
TABLE 16

Evaluation criteria and ranking results for all optimization algorithms for the year 2020

Prediction of the year 2020 for all optimization algorithms
AlgorithmsCasesFFANN structuresMAEMAPERMSE
AGDECase 321 × 7 × 152.523011.247772.7393
EOCase 221 × 9 × 141.19358.249056.9251
SMACase 421 × 7 × 167.509714.685292.2592
SFSFDBCase 321 × 10 × 133.22847.095848.9234
Evaluation criteria and ranking results for all optimization algorithms for the year 2020 In other words, the SFSFDB algorithm had a value of 33.2284 for the MAE evaluation criterion, which is 36.7355%, 19.3358%, and 50.7798% less than the results of the AGDE, EO, and SMA algorithms, respectively. When the results were evaluated in terms of the MAPE criterion, the SFSFDB algorithm had a value of 7.0958, a value that is 36.9133%, 13.9798%, and 51.6825% less than the results of the system trained by the other optimization algorithms. Considering the RMSE values, the result of the model designed with the SFSFDB optimization algorithm was 23.8159%, 8.0017%, and 43.3358% less than the results of the other algorithms, respectively. Figure 9A gives the comparison of the most suitable MLR‐FFANN structures, determined according to the results obtained from all optimization algorithms, in estimating the electric energy consumption values of the city of Bursa until September 2020 during the COVID‐19 pandemic process, according to the evaluation criteria. Figure 9B shows the difference between the actual and estimated electric energy consumption values. Actual and predicted electricity energy consumption data are detailed in Figure 9C.
FIGURE 9

For the year 2020: (A) Prediction for the year 2020, (B) error values between actual and predicted consumption values, and (C) actual and predicted electric energy consumption values of the city of Bursa for the year 2020

For the year 2020: (A) Prediction for the year 2020, (B) error values between actual and predicted consumption values, and (C) actual and predicted electric energy consumption values of the city of Bursa for the year 2020

Accuracy and reliability of the developed SFSFDB based MLR‐FFANN

Outlier detection

Outlier detection is critical in the development of appropriate models for recognizing individual data that may differ from the data chunk contained in a dataset. In the literature, the methods consisting of both numerical and graphical algorithms have been proposed for this purpose. , , Leverage approach is an algorithm that is considered quite good for detecting these different values. This method considers the deviations of a model from the experimental data, a matrix of experimental data known as the Hat matrix, and the predicted values. The main criterion in the application of this method is to use a model that can calculate or estimate the relevant data within acceptable limits. Leverage or hat indices (hat matrix (H)) are calculated using Equation (25) , , , , , : where X is a (n × k) matrix consisting of n data (rows) and k parameters (columns) of the model and t denotes the transpose matrix. Hat values of the data form the diagonal elements of the H value. The Williams plot is created by calculating the H values in Equation (25). Outliers or suspicious data are graphically identified in this plot. This graph shows the correlation of the Hat indices with the standardized cross‐validated residuals (R). A warning leverage (H*) is usually fixed at a value equal to 3p/n; where n represents the number of data points and p represents one more than the number of model parameters. A leverage set to three is normally considered a cut‐off value to accept points within ±3 standard deviations from the mean. The fact that the majority of data points are in the range of 0 ≤ H ≤ H* and −3 ≤ R ≤ 3 indicates that both the model development and its predictions are within the applicability limits. Thus, a statistically valid model is presented. “Good high leverage” points must be in the H* ≤ H and −3 ≤ R ≤ 3 fields. Points greater than H* or within the specified value range (R < −3 or 3 < R) are defined as model outliers or “bad high leverage” points. These erroneous representations/predictions are called as the doubtful data. In this study, the statistical hat matrix, Williams plot, and leverage approach, which enables outliers to be recognized in the model results, were used to check the reliability of the proposed model. H values were calculated with Equation (25) and the evaluation steps were followed step by step according to the procedure mentioned above. Thus, the Williams plot for the results from the SFSFDB model is plotted in Figure 10. The cyan color line in Figure 10 is the critical H* value or the leverage constraint which is computed as 0.1317 in this study. Moreover, the red line of R = ±3 is defined as the suspected limits. It can be seen that the majority of data points lie in the range 0 ≤ H ≤ H* and −3 ≤ R ≤ 3. As a result, the points corresponding to 96.3068% of the total number of data in the data set fall into the domain, while the remaining 3.6932% can be expressed as points outside the domain. These results show that the applied model is statistically acceptable and valid.
FIGURE 10

Detection of potentially suspect data and the domain of applicability of the developed SFSFDB model

Detection of potentially suspect data and the domain of applicability of the developed SFSFDB model

Analysis of the cumulative frequency versus absolute percent relative error

The cumulative frequency graph can be consulted for further comparison of the accuracy of the proposed models. , , , , , , , , , , This statistical methodology gives an idea of the average absolute percent relative error or absolute percent relative error models below a certain value. Equation (26) is explained as a relative percent error: where E is identified as the relative deviation of a represented/predicted value from an experimental value. x and o are defined as the actual and estimated/predicted value of the ith data, respectively, and n is the total number of data. The mean absolute percentage relative errors given in Equation (27) are used to calculate the absolute deviation of the experimental data. In this study, the cumulative frequency curve given in Figure 11, in which absolute percent relative error values are plotted against the cumulative frequency values, provides further comparisons regarding the accuracy of the proposed models. As can be seen from Figure 11, 90% of the predictions made by the optimized models by the SFSFDB, SMA, AGDE, and EO algorithms have absolute percent relative errors smaller than 27.1328%, 79.6028%, 53.9706%, and 32.6391%, respectively. These results represent another validation analysis of the superiority of the SFSFDB model over other models. Moreover, when the results of all proposed soft calculation methods are evaluated in themselves, they can be expressed as methods that provide fair enough accuracy for estimating electrical energy consumption data involving COVID‐19 precautions.
FIGURE 11

For the cumulative frequency versus absolute percent relative error: (A) The results of the different models and (B) zoom version of the results

For the cumulative frequency versus absolute percent relative error: (A) The results of the different models and (B) zoom version of the results

Sensitivity analysis

The robust predictions can be made as a result of the proposed algorithm being a sufficiently successful model. Therefore, it is necessary to determine whether it is a sufficiently successful model. The important thing here is that accuracy should not be evaluated alone when determining the adequacy of the model. Considering this situation, the degree of effectiveness of the independent variables affecting the model should be investigated by performing a sensitivity analysis. The cosine amplitude method (CAM), which is defined as one of the best sensitivity analyzes in the literature, was used in this study. The variable (R ), which affects the electrical energy consumption the most, is used to measure the degree of sensitivity among the independent variables. The higher the R , the more the independent variables affect the electrical energy consumption. If the relationship between the independent variables and electrical energy consumption is positive, the R value is positive, and if the relationship between them is negative, the R value is also negative. X array with n arguments is X = {x 1, x , x , …, x }, each element of this array as “x ” and if there are “m” of each argument, the elements of the data set are X  = {x , x , x ,…,x }. If the relationship between the dependent and independent variables in the data set is called x and x , the R value is calculated as in Equation (28) ; Figure 12 shows the results of the sensitivity analysis according to the CAM performed within the scope of this study. er(t − 1) is defined as the error value between the previous prediction and the true value, and er(t − 2) represents the error value between the two previous prediction and the true value. As can be seen from the results of the analysis, the factor that affects the electrical energy consumption estimation the most is the average humidity value with 0.9271. Average wind speed value follows the average humidity value with 0.9206. Average pressure with 0.8555 and average temperature with 0.8326, which are very close to these values, are seen as the most the other influencing factors. It is seen that the previous error value, er(t − 1) and the two previous error values, er(t − 2), also highly affect the estimation process. It is seen that the days of the week and COVID‐19 precautions have a less effect than the others.
FIGURE 12

The sensitivity analysis of dataset

The sensitivity analysis of dataset

CONCLUSION

In this study, the recently proposed SFSFDB algorithm was used for the first time as a trainer of the hybrid MLR‐FFANN algorithm for the estimation of the energy consumption values of the city of Bursa during the COVID‐19 pandemic. The SFSFDB algorithm has the ability to search both locally and globally and optimizes the weight, bias, and β polynomial coefficients of the proposed hybrid MLR‐FFANN model. The design of the network and the activation functions to be used in the training of the FFANN are among the factors affecting the success percentage of the network in training. Considering this situation, network structures with different hidden layer numbers were considered in the FFANN design of the hybrid model. Moreover, the activation functions in the hidden and output layers used in these network structures were discussed in different study cases as combinations of hyperbolic tangent and sigmoid functions. Whereas the training process of traditional ANNs is carried out by the subjective experience of the researcher or through many iterations, the training process of the hybrid model proposed for different cases in this study was carried out in 250 iterations. In order to show the success of the proposed hybrid MLR‐FFANN approach in the training process, the simulation results obtained from the SFSFDB algorithm were compared with the AGDE, EO, and SMA heuristic optimization methods used in the optimization problems of different science fields in the literature. The MAE, MAPE, and RMSE evaluation criteria obtained from the comparison results were statistically examined to evaluate the success of the algorithms in the training of the proposed method. Among the heuristic optimization approaches used in the training process of the hybrid MLR‐FFANN, the SFSFDB algorithm was more successful in Cases 1 and 3, the SFSFDB and AGDE algorithms for Case 2, and the AGDE algorithm in Case 4, compared to other algorithms. The simulation results in terms of the evaluation criteria in both training and testing processes and the success ranking of the algorithms showed that the EO and SFSFDB algorithms were more successful than the other algorithms. Furthermore, during the 2020 COVID‐19 pandemic, the electric energy consumption values of Bursa were estimated and at the end of the estimation process, the hybrid MLR‐FFANN, which was trained by optimizing the parameters via the SFSFDB algorithm, was more successful in the prediction process than the MLR‐FFANN structures trained with the other optimization algorithms. The following conclusions were reached as a result of this research: When the results of the proposed method based on SFSFDB are examined, statistically MAE, MAPE and RMSE values have the best values with 33.2284, 7.0958, and 48.9234, respectively. The validity of the suggested method was demonstrated by outlier analysis, which shows that all data points are within an acceptable range. Environmental factors had a more pronounced effect on the estimation of electrical energy consumption than days of the week, error values, and COVID‐19 precautions, according to the results of the sensitivity analysis. As a result, COVID‐19 pandemic precautions were used as input data in the dataset for the estimation of electric energy consumption, and the hybrid MLR‐FFANN approach, whose parameters were optimized with the SFSFDB algorithm, successfully carried out the training and learning processes.

CONFLICT OF INTEREST

The authors declare that they have no conflict of interest.
  6 in total

1.  Impact of COVID-19 Measures on Short-Term Electricity Consumption in the Most Affected EU Countries and USA States.

Authors:  Javier López Prol; Sungmin O
Journal:  iScience       Date:  2020-10-05

2.  Forecasting the U.S. oil markets based on social media information during the COVID-19 pandemic.

Authors:  Binrong Wu; Lin Wang; Sirui Wang; Yu-Rong Zeng
Journal:  Energy (Oxf)       Date:  2021-03-18       Impact factor: 7.147

3.  Prediction of electricity energy consumption including COVID-19 precautions using the hybrid MLR-FFANN optimized with the stochastic fractal search with fitness distance balance algorithm.

Authors:  Adem Dalcali; Harun Özbay; Serhat Duman
Journal:  Concurr Comput       Date:  2022-03-22       Impact factor: 1.831

4.  Electricity Consumption Estimation of the Polymer Material Injection-Molding Manufacturing Process: Empirical Model and Application.

Authors:  Ana Elduque; Daniel Elduque; Carmelo Pina; Isabel Clavería; Carlos Javierre
Journal:  Materials (Basel)       Date:  2018-09-16       Impact factor: 3.623

  6 in total
  1 in total

1.  Prediction of electricity energy consumption including COVID-19 precautions using the hybrid MLR-FFANN optimized with the stochastic fractal search with fitness distance balance algorithm.

Authors:  Adem Dalcali; Harun Özbay; Serhat Duman
Journal:  Concurr Comput       Date:  2022-03-22       Impact factor: 1.831

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.