Literature DB >> 32256554

Evaluation Method of Multiobjective Functions' Combination and Its Application in Hydrological Model Evaluation.

Jiuyuan Huo1, Liqun Liu2.   

Abstract

Parameter optimization of a hydrological model is intrinsically a high dimensional, nonlinear, multivariable, combinatorial optimization problem which involves a set of different objectives. Currently, the assessment of optimization results for the hydrological model is usually made through calculations and comparisons of objective function values of simulated and observed variables. Thus, the proper selection of objective functions' combination for model parameter optimization has an important impact on the hydrological forecasting. There exist various objective functions, and how to analyze and evaluate the objective function combinations for selecting the optimal parameters has not been studied in depth. Therefore, to select the proper objective function combination which can balance the trade-off among various design objectives and achieve the overall best benefit, a simple and convenient framework for the comparison of the influence of different objective function combinations on the optimization results is urgently needed. In this paper, various objective functions related to parameters optimization of hydrological models were collected from the literature and constructed to nine combinations. Then, a selection and evaluation framework of objective functions is proposed for hydrological model parameter optimization, in which a multiobjective artificial bee colony algorithm named RMOABC is employed to optimize the hydrological model and obtain the Pareto optimal solutions. The parameter optimization problem of the Xinanjiang hydrological model was taken as the application case for long-term runoff prediction in the Heihe River basin. Finally, the technique for order preference by similarity to ideal solution (TOPSIS) based on the entropy theory is adapted to sort the Pareto optimal solutions to compare these combinations of objective functions and obtain the comprehensive optimal objective functions' combination. The experiments results demonstrate that the combination 2 of objective functions can provide more comprehensive and reliable dominant options (i.e., parameter sets) for practical hydrological forecasting in the study area. The entropy-based method has been proved that it is effective to analyze and evaluate the performance of different combinations of objective functions and can provide more comprehensive and impersonal decision support for hydrological forecasting.
Copyright © 2020 Jiuyuan Huo and Liqun Liu.

Entities:  

Mesh:

Year:  2020        PMID: 32256554      PMCID: PMC7085873          DOI: 10.1155/2020/8594727

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

The parameters optimization of hydrological models has always been an important research content in the hydrological field, and it has a crucial impact on the overall performance of hydrological models and the reliable results of hydrological forecasting [1]. The essence of hydrological model parameters optimization is that adjusting the parameters of the hydrological model to address the errors and uncertainties in the watershed model simulation so that the model output values are as close as possible to the actual values [2, 3]. The behavior and performance evaluation of the hydrological model is commonly made and reported through comparisons of objective function values of simulated and observed variables. Based on the comparison results, the hydrologist can make mathematical subjective and/or objective estimates of the “closeness” of simulated behavior of the model to observations made within the watershed [4]. The single objective function which can only consider one aspect of hydrological features is adopted in the traditional optimization methods for hydrological model parameter [5, 6]. The engineering practices show that parameter optimization of the hydrological model is intrinsically a high dimensional, nonlinear, multivariable, combinatorial optimization problem which involves a set of different objectives (multiobjective) [7]. To fully excavate the various hydrological feature information contained in hydrological data, multiobjective research has become one of the important directions in hydrological model calibration [8]. Two or more conflicting objectives often have to be simultaneously optimized and the trade-off decisions should be taken between these objectives in the multiobjective optimization [9, 10]. The multiobjective optimization model such as the Pareto theory [11] can reduce the degree of simplification of the model and can more fully reflect the requirements of engineering design, thus providing a decision basis for obtaining the optimal solution set. Parameters optimization of hydrological model is a typical unconstrained multiobjective optimization problem which determines the model parameters' values by the optimization method to make the selected multiple objective functions achieve maximum or minimum simultaneously. Assuming that all objective functions need to be minimized, the multiobjective optimization problem of hydrological model parameters is expressed in the following equation [12]:where f(X),  i=(1,2,…, N) is the i-th objective function; N is the number of the objective functions; X is a vector composed of decision variables, namely, the parameters combination of the hydrological model; and D is the number of hydrological model parameters to be optimized. The optimization result is the noninferior parameter combination sets in which feasible regions are limited by the values' ranges of the model parameters. Some researchers have applied the multiobjective method for the parameters optimization problem of hydrological models and achieved good results [13]. In studies and literature of hydrologic modeling, there are a large number of objective functions such as the Nash–Sutcliffe efficiency coefficient (NSE), root mean square error (RMSE), and coefficient of determination (R2) that are frequently used. Because the objective functions are mutually constrained and each objective function places different emphasis on different systematic or dynamic behavioral errors, it is difficult for a hydrologist to clearly evaluate the advantages and disadvantages of the solutions of parameters optimization. Most of the related research studies focus on the design of multiobjective algorithm and construction of objective functions, which provides important theoretical basis and practical experience for multiobjective optimization of hydrological model parameters. However, they do not provide much guidance for the selection of the actual objective function combination. Thus, how to study the influence of different objective function combinations on the optimization results of hydrological model parameters and make decisions on these Pareto solution sets has become one of the most important issues. The technique for order preference by similarity to ideal solution (TOPSIS) which was first proposed by Hwang and Yoon is a well-known method for solving a multiple criteria decision-making problem. TOPSIS is designed based upon the concept that the chosen alternative should have the shortest distance from the positive ideal solution (PIS) and the farthest from the negative ideal solution (NIS) [14]. Thus, it can be adapted to evaluate and give the ranking order of all alternatives in solving a multiple attribute decision-making (MADM) problem. Because of its significant features of comprehensiveness, ease of programming, and reliability, the TOPSIS has been adopted in the decision-making of multiobjective optimization by computing the closeness of the different alternatives to an ideal solution [15]. In the state-of-the-art optimization algorithms, the artificial bee colony (ABC) algorithm [16] which was proposed by Karaboga in 2005 has the advantages of fast convergence speed, strong robustness, and so on. And it has been widely applied in different fields [17, 18]. In this paper, various objective functions related to parameters optimization of hydrological models were collected from the literature and constructed to nine combinations. Then, a multiobjective artificial bee colony algorithm named RMOABC which adopts the mechanisms of the regulation operator and adaptive grid is employed to optimize the hydrological model and obtain the Pareto optimal solution [19]. The parameter optimization problem of the Xinanjiang hydrological model was taken as the application case for long-term runoff prediction in the Heihe River basin. Finally, the TOPSIS method based on the entropy theory is adapted to sort the Pareto optimal solutions to compare the combinations of objective functions and obtain the comprehensive optimal combination. The paper is organized as follows. The related literatures are reviewed in Section 2. Section 3 describes the hydrological Xinanjiang model, the study area, the RMOABC algorithm, and the TOPSIS method. Section 4 summarizes the objective functions and their combinations commonly used in the literature for hydrological model parameters optimization. The selection and evaluation framework of objective functions for hydrological model parameter optimization is proposed in Section 5. The experiment settings, results, and corresponding analyses are discussed in Section 6, and finally, the conclusions are drawn in Section 7.

2. Literature Review

In recent years, the ABC algorithm's advantages of great accuracy and satisfactory convergence speed make it suitable for solving the multiobjective optimization problems. Hedayatzadeh et al. designed a multiobjective artificial bee colony (MOABC) based on the Pareto theory and ε-domination notion [20]. The performance of a Pareto-based MOABC algorithm has been investigated by Akbari et al. on CEC'09 datasets [21], and the experimental results show that compared with the other multiobjective algorithms, the variants of multiobjective ABC can find solutions with competitive convergence and diversity within a shorter period of time. The parameter calibration or optimization for the hydrological models has entered the era of multiobjective optimization, and a lot of literatures focus on the multiobjective research studies [22-24]. For the ABC algorithm, a novel multiobjective evolutionary algorithm named multiobjective artificial bee colony (MOABC) algorithm is presented and applied in long-term cascaded hydropower system dispatch in [25]. Pérez et al. presented a multiobjective artificial bee colony-based optimization approach to optimize the allocation of monitoring stations in the design of water quality monitoring networks in river basins [26]. In these studies, a number of objective functions were commonly used for a variety of model calibrations in hydrology such as mean squared error, absolute mean/maximum error, residual bias, Nash objective function [22], the sum of square errors [27], and so on. To evaluate and compare the objective functions, some scholars have also made some efforts. For instance, nine different efficiency criteria for the evaluation of model performance are described and compared through three examples involving an observed streamflow hydrograph from the Wilde Gera catchment in Germany [4]. And Guo et al. presented a method for the comparison of the influence of different objective function combinations on the optimization results of daily streamflow forecasting by a hydrological model [28]. In the evaluation and ranking methods, the TOPSIS is a well-known and widely used method for solving the multiple attribute decision-making problems. For example, Lia et al. extended the concept of TOPSIS to develop a methodology for solving a multiple objective decision-making problem [29]. Abo-Sinna et al. presented a method based on TOPSIS for selection of the large-scale multiobjective nonlinear programming problems with block angular structure [30]. Baky and Abo-Sinna extended the TOPSIS approach to solve bilevel MODM problems [31]. An iterative clustering around latent variable (CLV) based objective entropy weighted TOPSIS approach is proposed for benchmarking building energy performance in a multifactor manner [32]. And a multiobjective mathematical model under several constraints is developed to attain multiple goals simultaneously for sustainable energy investment planning [33]. The TOPSIS method is also adapted and applied for the multiobjective optimization problems (MOPs). For instance, Liu et al. proposed a new method of water distribution system design based on the multiobjective optimization algorithm, and the TOPSIS was used to rank the Pareto optimal solutions [34]. An intuitionistic fuzzy hybrid TOPSIS approach is proposed to handle risk assessment and is applied as a case study on a gas refinery for the welding and lamination task [35]. The concept of TOPSIS was extended to develop a methodology for solving multilevel nonlinear multiobjective decision-making (MLN-MODM) problems of maximization type [36]. TOPSIS was also used to select the optimal cutting conditions for trimming carbon fiber composite that could generate minimum tool wear, surface roughness, and tool temperature simultaneously while maintaining high production rates [15]. For the multiobjective parameter calibration of hydrological models, the TOPSIS has also been adopted by some scholars. A method which combines a genetic algorithm with TOPSIS for the Xinanjiang model is presented to handle the multicriteria parameter calibration problem with a single procedure [37]. A MOPSO algorithm and an entropy-based TOPSIS ranking method were employed to calibrate and verify the Xinanjiang model in Misai catchment [38]. In summary, as the hydrological model and the multiobjective optimization method are determined, the selection and combination of the objective functions become one of the most important factors affecting the prediction quality of the hydrological model. And the research studies on this field are still in the initial stage. Therefore, this paper mainly focuses on collecting the objective functions of hydrological models' optimization from the related literature, optimizing the parameters of the Xinanjiang hydrological model for river runoff forecasting in the Heihe River basin by the RMOABC algorithm and then evaluating and ranking the obtained nondominated solutions based on the TOPSIS method. It would provide a reference method and help for the selection and the combination of objective functions for parameters calibration of hydrological models.

3. Study Materials and Methods

3.1. Hydrological Model and Study Area

In this paper, a modified Xinanjiang hydrological model [39] with two sources, surface runoff and underground runoff, was selected as the hydrological model. The Xinanjiang model is a conceptual rainfall-runoff (RR) model proposed by Renjun Zhao in 1973 and has been widely used in the humid and semihumid areas in China [40]. The model has ten parameters which have been proved to simulate the runoff well in the working practices [41], including the potential evapotranspiration ratio to pan evaporation K; the percentage of impervious in the catchment IMP; the exponential parameter with a single parabolic curve B, which represents the spatial distribution nonuniformity of the soil moisture storage capacity over the catchment; the average storage capacity of soil moisture of the upper layer WUM (unit: mm); the average storage capacity of soil moisture of the lower layer WLM (unit: mm); the average storage capacity of soil moisture of the deep layer (unit: mm); the coefficient of soil evaporation in the lower and deep layer of the river basin C; the steady infiltration rate FC (unit: mm/hour); the outflow coefficients of the free water storage to groundwater relationships KKG; and the confluence parameters of surface runoff Kr. The value ranges of the ten parameters are shown in Table 1. The detailed description of the modified Xinanjiang model can be found in bib41[7, 12, 41].
Table 1

The value range of the modified Xinanjiang model parameters with two sources [41].

Parameter name K IMP B WUMWLMWDM C FCKKGKr
Lower limit0.0010.0010.0015.00050.00050.0000.0010.0010.0010.001
Upper limit1.0000.5001.00030.000100.000200.0000.30050.0000.99010.000
The Heihe River watershed in Northwest China was taken as the study area, and parameter optimization for runoff prediction which is a common concern in hydrology researches was taken as the case study. The observation data of daily rainfall, surface evaporation, and runoff from January 1, 1990, to December 31, 1994, were selected for the experiments [7, 12]. The period from January 1, 1990, to December 31, 1990, was chosen as the calibration period. The details of the data can be found in our previous papers [7, 12, 42].

3.2. The RMOABC Algorithm

The RMOABC algorithm is proposed to extend the original ABC algorithm based on Pareto theory [43] for handling the multiobjective optimization problems [19]. The regulation operator mechanism is integrated to balance the local search and global search in the evolution process, and the adaptive grid [44] mechanism proposed in the PAES (Pareto archive evolutionary strategy) algorithm was utilized in the RMOABC to produce well-distributed nondominated Pareto solution set in the external archive. Each nondominated solution can be mapped in a certain location in the grid according to the values of multiobjective functions. The grid can adaptively maintain the distribution of candidate solutions stored in the external archive in a uniform way in the evolution process. The details of the RMOABC algorithm and how to integrate the algorithm with the Xinanjiang model can be found in the literature [12, 19].

3.3. TOPSIS Method

The technique for order preference by similarity to ideal solution (TOPSIS) was proposed by Hwang et al. in 1981, and it is a useful technique in dealing with multiattribute or multicriteria decision-making problems in the real world [22]. It is based on the concept that the chosen alternative should have the shortest distance to the positive ideal solution (PIS) and the furthest distance from the negative ideal solution (NIS). The positive ideal solution consists of the best performance values among any other alternatives for each attribute, whereas the negative ideal solution is the combination of the worst performance measures [45, 46]. The TOPSIS method can produce a clear preference order of a set of competing designs, which can help decision makers carry out analysis, comparison, and ranking of the alternatives for the problems needed to be solved [22]. For the multiobjective optimization problems, TOPSIS can transfer the m-objective (criteria) problem, which are conflicting and difficult in comparison, into a two-objective (the shortest distance from the PIS and the longest distance from the NIS) problem. Then, the biobjective problem can be solved by calculating the relative closeness (overall performance coefficient) to the ideal solution [47]. Thus, TOPSIS can prioritize the nondominated solutions, rank the compromise solutions, and provide decision support for the researchers.

4. Multiobjective Functions for Calibration of Hydrological Model

The objective of model calibration is to select the model parameters so that the model simulates the hydrological behavior of the catchment as closely as possible. Objective functions which are defined as functions that must be minimized or maximized in the process of model parameter calibration are commonly used by hydrologists to provide a mathematical estimate of the “closeness” of the simulated behavior to the observed measurements [48]. Each objective function may place different emphasis on different types of simulated and observed behaviors; thus, the selection of the combination of the objective functions is crucial to the optimal parameter result and has been a challenge for the hydrologists. In this section, we summarized the objective functions and their combinations commonly used in the literature for hydrological model parameters optimization.

4.1. Objective Functions in Literature [49]

The Nash–Sutcliffe efficiency coefficient (NSE) [50] and its logarithmic form (LNNSE) [49] are the two common objective functions originating from statistical theory as the primary objectives for model calibration. The two objective functions are defined in equations (2) and (3), respectively. The NSE and the LNNSE functions are a pair of conflicting objective functions [7, 12], where the NSE function evaluates the ability to reproduce all stream flows, but it is known as biased to predict peak flows. On the contrary, the LNNSE function emphasizes low flows.where N is the total number of time steps in the calibration period, Qsim is the simulated runoff data value at time t, Qobs is the observed data value at time t, and is the average of the observation data. The two objective functions are optimized to the maximum at the same time in the parameters optimization process. The larger value means the better performance of the model forecasting.

4.2. Objective Functions in Literature [51]

Madsen applied the following numerical performance statistics to measure the different calibration objectives: Overall volume error: a good agreement between the average simulated and observed catchment runoff volume (i.e. a good water balance). (2) Overall root mean square error (RMSE): a good overall agreement of the shape of hydrograph. (3) Average RMSE of peak flow events: a good agreement of the peak flows with respect to timing, rate, and volume. (4) Average RMSE of low flow events: a good agreement for low flows. In Equations (4)–(7), N is the total number of time steps in the calibration period; Qobs is the observed discharge data value at time t; Qsim is the simulated runoff data value at time t; Mp is the number of peak flow events; Ml is the number of low flow events; n is the number of time steps in peak flow event j; and n is the number of time steps in low flow event k. Peak flow events are defined as periods where the observed discharge is above a given threshold level. Similarly, low flow events are defined as periods where the observed discharge is below a given threshold. The peak flow events were defined as periods with flow above a threshold value of 0.75 m3/s, and low flow events were defined as periods with flow below a threshold value of 0.2 m3/s.

4.3. Objective Functions in Literature [52]

In literature [52], the mean squared logarithmic error EMSL and the mean fourth-power error E of measured flow and forecasted flow were selected as the objective functions. The two objective functions are defined in equations (8) and (9):where N is the total number of time steps in the calibration period, Qsim is the simulated runoff data value at time t, and Qobs is the observed data value at time t. Due to the logarithmic operation, EMSL makes the fitting error of low flow more significant, so it is more focused on the fitting of low flow value. For the impact of the fourth power, the fitting error of high flow value has a greater contribution; thus, E is more focused on the fit of high flow values. The experimental results showed that there is a typical noninferior relationship between the objective functions EMSL and E.

4.4. Objective Functions in Literature [53]

The parameters of the ANN-based monthly streamflow forecasting model are estimated by several multiobjective optimization algorithms with three pairs of objective functions. (1) mean squared logarithmic error (EMSL) vs. mean squared derivative error (EMSD), (2) mean fourth-power error (E) vs. mean squared derivative error (EMSD), and (3) mean squared logarithmic error (EMSL) vs. mean fourth-power error (E). The definition of EMSD objective function is shown in the following equation:where N is the total number of time steps in the calibration period, Qsim is the simulated runoff data value at time t, and Qobs is the observed data value at time t. The EMSL function is more suitable for low flows due to the logarithmic transformation. E is considered to be an indicator of the goodness of fit for high flows because it amplifies the effect of the deviation. The experiments demonstrated that EMSD can be taken as an indicator of the fit of the shape of the hydrograph, and it should be used in combination with EMSL or E.

4.5. Objective Functions in Literature [3]

The Nash–Sutcliffe efficiency coefficient (NSE) and the overall runoff relative error R were selected as the objective functions in [3]. R is defined in the following equation:where N is the total number of time steps in the calibration period, Qsim is the simulated runoff data value at time t, and Qobs is the observed data value at time t.

4.6. Objective Functions in Literature [54]

There are four metrics used in [54]—two common statistical metrics that emphasize high and low flows and two hydrological metrics that emphasize the water balance and the midrange flow regime. These metrics capture four different but important components of the hydrograph while allowing for a comparison between statistical and hydrological metrics. The first statistical metric is the commonly used root mean squared error (ERMS) which has been defined in equation (5), and it emphasizes fitting the high flow portions of the hydrograph. The second statistical metric is transformed root mean squared error ETRMS. The simulated and observed flow time series are first transformed by a Box–Cox transformation (equation (12) with a λ value of 0.3, which has a similar effect as a log transformation. ETRMS emphasizes low flow portions of the hydrograph and is defined in equation (13).where N is the total number of time steps in the calibration period, Zsim, is the transformed simulated runoff data value at time t, and Zobs, is the observed data value at the time. After the Box‐Cox transformation, the observed and simulated flow data can help the parameter estimation method to better adapt to the low flows. The third metric is the runoff coefficient error (EROC) which captures the overall accuracy of the water balance by first combining the flows into one characteristic hydrological descriptor. It is defined in the following equation:where and are the simulated and observed mean annual runoff volume and is the mean annual precipitation which is 388.95 mm/year in the research area. The final metric is the slope of the flow duration curve error (ESFDC), which measures how well the model captures the distribution of midlevel flows. The slope of a watershed's flow duration curve indicates the variability, or flashiness, of its flow magnitudes. ESFDC metric is calculated by equation (15), and it is simply the absolute error in the slope of the flow duration curve between the 30 and 70 percentile flows.where Qsim,30 and Qsim,70 are the 30 and 70 percentile flows of the simulated flow duration curve and Qobs,30 and Qobs70 are the 30 and 70 percentile flows of the observed flow duration curve.

4.7. Objective Functions in Literature [55]

The performance measures used in [55] are the Nash–Sutcliffe efficiency coefficient (NSE), the coefficient of determination for linear regression (R2), and percent bias (PBIAS). The coefficient of determination for linear regression is a measure of how well the regression line represents the data. The coefficient of determination R2 is defined as the squared value of the coefficient of correlation which is defined in the following equation:where N is the total number of time steps in the calibration period, Qsim, is the simulated runoff data value at time t, Qobs, is the observed data value at time t, is the average of the simulated data, and is the average of the observation data. The range of R2 lies between 0 and 1 which describes how much of the observed values can be reproduced by the prediction. A bigger value means better prediction. P BIAS which is defined in equation (17) measures the average tendency (deviation) of the simulated data to their observed counterparts and is expressed as a percentage. The optimal value of PBIAS is 0.0, with low-magnitude values indicating accurate model simulation [56].

4.8. Objective Functions in Literature [8]

Three evaluation functions are selected as the objective functions for the runoff prediction of the hydrological model: the root mean square error  EMSR which represents the overall simulation result, the water volume balance function ERMOV, and the mean squared logarithmic error EMSL (overall volume error). EMSR is defined in the following equation:

4.9. Objective Functions in Literature [28]

As shown in equations (19)–(24), six objective functions were constructed based on the absolute error of the model forecasted flow and the measured flow to establish three objective function combinations, and a simple method for the comparison of the influence of different objective function combinations was presented in [28]. The objective function combinations include ERMS_ vs. ERMS_, R2 vs. R2, and EMA_ vs. EMA_. Among them, the subscripts H and L denote the flood peak flow objective and the low flow objective, respectively.where N is the total number of time steps in the calibration period; Qobs is the observed discharge data value at time t; Qsim, is the simulated runoff data value at time t; is the average of the observation data; Mp is the number of peak flow events; Ml is the number of low flow events; n is the number of time steps in peak flow event j; and n is the number of time steps in low flow event k. Peak flow events or low flow events are defined as periods where the observed discharge is above or below a given threshold level, respectively. Similar to Section 4.2, the low flow threshold is 0.2 m3/s and the peak flow threshold is 0.75 m3/s. In the literature [28], the researchers found that the combination of ERMS_, ERMS_, and EMA_ can effectively balance the multiple objectives and can provide more comprehensive decision-making support for the forecasters.

5. Selection and Evaluation Framework of Objective Function Combination

Solving a multiobjective optimization problem of parameter calibration for the hydrological model is to search for the nondominated solutions that can be expressed in terms of Pareto theory in the objective space of parameters. However, for the hydrologist, the nondominated solutions should be evaluated and ranked, and a set of parameters (i.e., one of the nondominated solutions) should be selected from the Pareto set for the application of the hydrological model. Thus, after the multiobjective optimization algorithm calibrating the parameters of a hydrological model and obtaining a set of nondominated solutions, the task of how to evaluate and rank these solutions corresponding to the objective function combination and select the optimal solution that meets the needs of decision makers has become more important. In this paper, a widely used ranking method called technique for order preference by similarity to ideal solution (TOPSIS) was used to prioritize the nondominated solutions and help the hydrologists find the appropriate combination of objective functions.

5.1. Entropy-Based TOPSIS

TOPSIS approach was originally proposed for the comprehensive evaluation of multiple scenarios in the case of multiple attributes. It is one of the best-known ranking methods of alternatives for MADM problems and can be easily adapted to different fields. If the evaluation alternative is closest to the ideal solution and far away from the negative ideal solution, it is the best alternative; otherwise, it is the worst [46]. For the multiobjective parameters optimization problem of the hydrological model in this paper, each optimization objective function represents an attribute. Thus, the nondominated solutions generated from RMOABC algorithm were treated as alternatives and the objectives were taken as attributes. In this paper, a TOPSIS method is employed to rank the nondominated solutions, and the entropy theory of information is also adapted to assign objective weights reasonably for different attributes. The entropy-based TOPSIS procedure consists of the following steps [32, 38]: Calculate the decision matrix D. The decision matrix D is constructed according to the Pareto optimal solution set. Assume that there are n Pareto optimal solutions (alternatives) A (i = 1, 2,…, n) to be evaluated against m criteria (objective functions) C (j = 1, 2,…, m). Then, as shown in equation (25), the decision matrix D contains performance ratings for the i-th alternative (Pareto solution) A with respect to the j-th criteria (objective function) C, which are denoted as x. In general, the evaluation criteria are classified into two types: benefit and cost. For criteria of benefit, larger value means more valuable, while for the criteria of cost criteria, smaller value means more valuable. (2) Construct normalized decision matrix R. Then, the above decision matrix D is normalized and trended to obtain a normalized matrix R. According to the criteria type of benefit or cost, the normalized value r can be calculated as follows: where xmax=∨x and xmin=∧x are the maximum and the minimum values of the j-th criteria among n alternatives and r represents the normalized performance of A with respect to attribute C. After the normalized transformation, the normalized decision-making matrix is represented as follows: where 0 ≤ r ≤ 1 which means the optimal value for each attribute is 1 and the worst value is 0. This step can transform various attribute dimensions into nondimensional attributes to allow comparisons across criteria. (3) Calculate the weighted normalized decision matrix. Considering the different importance of each criterion, the weighted normalized decision matrix Z can be constructed as follows: where w is the weight of the j-th criterion, w > 0, and ∑w=1. Generally, the weights of attributes are determined from the experience of decision maker depending on the specific problem. For more accuracy, a reasonable method based on the theory of information entropy for assigning objective weights of attributes is adapted to calculate objective weights [38]. According to the definition of entropy, the entropy value of each attribute (objective) j can be measured as follows: where H is the entropy value of objective j, k is a constant (k=1/ln(n)), n is the amount of nondominated solutions, and y is defined as where x is the i-th nondominated solution (alternative) under the j-th criteria (objective function). Thus, if the amount of objectives is m, the weight for each criterion j is given by the following equation: (4) Determine the positive ideal solution (best) and negative ideal solution (worst). The positive ideal solution (PIS), whose index consists of the best value for each criterion, is defined in equation (32). The negative ideal solution (NIS), whose index consists of the worst value for each criterion, is defined in equation (33). (5) Calculate the separation measures by the m-dimensional Euclidean distance for each alternative. The separation from the positive ideal alternative can be expressed as equation (34) and similarly, the separation from the negative ideal alternative can be expressed as equation (35). (6) Calculate the relative closeness (overall performance coefficient) to the ideal solution. Thus, considering the entropy-TOPSIS, the relative closeness coefficient of each nondominated solution is expressed as follows: where i=1,2,…, n and 0 ≤ C ≤ 1. (7) Rank the preference order. Finally, by sorting the closeness coefficients C calculated in Step 6, the ranking order of all alternatives can be obtained. According to the concept of the TOPSIS method, the higher value of C means that it has higher priority, and on the contrary, the smaller value of C means it has lower priority. Thus, the alternative with the highest C value will be the optimal alternative.

5.2. Combinations of Objective Functions

In this section,we summarized and collated the objective functions and their combinations from the relevant literatures on hydrological model parameter optimization. Information on the combination of objective functions, related descriptions, objective function characteristics, and related literature is summarized in Table 2. The hydrological model parameter optimization experiments will be carried out according to these combinations of objective functions, and the runoff prediction of the hydrological model will be carried out. We have summarized a total of 19 objective functions and 9 objective function combinations whose number has been denoted from 1 to 9.
Table 2

The summary of the combinations of objective functions.

No.CombinationDescriptionCharacteristicsLiteratures
1NSE vs. LNNSENSE emphasizes to high flow; LNNSE emphasizes to low flow.NSE and LNNSE are all benefit type.[49]
2 E RMOV vs. ERMS vs. ERMS_H1 vs. ERMS_L1 E RMOV is a good agreement for good water balance; ERMS emphasizes to high flows; ERMS_H1 emphasizes to high flows; ERMS_L1 emphasizes to low flows.These four functions are all cost type.[51]
3 E MSL vs. EM4 E MSL emphasizes to low flows; EM4 emphasizes to high flows. E MSL and EM4 are all cost type.[52]
4 E MSL vs. EMSD vs. EM4 E MSL emphasizes to low flow; EM4 emphasizes to high flows; EMSD emphasizes to high flows. E MSL, EM4 and EMSD are all cost type.[53]
5NSE vs. RE NSE emphasizes to high flows; RE emphasizes to low flows.NSE and RE are all benefit type.[3]
6 E RMS vs. ETRMS vs. EROC vs. ESFDC E RMS emphasizes to high flows; ETRMS emphasizes to low flows; EROC emphasizes to water balance; ESFDC emphasizes to midlevel flows. E RMS, ETRMS, EROC and ESFDC are all cost type.[54]
7NSE vs. R2 vs. PBIASNSE and R2 emphasize to high flow; PBIAS measures the average tendency of the simulated data to their observed one.NSE and R2 are benefit type; PBIAS is cost type.[55]
8EMSR vs. ERMOV vs. EMSLEMSR emphasizes to high flows; ERMOV is a good agreement for water balance; EMSL emphasizes to low flows.EMSR, ERMOV, and EMSL are all cost type.[8]
9 E RMS_L2 vs. ERMS_H2, vs. EMA_L E RMS_H2 emphasizes to high flows; ERMS_L2 and EMA_L emphasize to low flows. E RMS_L2, ERMS_H2, and EMA_L are all cost type.[28]
There are several reasons why we do not take all the object functions as the combination. (1) All objective functions are researched and summarized by many scientists in hydrological fields based on domain knowledge and practical work experience. It cannot be replaced and modified at will. (2) Because there are 19 kinds of objective functions, multiobjective functions cannot be optimized by algorithms at the same time, and an objective solution set that can balance each objective function cannot be obtained. (3) According to the experiments in Section 6.2.1 in the paper, it can be seen that there are strong conflicts in some objective functions, which generate a large number of NAN values and illegal values so that the convergence of all objective functions cannot be achieved. Since NSE, LNNSE, R, and R are the objective functions for finding the maximum, other objective functions are optimized for the minimum values. Therefore, to unify the types of the extreme value of these objective functions and to easily compare the values of objective functions, these four objective functions have been modified and converted into minimum types which are shown in equations (37)–(40):

5.3. Selection and Evaluation Framework

To comprehensively evaluate these objective functions and their combinations which are summarized in Table 2, we constructed a simple selection and evaluation framework of objective functions which is shown in Figure 1.
Figure 1

Selection and evaluation framework of combinations of objective functions.

Firstly, a library of objective function combinations for hydrological model parameter optimization is established. Assuming that there are K kinds of combinations of objective functions, the library can be shown in the following equation:where OFC is the i-th combination of objective functions, i=1,2,…, K, and j=1,2,…, Li is the number of the objective functions in the i-th combination. The objective functions in each combination can be partially identical but cannot be fully identical; thus, Li is different for OFC. Then, the RMOABC optimization algorithm is employed to solve the parameter optimization problem of Xinanjiang model one by one for the objective function combination, and the corresponding Pareto optimal solutions (i.e., parameters sets) for Xinanjiang model are obtained. According to the model parameters in the Pareto solution, other remaining objective function values are calculated one by one. Finally, the relative merits of the optimization scheme corresponding to the Pareto solution obtained for each objective function combination are compared using the method of entropy-based TOPSIS. The selection and evaluation framework of combinations of objective functions is a simple method for comparing the optimization performance of different objectives combination, and the specific steps are described as follows: Select one combination in turn from the combinations library of objective functions which is defined as Use the RMOABC optimization algorithm to call the Xinanjiang model for simulating the basin runoff of the Heihe River. The noninferior solution sets (optimal model parameters) under all combinations of objective functions can be obtained, as shown in the following equation: where i = 1, 2,…, m; j = 1, 2,…, ni; Solution_OFC is the non-inferior solutions set of the i-th combination of the objective function; X is the combination of model parameters corresponding the j-th noninferior solution to the i-th combination of the objective functions; and ni is the number of obtained noninferior solutions at the i-th combination of objective function. Take the parameters set obtained by each combination of objective functions in sequence and set them as the parameter to run the Xinanjiang hydrological model . Then, the values of the rest of the objective functions in the library are calculated. In this way, the values of all the objective functions corresponding to each combination of objective functions can be obtained, as shown in the following equation: where i = 1, 2,…, m; j = 1, 2,…, ni; and OtherSolution_OFC are other objective functions to the i-th combination of the objective function. To eliminate the influence of random factors of the algorithm, the algorithm runs 30 times independently for each combination of objective functions. Then, the noninferior solution sets of each combination of objective functions obtained in 30 runs are combined together. Collect and organize all the objective function values corresponding to the parameter set of all objective function combinations and eliminate the dominated solutions in the model parameter solution sets to obtain all noninferior solution sets under each combination, as shown in the following equation: where i = 1, 2,…, m; j = 1, 2,…, ni; and AllSolution_OFC is the noninferior solution set obtained by the i-th combination of objective functions. (6) Establish a decision matrix based on the entropy-based TOPSIS method to sort all noninferior solutions of the combination of objective functions. (7) Comprehensively evaluate the objective function combination. According to the TOPSIS ranking results of the noninferior solution sets obtained in the foregoing operations, quantitatively describe and compare the influence of each combination of objective functions on the parameters optimization of the hydrological model.

6. Experiment Analyses

6.1. Experiment Preparations

The experiments of the simulation were taken on the multiobjective evolutionary algorithm (MOEA) framework [57] in the paper. The MOEA is a powerful and efficient platform and provides a Java library of open source for developing and evaluating the multiobjective optimization algorithms. The hardware environment of simulation experiments is a PC with 4 cores of Intel i7 2.6 GHz CPU and 8 GB of RAM.

6.1.1. Parameter Settings

Because the selection of algorithm parameters can greatly affect the execution performance, we adopted the recommended parameter settings from the related research studies [12] for the RMOABC algorithm. The swarm size of bees (NP) is set to 100, the external archive capacity is set to 100, and the adaptive grid number is set to 25. For the Limit parameter, it is set to 0.25 ∗ NP ∗ D [58]. D denotes the dimension of decision variables of the optimized problem, that is, the number of Xinanjiang model parameters, 10. The dimension of the objective functions is set to the number of objective functions in the selected combination. Based on our previous experience [12], for the calibration of the parameters of Xinanjiang hydrological model, a maximum number of model evaluations in the range 2000–4000 normally ensure an efficient calibration. Thus, the stopping criterion for the optimization algorithm is the maximum number of model evaluations, and it is set to 3000.

6.1.2. Evaluation Indicators of Multiobjective Optimization Algorithms

An important issue in multiobjective optimization is to evaluate and compare the quality of solution sets. The straightforward way to compare the quality of solution sets is visualization, but visual comparison cannot quantify the difference between solution sets and also becomes harder with more objectives involved [59]. Thus, quantitative evaluation methods can be interpreted as how well it represents the Pareto front and used to measure the performances of the multiobjective optimization algorithms [28]. In general, the quality indicators of solution set can be divided into four aspects: convergence, spread, uniformity, and cardinality. Convergence of a solution set refers to the closeness of the set to the Pareto front which denotes the distance between the calculated noninferior front and the known true noninferior front or approximate true noninferior front. Spread of a solution set considers the region of the set covering. Uniformity of a set refers to how even the solution distribution is in the set, and an equidistant spacing amongst solutions is desirable. Cardinality of a solution set refers to the number of solutions in the set [59]. According to the literature [59], the most frequently used evaluation indicators were selected to evaluate all the four aspects of performance of the nondominated solution sets in this paper. The generational distance (GD) [60] was selected to evaluate the convergence performance, and the spacing (SP) [61] was selected to evaluate the uniformity performance. The inverted generational distance (IGD) [62] was chosen to evaluate the comprehensive quality of convergence and spread, and the hypervolume (HV) [63] was also chosen to evaluate the comprehensive quality of convergence, spread, and cardinality. Due to the desirable practical usability and theoretical properties, the HV indicator is arguably the most commonly used quality indicator and is suitable for many real-world optimization scenarios. And the HV indicator is the only popular unary indicator having the strict Pareto compliance which implies that only the Pareto front achieves a unique optimal value for a problem [59]. We also added the computational time (Time) which was recorded in seconds to evaluate the execution efficiency of the algorithms.

6.2. Experimental Result Analyses

6.2.1. Model Parameter Optimization Result

According to the above experimental scheme and algorithm parameter setting, the RMOABC optimization algorithm is used to optimize the parameters of the Xinanjiang model based on the nine objective function combinations of the hydrological model parameters compiled in Section 5.2. Each group of experiments was carried out 30 times, and the optimal parameter set of each group of experiments was collected together, and the dominant solution in each group was removed. As shown in Table 3, a total of 8,435 optimal parameter solution sets are obtained. The Pareto optimal solutions obtained by the objective function combination 1, 3, 4, 5, and 8 are relatively small. They are 95, 120, 178, 325, and 355, respectively. The objective function combination 6, 7, and 9 obtained more than 1,000 Pareto optimal solutions, and in particular, the objective function combination 2 obtained a total of 3,629.
Table 3

A number of optimal solutions in the preprocessing of the objective functions values generated by the RMOABC algorithms for the 9 objective functions' combinations.

No.CombinationNumber of optimal solutionsNumber of optimal solutions after removing NAN valuesNumber of optimal solutions after removing NAN values and illegal valuesNumber of optimal solutions after removing NAN values, illegal values, and dominated solution
1NSE vs. LNNSE957866
2 E RMOV vs. ERMS vs. ERMS_H1 vs. ERMS_L1362930167575
3 E MSL vs. EM41208822
4 E MSL vs. EMSD vs. EM417814022
5NSE vs. RE32526233
6 E RMS vs. ETRMS vs. EROC vs. ESFDC10428381212
7NSE vs. R2 vs. PBIAS166714964444
8EMSR vs. ERMOV vs. EMSL3553201111
9 E RMS_L2 vs. ERMS_H2, vs. EMA_L10248271010

Total 84357065165165
However, since the nine objective function combinations include 19 objective functions, we take each parameter solution from each objective function combination in turn according to the steps of the selection and evaluation framework and then calculate the corresponding values of the remaining objective functions. For example, in the objective function combination 2, four objective functions are included, and one parameter solution is sequentially extracted from the corresponding 3,629 solutions, and the runoff data are simulated by the Xinanjiang model, and then the remaining 15 objective functions' values are calculated. In this way, we can get all the values of 19 objective functions corresponding to the 8,435 optimal parameter solution sets. Nevertheless, we found that there are many NAN values in the newly calculated objective function values which are mainly concentrated in the objective functions of LNNSE, EMSL, ETRMS, EMSR, and EMA_. After analysis, the main reasons for these NAN values are as follows. (1) As there is a value of 0 in the observation data or the prediction data, the natural logarithm calculation in LNNSE and EMSL will generate the NAN value. (2) In the hydrological model, 0 is taken as the divisor. The occurrence of the NAN value in the objective function affects the parameter solution which cannot be sorted. Therefore, we removed all the solutions which contain NAN value in the objective function. After elimination of all the solutions containing NAN values, a total of 6,918 remaining multiobjective parameter solutions are obtained. As described in Section 5.2, in order to facilitate the comparison of these 19 objective functions, we have converted all of these objective functions into minimization functions, and the function values should be in the range [0, 1], and the closer the function value is to 0, the higher the degree of optimization of the solution. However, after reviewing the result set of the objective function values, it was found that a large amount of data larger than 1 exists in the remaining objective function values. It shows that there are strong conflicts between these objective functions. The optimal parameter set obtained for some objective functions does not perform well or even worse on other objective functions. To compare the performance of the objective function, we removed all model parameter solutions containing objective function values greater than 1 from the experimental result set. After eliminating the solutions with NAN value and value greater than 1, a total of 165 optimal parameter solution sets are obtained, and there are about 6.900 solutions whose objective function value exceeds 1. Next, we compared the Pareto dominance of 19 objective function values with 165 optimal parameter solutions and found that there is no dominant solution in these optimal parameter solutions. Therefore, the nine objective function combinations finally obtained 165 optimal parameter solutions. The number of optimal solution sets of each objective function combination in the total number of solution sets is shown in Table 4 and Figure 2. It can be seen that the objective function combination 2 and the objective function 7 obtain more optimal solutions, respectively. Their proportions are 45.45% and 26.67%, respectively. The optimal solutions obtained by the remaining objective combinations are relatively small, especially the objective function combinations 3 and 4 which only obtained two optimal solution sets. The objective function combinations 2 and 7 also obtained 3,629 and 1,667 solution sets before the data preprocessing, respectively, which indicates that these two objective function combinations have stronger ability to obtain the optimal parameter solution sets, and the ability of their obtained parameter solution sets is also strong for adapting to other objectives.
Table 4

The proportion of the optimal solution set of each objective function combination in the total number of solution sets.

IDNumber of optimal solutionsProportion (%)
163.64
27545.45
321.21
421.21
531.82
6127.27
74426.67
8116.67
9106.06
Figure 2

The proportion of the optimal solution set of each objective function combination in the total number of solution sets.

6.2.2. Calculating the Closeness of the Optimal Solution Sets of Objective Function Combinations

After the above preprocessing for the Pareto optimal parameter solutions generated by the RMOABC algorithms based on the 19 objective functions for a validation period, a total of 165 model optimal parameter sets are obtained for the Xinanjiang model. The sample Pareto optimal parameter solutions are shown in Table 5.
Table 5

The sample Pareto optimal parameter solutions for the Xinanjiang model after the preprocessing.

ID K IMP B WUMWLMWDM C FCKKGKr
10.447020.245670.9892611.4394985.16284143.354090.0001029.638790.990004.08941
10.149490.304690.2017330.0000050.00000169.390660.063207.276590.990005.31799
10.241080.369770.7725012.5433775.82264138.123160.0328642.770990.990006.00000
20.178730.296570.1335830.0000050.00000200.000000.0001038.983370.990006.00000
20.224210.267620.6091916.4625792.4366576.035920.0853414.961320.990005.04023
The Pareto optimal solution of the hydrological model contains 19 optimization objectives, that is, there are 19 attribute values. Except the optimal solution corresponding to the attribute value of the objective function combination, the remaining objective function values are all calculated by the parameter solution. Then, the 165 attribute value matrices of the nine objective function combinations are combined to form a TOPSIS decision matrix D of 165 × 19. Then, the decision matrix D will be solved according to the steps of the entropy-based TOPSIS method described in Section 5.1. In the TOPSIS method, the weight of each objective function is the key factor affecting the ranking result of optimal solutions, but the common method is to assign the corresponding weight values to the objective functions based on the experience of the decision maker for the specific problem. In this paper, 19 objective functions have been selected for the research; therefore, we cannot assign weights to these objective functions reasonably and scientifically. Thus, we adopt the method of calculating the information entropy of each objective function in the optimal solutions to determine their weights for more accuracy [38]. Table 6 indicates the entropy value and weights of 19 functions which were calculated according to equations (29) and (31). From the table, it can be found that the 19 objective functions show unequal weights. R function shows the highest weight, and NSE and LNNSE also have an important effect on the model parameters optimization. EROC indicates the lowest weight, 0.00000, which means the objective function almost has no effect on the hydrological model optimization results. And E, EMSD, ESFDC, and  EMSR also have little importance for the optimization problem.
Table 6

Entropy values and weights of the 19 objective functions.

EntropyNSELNNSE E RMOV E RMS E RMS_H1 E RMS_L1 E MSL E M4 E MSD R E E TRMS E ROC E SFDC R 2 P BIAS EMSR E RMS_L2 E RMS_H2 E MA_L
Value0.9985670.9986810.9998680.9999120.9994560.9999150.9995530.9999860.9999960.9984930.9999601.0000000.9999970.9991010.9991160.9999960.9999340.9993990.999780
Weight0.172780.159110.015960.010670.065580.010290.053910.001720.000520.181770.004820.000000.000350.108480.106630.000430.007940.072480.02657
After the completion of the weight calculation for each objection function, the weighted normalized decision matrix Z can be calculated. Then, the positive ideal solution (PIS) and negative ideal solution (NIS) will be determined. And the separation measures by the m-dimensional Euclidean distance for each alternative will be obtained. Finally, based on the entropy-TOPSIS method, the overall performance coefficient of each nondominated solution which is expressed as relative closeness to the ideal solution will be obtained according to equation (36). Since the number of optimal solutions is large, Table 7 shows all the 19 objective function values of the sample Pareto optimal parameter solutions in Table 5 and their relative closeness which is calculated by the entropy-based TOPSIS method.
Table 7

The sample objective function values of Pareto optimal parameter solutions and their closeness calculated by the entropy-based TOPSIS method.

ID19 objectivesCloseness
NSELNNSE E RMOV E RMS E RMS_H1 E RMS_L1 E MSL E M4 E MSD R E E TRMS E ROC E SFDC R 2 P BIAS EMSR E RMS_L2 E RMS_H2 E MA_L
10.746140.994160.000000.243610.335850.089710.466530.038920.029140.026060.162320.000020.047910.243350.026760.028810.057900.556310.109760.52341
10.362910.636280.026810.169890.212610.058840.298590.007320.016670.005220.121590.000000.039490.194580.005190.017200.047250.338070.083450.77118
10.516560.487200.046730.202690.290840.075220.228630.016450.018120.105160.135640.000110.041310.189540.117520.017780.048470.444530.072740.73661
20.864200.962550.001830.262170.519690.097510.451700.049220.018760.034940.190230.000030.022460.860610.033760.036920.130230.627790.240850.44116
20.857870.846180.009420.261210.324740.139900.397090.026460.017770.293740.181600.000380.043190.298740.415900.032690.109880.486000.184800.41459
20.430250.729050.018200.184990.180380.138940.342120.003780.013310.212600.142250.000240.036360.301910.270010.023380.115860.245680.180470.63268

6.2.3. Ranking the TOPSIS Preference Order

Finally, by sorting the closeness coefficients C in descending order, the ranking order of all alternatives (i.e., Pareto optimal solutions) can be obtained. The higher value of C means the higher priority; the smaller value of C means the lower priority. The maximum value of the overall ranking of closeness coefficients calculated by the 165 optimal parameter solution sets is 0.95196 which means the solution is the optimal alternative, and the minimum value is 0.25942 which means the related solution is the worse alternative. To compare the optimal solution sets obtained by each objective function combination, we multiply the value of closeness coefficients by 100 and convert it into a 5-point system, that is, the converted value is “Excellent” between 90 and 100, “Good” between the 80 to 89, “Medium” between 70 and 79, “Pass” between 60 and 69, and “Poor” for values lower than 60. After dividing the ranking score into 5 levels, we then analyzed the distribution of the optimal parameter solution sets obtained by the nine combinations of the objective functions in the 5-segment position interval. The results are shown in Table 8 and Figure 3. We can see that in the 165 optimal solution sets, objective combinations 2 and 7 obtain a larger number of optimal solutions, while other objective combinations obtain fewer optimal solutions. Taking objective combination 1 as an example, five of its six optimal solutions are above the “Pass” level, indicating that the quality of the solution corresponding to the objective combination is higher, but its number of solutions is very small, accounting for only 3.64% of the total solution. It shows that the ability of these objective function combinations to obtain the optimal parameter solution sets is not strong, and the obtained parameter solution set cannot adapt to other objective function requirements which means that they do not have advantages in the competition.
Table 8

The distribution of the optimal parameter solution sets obtained by the nine combinations of the objective functions in the 5-segment position interval.

IDNumber
ExcellentGoodMediumPassPoorSum
1032016
261914181875
3200002
4010102
5011013
60632112
7671411644
82312311
91202510
Sum1742353635165
Figure 3

Distribution of the optimal parameter solutions after TOPIS sorting in the 5-segment position interval.

It can be seen from the distribution diagram of TOPSIS sorting position in Figure 3 that the number of distributions of objective combinations 2 and 7 in the five sorting intervals is very uniform, especially in the Excellent and Good intervals, and the solutions of the top ranking are superior in number and ratio to the other objective functions. It shows that objective function combinations 2 and 7 can achieve more and better solutions than other objective combinations. To compare the quality of the solutions corresponding to each objective combination, we extracted all the solutions that reach the excellent level and sorted them according to the closeness value. A total of 17 Pareto optimal parameter sets have reached the excellent level whose relative closeness values are bigger than 0.90, and the ranking results are shown in Table 9. The solution having the highest TOPSIS relative closeness indicated with an asterisk in the table is the compromise solution which fits the decision maker's preference most.
Table 9

The relative closeness values and the related combination ID of objective functions of the top 17 Pareto optimal solutions.

RankIDCloseness
1 2 0 .9519 6
2 2 0.95024
3 2 0.94885
4 2 0.94530
580.94344
6 2 0.93781
770.93676
880.93549
970.93484
1090.92266
1170.92189
1230.92087
1370.92072
1470.91376
1530.91131
16 2 0.91125
1770.90254
It can be seen from the table that among the top 10 solutions, the first to fourth and the sixth are the optimal solution sets corresponding to the objective combination 2, and in all 17 solution sets, the objective combination 2 has 6 which accounts for 35.29%. It shows that the optimal solution corresponding to the objective combination 2 not only has higher quality but also has stronger competitiveness and can basically adapt to the goals and requirements of all 19 objective functions. The other outstanding objective function combination, i.e., combination 7, also has 6 solution sets to reach the excellent level, but its corresponding highest order solution is only in the 7th position, and the rest of the solutions are ranked in the lower positions. The solutions related to the remaining objective combinations do not perform very well. For example, the objective combinations 3 and 8 only have 2 solutions, and the objective combination 9 only has 1 solution in the excellent level. Not only the quality of the solution is low, but the number of solutions is also very small. And for the objective combinations 1, 4, 5, and 6, they even do not have the optimal solution set to achieve the “Excellent” level. Therefore, it can be seen from the experimental results that the solutions corresponding to the objective function 2 and the objective function 7 are more widely distributed than the other objective combinations, and their performances are better, especially the solution quality corresponding to the objective function 2 is more excellent.

6.2.4. Comparison of Optimal Parameter Results Corresponding to Each Objective Function Combination

First, each parameter set is named in turn by the sequence number sorted by TOPSIS. Then, according to the ID number of each objective function combination, the solution set with the highest ranking of each target function combination is extracted from the TOPSIS sorted parameter solution set, and the 10 optimized parameter values, the corresponding closeness values, and the corresponding sorting results are shown in Table 10. For example, the objective combination 2 corresponds to solution 1, the objective combination 8 corresponds to solution 5, and so on. Meanwhile, the 19 objective function values for optimal parameter solution related to the 9 combinations of objective functions are listed in Table 11. The solutions of the objective combinations 2, 8, 7, 9, and 3 are at the “Excellent” level, and the solutions corresponding to the remaining objective combinations are at the “Good” level.
Table 10

The calibrated model parameters related to the optimal solution for 9 combinations of objective functions.

ID K IMP B WUMWLMWDM C FCKKGKrClosenessRankScore
20.224210.267620.6091916.4625792.4366576.035920.0853414.961320.990005.040230.951961Excellent
80.229000.271941.0000030.00000100.0000050.000000.0628950.000000.990006.000000.943445Excellent
70.254600.272880.5857320.5531250.00000155.846050.3000043.068520.990004.685650.936767Excellent
90.227560.288690.2075117.3157799.45902188.868700.3000027.708880.990005.206260.9226610Excellent
30.265360.337200.289655.0000086.74806135.533690.0357214.867290.990006.000000.9208712Excellent
10.149490.304690.2017330.0000050.00000169.390660.063207.276590.990005.317990.8909123Good
60.262750.325610.477885.0000050.00000194.306140.2231526.128750.990004.423700.8843830Good
40.404760.280150.5706317.5139964.47000146.245380.0413846.784190.990005.621860.8284147Good
50.348800.310210.3844929.6477176.45051158.227210.0740030.969840.990005.791010.8050655Good
Table 11

The 19 objective function values for optimal parameter solution related to the 9 combinations of objective functions.

IDRankedNSELNNSE E RMOV E RMS E RMS_H1 E RMS_L1 E MSL E M4 E MSD R E E TRMS E ROC E SFDC R 2 P BIAS EMSR E RMS_L2 E RMS_H2 E MA_L
210.2244110.3241770.1463830.1335980.2112390.0682810.1521270.0021540.0139080.002090.0982361.89E − 060.0339920.2228010.0020850.0106520.0515420.2544290.069254
850.2285450.3311140.1393740.1348230.2111540.0846070.1553830.0023140.014360.0283680.0984562.64E − 050.032390.2272140.0291960.0107260.0682020.2622090.070093
770.2317880.3608390.1091570.1357760.2017550.0819970.1693320.001980.0141180.0357320.1009863.35E − 050.0341860.2269610.0370560.0115120.0630490.2459860.075922
9100.2208350.3340220.1292870.132530.2197240.0650590.1567470.0021630.014390.0731920.0977336.17E − 050.0332350.2130950.06820.0107450.0489410.2587470.072094
3120.2279310.3407340.1250540.1346420.1957930.0431170.1598970.0023050.0145680.0788540.0990016.61E − 050.0346320.2018410.0730910.0109680.0305860.2545260.062349
1230.3261840.3050820.1619830.1610680.2168170.0598660.1431660.0079550.0149080.0601370.1099455.79E − 050.0398570.2052920.0639850.0117870.0395730.3468550.059945
6300.2843050.3794210.1035170.1503730.1928090.0851780.1780520.0036340.0152730.0935780.1095889.34E − 050.0373410.2210260.1032390.0129060.057720.2766690.074355
4470.27230.3873950.1009260.1471640.2698880.0377620.1817940.0037250.014590.1857580.1078681.42E − 040.0316690.2326810.1566580.012840.0266980.3032010.064164
5550.2592730.4429130.0592190.1436010.2466450.0394340.2078470.0028930.0145250.2124670.1069721.59E − 040.0326530.2110870.1752350.0133690.0277120.2792330.071378
In Section 5.2, we made corresponding modifications to the objective functions and converted the maximum objective function to the minimum type, and the values of each objective function were normalized whose value range is limited in [0, 1]. The smaller the value, the better the solution related to the objective function. Therefore, to compare the relative differences of the attributes of the nine objective function combinations corresponding to the optimal solutions, we compare the 19 objective function values in Table 11, and the comparison results are shown in Figure 4.
Figure 4

Comparison of the 19 objective function values of the optimal parameter solution related to the 9 combinations of objective functions.

It can be seen from the figure that among these optimal solutions, the parameter set of the objective function combinations 2, 8, and 7 performs better, and smaller value can be obtained in each objective function. It shows that the objective combinations can meet the optimization requirements of these 19 objective functions and can achieve better optimization results. Among them, the objective function combination 2 performs particularly well, and R and PBIAS clearly achieved better results. However, the corresponding solutions of objective combinations 9 and 3 are also in excellent grade, but they have some fluctuations in the objective function values; that is, they perform better in some objective functions but perform poorly in other objective functions. Especially for the solutions corresponding to the objective combinations 1, 6, 4, and 5 in the “Good” level, the fluctuations are more obvious. Taking the objective combination 5 as an example, the optimal optimization effect is obtained on ERMOV, but it achieved the worst optimization results in LNNSE, EMSL, R, and PBIAS. It is indicated that the solutions related to these objective functions are not stable enough to fully cover the requirements of the 19 objective functions and they do not have strong adaptability.

6.2.5. Result Comparison of Multiobjective Evaluation Indicators

It can be seen from the above experimental results that compared with other objective function combinations, the objective function combinations 2 and 7 can obtain a widely distributed and high-quality parameter solution set. In order to compare the quality of the corresponding solutions of these two objective function combinations, the generational distance (GD), spacing (SP), inverted generational distance (IGD), and Hypervolume (HV) were adopted to evaluate the performance of convergence, uniformity, spread, and cardinality of the solution set. And the computational time (Time) was used to evaluate the execution efficiency of the algorithms. Firstly, according to the approximate noninferior solution set obtained in the previous experiments and the noninferior solution sets obtained by the 30 independent runs of algorithm, the five quality indicators of noninferior solutions under each objective combination are calculated, and then the performances of the two objective combinations on the optimization of hydrological model parameters can be quantitatively described and compared. The five performance metrics for parameter optimization problem are shown in Table 12. Mean and SD represents the average value and the standard deviation of the experiments results, respectively. The best values among the results for the optimization problem are shown in bold.
Table 12

Performance comparison of combinations 2 and 7 of objective functions for the parameter optimization problem.

IDGDSPIGDHVTime (second)
MeanSDMeanSDMeanSDMeanSDMeanSD
2 0.290118 0.00223 0.03885 0.08202 0.294291 0.09878 0.423085 0.04389 6.6179570.13140
70.2901270.018280.096350.097660.294298 0.01061 0.4190020.05696 4.154882 0.08859
From the results of Table 12, the conclusions consistent with the above can be obtained. (1) The optimal noninferior parameter solution sets corresponding to objective combinations 2 and 7 have good performance in convergence, uniformity, spread, and cardinality. (2) In comparison, the optimal parameter solutions of objective function combination of 2 have relatively better results for generational distance (GD), spacing (SP), inverted generational distance (IGD), and hypervolume (HV). It demonstrated that the combination 2 is better than the combination 7 in terms of the performance indicators. (3) Because the number of objective functions in combination 2 is bigger than combination 7, it has to take more execution time to obtain the final solution set. (4) Overall, the objective combinations 2 and 7 can effectively coordinate between the 19 various conflicting objectives which are arranged in the paper, and it is expected to provide forecasters with a more comprehensive basis for decision-making.

7. Conclusions

In the multiobjective parameters' optimization process of the hydrological model, a combination of efficiency criteria (objective functions) is defined as the mathematical measures to evaluate how well the model simulation fits the available observed data. There are a large number of objective functions, and each objective function may place more or less emphasis on different aspects of the hydrological model. However, how to choose and evaluate the different combination of objective functions efficiently has been a challenge for even the most experienced hydrologists. To help the researchers select and evaluate the objective functions that are suitable for their research, nine objective function combinations consisting of 19 objective functions were collected from the literature and arranged in this paper. Then, a selection and evaluation framework of objective function combinations was presented. The daily runoff forecasting model, Xinanjiang model, is taken as an application case, and the RMOABC algorithm is introduced to find the nondominated parameter solutions for handling the optimization problem of the hydrological model in the Heihe River basin. And a simple information entropy-based TOPSIS ranking method is adapted to rank the nondominated parameter solutions and compare the performance of optimization results of different objective function combinations. From the optimization results, it can be found that there are a large number of conflicts between the objective functions of objective combinations, which leads to the result that optimal parameter solutions are difficult to satisfy the optimization tasks of all objective functions. Therefore, the combination of objective functions should be chosen carefully and reasonably; otherwise, the obtained optimal parameters will not reflect the real characteristics of the hydrological model and thus cannot complete the hydrological model runoff forecasting task well. Then, the information entropy-based TOPSIS ranking method was taken to calculate the closeness of the optimal solution sets of the objective function combinations, and the ranking order of Pareto optimal solutions was obtained by sorting the closeness coefficients C in descending order. Based on the analysis of the optimization results, the number of distributions of the objective combinations 2 and 7 in the five sorting intervals is very uniform, especially in the “Excellent” and “Good” levels. And their Pareto solutions with the highest ranking have a great advantage in the number and proportion of the solution sets to other objective function combinations which shows that objective function combinations 2 and 7 can achieve more and better solutions than other combinations. Then, we compared the optimal parameter results corresponding to each objective function combined with the multiobjective evaluation indicator results. The solution corresponding to objective combination 2 has better performance in the quality indicators. The experiment results showed that the convergence and distribution of combination 2 of objective functions are better than the other combinations, and it can effectively balance the performance of various objectives. It also showed that the entropy-based TOPSIS can provide an effective and better method to calculate objective weights that permit trade-offs between attributes and rank the nondominated solutions impersonally for the parameters' optimization problem of the hydrological model, and it can also provide a more comprehensive decision basis for hydrological forecasting. The future work is to construct a more accurate and balanced combination of objective functions that has good distribution and convergence in different objective spaces.
  1 in total

1.  Approximating the nondominated front using the Pareto Archived Evolution Strategy.

Authors:  J D Knowles; D W Corne
Journal:  Evol Comput       Date:  2000       Impact factor: 3.277

  1 in total
  1 in total

1.  Modulation of the goodness of fit in hydrological modelling based on inner balance errors.

Authors:  Francisco Pellicer-Martínez; Francisco Gomariz-Castillo; María Manuela Portela; Isabel María Martínez-Alcalá; José Miguel Martínez-Paz
Journal:  PLoS One       Date:  2021-11-18       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.