Literature DB >> 28192508

Multiswarm comprehensive learning particle swarm optimization for solving multiobjective optimization problems.

Xiang Yu1, Xueqing Zhang2.   

Abstract

Comprehensive learning particle swarm optimization (CLPSO) is a powerful state-of-the-art single-objective metaheuristic. Extending from CLPSO, this paper proposes multiswarm CLPSO (MSCLPSO) for multiobjective optimization. MSCLPSO involves multiple swarms, with each swarm associated with a separate original objective. Each particle's personal best position is determined just according to the corresponding single objective. Elitists are stored externally. MSCLPSO differs from existing multiobjective particle swarm optimizers in three aspects. First, each swarm focuses on optimizing the associated objective using CLPSO, without learning from the elitists or any other swarm. Second, mutation is applied to the elitists and the mutation strategy appropriately exploits the personal best positions and elitists. Third, a modified differential evolution (DE) strategy is applied to some extreme and least crowded elitists. The DE strategy updates an elitist based on the differences of the elitists. The personal best positions carry useful information about the Pareto set, and the mutation and DE strategies help MSCLPSO discover the true Pareto front. Experiments conducted on various benchmark problems demonstrate that MSCLPSO can find nondominated solutions distributed reasonably over the true Pareto front in a single run.

Entities:  

Mesh:

Year:  2017        PMID: 28192508      PMCID: PMC5305220          DOI: 10.1371/journal.pone.0172033

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


1. Introduction

Multiobjective optimization deals with multiple objectives that often conflict with each other. The presence of such multiple objectives gives rise to a set of nondominated solutions. Multiobjective optimization methods are either generating or preferences-based [1]. Regarding the generating methods, no preferences of the objectives are given, and nondominated solutions reasonably covering the entire extension of the true Pareto front need to be found so as to provide the decision maker diverse information to determine the final tradeoff [2]. The preferences-based methods, with the preferences of the objectives known in advance, convert the multiple objectives into a single objective through techniques such as weighting and ε-constraint; the single-objective problem can then be solved using a single-objective optimizer. It has been noted in [3] that the weighting technique cannot find a nondominated solution on the nonconvex portions of the Pareto front and the ε-constraint technique finds a nondominated solution only if certain conditions are satisfied. The Pareto dominance relationship doesn’t rely on any preferences knowledge and can be used in the generating methods to handle the multiple objectives directly. Over the past several decades, a number of generating multiobjective metaheuristics (MOMHs) have been applied to solve real-world multiobjective optimization problems (MOPs) in a wide range of areas. Compared with traditional optimizers such as linear programming, nonlinear programming, optimal control theory, and dynamic programming, MOMHs are significantly more flexible as they don’t require the objectives and constraints to be continuous, differentiable, linear, or convex, and MOMHs are rather efficient. In addition, population-based MOMHs using a population of individuals (with each individual representing a candidate solution) facilitate the discovery of multiple nondominated solutions in a single run. This paper aims to propose a high performance MOMH based on particle swarm optimization (PSO). PSO is a swarm intelligence inspired metaheuristic introduced in 1995 [4, 5]. PSO is population-based and solves a single-objective optimization problem (SOP) using a swarm of particles. All the particles “fly” in the search space. Each particle, denoted as i, is associated with a position, a velocity, and a fitness that indicates its performance. PSO relies on iterative learning to find the optimum. In each iteration (or generation), i adjusts its velocity according to its previous velocity, its historical best position (i.e. personal best position), and the personal best positions of its neighborhood particles. As indicated by the reported experimental results of some recently proposed PSO variants such as comprehensive learning PSO (CLPSO) [6], orthogonal learning PSO (OLPSO) [7], selectively informed PSO (SIPSO) [8], and PSO with limited information (LIPSO) [9], the personal best positions of i’s neighborhood particles need to be nontrivially and appropriately leveraged during the update of i’s flight trajectory so as to achieve satisfactory exploration performance on multimodal SOPs. CLPSO and OLPSO encourage i to learn from different exemplars (i.e. i’s personal best position or a position determined from i’s neighborhood) on different dimensions. For SIPSO, the particles take different learning strategies based on the degree of connections; a densely-connected hub particle gets full information from all the neighbors while a non-hub particle with few connections only follows the best-performed neighbor. LIPSO adjusts i’s velocity through the use of limited yet adequate search experience information regarding i’s neighborhood. PSO can handle large scale SOPs with the aid of parallelization [10]. When extending PSO to the domain of multiobjective optimization, elitists need to be stored externally [11-14] or internally [15-18]. An elitist is a solution nondominated among all the candidate solutions generated so far. Existing MOMHs either treat the outstanding MOP as a whole or involve decomposition. For multiobjective PSOs (MOPSOs) that treat the MOP as a whole [12, 13], i’s personal best position is determined based on Pareto dominance. An external repository stores elitists. i learns from its (and other particles’) personal best position(s) and an elitist selected from the external repository. Decomposition based MOPSOs decompose the MOP into multiple different SOPs. Multiple swarms/particles are used, with each swarm/particle independently optimizing a separate SOP. i’s personal best position is thus determined according to the corresponding single objective. The multiple swarms/particles collaborate to derive nondominated solutions through direct and/or indirect information exchange. Vector evaluated PSO (VEPSO) [11] and coevolutionary multiswarm PSO (CMPSO) [14] take advantage of multiple swarms, with each swarm focusing on optimizing a separate objective of the original multiple objectives. In VEPSO, i learns from its personal best position and the search experience of its neighboring swarms. In CMPSO, the swarms don’t exchange information directly, but instead the personal best position and an elitist randomly selected from the external repository are used to update i’s velocity. The external repository is shared by all the swarms. Multiobjective evolutionary algorithm based on decomposition (MOEA/D) [19] is a framework that lets each individual to optimize a separate SOP. Each single objective is attained using aggregation techniques such as weighted sum, Tchebycheff, and boundary intersection. Each individual evolves based on its personal search experience and its neighboring individuals’ search experience. The works [15-18] are MOPSOs based on the MOEA/D framework. Extending from the powerful single-objective PSO variant CLPSO, this paper proposes multiswarm CLPSO (MSCLPSO) for multiobjective optimization. CLPSO has been extended to handle multiobjective optimization in [13, 20]. No decomposition is involved in multiobjective CLPSO (MOCLPSO) and attributed MOCLPSO (A-MOCLPSO) respectively proposed in [13] and [20]. MSCLPSO is the same with CMPSO [14] in terms of using multiple swarms, the way of determining the personal best position, and storing elitists in a shared external repository. MSCLPSO promotes the diversity of the elitists through respectively the crowding distance technique [21] for two-objective MOPs and the M-nearest-neighbors product-based vicinity distance technique for MOPs with more than two objectives [22]. The crowding distance technique works excellently in the case of two objectives, but it fails to effectively approximate the diversity of the elitists when the number of objectives is three or more [22]. MSCLPSO is novel in three aspects. First, each swarm focuses on optimizing the associated SOP strictly using CLPSO, without learning from the elitists and any other swarm. Second, mutation is applied to the elitists and the mutation strategy appropriately exploits the personal best positions and elitists. Third, a modified differential evolution (DE) strategy is applied to some extreme and least crowded elitists. The DE strategy updates an elitist based on the differences of the elitists. The personal best positions carry useful information about the Pareto set, and the mutation and DE strategies help MSCLPSO discover the true Pareto front. MSCLPSO takes the decomposition based multiswarm architecture and updates each particle i’s velocity purely based on the search experience of the particles in i’s host swarm because information determined based on Pareto dominance or some other single objective might not contribute to the optimization on i’s associated objective. MSCLPSO was applied to the 2-objective sustainable operation of China’s Three Gorges cascaded hydropower system in [23]. This paper gives a detailed description of MSCLPSO and presents the algorithm’s performance on a variety of benchmark MOPs. The rest of this paper is organized as follows. In Section 2, the working principle of CLPSO, definitions related to multiobjective optimization, and a brief literature review on MOMHs are presented. Section 3 details the implementation of MSCLPSO. In Section 4, the performance of MSCLPSO is evaluated on some 2- and 3-objective benchmark MOPs. Section 5 concludes the paper.

2. Background

2.1 Comprehensive learning particle swarm optimization

Let there be D decision variables, the swarm of N particles fly in a D-dimensional search space. Each particle i (1 ≤ i ≤ N) is associated with a position P = (P, P, …, P) and a velocity V = (V, V, …, V). V and P are initialized randomly. In each generation, V and P are updated as follows. where d (1 ≤ d ≤ D) is the dimension index; w is the inertia weight; c is the acceleration coefficient and c is suggested to be 1.5 [6]; r is a random number uniformly distributed in the range [0, 1]; and E = (E, E, …, E) is the guidance vector of exemplars. The inertia weight w linearly decreases. Specifically, let kmax be the predefined maximum number of generations, in each generation k, w is updated according to Eq (3). where wmax and wmin are respectively the maximum and minimum inertia weights. The recommended values for wmax and wmin are respectively 0.9 and 0.4 [6]. The dimensional velocity V is usually clamped to a positive value . If , then V is set to ; or if , then V is set to . Let and respectively be the lower and upper bounds of the search space on dimension d, is suggested to be set as 20% of [6]. Let B = (B, B, …, B) be the personal best position of i. After the position P is updated, P is evaluated and will replace B if P has a better fitness value. The exemplar E can be B or B with j ≠ i. The decision to learn whether from B or B depends on a learning probability L. For dimension d, a random number uniformly distributed in the range [0, 1] is generated. If the generated number is no less than L, i will learn from B on dimension d; otherwise from B. The particle j is selected from a 2-tournament procedure. If E happen to be the same as B, ECLPSO will randomly choose one dimension to learn from some other particle’s corresponding dimensional personal best position. An empirical expression is developed in CLPSO to set the learning probability L for each particle i. CLPSO allows each particle i to learn from the same exemplars until i’s fitness values cease improving for a refreshing gap of h consecutive generations. h is suggested to be 7 [6]. CLPSO calculates the fitness value of a particle i only if i is feasible (i.e. within on each dimension d).

2.2 Multiobjective optimization and pareto dominance

Without loss of generality, consider a multiobjective minimization problem described in Eq (5). where x = (x1, x2, …, x) is the decision vector; M (M ≥ 2) is the number of objectives; f is the function or numerical simulation procedure used to evaluate the fitness of x on objective m, m = 1, 2, …, M; and g is the combination of constraints. Some definitions related to multiobjective optimization and Pareto dominance are given below. Definition 1: The search space is SS = {x ∈ R | g{x} ≤ 0}. Definition 2: The objective space is OS = {f(x) ∈ R | x ∈ SS}. Definition 3: Given two points a = (a1, a2, …, a) and b = (b1, b2, …, b) in the objective space OS, b dominates a if b ≤ a for all m = 1, 2, …, M, and b ≠ a. Definition 4: A point a in the objective space OS is nondominated if there is no other point b in OS such that b dominates a. Definition 5: A point x in the search space SS is Pareto-optimal if f(x) is nondominated. Definition 6: The Pareto set is PS = {x ∈ SS | x is Pareto-optimal}. Definition 7: The Pareto front is PF = {f(x) ∈ OS | x∈ PS}. The objective space OS is partially ordered in the sense that two arbitrary points are related to each other in two possibly ways: either one dominates the other or neither dominates. Definition 3 can be modified based on the concept of ε-dominance [24]. Definition 8: Given two points a = (a1, a2, …, a) and b = (b1, b2, …, b) in the objective space OS, b s a if b—a ≤ ε for all m = 1, 2, …, M, and there exists one m such that b—a < ε, where ε is a predefined small positive number.

2.3 Brief literature review on MOMHs

MOMHs have attracted extensive research interests. A large number of MOMHs have been proposed in literature. The MOMHs were tested on some commonly used benchmark MOPs and/or applied to real-world MOPs. The main challenges to achieve high performance multiobjective optimization include guiding the search towards the true Pareto front and obtaining reasonably distributed nondominated solutions. Nondominated sorting genetic algorithm II (NSGA-II) [21] and MOEA/D [19] adopt crossover and mutation to evolve the individuals. MOEA/D-DE [25] replaces the crossover operator of MOEA/D with DE to solve MOPs with complicated Pareto sets. MOEA/D-DE updates an individual based on the difference of two individuals selected from the updated individual’s neighborhood, assuming that such a difference provides a good search direction. Gaussian mutation was employed in CMPSO [14] to help refine the externally stored elitists. The diversity of the elitists can be promoted using techniques such as adaptive grid adopted in Pareto archived evolution strategy (PAES) [26] and PAES2 [27], clustering in strength Pareto evolutionary algorithm (SPEA) [28], crowding distance in NSGA-II [21], fitness sharing in niched Pareto genetic algorithm (NPGA) [29], maximin sorting in [30], M-nearest-neighbors product-based vicinity distance in [22] and multiobjective immune algorithm with nondominated neighbor-based selection 2 (NNIA2) [31], nearest neighbor density estimation in SPEA2 [32], and weighting based aggregation in MOEA/D [19]. Hypervolume, introduced by Zitzler and Thiele [28], is a desirable metric used to evaluate the performance of a MOMH, as the maximization of hypervolume constitutes the necessary and sufficient conditions for deriving maximally diverse nondominated solutions over the true Pareto front [33]. Recently, some hypervolume based MOMHs [34-36] have been proposed and they directly use the hypervolume metric to act as a selection pressure rewarding convergence and diversity. However, it is NP-hard to calculate the hypervolume metric [37]. Many MOMHs are decomposition based. The earliest decomposition based MOMH is vector evaluated genetic algorithm (VEGA) [38]. VEPSO [11] is an adaptation of VEGA to the PSO framework. Harrison et al. [39] investigated various strategies for direct information sharing among the swarms in VEPSO. The multigroup learning automata based approach proposed in [40] utilizes a synergistic learning strategy to encourage each group to learn not only from the elitists but also from the search experience of all the other groups. Zhang et al. [41] enhanced the performance of MOEA/D-DE with a dynamic resource allocation strategy. The works [42-47] are other recent improvements on MOEA/D.

3. Multiswarm comprehensive learning particle swarm optimization

3.1 Basic architecture

MSCLPSO is a decomposition based multiswarm MOPSO [23]. As Fig 1 shows, M swarms are used and each swarm m (1 ≤ m ≤ M) focuses on optimizing objective f using CLPSO. Elitists are stored into an external repository. The repository is shared by all the swarms. Each swarm m doesn’t learn from the elitists and the search experience of any other swarm.
Fig 1

Basic architecture of MSCLPSO.

3.2 Maintenance of the external repository

The external repository REP is initialized to be empty. As the number of elitists quickly grows during the run, REP has a fixed maximum size Rmax. REP is maintained as follows in every generation [23]: Step 1) A temporary set TMP is initialized to be empty. Step 2) All the elitists in REP are added into TMP. Step 3) For each particle i in each swarm m, the particle’s position P is added into TMP if P is feasible (i.e. within on each dimension d). Step 4) Apply mutation to some elitists of REP and add the mutated solutions into TMP. Step 5) Apply DE to a number of extreme and least crowded elitists of REP and add the differentially evolved solutions into TMP. Step 6) Set REP to be empty. Step 7) Each solution in TMP is checked whether it is dominated by any other solution in TMP. Any dominated solution is removed from TMP. Step 8) Sort the remaining elitists in TMP in the decreasing order of crowding/vicinity distances. If the number of the elitists in TMP is larger than Rmax, the first Rmax elitists are allowed to stay in TMP and the others are removed from TMP. All the elitists in TMP are then copied to REP. In Step 8), the elitists in TMP are sorted using respectively the crowding distance technique [21] for two-objective MOPs and the M-nearest-neighbors product-based vicinity distance technique [22] for MOPs with more than two objectives. The crowding/vicinity distance of an elitist provides an estimate of the density of the surrounding solutions. The crowding distance of an elitist is a weighted distance of the elitist’s two neighboring solutions, while the vicinity distance is the multiplication product of the distances between the elitist and the elitist’s M nearest neighbors. Allowing the nondominated solutions with larger crowding/vicinity distances to stay in REP enhances the diversity of the resulting elitists on the Pareto front.

3.3 Mutation

As briefly discussed in [23], the personal best positions and elitists carry useful information about the Pareto set. The mutation strategy adopted in MSCLPSO exploits the personal best positions and the differences of the elitists. After a sufficient number of generations, the personal best position B of particle i in swarm m is an exact-optimum or near-optimum corresponding to objective f. If the Pareto-optimal decision vectors are indifferent on dimension d, B might be close to dimension d of the Pareto-optimal decision vector that is optimal on objective f, hence learning from B might contribute to the search of the Pareto set on dimension d; on the other hand, if the Pareto set is complicated on dimension d, the personal best positions obtained by different swarms often differ considerably on that dimension, accordingly learning from the personal best positions leads to the search of different regions of the Pareto set on dimension d. In addition, the difference between two different elitists on dimension d is often small in the simple cases and could be large in the complicated cases. To be more specific, let the number of the elitists in REP be R, the maximum number of mutations be Nmut, and the mutation tradeoff probability be α, the details of the mutation strategy in Step 4) of the external repository maintenance procedure are described in the following. Step 4.1) Initialize the mutation counter n = 1. Step 4.2) If n ≤ min{Nmut, R}, go to Step 4.3); otherwise, return. Step 4.3) Randomly select an elitist l from REP. l’s decision vector is copied as Q = {Q, Q, …, Q}. Randomly select a dimension d. Generate a random number rmut1 uniformly distributed in the range [0, 1]. If rmut1 < α or R < 2, go to Step 4.4); otherwise, go Step 4.5). Step 4.4) Randomly select a swarm m. Randomly select a particle i in swarm m. Mutate Q according to Eq (6). where rmut2 is a random number uniformly distributed in the range [0, 1]. Step 4.5) Randomly select two different elitists l1 and l2 from REP. l1 and l2 don’t need to be different from l. Mutate Q according to Eq (7). where rmut3 is a random number uniformly distributed in the range [0, 1]; and Z and Z are respectively the decision vector of l1 and that of l2. Step 4.6) Repair Q using the re-initialization technique introduced in MOEA/D-DE [25, 41] if Q is outside the dimensional search space . Step 4.7) Add Q into TMP. Step 4.8) Increase the mutation counter n = n + 1, and go back to Step 4.2). The mutation tradeoff probability α controls whether to mutate based on the personal best position using Eq (6) or the difference of the elitists using Eq (7). α is suggested to take a medium value in the range [0, 1] so as to achieve a balanced use of the information embodied in the personal best positions and elitists. The number of maximum mutations Nmut can be less than Rmax. As can be seen from Step 4.3), each elitist and each dimension has an equal chance to be selected.

3.4 Differential evolution

Let the maximum number of DEs be Nde, for each elitist l in REP with 1 ≤ l ≤ min{Nde, R}, l’s decision vector is copied as Q. Let the DE tradeoff probability be β, a random number rde1 uniformly distributed in the range [0, 1] is generated. If rde1 < β and R ≥ 2, each dimension of Q is differentially evolved according to Eq (8); otherwise according to Eq (9). All the dimensions use the same pair of DE coefficients rde2 and rde3, with rde2 and rde3 being two random numbers generated from a normal distribution with mean 0.5 and standard deviation 0.5. where l1 and l2 are two elitists randomly selected from REP; Δ is the Euclidean distance between l1 and l in the objective space; Δ is the Euclidean distance between l2 and l in the objective space; and δ is the velocity limiter which is a number in the range (0, 1]. Q is further repaired using the re-initialization technique introduced in MOEA/D-DE [25, 41] if Q is outside the dimensional search space . After all the dimensions are differentially evolved, Q is added into TMP. The above details explain Step 5) of the external repository maintenance procedure. The number of maximum DEs Nde can be less than Rmax. Remember that in Step 7), the elitists in TMP are sorted in the decreasing order of crowding/vicinity distances, and then copied to REP. Therefore, the smaller the index of an elitist in REP, the larger the crowding/vicinity distance of that elitist. MSCLPSO applies DE to a number of elitists with the smallest indices in REP. In other words, the differentially evolved elitists are extreme and least crowded. Note that an elitist that is extreme on a single objective is assigned an infinite crowding/vicinity distance [21, 22]; though such an elitist may actually be crowded, it may however be still far from the corresponding true extreme nondominated solution on the Pareto front. The application of DE to the extreme and least crowded elitists is expected to improve the diversity of the elitists [23]. In Eq (8), l1 and l2 don’t need to be different, but at least one of them is different from l. Eq (8) pushes l towards the more distant (measured in the objective space) elitist of l1 and l2, and in the meanwhile pulls l away from the nearer elitist, with the purpose of exploring the search space. In addition, Eq (8) provides more diverse search directions than the DE operators used in literature MOMHs. Eq (9) also provides diverse search directions. l1, l2, and l don’t need to be mutually different. The term rde2Z—rde3Z is clamped to the range . δ is suggested to take a small value in order to facilitate exploiting the region near l. The mutation tradeoff probability β thus is suggested to take a medium value to achieve a balance between the exploration and exploitation of the search space. The assumption of the DE strategy is that the Pareto-optimal decision vectors are somewhat correlated and the learning from l1 and l2 could provide an appropriate search direction for the evolution of l.

3.5 Flow chart and complexity analysis

The swarms obtain personal best positions that carry useful information about the Pareto set. The mutation and DE strategies help MSCLPSO discover the true Pareto front. The flow chart of MSCLPSO is depicted in Fig 2.
Fig 2

Flow chart of MSCLPSO.

MSCLPSO needs to store various data structures and algorithm parameters. The largest data structure is TMP and it requires O(MN + Rmax + Nmut + Nde) memory space. Hence, the space complexity of MSCLPSO is O(MN + Rmax + Nmut + Nde) plus the space required by the objective functions and constraints. The maintenance of the external repository in Step 3 mainly involves the dominance checking. The dominance checking compares each solution with every other solution in TMP on all the objectives. There are maximally MN + Rmax + Nmut + Nde solutions in TMP. Hence the dominance checking requires O((MN + Rmax + Nmut + Nde)2D) comparisons. In addition, Step 3 requires O(Nmut + Nde) function evaluations (FEs). The time requirement of CLPSO is O(ND) basic operations plus O(N) FEs. As there are M swarms, Step 4 thus requires O(MND) basic operations plus O(MN) FEs. Step 1 is executed once. Accordingly, overall MSCLPSO requires O(kmax(MN + Rmax + Nmut + Nde)2D) basic operations plus O(kmax(MN + Nmut + Nde)) FEs.

4. Experimental studies

4.1 Performance metric

The inverted generational distance (IGD) [14, 21, 41] has been widely adopted and strongly recommended as a performance metric for evaluating MOMHs in recent years, as it can reflect both convergence and diversity of the obtained nondominated solutions. Assuming that the true Pareto front PF is known and U is a set of uniformly distributed points sampled along PF, the IGD metric is calculated according to Eq (10). where u is a point in U; s is the Euclidean distance between u and the nondominated solution in REP that is nearest to u, measured in the objective space; and U is the number of points in U. It is clear that if the nondominated solutions in REP have a good spread along the true Pareto front, the IGD value will be small.

4.2 Benchmark problems

Various benchmark MOPs have been proposed in literature to evaluate MOMHs. The following benchmark MOPs are chosen: ZDT2 and ZDT3 from the ZDT test set [2], two modified versions of ZDT4 called ZDT4-V1 and ZDT4-V2, WFG1 from the WFG test set [48], UF1, UF2, UF7, UF8, and UF9 from the UF test set [49]. Eqs (11) and (12) respectively describe the ZDT4-V1 and ZDT4-V2 problems. As can be seen from Eqs (11) and (12), y is similar to the complex multimodal Rastrigin’s function [50]. ZDT4-V1 is the same with ZDT4 except that the search space of x (2 ≤ d ≤ D) is [-1, 1] in ZDT4-V1 while [-5, 5] in ZDT4. The characteristics of the 10 benchmark MOPs are listed in Table 1. The problems exhibit different characteristics such as 2 and 3 objectives, 10 and 30 dimensions, unimodal and multimodal objective functions, linear, convex, concave, disconnected, and mixed Pareto fronts, and simple and complicated Pareto sets. Therefore, the problems can be used to evaluate the performance of MOMHs from various aspects. In Table 1, o denotes the number of position-related working parameters for the WFG1 problem and is set as 2. S1 File gives the source code of the MSCLPSO algorithm with all the benchmark problems.
Table 1

Characteristics of all the benchmark problems.

ProblemMDSearch spacePareto setPareto front
ZDT2230xd ∈ [0, 1], 1 ≤ dDx1 ∈ [0, 1], xd = 0, 2 ≤ dDConcave, f1 U*, f2 M**
ZDT3230xd ∈ [0, 1], 1 ≤ dDx1 ∈ [0, 1], xd = 0, 2 ≤ dDConvex, disconnected, f1 U, f2 M
ZDT4-V1210x1 ∈ [0, 1],xd ∈ [-1, 1], 2 ≤ dDx1 ∈ [0, 1], xd = 0, 2 ≤ dDConvex, f1 U, f2 M
ZDT4-V2210x1 ∈ [0, 1],xd ∈ [-5, 2], 2 ≤ dDx1 ∈ [0, 1], xd = 0.5, 2 ≤ dDConvex, f1 U, f2 M
WFG1210xd ∈ [0, 2d], 1 ≤ dDxd ∈ [0, 2d], 1 ≤ do,xd = 0.35, o + 1 ≤ dDConvex, mixed, f1 U, f2 U
UF1230x1 ∈ [0, 1],xd ∈ [-1, 1], 2 ≤ dDx1[0,1],xd=sin(6πx1+dπD),2dDConvex, f1 M, f2 M
UF2230x1 ∈ [0, 1],xd ∈ [-1, 1], 2 ≤ dDx1[0,1],xd=(0.3(x1)2cos(24πx1+4dπD)+0.6x1)cos(6πx1+dπD), if 2dD and d is odd,xd=(0.3(x1)2cos(24πx1+4dπD)+0.6x1)sin(6πx1+dπD), if 2dD and d is evenConvex, f1 M, f2 M
UF7230x1 ∈ [0, 1],xd ∈ [-1, 1], 2 ≤ dDx1[0,1],xd=sin(6πx1+dπD),2dDLinear, f1 M, f2 M
UF8330x1 ∈ [0, 1], x2 ∈ [0, 1],xd ∈ [-2, 2], 3 ≤ dDx1[0,1],x2[0,1],xd=2x2sin(2πx1+dπD),3dDConcave, f1 M, f2 M, f3 M
UF9330x1 ∈ [0, 1], x2 ∈ [0, 1],xd ∈ [-2, 2], 3 ≤ dDx1[0,0.25][0.75,1],x2[0,1],xd=2x2sin(2πx1+dπD),3dDLinear, disconnected, f1 M, f2 M, f3 M

* U refers to unimodal.

** M refers to multimodal.

* U refers to unimodal. ** M refers to multimodal.

4.3 MOMHs compared and parameter settings

Two performance issues are investigated: 1) can the personal best positions, mutation, and DE help MSCLPSO discover the true Pareto front? and 2) how does MSCLPSO perform compared with other literature MOMHs? For the first issue, two MSCLPSO variants, namely MSCLPSO-1 and MSCLPSO-2, are studied. In MSCLPSO-1, the mutation tradeoff probability α = 0, i.e. the mutation strategy doesn’t learn from the personal best positions. In MSCLPSO-2, the maximum number of DEs Nde = 0, i.e. the DE strategy is not invoked. Concerning the second issue, MSCLPSO is compared with four representative MOMHs: CMPSO [14], MOEA/D [19], MOEA/D-DE [41], and NSGA-II [21]. The parameters of the MSCLPSO variants are determined based on trials on all the benchmark MOPs and are listed in Table 2. The parameters of CLPSO take the recommended values stated in Subsection 2.1. The parameters of CMPSO, MOEA/D, and NSGA-II take the recommended values given in the corresponding references. The elitists are externally stored in a repository for the MSCLPSO variants and CMPSO and internally preserved in the evolving population for MOEA/D and NSGA-II. To facilitate fair comparison, the (maximum) number of externally/internally stored elitists is set as 100 on the 2-objective benchmark MOPs and 300 on the 3-objective MOPs. U is set as 1000 and 10000 respectively for the 2- and 3-objective MOPs. The benchmark MOPs require different number of FEs to obtain the shape of the true Pareto front due to their different difficulty levels. The FEs values on the problems are listed in Table 3. ε-dominance is applied to the MSCLPSO variants and CMPSO and ε = 0.0001. The MSCLPSO variants, CMPSO, MOEA/D, and NSGA-II are executed for 30 independent runs on each problem.
Table 2

Algorithm parameters of the MSCLPSO variants.

MSCLPSO variantParameters
MSCLPSON = 10, α = 0.5, β = 0.5, δ = 5%, Nmut = Rmax (M– 1) / 5, Nde = Rmax (M– 1) / 10
MSCLPSO-1N = 10, α = 0, β = 0.5, δ = 5%, Nmut = Rmax (M– 1) / 5, Nde = Rmax (M– 1) / 10
MSCLPSO-2N = 10, α = 0.5, β = 0.5, δ = 5%, Nmut = 3 Rmax (M– 1) / 10, Nde = 0
Table 3

Number of function evaluations on all the benchmark problems.

ProblemZDT2ZDT3ZDT4-V1ZDT4-V2WFG1UF1UF2UF7UF8UF9
FEs10E410E410E425E4100E435E4100E440E490E490E4
In [41], MOEA/D-DE was evaluated on the UF problems with complicated Pareto sets. MOEA/D-DE used a population of 600 individuals to solve the 2-objective UF problems and a population of 1000 individuals to solve the 3-objective UF problems. 100 elitists and 150 elitists were selected from the final population to calculate the IGD metric respectively on the 2- and 3-objective problems. In contrast, MOEA/D used populations of 100 and 300 individuals respectively on the 2-objective ZDT and 3-objective DTLZ problems with simple Pareto sets in [19]. It seems like MOEA/D-DE doesn’t have a unified parameter setting framework for various benchmark MOPs. Therefore, MSCLPSO is compared with MOEA/D-DE only on the UF1, UF2, UF7, UF8, and UF9 problems with 300000 FEs on each of the problems. The performance data of MOEA/D-DE are directly copied from [41]. On the UF8 and UF9 problems, 150 elitists with the largest vicinity distances are selected from the final external repository of MSCLPSO.

4.4 Experimental results and discussions

Table 4 lists the IGD results related to the 30 runs of the MSCLPSO variants, CMPSO, MOEA/D, and NSGA-II on all the benchmark MOPs. MSCLPSO, CMPSO, MOEA/D, and NSGA-II are ranked according to their mean IGD results and the MOMHs are compared using the wellknown Wilcoxon rank sum test with the significance level 0.05. The ranking and Wilcoxon rank sum test results are listed in Table 5. The final single-objective best solutions obtained by the swarms of MSCLPSO on all the benchmark MOPs are listed in Table 6. Table 7 compares MSCLPSO and MOEA/D-DE on the UF1, UF2, UF7, UF8, and UF9 problems. Table 8 gives the IGD results of MSCLPSO using some different parameter settings. In Tables 4 and 7, the best results on each problem are marked in bold. The final nondominated solutions obtained by MSCLPSO and some literature MOMHs on all the benchmark MOPs are illustrated in Figs 3 and 4. The data set underlying Figs 3 and 4 can be found in S2 File.
Table 4

IGD results of the MSCLPSO variants, CMPSO, MOEA/D, and NSGA-II on all the benchmark problems.

ProblemIGD resultMSCLPSOMSCLPSO-1MSCLPSO-2CMPSOMOEA/DNSGA-II
ZDT2Mean4.35E-34.38E-37.95E-34.14E-33.80E-34.81E-3
SD1.08E-49.79E-51.06E-31.01E-41.53E-72.07E-4
Best4.11E-34.20E-36.18E-33.99E-33.80E-34.49E-3
Worst4.57E-34.54E-31.02E-24.39E-33.80E-35.37E-3
ZDT3Mean4.87E-34.89E-35.55E-36.35E-38.77E-35.51E-3
SD9.41E-51.04E-42.12E-48.43E-41.88E-52.38E-4
Best4.72E-34.76E-35.10E-35.31E-38.75E-35.10E-3
Worst5.07E-35.21E-36.01E-38.93E-38.86E-36.15E-3
ZDT4-V1Mean4.38E-33.16E-28.19E-31.16E-27.81E-24.18E-2
SD1.25E-46.05E-22.22E-21.04E-29.02E-28.12E-2
Best4.19E-34.21E-33.97E-34.30E-33.87E-34.36E-3
Worst4.81E-32.56E-11.26E-15.13E-22.56E-12.56E-1
ZDT4-V2Mean4.26E-33.30E-28.20E-33.677.37E-16.60E-1
SD1.04E-46.23E-22.22E-22.443.85E-13.51E-1
Best4.04E-34.13E-33.94E-39.59E-11.86E-12.58E-1
Worst4.59E-32.56E-11.26E-19.931.711.60
WFG1Mean1.35E-21.39E-21.38E-25.48E-17.72E-12.53E-1
SD4.11E-45.57E-44.94E-42.10E-12.53E-21.38E-1
Best1.29E-21.31E-21.29E-22.57E-17.29E-13.04E-2
Worst1.45E-21.52E-21.49E-29.16E-18.18E-15.60E-1
UF1Mean4.26E-34.26E-33.10E-26.29E-21.28E-17.68E-2
SD1.35E-41.02E-41.19E-21.59E-26.96E-22.07E-2
Best4.08E-34.13E-31.37E-24.02E-24.36E-25.37E-2
Worst4.79E-34.50E-36.04E-29.76E-22.70E-11.48E-1
UF2Mean4.42E-34.34E-31.02E-21.60E-22.98E-22.69E-2
SD2.14E-41.95E-42.33E-33.14E-32.24E-21.05E-2
Best4.17E-34.03E-37.01E-31.05E-21.19E-21.53E-2
Worst4.99E-34.96E-31.83E-22.55E-21.20E-15.73E-2
UF7Mean4.30E-34.33E-34.98E-21.04E-14.24E-17.32E-2
SD1.17E-41.29E-48.25E-21.04E-11.68E-11.15E-1
Best4.09E-34.08E-31.01E-23.80E-24.03E-21.99E-2
Worst4.64E-34.56E-32.91E-13.74E-16.45E-14.26E-1
UF8Mean4.75E-24.95E-21.16E-13.94E-11.88E-14.00E-1
SD3.22E-31.16E-24.06E-23.83E-21.68E-12.84E-2
Best4.28E-24.11E-27.32E-22.58E-18.23E-23.20E-1
Worst5.44E-21.07E-12.42E-14.30E-17.49E-14.22E-1
UF9Mean2.70E-26.48E-23.85E-29.38E-21.59E-13.33E-1
SD1.58E-36.62E-29.73E-33.44E-22.98E-22.77E-2
Best2.37E-22.45E-22.52E-25.63E-24.99E-22.91E-1
Worst3.01E-22.08E-16.74E-22.06E-11.90E-13.73E-1
Table 5

Ranks of MSCLPSO, CMPSO, MOEA/D, and NSGA-II in term of the mean IGD results on all the benchmark problems.

ProblemZDT2ZDT3ZDT4-V1ZDT4-V2WFG1UF1UF2UF7UF8UF9TotalRank
MSCLPSO3111111111121
CMPSO2+3-2-4-3-2-2-3-3-2-262
MOEA/D1+4-4-3-4-4-4-4-2-3-334
NSGA-II4-2-3-2-2-3-3-2-4-4-293

+ The corresponding MOMH is significantly better than MSCLPSO according to the Wilcoxon rank sum test.

- The corresponding MOMH is significantly worse than MSCLPSO according to the Wilcoxon rank sum test.

Table 6

Final single-objective best solutions obtained by the swarms of MSCLPSO on all the benchmark problems.

ProblemSwarm 1Swarm 2Swarm 3
MeanSDMeanSDMeanSD
ZDT24.30E-201.80E-194.95E-22.23E-2--
ZDT33.82E-112.09E-10-7.60E-16.25E-3--
ZDT4-V13.40E-231.81E-226.67E-28.48E-2--
ZDT4-V21.10E-296.03E-293.55E-27.00E-2--
WFG11.04E-19.85E-38.30E-21.41E-17--
UF13.02E-22.99E-22.29E-22.68E-2--
UF22.23E-168.31E-161.49E-21.88E-2--
UF79.90E-22.08E-11.55E-21.17E-2--
UF81.06E-33.34E-31.09E-52.18E-55.74E-51.12E-4
UF93.65E-57.49E-56.34E-42.38E-32.52E-24.68E-2
Table 7

Comparison of MSCLPSO and MOEA/D-DE on the UF benchmark problems.

UF1UF2UF7UF8UF9
MSCLPSOMean4.40E-36.19E-34.62E-39.29E-25.07E-2
SD1.83E-48.24E-43.21E-43.45E-22.59E-2
Best4.16E-35.05E-34.25E-36.18E-23.73E-2
Worst5.10E-38.99E-35.60E-32.04E-11.84E-1
MOEA/D-DEMean4.35E-36.79E-34.44E-35.84E-27.90E-2
SD2.90E-41.82E-31.17E-33.21E-35.42E-2
Best3.99E-34.81E-34.05E-35.07E-23.50E-2
Worst5.19E-31.09E-21.06E-26.56E-21.50E-1
Table 8

IGD results of MSCLPSO using some different parameter settings.

Parameter settingProblemIGD result
MeanSDBestWorst
α = 1UF15.44E-35.40E-44.87E-37.20E-3
β = 0UF16.40E-35.21E-34.29E-33.13E-2
β = 1UF74.49E-33.88E-44.04E-35.70E-3
δ = 1%UF74.75E-33.58E-44.24E-35.57E-3
δ = 10%ZDT24.41E-31.15E-44.16E-34.68E-3
δ = 20%UF85.04E-21.81E-24.16E-21.44E-1
Fig 3

Final nondominated solutions obtained on the ZDT and WFG benchmark problems.

(a) MSCLPSO in the best run on ZDT2 (b) MSCLPSO in the best run and MOEA/D in the best run on ZDT3 (c) MSCLPSO in the best run and CMPSO in the worst run on ZDT4-V1 (d) MSCLPSO in the best run and MOEA/D in the best run on ZDT4-V2 (e) MSCLPSO in the best run, NSGA-II in the best run, and CMPSO in the best run on WFG1.

Fig 4

Final nondominated solutions obtained on the UF benchmark problems.

(a) MSCLPSO in the best run and CMPSO in the best run on UF1 (b) MSCLPSO in the best run and CMPSO in the best run on UF2 (c) MSCLPSO in the best run and NSGA-II in the best run on UF7 (d) MSCLPSO in the best run on UF8 (e) MSCLPSO in the best run on UF9.

+ The corresponding MOMH is significantly better than MSCLPSO according to the Wilcoxon rank sum test. - The corresponding MOMH is significantly worse than MSCLPSO according to the Wilcoxon rank sum test.

Final nondominated solutions obtained on the ZDT and WFG benchmark problems.

(a) MSCLPSO in the best run on ZDT2 (b) MSCLPSO in the best run and MOEA/D in the best run on ZDT3 (c) MSCLPSO in the best run and CMPSO in the worst run on ZDT4-V1 (d) MSCLPSO in the best run and MOEA/D in the best run on ZDT4-V2 (e) MSCLPSO in the best run, NSGA-II in the best run, and CMPSO in the best run on WFG1.

Final nondominated solutions obtained on the UF benchmark problems.

(a) MSCLPSO in the best run and CMPSO in the best run on UF1 (b) MSCLPSO in the best run and CMPSO in the best run on UF2 (c) MSCLPSO in the best run and NSGA-II in the best run on UF7 (d) MSCLPSO in the best run on UF8 (e) MSCLPSO in the best run on UF9.

The personal best positions and the mutation strategy

As can be observed from the IGD results given in Table 4 and the final nondominated solutions illustrated in Figs 3 and 4, MSCLPSO can find diverse nondominated solutions reasonably distributed over the true Pareto front on all the 10 benchmark MOPs. The performance of MSCLPSO is rather robust, as indicated from the mean, standard deviation, best, and worst IGD results of MSCLPSO on all the problems. Compared with MSCLPSO, MSCLPSO-1 cannot approximate the true Pareto front on the ZDT4-V1, ZDT4-V2, UF8, and UF9 problems in some runs, as indicated from the worst IGD results of MSCLPSO-1. ZDT4-V1 and ZDT4-V2 have simple Pareto sets, while UF8 and UF9 feature complicated Pareto sets. The final single-objective solutions given in Table 6 show that MSCLPSO can derive an exact-optimum or near-optimum for each single objective on all the 10 benchmark MOPs. The personal best positions obtained by MSCLPSO on objective f2 on ZDT2, ZDT3, ZDT4-V1, and ZDT4-V2 are close to the Pareto-optimal decision vectors on most dimensions in the search space. The personal best positions obtained by MSCLPSO on both objective f1 and objective f2 on WFG1 are close to the Pareto set. The personal best positions obtained by MSCLPSO on different single objectives on UF8 and UF9 are located in rather different regions of the search space. All the observations verify that: 1) CLPSO, owing to its powerful exploration capability, is a proper choice to be adopted in MSCLPSO to help find the personal best positions; 2) the personal best positions carry useful information about the Pareto set, whether the Pareto-optimal decision vectors in the Pareto set are indifferent or significantly different on a dimension; and 3) learning from the personal best position in the mutation strategy benefits the discovery of the true Pareto front.

The DE strategy

The IGD results given in Table 4 show that MSCLPSO-2 cannot approximate the true Pareto front on the ZDT4-V1, ZDT4-V2, UF1, UF2, UF7, UF8, and UF9 problems in some or all of the runs. The comparison of MSCLPSO and MSCLPSO-2 indicates that the DE strategy, through leveraging the useful information carried by the elitists, evolves the elitists and is able to explore diverse regions of the search space. The comparison of MSCLPSO, MSCLPSO-1, and MSCLPSO-2 further demonstrates that the combined use of the personal best positions, the mutation strategy, and the DE strategy is required to achieve high performance multiobjective optimization.

Comparison of MSCLPSO with CMPSO, MOEA/D, and NSGA-II

As the IGD results given in Table 4 show, CMPSO, MOEA/D, and NSGA-II cannot approximate the true Pareto front on the ZDT4-V1, ZDT4-V2, WFG1, UF1, UF2, UF7, UF8, and UF9 problems in some or all of the runs. MOEA/D performs the best on ZDT2. As can be seen from Fig 3(b), the final nondominated solutions obtained by MOEA/D are not reasonably distributed on the true Pareto front of ZDT3. As Fig 3(c), 3(d) and 3(e) show, CMPSO sometimes gets stuck in a local Pareto front on ZDT4-V1; MOEA/D gets trapped in a local Pareto front even in the best run on ZDT4-V2; CMPSO encounters a local Pareto front in the best run on WFG1, and CMPSO even cannot approximate the entire local Pareto front on WFG1; and NSGA-II can only locate part of the true Pareto front in the best run on WFG1. As indicated from Fig 4(a), 4(b) and 4(c), the MOMHs other than MSCLPSO cannot discover the entire true Pareto on the UF1, UF2, and UF7 problems. Looking at the ranking results given in Table 5, MSCLPSO significantly beats CMPSO, MOEA/D, and NSGA-II on 9 out of the 10 benchmark MOPs and is overall ranked as the best MOMH. All the observations again verify the strengths of the novel techniques adopted in MSCLPSO.

Comparison of MSCLPSO and MOEA/D-DE

As can be seen from the IGD results given in Table 7, MSCLPSO ties with MOEA/D-DE in performance on the UF1, UF2, UF7, UF8, and UF9 problems. MOEA/D-DE cannot effectively solve the ZDT4-V1, ZDT4-V2, and WFG1 problems, because: 1) objective f2 of ZDT4-V1 and ZDT4-V2 is complex multimodal; 2) the Pareto-optimal decision vectors corresponding to WFG1 are not clearly correlated on dimension 1 and dimension 2; and 3) DE often fails in the aforementioned two cases. MSCLPSO is advantageous than MOEA/D-DE in the following aspects: 1) MSCLPSO provides a unified parameter setting framework; 2) MOEA/D-DE needs to determine weight vectors with the largest distances from 5000 randomly selected weight vectors [41]; 3) MOEA/D-DE uses considerably more individuals than MSCLPSO, e.g. MOEA/D-DE uses 600 individuals on the 2-objective UF problems, whereas MSCLPSO just uses 150 individuals in total (with 20 particles, 100 externally stored elitists, 20 individuals for mutation, and 10 individuals for DE); and 4) MOEA/D-DE requires a nontrivial procedure to select nondominated solutions from the final population [41].

Tuning of the algorithm parameters

As can be observed from the IGD results given in Table 8, the performance of MSCLPSO is sensitive to the values of the algorithm parameters. The appropriate values of the parameters are determined based on trials on all the benchmark MOPs. α = 0 is inappropriate as indicated from the performance data of MSCLPSO-1 given in Table 4, while α = 1 is also inappropriate as can be seen from Table 8. Table 8 also shows that β = 0 and β = 1 are both inappropriate, and δ needs to take an appropriate value. The observations demonstrate that: 1) the mutation strategy needs to exploit both the personal best positions and the elitists; and 2) the DE strategy needs to make a tradeoff between exploration and exploitation.

5. Conclusions

A metaheuristic called MSCLPSO has been proposed in this paper to achieve high performance multiobjective optimization. MSCLPSO involves multiple swarms, with each swarm focusing on optimizing a separate original objective strictly using the state-of-the-art powerful single-objective metaheuristic CLPSO. Elitists are stored externally. Each swarm doesn’t learn from the elitists and any other swarm. Each particle’s personal best position is determined based on the corresponding single objective, instead of Pareto dominance. MSCLPSO adopts a novel mutation strategy and a novel DE strategy to evolve the elitists. The mutation strategy appropriately exploits the personal best positions and elitists. The DE strategy achieves a balance between exploration and exploitation. MSCLPSO offers a novel technical route to handle multiobjective optimization different from those of existing literature MOMHs. Experiments conducted on various benchmark MOPs have demonstrated that MSCLPSO can robustly derive diverse nondominated solutions distributed reasonably over the true Pareto front in a single run.

Java Source Code of the MSCLPSO Algorithm with the Benchmark Problems.

(JAVA) Click here for additional data file.

Origin Data Set for Figs 3 and 4.

(OPJ) Click here for additional data file.
  8 in total

1.  Approximating the nondominated front using the Pareto Archived Evolution Strategy.

Authors:  J D Knowles; D W Corne
Journal:  Evol Comput       Date:  2000       Impact factor: 3.277

2.  Comparison of multiobjective evolutionary algorithms: empirical results.

Authors:  E Zitzler; K Deb; L Thiele
Journal:  Evol Comput       Date:  2000       Impact factor: 3.277

3.  Combining convergence and diversity in evolutionary multiobjective optimization.

Authors:  Marco Laumanns; Lothar Thiele; Kalyanmoy Deb; Eckart Zitzler
Journal:  Evol Comput       Date:  2002       Impact factor: 3.277

4.  Covariance matrix adaptation for multi-objective optimization.

Authors:  Christian Igel; Nikolaus Hansen; Stefan Roth
Journal:  Evol Comput       Date:  2007       Impact factor: 3.277

5.  MOEA/D with adaptive weight adjustment.

Authors:  Yutao Qi; Xiaoliang Ma; Fang Liu; Licheng Jiao; Jianyong Sun; Jianshe Wu
Journal:  Evol Comput       Date:  2014-02-06       Impact factor: 3.277

6.  MOEA/D-ACO: a multiobjective evolutionary algorithm using decomposition and AntColony.

Authors:  Liangjun Ke; Qingfu Zhang; Roberto Battiti
Journal:  IEEE Trans Cybern       Date:  2013-12       Impact factor: 11.448

7.  Multiple Populations for Multiple Objectives: A Coevolutionary Technique for Solving Multiobjective Optimization Problems.

Authors:  Zhi-Hui Zhan; Jingjing Li; Jiannong Cao; Jun Zhang; H S-H Chung; Yu-Hui Shi
Journal:  IEEE Trans Cybern       Date:  2013-03-07       Impact factor: 11.448

8.  Selectively-informed particle swarm optimization.

Authors:  Yang Gao; Wenbo Du; Gang Yan
Journal:  Sci Rep       Date:  2015-03-19       Impact factor: 4.379

  8 in total
  2 in total

1.  Multi/Many-Objective Particle Swarm Optimization Algorithm Based on Competition Mechanism.

Authors:  Wusi Yang; Li Chen; Yi Wang; Maosheng Zhang
Journal:  Comput Intell Neurosci       Date:  2020-02-19

2.  Enhanced Comprehensive Learning Particle Swarm Optimization with Dimensional Independent and Adaptive Parameters.

Authors:  Xiang Yu; Yu Qiao
Journal:  Comput Intell Neurosci       Date:  2021-02-05
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.