Literature DB >> 32831818

A Complex Chained P System Based on Evolutionary Mechanism for Image Segmentation.

Xiyu Liu1,2, Lin Wang1,2, Jianhua Qu1,2, Ning Wang1,2.   

Abstract

A new clustering membrane system using a complex chained P system (CCP) based on evolutionary mechanism is designed, developed, implemented, and tested. The purpose of CCP is to solve clustering problems. In CCP, two kinds of evolution rules in different chained membranes are used to enhance the global search ability. The first kind of evolution rules using traditional and modified particle swarm optimization (PSO) clustering techniques are used to evolve the objects. Another based on differential evolution (DE) is introduced to further improve the global search ability. The communication rules are adopted to accelerate the convergence and avoid prematurity. Under the control of evolution-communication mechanism, the CCP can effectively search for the optimal partitioning and improve the clustering performance with the help of the distributed parallel computing model. This proposed CCP is compared with four existing PSO clustering approaches on eight real-life datasets to verify the validity. The computational results on tested images also clearly show the effectiveness of CCP in solving image segmentation problems.
Copyright © 2020 Xiyu Liu et al.

Entities:  

Mesh:

Year:  2020        PMID: 32831818      PMCID: PMC7428845          DOI: 10.1155/2020/6524919

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

Membrane computing, also known as membrane systems or P systems, is a novel research of bioinspired computing initiated by Păun [1]. It seeks to discover novel biological computing models from the structure of biological cells as well as the cooperation of cells in tissues, organs, and populations of cells. Over the past years, there are three investigated P systems, cell-like P systems, tissue-like P systems, and neural-like P systems, including spiking neural P systems. P systems have several characteristics: nondeterminism, programmability, extensibility, and readability [2]. Research shows that the some models of P systems present the same computing power as Turing machines and is more efficient to some extent [3]. Therefore, the analysis of computing power and computational efficiency of P systems is one of the important basic studies [4, 5]. Other studies are focused on the variation of P systems to solve optimization problems, including the variant of rules and structures [6, 7]. In addition, some intelligence techniques, such as evolutionary computation and fuzzy theory, is also introduced to the variant P systems in some specific optimization problems [8]. Because the parallel computation in membrane systems can avoid the increase in time consumption with the increase in the number of data points, the membrane systems are suitable for solving clustering problems [9]. There are a lot of interesting researches in variant P systems to solve clustering problems. Liu and Xue [10] proposed a new cluster splitting technique based on Hopfield networks and P systems. Liu et al. [11] presented an improved Apriori algorithm, named ECTPPT-Apriori, based on evolution-communication tissue-like P system with promoters and inhibitors. Peng et al. [12] developed an extended P system with active membranes, in which a modification differential evolution mechanism is used to find the optimal clustering centers in clustering problems. Peng et al. [13] introduced a multiobjective clustering framework using a tissue-like P system for solve fuzzy clustering problems. Wang et al. [14] proposed a new cell-like P system using a modified genetic algorithm to evolve the objects and using communication rules in the cell-like P system. Image segmentation is an important part of image processing; it also has critical impact on the final quality of image analysis and subsequent tasks [15]. In the previous studies, the segmentation technique can be divided into region-based methods, edge-based methods, cluster-based methods, and threshold-based methods [16], in which the threshold methods can be classified into bilevel and multilevel threshold methods based on the number of clusters [17]. The region-based methods can obtain high segmentation quality but are sensitive to the parameters. The edge-based methods have high segmentation quality in different regions or targets and are also more sensitive to noise. The cluster-based methods are simple and easily implemented, but the clustering results rely on the number of clusters and feature selection in the colour space. The threshold-based methods are simple to computation, requiring no prior knowledge, but the continuity of the regions is not guaranteed due to the lack of space information [18]. So far, image segmentation has been used with wide recognized significance in machine vision, computer-aided diagnosis of the medical imaging, feature extraction, and analysis [19]. In this paper, the experiment on tested images is simple and easily implemented, and the application of image segmentation is not used in our works, so more details are not described in the following. Because these segmentation methods mentioned above have their respective advantages and limitations, a lot of works have been done to find robust and optimum segmentation techniques [20]. The threshold technique is one of the most popular segmentation techniques which are based on the gray level of images. It is simplicity and easy implementation, which has lower computational complexity [21]. Li et al. [22] presented a novel thresholding extraction method based on variational mode decomposition (VMD). Zhao et al. [23] introduced a gradient-based adaptive particle swarm optimization (PSO) combined with improved external optimization (EO) to overcome being trapped into local optima in high-dimensional tasks. Wang et al. [24] designed a new P system with related interaction rules; the PSO mechanism is used to maximize entropy threshold. Tan et al. [25] proposed a hybrid clustering models using ensemble deep neural networks for skin lesion segmentation. P systems are a class of distributed parallel computing models that can be used to improve the global search ability of PSO [26]. The commutation rules of chained P systems can be used to accelerate convergence of PSO. Compared with the crossover and mutation operation of DE, the partitioning information only can be used in the velocity updating of particles [27, 28]. Besides, although the neural networks have high quality for solving the optimization problems, the space and time consumption is too much, and the parallel computing model of P systems is also not executed on the neural networks. Therefore, a new variant of P systems based on PSO mechanism is proposed, which is named complex chained P (CCP) systems. And, the concepts of membrane, objects, and rules based on a special chained structure are introduced in the literature [29]. In this CCP system, two kinds of evolution rules in different chained membranes are introduced to enhance the global search ability. One of evolution rules is using the traditional and modified PSO mechanism to evolve the objects, and the partitioning information is introduced as the environmental factor to improve the clustering performance. Another is based on differential evolution (DE) to evolve the global chained objects in order to enhance the global search ability. Compared with genetic algorithm (GA) [30], the DE is simple and easily implemented, which has less predefined parameters and quick convergence speed. The communication rules for global objects between chained membranes are used to accelerate the convergence speed and avoid prematurity [31, 32]. At last, the CCP system is evaluated on eight benchmark clustering problems and eight tested images with the compared clustering and image segmentation techniques; the experimental results verify the validity and performance of proposed CCP. The rest of this paper is organized as follows: the framework of the chained P systems is described in Section 2. Section 3 gives more details about the complex chained P system for clustering problems, and the evolution rules and communication rules are described in this section. In order to verify the validity of CCP, some experiments which are conducted on benchmark clustering problems, are taken in Section 4.2. Furthermore, some tested images are used to evaluate the competitive performance of CCP in Section 4.3. Section 5 provides some conclusions and outlines future research directions.

2. Chained P Systems

Some concepts can be briefly defined as follows. A special membrane is called chained membrane, which contains many objects based on a chained structure, and these objects are called chained object. The formal descriptions of the chained P system can be expressed as the following tuple [29].where O is a finite nonempty alphabet, and the symbols are called chained objects. μ is the membrane structure consisting of m membranes, which is composed of two parts: the structure of chained membranes and the structure of the whole chained P system. w1, ⋯, w are the initial finite sets of chained objects; w represents the chained objects in membrane i, for 1 ≤ i ≤ m, w ∈ O. R1, ⋯, R are the finite sets of chained rules, and R represents a finite set of chained rules in membrane i, for 1 ≤ i ≤ m, R={R, R, ⋯R} which consists of r subrules that are executed on a special order. There are many chained rules on objects, for example, object addition rule, object subtraction rule, object crossover rule, and object variation rule. σin is the input region or membrane in the chained P systems, which contains the initial objects in the systems. σout is the output region or membrane in the chained P systems. If a certain chained rule cannot be executed in the chained P system, the computation process will be stopped and the computation results or objects will be transported in the output region or membrane.

3. Proposed Complex Chained P System for Clustering

The proposed chained P system based on evolutionary mechanism with a complex chained structure for clustering problems is presented in this section. First, the general framework of this complex chained P system (CCP) is described, and the basic membrane structure is given. Next, the evolution and communication mechanisms of chained P system are introduced in the CCP system. Finally, the computational process of the CCP system is described in the last parts.

3.1. General Framework of CCP System

The general framework of CCP is similar to that of the chained P systems, but the main differences are the membrane structure of whole P system and the evolution rules for chained objects. In complex chained P system, there are two kinds of chained membranes, chained membranes σ1 to σ and membranes σ, which contains different evolution rules. These chained membranes are labelled from 1 to m+1. Respectively, the formal descriptions of the CCP system are as follows :where O is a finite set of alphabets, which includes all chained objects or strings in the CCP systems. μ is the membrane structure of CCP system consisting of m+3 membranes. w1, ⋯, w are the multisets of the initial chained objects, with w ∈ O, for 1 ≤ i ≤ m, and w represents the chained objects. R1, ⋯, R represent finite sets of evolution rules in chained membranes σ1 to σ, R,  (1 ≤ i ≤ m) represent a set of evolution subrules in chained membrane i, and the number of the subrules are denoted by r. The subevolution rules are of the form: R={u⟶v}, with u, v ∈ O, which means that chained object u will be evolved to chained object v. R1′, ⋯, R′ represent the communication rules in chained membranes σ1 to σ. The communication rules are of the form: R′={u⟶(v, in)}, with u, v ∈ O, for 1 ≤ i ≤ m+1, which means the chained object u in membrane i will be changed in object v and transported into membrane j. R represents finite sets of evolution rules in chained membrane σ. For two arbitrary chained objects u and v, u, v ∈ O, there are two kinds of evolution rules existing in the membranes can be executed: crossover rules and variation rules, R={R, R}. σin is the input region or membrane in the CCP system, which contains the initial objects of the whole system. σout is the output region or membrane in the CCP systems. Once the computation is completed, the computation results or objects will be transported to the output region or membranes. Figure 1 gives the membrane structure of the proposed CCP system.
Figure 1

The membrane structure of CCP system.

In this proposed CCP, the chained objects are considered to be feasible solutions in the search space, and the partition obtained by the clustering technique is represented by C={C}, for 1 ≤ k ≤ K. Thus, the i-th chained object u can be defined as u=(z, z, ⋯, z), where z,  (1 ≤ k ≤ K) represents the k-th cluster center, K is the number of the clusters, and the dimension of cluster center is denoted by d. At last, the best chained object that represents the best partitioning results in the system will be output to the σout when the computation is completed.

3.2. Evolution Rules on Chained Objects

There are two kinds of evolution rules on chained objects existing in the chained membranes, the evolution rules in chained membranes σ1 to σ and another evolution rules in chained membrane σ.

3.2.1. Evolution Rules in Chained Membranes σ1 to σ

There are two kinds of evolution subrules in chained membranes σ1 to σ, which are based on different velocity updating strategies for chained objects. Firstly, the particle swarm optimization (PSO) mechanism is used to evolve the chained objects, and the traditional velocity model of PSO is introduced to update the velocity of chained object. At t+1 time, the velocity formula of the i-th object u in j-th chained membrane σ,  (1 ≤ j ≤ m) is determined by (3) in the following equation:where the inertia weight is denoted by w and c1 and c2 represent the learning factor, which is usually restricted to 2. The iteration counter is denoted by t. r1 and r2 are two independent uniform random numbers. The local best of u in history is denoted by ulbest, specially, the global best in σ is denoted by ugbest. Another velocity updating strategy for objects is based on environmental factors. At t+1 time, a modified velocity formula of EPSO is determined by (4) in the following equation:where c3 represents a positive constant, r3 is a uniform random number, and E(t) represents the environmental factor around to the object u, which is based on the information of clusters. The partitioning information of cluster is changed dynamically through the evolution of objects, and the geometric center of the data point belonging to the corresponding cluster in the object u is used as the environment factor, where E(t)={e(t), e(t), ⋯, e(t)}, for k=1,2, ⋯, K, which is given by (5) at t time in the following equation:where e(t) represents the k-th cluster centroid of the environment factor E, x represents the p-th data point in the dataset, for p=1,2, ⋯, N, N is the number of the data point in the datasets, and number of data points belonging to the corresponding cluster C is denoted by N. At t+1 time, the position formula of u is determined by (6) in the following equation:and the local best of u at t+1 time is updated according to (7) in the following equation:where f(u) represents the fitness function of u for clustering problems, which can be defined by (8) in the following equation: The purpose of clustering problems is to find a partitioning result C={C1, C2, ⋯, C} obtained by techniques to minimize the values of fitness function according to equation (8). The global best ugbest at t+1 time is updated according to (7) in the following equation: At t time, a success rate of chained objects u is defined by the following equation:where S(u, t) represents the success rate of u. The success rate in the chained membrane σ, for 1 ≤ j ≤ m, is computed by (11) in the following equation:where n represents the number of the chained object in chained membrane σ and P(σ, t) is the success rate of σ, P(σ, t) ∈ [0,1], which means the improvement rate at last computation. In this study, a linearly increasing strategy is introduced to adjust the values of the inertia weight dynamically; the possible range is based on the mapping relationship of P(σ, t), which is given by (12) in the following equation:where wminand wmax represent the minimum and maximum of inertia weight and w(t) ∈ [wmin, wmax].

3.2.2. Evolution Rules in Chained Membrane σ

The evolution rules in chained membrane σ contain objects crossover rules and object variation rules, differential evolutionary (DE) approach is further used as the variant of evolution rules in σ, and the mutation and crossover mechanism is also introduced to help the objects escape the local optima. At t time, the i-th chained object in σ is denoted by u(t)=(u(t), u(t), ⋯, u(t)), where D=k∗d represents the dimension of the objects and u(t) and u(t) are two randomly chained objects in σ; the mutation operation is defined by (13) in the following equation:where v(t) represents the created donor, ugbest(t) represents the best object in σ, the scaling factor is denoted by F, and the value is given by F1=0.5 × (1+rand(0,1)), F2=0.5 × (1+rand(0,1)). The mutant u′(t) comes from the created donor through crossover mechanism, which can be defined bywhere the crossover rate is denoted by P, it is a predefined constant within the range from 0 to 1, rand is an independent uniform random number, and rand is the random dimension from 1 to D. At t+1 time, the position formula of u is determined by

3.3. Communication Rules in Chained Membranes

The communication rules of chained P system are used to enhance the cooperation between the chained membranes, which provide good foundation for exchange and sharing of the information. There are two kinds of communication rules in the CCP system: R′={ugbest(t)⟶(u(t), in)}, for j=1,2, ⋯, m. At t time, the global best ugbest in σ is sent to the chained membrane σ and transformed into the j-th object u. At each iterative computation, the chained membrane σ only contains m objects that come from the chained membranes σ1 to σ. R′={ugbest(t)⟶(ugbest(t), in)}, for j=1,2, ⋯, m. At t time, the global best ugbest in σ is transported to the chained membranes σ1 to σ, which is considered to be the global best ugbest in chained membrane σ. Meanwhile, the global best ugbest in membrane σ is sent to the output membrane σout, which is viewed as the best object or the final computation results of the CCP at t time.

3.4. Computation Process of CCP

3.4.1. Initialization

The input membrane σincontains all initial objects in the P system, denoted by Q=m∗n. The position of chained object is randomly initialized in the search space. After initialization, the chained objects in the membrane σin will be transported to the chained membrane σ1 to σ, and each chained membrane contains n objects.

3.4.2. Evolution in Chained Membranes σ1 to σ

The evolution rules on chained objects are used to completed the evolution process according to the equations (3), (4),and (6) in chained membrane σ,  (1 ≤ j ≤ m). And, the selection for velocity formula is based on a random strategy. The local best and global best in σ are updated through the equations (7) and (9).

3.4.3. Communication between Chained Membranes

The first kinds of communication rules are used to transport the global best in the σ,  (1 ≤ j ≤ m) to the chained membrane σ.

3.4.4. Evolution Rules in Chained Membranes σ

The evolution rules on chained objects are used to completed the evolution process according to the equations (13) and (14) in chained membrane σ, and global best in σ is updated by the equation (15).

3.4.5. Communication between Chained Membranes

The second kinds of communication rules are used to transport the global best in σ to each chained membrane σ,  (1 ≤ j ≤ m).

3.4.6. Halting and Output

The evolution and communication will be implemented repeatedly with an iterative form during the computation process. The termination criterion of the system is stopped to whether the maximum number of iterations or computation is reached. When the system halts, the output membrane σout will send the global best to the environment, and this object is regarded as the final computing results of CCP system.

4. Experimental Analysis of Clustering Problems

In this section, the feasibility and effectiveness of the proposed CCP will be demonstrated through the experimental analysis. The datasets of the experiment are introduced first, and the artificial dataset from the previous studies [33] is used to tune the parameters in CCP. Eight real-life datasets from UCI machine leaning repository [34] is used to compare the performance with currently existing clustering approaches. All clustering approaches, including CCP, are implemented on MATLAB 2016b, and all the experiments are conducted on a DELL desktop computer with an Intel 4.00 GHz i7-8550U processor and 8 GB of RAM in a Windows 10 environment.

4.1. Parameter Setting

The numbers of chained membranes have important influences on the performance of CCP. Therefore, four artificial datasets [33], Data_5_2, Data_9_2, Size5, and Square4, are used to tune this parameter in order to ensure the equity of the experiment. The details of the four artificial datasets are given in Table 1.
Table 1

Properties of the artificial datasets.

DatasetsDataFeatureClass
Data_5_2 25025
Data_9_2 90029
Size_5 100024
Square4 100024
Different CCP systems with different degrees of m=4, m=5, m=6 [12] are used to evaluate the effects of the number of the chained membranes. The maximum of iterations is set to tmax=100, and the positive constant c3 is a random number which distributed to 0.6 and 3. The lower and upper limits of the inertia weight are set to wmin=0.4 and wmax=1.2. The mutation probability P is randomly generated from 0.2 to 0.8. Other parameters which are not tested in this experiment will be maintained the same values for the fairness of the comparison experiments. And, the number of independent running is set to 30 to eliminate the effect of the random factors. The values of mean and SD of the fitness function obtained by CCP system with different degrees are reported in Table 2.
Table 2

Performance of CCP with different degrees.

Datasets m=4 m=5 m=10
MeanSDMeanSDMeanSD
Data_5_2 326.50710.0301326.46960.0123 326.4641 0.0110
Data_9_2 590.85870.0624590.75890.0463 590.7413 0.0287
Size_5 2493.14504.80482491.96800.0789 2491.9020 0.0565
Square4 2367.64430.03172367.60520.0260 2367.5917 0.0141
Table 2 reports the clustering results of the CCP system with different degrees on four artificial datasets. The best values of mean and SD for each dataset are given in bold. These results show that the mean and the SD values obtained by the CCP system when m=10 are the best among these different systems.

4.2. Clustering Problems

The performance of the CCP is compared with four clustering approaches that have been reported in the literature to further evaluate the effectiveness, such as standard particle swarm optimization (PSO) [35], differential evolution (DE) [36], environment particle swarm optimization (EPSO) [37], and adaptive particle swarm optimization (APSO) [38]. The comparison experiments are conducted on eight real-life datasets from the UCI machine learning repository, Iris, Newthyroid, Seeds, Diabetes, Yeast, Glass, CMC, and Lung Cancer. More details about these datasets are presented in Table 3.
Table 3

Properties of the UCI datasets.

DatasetsDataFeatureClass
Iris 15043
Newthyroid 21553
Seeds 21073
Diabetes 76882
Yeast 1484810
Glass 21496
CMC 147393
Lung Cancer 32263
In these compared clustering approaches, the crossover probability P of the DE approach is randomly generated from 0.2 to 0.8. An environment factor based on the clustering information is embedded in the velocity updating model of EPSO approach, and the learning factor c3 is randomly generated from 0.6 to 3. A nonlinear regressive function of APSO based on the population diversity is used to the adjustment formula of the inertia weight, L represents a predefined constant, and c is a predefined constant. All adjustable parameters in these clustering approaches are set to the appropriate values which are reported in the respective publications from Table 4.
Table 4

Parameter setting of the comparative clustering approaches used in the experiments.

ParametersPSODEEPSOAPSOCCP
Population (Q)100100100100100
t max 200200200200200
c 1,c22,2N2,2N2,2
c 3 NN(0.6, 3)N(0.6, 3)
r 1,r2(0,1)N(0, 1)N(0, 1)
r 3 NN(0, 1)NN
(wmin,wmax)1N(0.4, 0.6)N(0.4, 1.2)
P c N(0.2, 0.8)NN(0.2, 0.8)
L NNN2.1N
c NNN2N
m NNNN10
Each clustering approach, including CCP, ran for 50 times for each dataset to eliminate the effects of random factors. Simple statistics including worst values (Worst), best value (Best), Mean, and SD of fitness function according to (8) are used in the experiments as the evaluation criteria of clustering results. The experimental environment is the same for all comparative clustering approaches. Figure 2 shows the convergence of these clustering approaches on the eight test datasets for typical runs of these approaches. The fitness value obtained by CCP declines faster at the beginning of the evolution process and then obtains fine convergence for each dataset. The values of the fitness function of PSO and DE decrease slowly at the beginning of the evolution process and do not apparently have better convergence performance than other approaches. Although EPSO and APSO show better performance than the above clustering approaches, they are also easily trapped into local optima, as shown in Figures 2(e), 2(f), and 2(h). Therefore, CCP has better convergence speed and higher clustering quality than the comparative approaches for all these datasets, as shown in Figure 2.
Figure 2

Convergence of the approaches on each of the eight real-life datasets. (a) Iris, (b) Newthyroid, (c) Seeds, (d) Diabetes, (e) Yeast, (f) Glass, (g) CMC, and (h) Lung Cancer.

Simple statistics of the fitness function values of these clustering approaches on these datasets are reported in Table 5. Results in Table 5 show that CCP has the overall best performance on these eight test datasets. Due to the characteristics of the test datasets, some clustering approaches performed better on some specific datasets with smaller SD, but the performance of CCP on these ten datasets is all considered comparable. Table 6 provides the average computation time in seconds taken by each of the five clustering techniques when running 50 times on each of the datasets. It can be seen from Table 6 that the proposed CCP has a larger average computation time as compared with PSO, DE, EPSO, and APSO. The evolution process in the chained membranes is time-consuming so that CCP takes more time than other techniques does.
Table 5

Performance of the comparative clustering approaches on the eight real-life datasets.

DatasetsStatisticsClustering approaches
PSODEEPSOAPSOCCP
Iris Worst153.8256109.996996.818396.7113 96.6555
Best114.6361101.988896.706396.6672 96.6555
Mean133.4267105.255596.743196.6863 96.6555
S.D.8.71502.33620.03740.0093 1.23E13

Newthyroid Worst2400.01521977.37291943.60911908.2618 1895.9965
Best2040.00581924.23651887.75781885.4950 1866.5183
Mean2271.05291950.82371903.76991897.2114 1888.1277
S.D.91.131714.021711.1225 5.0863 7.5021

Seeds Worst452.8039353.1875312.1602311.9663 311.7978
Best384.2174333.9361311.8824311.8493 311.7978
Mean422.1335342.6432312.0224311.9148 311.7978
S.D.23.01065.40780.07150.0257 4.27E07

Diabetes Worst59885.083547911.271447629.862147590.4655 47561.1362
Best51628.108347693.640347585.401247561.6124 47561.1262
Mean54535.433547779.187947607.617647573.1668 47561.1262
S.D.2147.431973.224811.85706.9363 3.79E08

Yeast Worst430.0923378.2874248.9934287.4479 245.0999
Best391.0852348.2327236.9439256.7699 235.3785
Mean407.6461367.3743242.7818271.6039 239.6706
S.D.10.97767.20503.30828.0848 2.4396

Glass Worst451.0029334.2308226.0186253.7243 213.4793
Best328.2842292.1006212.6117210.8631 212.1629
Mean418.7840314.2182216.2731230.8009 212.9308
S.D.31.27438.91583.529014.8939 0.3110

CMC Worst6852.76975720.00025537.81425535.5675 5532.3096
Best6335.67135628.69905535.58535533.1403 5532.1857
Mean6551.65585683.09785536.82425534.1784 5532.1913
S.D.174.044328.14800.55430.5962 0.0278

Lung Cancer Worst170.2152153.9227127.4346125.6464 125.6485
Best184.9632159.9095138.4711126.3331 125.6685
Mean178.8476157.0802133.1676125.7308 125.6609
S.D.4.15421.60513.34660.1595 0.0052
Table 6

Average computation time in seconds over 50 runs.

DatasetsClustering approaches
PSODEEPSOAPSOKEPSO-CCP
Iris 2.46302.73153.06082.67083.3833
Newthyroid 2.47852.82513.33502.83603.6576
Seeds 2.42632.70723.28172.84463.5799
Diabetes 2.73413.11164.91563.23155.2833
Yeast 5.16745.23559.82555.683110.3316
Glass 2.65263.00693.63782.99464.0859
CMC 3.75504.03918.05833.99408.1726
Lung Cancer 2.34512.73822.82672.76553.2006
In order to evaluate the clustering performance of these compared clustering approaches, clustering accuracy (CA) is used to evaluate the quality of clustering results obtained by the clustering techniques; the overall accuracy of partitioning results is defined bywhere |C∩C| represents the number of the data points in both belonging to actual cluster k and partitioning cluster l, for k,  l=1,2, ⋯, K. The simple statistics of clustering approaches on eight datasets are reported in Table 7. It can be seen that CCP has overall better performance on these datasets. Although some approaches show a better performance on some specific datasets, the performance of CCP on these specific datasets is also considered compared from Table 7.
Table 7

Performance of the comparative clustering approaches on the eight real-life datasets.

DatasetsStatisticsClustering approaches
PSODEEPSOAPSOCCP
Iris Worst0.66670.89330.8933 0.9000 0.8667
Best0.92670.90000.90000.9000 0.9600
Mean0.80770.89830.89970.9000 0.9013
SD0.09040.00270.0015 1.03E170.0248

Newthyroid Worst0.69770.74880.77210.7721 0.7767
Best 0.8372 0.80930.80470.80470.8047
Mean0.73910.77770.80000.8000 0.8014
SD0.03370.01870.01140.0114 0.0080

Seeds Worst0.64760.84770.89520.8952 0.8952
Best0.8953 0.9143 0.89520.89520.8953
Mean0.83670.88140.89520.8952 0.8952
SD0.08500.02071.14E − 161.14E − 16 1.14E16

Diabetes Worst0.65100.65100.65100.6510 0.6510
Best0.65100.65100.65100.6510 0.6654
Mean0.65100.65100.65100.6510 0.6518
SD1.14E − 161.14E – 161.14E − 16 1.14E160.0032

Yeast Worst0.31200.33090.45080.3942 0.4683
Best0.33760.40160.53700.4569 0.5418
Mean0.31710.36400.49650.4377 0.5200
SD 0.0057 0.02480.02750.01340.0152

Glass Worst0.35510.47200.56070.5093 0.5841
Best0.48130.5234 0.5981 0.58410.5841
Mean0.41400.50210.58340.5486 0.5841
SD0.04440.01440.01220.0267 1.14E16

CMC Worst0.43320.43580.4515 0.4562 0.4515
Best0.4630 0.4582 0.45690.45620.4569
Mean0.45100.44920.45560.4562 0.4562
SD0.00870.00740.0016 5.66E170.0012

Lung Cancer Worst0.40620.40630.53130.4063 0.5625
Best0.56250.50000.5625 0.6250 0.5625
Mean0.46410.42810.55780.5188 0.5625
SD0.04330.02500.01140.0644 1.28E16

4.3. Proposed CCP for Image Segmentation

In this section, some typical experiment and analysis for tested image are proposed to evaluate the segmentation performance of this proposed CCP. These tested images are used in the previous studies and researches, which are provided from the Berkeley segmentation dataset and benchmark [39]. The size of the tested image is 481 × 321; Figure 3 gives the original image of these tested images.
Figure 3

Benchmark tested images. (a) Church. (b) Train. (c) Roman. (d) Starfish. (e) Surfer II. (f) Cow. (g) Crocodile. (h) Elephant.

OSTU proposed by Qtsu [40] is one of most popular segmentation methods, and it has been used to determine whether the optimal threshold method can give a satisfactory segmentation results. The following discriminant criterion measure of OSTU can be described as follows: An image contains N pixels from 0 to L, usually L is set to 255, need to segmented in M+1 clusters. Thus, M thresholds {t1, t2, ⋯, t} are needed that divided the original image. The number of i-th gray level pixels or frequencies is denoted by h(i), PR represents the probability of i-th gray level pixel in the image,PR=h(i)/N, where N=∑h(i). The optimal thresholds {t1, t2, ⋯, t} are determined by the following equations:where w2=∑PR, u2=∑iPR/w, for k=1,2, ⋯, M. The compared experiments are performed on the test images with a different number of thresholds, M=3, M=5, and M=8 [41], to evaluate the performance of the CCP in both low- and high-dimensional multilevel thresholding problems. And, the proposed CCP is compared with PSO, DE, EPSO, and APSO approaches as mentation above. The purpose of image segmentation is to find a set of thresholds {t1, t2, ⋯, t} to maximize the values of Ostu's function according to equation (17). Figures 4–7 give the segmented results on tested images with a different number of thresholds.
Figure 4

Segmented images of church at different thresholds calculated by all approaches: (a) PSO (Th = 3), (b) DE (Th = 3), (c) EPSO (Th = 3), (d) APSO (Th = 3), (e) CCP (Th = 3), (f) PSO (Th = 5), (g) DE (Th = 5), (h) EPSO (Th = 5), (i) APSO (Th = 5), (j) CCP (Th = 5), (k) PSO (Th = 8), (l) DE (Th = 8), (m) EPSO (Th = 8), (n) APSO (Th = 8), (o) CCP (Th = 8).

Figure 5

Segmented images of starfish at different thresholds calculated by all approaches: (a) PSO (Th = 3), (b) DE (Th = 3), (c) EPSO (Th = 3), (d) APSO (Th = 3), (e) CCP (Th = 3), (f) PSO (Th = 5), (g) DE (Th = 5), (h) EPSO (Th = 5), (i) APSO (Th = 5), (j) CCP (Th = 5), (k) PSO (Th = 8), (l) DE (Th = 8), (m) EPSO (Th = 8), (n) APSO (Th = 8), (o) CCP (Th = 8).

Figure 6

Segmented images of Surfer II at different thresholds calculated by all approaches: (a) PSO (Th = 3), (b) DE (Th = 3), (c) EPSO (Th = 3), (d) APSO (Th = 3), (e) CCP (Th = 3), (f) PSO (Th = 5), (g) DE (Th = 5), (h) EPSO (Th = 5), (i) APSO (Th = 5), (j) CCP (Th = 5), (k) PSO (Th = 8), (l) DE (Th = 8), (m) EPSO (Th = 8), (n) APSO (Th = 8), (o) CCP (Th = 8).

Figure 7

Segmented images of elephant at different thresholds calculated by all approaches: (a) PSO (Th = 3), (b) DE (Th = 3), (c) EPSO (Th = 3), (d) APSO (Th = 3), (e) CCP (Th = 3), (f) PSO (Th = 5), (g) DE (Th = 5), (h) EPSO (Th = 5), (i) APSO (Th = 5), (j) CCP (Th = 5), (k) PSO (Th = 8), (l) DE (Th = 8), (m) EPSO (Th = 8), (n) APSO (Th = 8), (o) CCP (Th = 8).

Figures 4–7 provide the segmented images on church, starfish, surfer II, and elephants obtained by the compared techniques. It can be observed that the segmented quality has been improved with the increasing number of thresholds. And, CCP has a better performance than others on these tested images. Respectively, the segmented images obtained by APSO and CCP achieve better consistency than those by PSO, DE, and EPSO when Th=8. The gray-level histogram is often regarded as a kind of distributions to determine the thresholds for the image segmentation [42]. And, the peak value of the histogram is one of the important factors that affect the segmentation accuracy. Therefore, the thresholds of the compared approaches on church and elephant images are shown in gray-level histogram as follows. Figures 8 and 9 show the segmented results of church and elephant at 8 thresholds level obtained by CCP and compared approaches. The optimal thresholds values of CCP with {40, 79, 114, 130, 158, 191, 213, 235} is similar to those of the PSO with {40, 77, 114, 130, 157, 190, 213, 235} on church images and also similar to those of the elephant image. From these figures, it can be seen that the optimal thresholds values by DE and EPSO are very different from those by PSO, APSO, and CCP in most cases. Because the segmentation results depend on the information of classes according to the thresholds level, the optimal thresholds of EPSO have statistical difference with others. Furthermore, it is not difficult to find that the selected optimal thresholds heavily depend on the objective function that is chosen. Besides EPSO, other threshold selection approaches, including the CCP approach, can segment the test images more reasonably, as shown in Figures 4–7. Table 8 provides the mean and S of Ostu's function with 3, 5, and 8 thresholds achieved by all compared approaches in 50 run times.
Figure 8

Segmented images of church at 8 thresholds level calculated by all approaches and their respective gray histogram. (a) Church, (b) PSO, (c) DE, (d) EPSO, (e) APSO, and (f) CCP.

Figure 9

Segmented images of elephant at 8 thresholds level calculated by all approaches and their respective gray histogram. (a) Elephant, (b) PSO, (c) DE, (d) EPSO, (e) APSO, and (f) CCP.

Table 8

The values of Ostu's function with a different number of thresholds calculated by all approaches.

ImagesThPSegmentation approaches
PSODEEPSOAPSOCCP
Church3Mean3276.83673276.8373276.76853276.8367 3276.8367
SD1.87E − 121.87E − 120.1558311.87E − 12 0.00E+00
5Mean3381.91073381.49233380.49293381.9236 3381.9246
SD0.01710.22451.12400.0046 9.25E13
8Mean3422.56213420.95783416.20823420.7834 3423.2006
SD2.6579 0.9181 3.55742.867251.2323

Train3Mean2606.59512606.59512606.56232606.5951 2606.5951
SD 4.67E134.67E − 130.05004.67E − 13 4.67E13
5Mean2736.22572735.31812735.26762735.6328 2736.2331
SD0.01350.52410.67432.6787 9.33E13
8Mean2790.34352788.03382786.98292790.4068 2790.4461
SD0.09130.57242.55110.0897 0.0389

Roman3Mean2142.34402142.34402142.27522142.3440 2142.3440
SD9.33E − 139.33E − 130.10739.33E − 13 1.39E13
5Mean2221.30262220.84652220.59982221.3051 2221.3056
SD0.00470.24760.46360.0012 1.67E13
8Mean2251.72182250.57662249.71172251.7258 2251.7325
SD0.05920.42111.21470.0673 0.0573

Starfish3Mean2784.22722784.22722784.07952784.2272 2784.2272
SD1.4E − 121.4E − 120.12671.4E − 12 9.25E13
5Mean2916.25572915.64662915.58622916.2726 2916.2730
SD0.02400.36930.67160.0010 0.0009
8Mean2973.99762971.81862971.22072974.0391 2974.0829
SD0.04040.76031.87860.0942 0.0151

Surfer II3Mean7953.41817953.41817953.34897953.4181 7953.4181
SD2.80E − 122.80E 120.0718 2.78E122.80E − 12
5Mean8025.58808025.10568024.61608025.6045 8025.6055
SD0.01470.26391.22320.0031 0.0019
8Mean8055.97548054.33878052.11598056.0443 8056.0530
SD0.05050.54972.27410.0361 0.0189

Cow3Mean3858.22023858.22023858.18723858.2200 3858.2202
SD0.00E + 000.00E + 000.08860.0009 0.00E+00
5Mean3965.28063964.37223964.07703965.2925 3965.2932
SD0.01960.45311.24800.0032 4.67E13
8Mean4028.84474026.04904024.42134028.9299 4028.9450
SD0.07580.86352.64470.0548 0.0107

Crocodile3Mean3155.43593155.43433155.34683155.4359 3155.4359
SD9.33E − 130.00690.09699.33E 13 9.25E13
5Mean3291.42963290.60653290.47433291.4263 3291.4407
SD0.01630.49431.05940.0323 0.00E+00
8Mean3348.63143346.48093345.80103348.6425 3348.7305
SD0.07190.63802.08890.1033 0.0168

Elephants3Mean1626.72051626.72051626.67101626.7198 1626.7205
SD 0.00E+000.00E + 000.12200.00350.00E + 000
5Mean1695.86631695.52981695.23351695.8663 1695.8676
SD0.00390.14150.87550.0023 0.0005
8Mean1728.17061726.92761726.15341728.0408 1728.2503
SD0.05920.39311.52280.6764 0.0232
The simple statistics of compared approaches on tested images are reported in Table 8. The best values of mean and SD for each image are highlighted, and it is not hard to see that the CCP system is able to find the best values. Because the PSO, DE, EPSO, and APSO are not specially techniques for image segmentation, some segmented approaches based on multilevel threshold, whale optimation algorithm (WOA) [43], gray wolf optimizer (GWO) [44], whale optimization algorithm based on thresholding heuristic (WOA-TH) [41], and gray wolf optimizer based on thresholding heuristic (GWO-TH) [41], are used to evaluate the clustering effectiveness of CCP. The maximum number of iterations is set to 2200, 3000, and 3600 with the number of thresholds being 3, 5, and 8. The mean and SD of Ostu's function are obtained by the 100 run times to avoid the effects of random factors. Table 9 provides the values of mean and SD obtained by different segmented approaches on tested images. Obviously, traditional approaches are easily trapped into local optima, and WOA and GWO have worse segmented performance than that of the others. And, the thresholding heuristic has finetuned the best thresholds to enhance the global search ability of WOA and GWO. It also can be observed that the CCP has better and stable performance than compared segmented approaches on the tested images.
Table 9

The values of Ostu's function with a different number of thresholds calculated by segmentation approaches.

ImagesThPSegmentation approaches
WOAGWOWOA-THGWO-THCCP
Church3Mean3271.44273271.43833271.44273271.4445 3276.8367
SD1.24E – 029.93E – 03 9.14 E138.27E − 051.87E − 12
5Mean3375.87213374.21973375.91183371.9813 3381.9246
SD2.65E + 006.74E + 002.62E + 009.65E + 00 9.33E − 13
8Mean3416.68153415.44153417.08763415.6558 3423.5841
SD2.40E + 001.74E + 001.78E + 003.51E + 00 2.72E − 02

Train3Mean2606.59512611.4987 2611.5081 2611.50812606.5951
SD2.26E − 022.53E − 024.57E − 124.57E − 12 9.52 E  − 13
5Mean2740.67492740.5534 2740.6800 2740.67842736.2331
SD1.57E − 021.16E − 015.77E − 036.18E − 03 2.32 E  − 03
8Mean2794.69312793.9232 2794.7170 2794.27332790.4901
SD5.57E − 021.19E +001.90E − 025.32E − 01 1.12E − 02

Roman3Mean2138.78932138.78372138.80572138.7992 2142.3440
SD2.52E − 022.62E − 020.00E +001.69E − 02 9.33 E  − 13
5Mean2218.66352218.79362218.96222218.9687 2221.3056
SD2.53E + 007.93E − 011.11E − 022.35E − 04 9.25 E  − 13
8Mean2249.37682248.64082249.64572249.6080 2251.7798
SD1.28E + 001.17E + 001.37E − 012.00E − 01 3.77 E  − 02

Starfish3Mean2779.92522779.91672779.92522779.9214 2784.2272
SD3.20E − 126.64E − 033.20E − 125.91E − 03 1.40 E  − 12
5Mean2912.85322912.70582912.85622912.8371 2916.2730
SD1.18E − 021.66E − 015.48E − 034.62E − 02 9.34E-04
8Mean2972.22182971.37252972.34792972.2159 2974.0989
SD1.21E + 001.19E + 007.36E − 031.99E − 01 2.23E − 03

Surfer II3Mean7953.41677953.40637953.41677953.4135 7953.4181
SD5.07E − 031.89E − 024.82E − 037.80E − 03 6.48 E  − 12
5Mean8025.07038025.49598025.60498025.6025 8025.6055
SD7.73E − 035.56E − 027.11E − 033.20E − 03 2.80 E  − 12
8Mean8055.42158054.41358056.06178055.8463 8056.0646
SD8.27E − 037.36E − 012.05E − 031.25E − 01 2.01 E  − 03
Cow3Mean3858.22023858.21883858.22023858.2202 3858.2202
SD5.94E − 121.03E − 025.94E − 125.94E − 12 1.85 E  − 12
5Mean3965.27953965.17393965.29323964.6096 3965.2932
SD2.01E − 029.72E − 026.40E − 122.93E + 00 1.39 E  − 12
8Mean4028.94104027.78924028.84744027.7210 4028.9605
SD1.83E − 029.47E − 016.11E − 011.69E + 00 1.50 E  − 02

Crocodile3Mean3155.43593155.43373155.43593155.4359 3155.4359
SD9.14E − 131.39E − 029.14E − 13 9.14 E  − 13 9.33E − 13
5Mean3291.43403291.29453291.43663291.4287 3291.4407
SD1.02E − 021.22E − 015.38E − 031.35E − 02 4.67E − 13
8Mean3348.03843347.45283348.74533348.6504 3348.7454
SD2.79E +001.75E +00 1.94 E  − 03 7.60E − 023.27E − 03

Elephants3Mean1626.71831626.71921626.72051626.7196 1626.7205
SD3.53E − 039.10E − 031.37E − 129.25E − 03 9.25 E  − 13
5Mean1695.04821695.32081695.86641695.8231 1695.8676
SD3.99E +002.84E +00 1.05 E  - 03 8.42E − 027.12E − 02
8Mean1727.17251726.53791728.25961728.1452 1728.2676
SD2.86E + 001.34E + 002.55E − 021.35E − 01 1.20E − 02

5. Conclusions

A complex chained P system (CCP) is proposed for solving clustering problems, which combines complex chained P systems and evolution mechanisms, including PSO and DE mechanisms. Two kinds of evolution rules for objects in different chained membranes are introduced to enhance the global search ability of PSO. One of evolution rules contain two subevolution rules, which are based on traditional and modified PSO techniques. The partitioning information as environmental factor is introduced to improve the clustering performance of PSO. Another is based on the DE mechanism to evolve the global chained objects in the chained membrane m+1 to enhance the global search ability. In addition, two kinds of communication rules in the chained P systems are defined to enhance the cooperation between chained membranes and avoid prematurity. In order to verify the validity and the performance of CCP, this proposed system is evaluated on eight benchmark clustering problems from the UCI machine learning repository as compared with four developed clustering approaches. Furthermore, eight tested images which from the Berkeley segmentation image databases BSDS300 are used to further evaluate the performance of CCP compared to four existing segmentation techniques. These experimental results verify the validity and performance of this proposed CCP. P systems, as parallel computing models, are highly effective and efficient in solving optimization problems with linear or polynomial complexity. These parallel computing models based on evolution mechanisms provide new ways for solving clustering problems. The extended clustering chained P system uses the chained P system as the computation structure, and the communication rules between chained membranes are single directional. Although these single directional communication rules are simple and easy to implement, bidirectional communication rules may be introduced in future studies to further accelerate the convergence and improve the diversity of populations. Some more complicated communication structures between different membranes may be used in future studies to improve the performance of the approach. Furthermore, the experiments only used small datasets from the artificial datasets and the UCI Machine Learning Repository, and the proposed approach may have some limitations on high dimensional and large datasets. Future studies may test the effectiveness of CCP using large datasets. Balancing the local and global search abilities is also a hard problem to resolve in the future studies. Future studies may also focus on extended P systems based on tissue-like P systems and other bioinspired computing models. More works are needed to apply these extended membrane systems to solve automatic and multiobjective clustering problems.
  6 in total

1.  An Extended Membrane System with Active Membranes to Solve Automatic Fuzzy Clustering Problems.

Authors:  Hong Peng; Jun Wang; Peng Shi; Mario J Pérez-Jiménez; Agustín Riscos-Núñez
Journal:  Int J Neural Syst       Date:  2015-12-01       Impact factor: 5.866

2.  An Adaptive-PSO-Based Self-Organizing RBF Neural Network.

Authors:  Hong-Gui Han; Wei Lu; Ying Hou; Jun-Fei Qiao
Journal:  IEEE Trans Neural Netw Learn Syst       Date:  2016-10-24       Impact factor: 10.451

3.  Spiking neural P systems with multiple channels and anti-spikes.

Authors:  Xiaoxiao Song; Jun Wang; Hong Peng; Guimin Ning; Zhang Sun; Tao Wang; Fan Yang
Journal:  Biosystems       Date:  2018-05-24       Impact factor: 1.973

4.  Extended spiking neural P systems with white hole rules and their red-green variants.

Authors:  Artiom Alhazov; Rudolf Freund; Sergiu Ivanov; Marion Oswald; Sergey Verlan
Journal:  Nat Comput       Date:  2017-11-13       Impact factor: 1.690

5.  Review on 2D and 3D MRI Image Segmentation Techniques.

Authors:  S Shirly; K Ramesh
Journal:  Curr Med Imaging Rev       Date:  2019

6.  GA-Based Membrane Evolutionary Algorithm for Ensemble Clustering.

Authors:  Yanhua Wang; Xiyu Liu; Laisheng Xiang
Journal:  Comput Intell Neurosci       Date:  2017-11-16
  6 in total
  1 in total

1.  Spiking Neural P Systems with Membrane Potentials, Inhibitory Rules, and Anti-Spikes.

Authors:  Yuping Liu; Yuzhen Zhao
Journal:  Entropy (Basel)       Date:  2022-06-16       Impact factor: 2.738

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.