Literature DB >> 31828105

Finding Community Modules of Brain Networks Based on PSO with Uniform Design.

Jie Zhang1,2, Lingkai Tang2, Bo Liao2, Xiaoshu Zhu1, Fang-Xiang Wu2.   

Abstract

The brain has the most complex structures and functions in living organisms, and brain networks can provide us an effective way for brain function analysis and brain disease detection. In brain networks, there exist some important neural unit modules, which contain many meaningful biological insights. It is appealing to find the neural unit modules and obtain their affiliations. In this study, we present a novel method by integrating the uniform design into the particle swarm optimization to find community modules of brain networks, abbreviated as UPSO. The difference between UPSO and the existing ones lies in that UPSO is presented first for detecting community modules. Several brain networks generated from functional MRI for studying autism are used to verify the proposed algorithm. Experimental results obtained on these brain networks demonstrate that UPSO can find community modules efficiently and outperforms the other competing methods in terms of modularity and conductance. Additionally, the comparison of UPSO and PSO also shows that the uniform design plays an important role in improving the performance of UPSO.
Copyright © 2019 Jie Zhang et al.

Entities:  

Mesh:

Year:  2019        PMID: 31828105      PMCID: PMC6885845          DOI: 10.1155/2019/4979582

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Graph theory is a very helpful mathematical tool in the field of brain network analysis [1-3]. A brain can be represented as a modular network [4, 5], which is composed of some important neural unit modules. They can provide us rich and useful information and exhibit small-world properties of brain networks [6]. These modules are known as community modules. In brain networks, each vertex denotes a region of interest (ROI) [7], and each edge and its weight represent the connectivity and its strength, respectively [8-10]. Community detection methods are frequently used to find community modules. Girvan and Newman proposed the concept of modularity [11-13], which is the widely used and best known metric. A larger modularity represents a better community partition. Modularity-based community detection methods find the best community modules by seeking the maximum modularity. Namely, when the modularity is maximal, the methods terminate. Therefore, community detection methods can be addressed by means of optimization methods. The FastQ [14] community detection method uses a greedy optimization to maximize modularity. It repeatedly joins communities together in pairs by choosing the join that results in the maximum alteration of modularity in each step. Danon et al.'s [15] community detection method is a modification of FastQ, in which the communities of different sizes are treated equally. The Louvain [16, 17] community detection method firstly calculates the gain of modularity by exchanging a node to its neighbor nodes. Then, the neighbor node obtaining the maximum gain replaces the node. Particle swarm optimization (PSO) [18-20], as one of the swarm intelligent optimization algorithms, was first put forward by Eberhart and Kennedy [21, 22]. It simulates the foraging process of birds. Each bird (particle) may search the feasible solution space individually and share its individual optimal information to the other bird (particle). The swarm can obtain the global optimal solution by comparing the best solutions of all birds (particles) in the swarm. PSO can obtain the optimal solution quickly. However, it has the drawback of premature convergence [23]. The uniform design belongs to the category of the pseudo-Monte Carlo method. It can generate the solutions scattered uniformly over the vector space, and the solutions are independent of each other [24-26]. The uniform design can be applied to many problems, including bio-inspired intelligent optimizations. Zhang et al. [27] combined the uniform design and artificial bee colony to find the community of brain networks. Zhang et al. [26] introduced the uniform design into association rule mining and presented a multiobjective association rule mining algorithm based on the attribute index and the uniform design. Leung and Wang [24] integrated the uniform design and the multiobjective genetic algorithm to obtain the Pareto optimal solutions uniformly over the Pareto frontier. Zhu et al. [28] combined the uniform design and PAM to find the Pareto optimal solutions of the multiobjective particle swarm optimization. Dai and Wang [29] presented a new decomposition-based evolutionary algorithm with the uniform design. Liu et al. [30] proposed a hybrid genetic algorithm based on the variable grouping and the uniform design for global optimization problems. Tan et al. [31] adopted the uniform design to set the aggregation coefficient vectors of the subproblems and proposed the uniform design multiobjective evolutionary algorithm based on decomposition. Feng et al. [32] presented a uniform dynamic programming to alleviate the dimensionality problem of dynamic programming by means of introducing a uniform dynamic to dynamic programming. There are only a few reports on community detection in brain networks in the literature. Liao et al. [33] utilized U-Net-based deep convolutional networks to identify and segment the brain tumor. Williams et al. [34] utilized both Louvain [16] and Infomap [35] community detection algorithms to identify modules in noisy or incomplete brain networks. Zhang et al. [27] utilized the artificial bee colony with the uniform design to detect community modules of brain networks. Wang et al. [36] used the multiview nonnegative matrix factorization to detect modules in multiple biological networks. This study presents a novel method to find community modules of brain networks by integrating PSO with the uniform design. PSO is used to maximize modularity, while the uniform design is used to alleviate premature convergence of PSO by generating sampled points scattered evenly over the vector space. The rest of this study is organized as follows: Section 2 describes the preliminaries of UPSO. Section 3 introduces two evaluation metrics. The dataset and the preprocessing method to be used are described in Section 4. The details of UPSO are shown in Section 5. The comparison between UPSO and several competing algorithms is illustrated in Section 6. The conclusion and future work are described in Section 7.

2. Preliminaries

In this section, we describe PSO and the uniform design.

2.1. Particle Swarm Optimization

In a d-dimensional search space, the position and velocity of the i-th particle are, respectively, represented as x=[x, x,…, x] and s=[s, s,…, s], where i=1,2,…, Npop, in which Npop denotes the population size. The optimal solution of the i-th particle is called the individual optimum, while the optimal solution of the whole swarm is called the global optimum. They, respectively, are denoted as Pbest=[p, p,…, p] and Gbest=[p, p,…, p]. The following formulas are utilized to update the velocity and position of each particle in the swarm [21, 22], respectively:where i=1,2,…, Npop; ω is called the inertia weight coefficient reflecting the ability to track the previous speed; c1 and c2 are called the acceleration coefficients of the individual and the global optimum, respectively, and are commonly set as 2; and r1 and r2 are two random numbers distributed uniformly in (0, 1). From the theoretical analysis of a PSO algorithm, the trajectory of a particle x converges to the mean of Pbest and Gbest. Whenever the particle converges, it “flies” to the individual best position and the global best position [37]. According to formulas (1) and (2), the individual optimum position of each particle gradually moves closer to the global optimum position. Therefore, all the particles may converge to the global optimum position.

2.2. Uniform Design

The uniform design is an experimental design method. Its main objective is to sample a small set of points from a given set of points such that the sampled points are uniformly scattered. Let n be the number of factors and q be the number of levels per factor. When n and q are given, the uniform design selects q combination from all q possible combinations such that these combinations are scattered uniformly over the space of all possible combinations. The selected q combinations are expressed in a uniform array U(n, q)=[U], where U is the level of the l2-th factor in the l1-th combination and can be calculated by the following formula [24–26, 28, 29, 38]:where σ is a parameter given in Table 1.
Table 1

Values of the parameter σ for different numbers of factors and different numbers of levels per factor [24, 25].

Number of levels per factorNumber of factors σ
52∼42

72∼63

112∼107

1325
34
4∼126

172∼1610

192∼38
4∼1814

232, 13∼14, 20∼227
8∼1215
3∼7, 15∼1917

29212
39
4∼716
8∼12, 16∼248
13∼1514
25∼2818

312, 5∼12, 20∼3012
3∼4, 13∼1922
Based on the uniform design, a crossover operator is as follows [24]. It quantizes the solution space defined by two parents into a finite number of points and then applies the uniform design to select a small sample of uniformly scattered points as the potential offspring. Consider two parents x1=(x1,1, x1,2,…, x1,) and x2=(x2,1, x2,2,…, x2,). The minimal and maximal values of each dimension for x1 and x2 can generate a novel solution space [lparent, uparent], denoted as follows: Each domain of [lparent, uparent] is quantized into Q1 levels β, β,…, β, where Q1 is a predefined prime number and β is given as follows: Then, the uniform design is applied to select a sample point as the potential offspring. The crossover operator of two parents x1 and x2 can acquire Q1 offsprings, which are scattered evenly over the vector space spanned by x1 and x2. More details of the algorithm can be obtained from references [24, 25].

3. Evaluation Metrics

There exist many evaluation metrics for community modules of complex brain networks. In this study, we adopt the following metrics.

3.1. Modularity [27]

The modularity metric is a statistic that quantifies the degree to which the network may be divided into such clearly delineated groups [39, 40]. Newman et al. introduced the modularity function and modularity matrix to avoid the influences of random factors so as to obtain the better divisions of the community structure [12, 13, 41]. The modularity Q is the number portion of edges falling within communities minus the expected number portion in an equivalent network with edges placed at random. The modularity Q can be expressed as follows [42, 43]:where δ(i, j)=1 if vertices i and j belong to the same community or δ(i, j)=0 otherwise. m=(∑k)/2 denotes the number of edges in the network, k is the degree of the vertex i, and a is the weight in the adjacent matrix A. Let B=a − (kk/2m), which is called the modularity matrix. Formula (6) can be rewritten in the matrix format as follows:where the assignment matrix X=(x), in which x=1 if vertex i belongs to the community h or x=0 otherwise. The function Trace( ) denotes the sum of diagonal elements of a matrix. A high modularity indicates a better partitioning of the graph. The search for optimal modularity Q is an NP-hard problem [44, 45] because the space of possible partitions grows faster than any power of system size.

3.2. Conductance [27]

The conductance of a cut is a metric that compares the size of a cut (i.e., the number of edges cut) and the number of edges in either of the two subgraphs induced by that cut. The conductance ϕ(G) of a graph is the minimum conductance value between all its clusters. Consider a cut that divides G into k nonoverlapping clusters C1, C2,…, C. The conductance of any given cluster ϕ(C) is given by the following formula [43, 46]:where K denotes the number of clusters, a is the weight in the adjacent matrix, C represents the k-th cluster (k=1,2,…, K), and a(C) is the number of edges with at least one endpoint in C. This ϕ(C) represents the cost of one cut that bisects G into two vertex sets C and (the complement of C). Since we want to find a number k of clusters, we will need k−1 cuts to achieve that number. The conductance for the whole clustering is the average value of those k−1ϕ cuts. The conductance metric can evaluate how difficultly a random walk is that leaves a cluster [40]. The more difficultly a random walk leaves a cluster is, the more compact cluster is. A low conductance indicates a better partitioning of the graph. The conductance metric usually ranges from 0 to 1, while 0 is the optimal score, which means that each cluster corresponds to a maximal strongly connected component of the network.

4. Dataset and Preprocessing

4.1. Dataset

A network is a mathematical representation of a real-world complex system and is determined by a collection of nodes (vertices) and links (edges) between pairs of nodes. Brain connectivity datasets comprise networks of brain regions connected by anatomical tracts or by functional associations. Nodes in brain networks usually represent ROIs, while links represent anatomical, functional, or effective connection [40]. A connectivity matrix (CM) is used to store the connectivity strength between all pairs of ROIs in a brain network [47]. The Autism dataset [6] collected 175 individuals with autism spectrum disorder (ASD) and typically developing (TD) ones, which were acquired from 79 resting-state functional MRI (rsfMRI: 42 ASD and 37 TD) brain networks and 94 diffusion tensor imaging (DTI: 51 ASD and 43 TD) brain networks. The dataset can be obtained from the UCLA multimodal connectivity database (http://umcd.humanconnectomeproject.org) [47]. Each rsfMRI imaging is composed of a 264 × 264 connectivity matrix (CM), in which each value denotes the z-transformed Pearson correlation coefficient (PCC) [6]. In this study, 79 rsfMRI brain networks (42 ASD and 37 TD) are utilized to test the proposed algorithm.

4.2. Data Preprocessing

In this study, we conduct the following preprocessing steps for the above dataset: Reverse z-transformation is performed on the original CM to acquire the PCC connectivity matrix (PCM) according to the following formula: where x ∈ CM and x′ ∈ PCM denote the original and new values, respectively. (2) The negative data in the PCM signify that the correlation among the vertices is negative correlation. In this study, these negative elements are taken as 0 to get rid of negative correlation. After conducting the above two steps, all data in the PCM are in [0, 1], and the PCM turns into a symmetric and nonnegative matrix. (3) To eliminate data noise, this study adopts the thresholding method to remove all edges with the weight less than a specific value θ. Namely, if x < θ, then x=0. In the later numerical experiment, θ = 0.2.

5. The Proposed Algorithm

In this study, we propose a novel algorithm for finding community modules of brain networks by integrating PSO with the uniform design (abbreviated as UPSO). Its coding and detailed steps are described as follows.

5.1. Coding

A brain network G can be represented as G=(V, E), where V={v1, v2,…, v} is a set of N=|V| vertices and E={(v, v) | v, v ∈ V} is a set of M=|E| weighted edges (arcs) among N vertices. The adjacent matrix of G is expressed as A=(a), where a denotes the weight between vertices i and j. From the above-mentioned dataset and data processing, we can see that A is a symmetric and nonnegative matrix. The number of community modules and centroid of a community module are denoted by K and CC=(cc,…, cc),  where k=1,2,…, K, respectively. In PSO, the position coding x of a particle is expressed aswhere x is a K∗N-dimensional row vector and i=1,…, Npop (the population size in PSO).

5.2. Detailed Steps

The proposed algorithm UPSO utilizes the uniform design to obtain the sampled points scattered evenly over the solution space. The initial method based on the uniform design can generate a group of suitable initial particles scattered evenly over the solution space. The crossover operator based on the uniform design can acquire the offspring scattered uniformly over the space spanned by two crossover parents. UPSO iteratively tries to improve a candidate solution in terms of modularity. It integrates the uniform design and PSO to find community modules of brain networks. It can not only obviate the shortcoming of premature convergence in PSO but also acquire the solutions scattered evenly over the solution space. It can find out community modules from brain networks without knowing the number of community modules. Its flow chart is illustrated in Figure 1.
Figure 1

Flow chart of the proposed algorithm.

The detailed steps of the proposed algorithm UPSO are described as follows.

Step 1 .

(generating a temporary initial swarm). The following operations are performed one after another: Let K = K + 1, where K denotes the number of community modules, and its minimal and maximal values are, respectively, 1 and N (the number of vertices). K = 1 to N is to acquire the fittest number of community modules. According to the swarm size Npop, the number of subintervals S and the swarm size of a subinterval Q0 are determined such that S × Q0 ≥ Npop, where S can be taken as 2, or 22, or 23, etc.; Q0 is one of the prime numbers in the first column of Table 1. Here, any combination satisfying S × Q0 ≥ Npop can be chosen. The generation algorithm of initial population based on the uniform design described in reference [28] is implemented to generate a temporary initial swarm Tmp_pop in terms of K, in which each element x contains K community centroids.

Step 2 .

(calculating the fitness of the temporary initial swarm). For each particle p in Tmp_pop, the following operations are performed in sequence: K community centroids CC are separated from x. For each element a in the adjacent matrix A described in Section 5.1, the distances are calculated between a and each CC. Each vertex is assigned to the closest community C to obtain its community affiliation IDX and K community modules. The modularity Q of K community modules is calculated using formula (6) or (7) in terms of IDX, and it is taken as the fitness f(x) of x.

Step 3 .

(generating the initial swarm from the temporary initial swarm). According to the acquired fitness of each particle in the temporary initial swarm, the best Npop ones of the Q0∗S particles are selected as the initial swarm pop.

Step 4 .

(regulating each community module). For each particle position x in pop, the following operations are performed in sequence. The centroid of the community C is updated according to the following formula:where a=(a, a,…, a), i=1,2,…, N  and  N  is  thenumber  of  vertices;  k=1,2,…, K; n is the number of vertices which belong to the community C; and is the new community centroid of C. K new community centroids form a new position, marked as KC, whose fitness and community affiliation are f(KC) and KC_IDX, respectively. If f(KC) > f(x), then x = KC, f(x)=f(KC), and IDX = KC_IDX.

Step 5 .

(initializing the velocity s, individual optimal Pbest, and global optimal Gbest). The velocity s and individual optimal Pbest of the particle p are initialized as its position x, and the fitness of Pbest is set as f(Pbest)=f(x). The maximal value in all Pbest is taken as the global optimal Gbest, which stores the best x and Q of the swarm. The community affiliation Gbest is stored into IDX.

Step 6 .

(increasing iterations and judging terminal conditions). Let t = t + 1, then judge whether terminal conditions are satisfied or not, where t denotes the t-th iteration and its initial value is 0. If K is known, and any of the terminal conditions is satisfied, the algorithm terminates and outputs the optimal solution and its community affiliation; otherwise, the algorithm moves to Step 7. Terminal conditions are described in Section 6.1.

Step 7 .

(computing the weight coefficient w in PSO). The weight coefficient w in PSO utilizes a linear decreasing strategy [48, 49] indicated in the following formula:where wmax and wmin are the maximal and minimal values of w and tmax is the maximal number of iterations. In the later numerical experiment, wmin=0.1 and wmax=1.

Step 8 .

(updating the velocity and position of each particle). To guide the moving trajectory of a particle by KC, formula (1) is modified into the following formula: The velocity and position of each particle in the pop are updated in terms of formulas (13) and (2), respectively.

Step 9 .

(calculating the fitness and regulating each community module). The fitness of each particle in the pop is calculated according to the operations in Step 2, and Step 4 is implemented to regulate each community module.

Step 10 .

(updating Pbest, Gbest, and IDX). For each particle in the pop, if f(x) > f(Pbest), then Pbest=x and f(Pbest)=f(x). If f(x) > f(Gbest), then Gbest=x, f(Gbest)=f(x), and IDX = IDX.

Step 11 .

(implementing the crossover operator based on the uniform design). For each particle in the pop, the following operations are performed in sequence: The crossover operator based on the uniform design is implemented on x and Pbest to acquire the Q1 offspring scattered uniformly over the space spanned by them and also on x and Gbest to acquire another Q1 offspring. The fitness of the 2∗Q1 offspring is calculated, and the best one of them is marked as Obest. The fitness and community affiliation of Obest are expressed as f(Obest) and I  DX, respectively. If f(Obest) > f(Pbest), then x = Obest, f(x)=f(Obest), Pbest=Obest, and f(Pbest)=f(Obest). If f(Obest) > f(Gbest), then f(Gbest)=f(Obest), Gbest=Obest, and I  DX=I  DX.

Step 12 .

The algorithm is returned to Step 6.

Step 13 .

If K < N, the best Q and community affiliation IDX are saved and then the algorithm returns to Step 1; otherwise, the algorithm outputs the optimal solution Gbest, the community affiliation IDX, and the fittest K.

6. Numerical Results

In this study, we select four competing community detection algorithms to compare the performances of UPSO. They include the spectral clustering [50], FastQ [14], Danon et al. [15], and Louvain [16] algorithms. FastQ, Danon, and Louvain algorithms are three commonly used community detection methods. Among five algorithms, UPSO and the spectral clustering are stochastic search algorithms, while FastQ, Danon, and Louvain algorithms are deterministic search algorithms. The parameter values of UPSO and the numerical results obtained by UPSO and four competing algorithms are described as follows.

6.1. Parameter Values

In this study, the parameters of UPSO are described as follows.

6.1.1. Parameters for PSO

The minimal and maximal inertia weight coefficients are wmin=0.1 and wmax=1 (the recommended values in PSO); the acceleration coefficients c1, c2, and c3 are all equal to 2 (the recommended values in PSO); the population size Npop = 100; the maximal number of iterations tmax = 100.

6.1.2. Parameters for the Uniform Design

As the above-mentioned each rsfMRI imaging is a 264 × 264 CM, we set the number of subintervals S as 4 (S can be 21, 22, 23, ......); the number of sample points or the swarm size of each subinterval Q0 is set as 31 because Q0 can be any values in Table 1 and the product of Q0 and S must be larger than the population size Npop, namely, (Q0∗S=31∗4=124) > (Npop = 100). The parameter Q1 is set as 5 in order to only generate 5 offsprings in uniform cross to decrease time consumption.

6.1.3. Terminal Conditions

The number of iterations t > tmax The number of fitness remains unchanged, tno, and is larger than or equal to 30% of tmax When any of the above two terminal conditions is satisfied, the algorithm terminates. It is worth noting that the above parameter values are not fixed and can be changed according to different datasets. The above parameter values are only one of the suitable values, and they do not need to be fine tuned. As the spectral clustering needs to preestimate the number of community modules, it uses the identical number of community modules to UPSO. FastQ, Danon, and Louvain algorithms do not necessarily need to estimate the number of community modules; therefore, they use their default parameters.

6.2. Results

6.2.1. Comparisons of Evaluation Metrics

All the 79 rsfMRI brain networks are utilized to test the performance of five algorithms. Five algorithms independently performed 20 runs to compare their average values. For stochastic search algorithms, UPSO and the spectral clustering, we also compare their standard deviations (the values in parentheses in Tables 2 and 3). Tables 2 and 3, respectively, show the results of modularity and conductance metrics obtained by five algorithms.
Table 2

Comparisons of the modularity metric.

DatasetUPSOSpectral clusteringFastQDanonLouvain
ASD67B 0.3127 (0.0003)0.2981 (0.0000)0.29370.29300.3092
ASD70B 0.3784 (0.0013)0.3479 (0.0037)0.31450.33410.3735
ASD73C 0.4261 (0.0010)0.3943 (0.0031)0.35610.37670.4213
ASD75B 0.4420 (0.0000)0.3805 (0.0000)0.41740.40110.4403
ASD76C 0.4201 (0.0011)0.4112 (0.0000)0.31350.39670.4192
ASD82 0.4300 (0.0000)0.3888 (0.0023)0.36880.36440.4281
ASD83B 0.3527 (0.0002)0.3321 (0.0179)0.30420.32430.3522
ASD87B 0.4551 (0.0001)0.4432 (0.0003)0.42950.43470.4536
ASD90B 0.3919 (0.0025)0.3699 (0.0000)0.33200.3555 0.3919
ASD91B 0.3522 (0.0020)0.3152 (0.0040)0)0.34020.29940.3468
ASD92 0.3987 (0.0000)0.3663 (0.0198)0.36570.37300.3954
ASD93B 0.4153 (0.0001)0.3720 (0.0038)0.36920.38180.4089
ASD95 0.3972 (0.0005)0.3265 (0.0017)0.38420.3765 0.3972
ASD96B 0.4523 (0.0000)0.4092 (0.0381)0.35500.43390.4510
ASD97 0.4233 (0.0017)0.3909 (0.0048)0.37030.37620.4187
ASD99 0.4336 (0.0001)0.4195 (0.0258)0.37350.41940.4332
ASD102 0.4428 (0.0000)0.4325 (0.0015)0.35230.34820.4358
ASD103 0.4443 (0.0000)0.4259 (0.0185)0.37680.43070.4439
ASD104 0.4362 (0.0000)0.4241 (0.0000)0.38680.38090.4355
ASD106 0.4194 (0.0003)0.3513 (0.0197)0.39890.40710.4192
ASD108 0.3997 (0.0003)0.3782 (0.0095)0.35160.35900.3885
ASD111 0.4169 (0.0000)0.4109 (0.0004)0.37420.37690.4141
ASD112 0.4515 (0.0001)0.4112 (0.0031)0.39640.39010.4477
ASD113 0.4104 (0.0016)0.3764 (0.0003)0.38060.37780.4039
ASD114 0.3960 (0.0010)0.3590 (0.0045)0.37650.37660.3914
ASD115 0.3903 (0.0010)0.3311 (0.0025)0.36240.36520.3861
ASD116 0.4095 (0.0008)0.3835 (0.0007)0.34460.30740.4031
ASD117 0.4018 (0.0034)0.3684 (0.0043)0.36130.34690.3955
ASD119 0.4386 (0.0009)0.3638 (0.0190)0.42620.42450.4368
ASD120 0.4214 (0.0001)0.3900 (0.0014)0.34990.38100.4199
ASD124 0.3926 (0.0020)0.3574 (0.0085)0.30210.30210.3893
ASD125 0.3935 (0.0008)0.3774 (0.0116)0.33470.32970.3916
ASD127 0.3955 (0.0012)0.3766 (0.0013)0.35910.33900.3903
ASD129 0.3930 (0.0003)0.3425 (0.0017)0.35790.31870.3878
ASD130 0.4208 (0.0006)0.3975 (0.0156)0.38770.37860.4115
ASD131 0.4232 (0.0021)0.3855 (0.0171)0.40280.39670.4191
ASD132 0.4473 (0.0005)0.4195 (0.0026)0.35160.36480.4445
ASD133 0.4249 (0.0003)0.3841 (0.0205)0.38180.38500.4221
ASD134 0.4018 (0.0006)0.3555 (0.0050)0.38940.34180.3988
ASD138 0.4040 (0.0001)0.3972 (0.0085)0.38530.39420.3979
ASD142 0.4349 (0.0004)0.3867 (0.0011)0.40600.40670.4337
ASD143 0.4027 (0.0001)0.3735 (0.0201)0.32010.35210.4014
TD86C 0.3837 (0.0035)0.3710 (0.0005)0.33870.31790.3836
TD100C 0.3910 (0.0001)0.3843 (0.0046)0.32080.34120.3864
TD101B 0.4257 (0.0002)0.3961 (0.0073)0.40690.39900.4239
TD102B 0.4253 (0.0001)0.3715 (0.0001)0.37410.39430.4221
TD103B 0.4369 (0.0005)0.3687 (0.0155)0.43040.42170.4363
TD105 0.4295 (0.0006)0.4222 (0.0014)0.39220.36810.4265
TD107B 0.4301 (0.0002)0.3820 (0.0000)0.41100.40420.4287
TD108B 0.4171 (0.0001)0.3689 (0.0010)0.37230.37680.4151
TD111B 0.3341 (0.0013)0.3164 (0.0029)0.29160.26170.3286
TD112B 0.4207 (0.0001)0.3744 (0.0210)0.39930.41280.4169
TD113B 0.4017 (0.0009)0.3662 (0.0000)0.34810.34130.3999
TD114 0.4068 (0.0010)0.3895 (0.0000)0.37570.35520.4065
TD118 0.4267 (0.0002)0.4154 (0.0077)0.36080.34490.4243
TD120 0.4189 (0.0014)0.3724 (0.0008)0.36990.36010.4187
TD121 0.4676 (0.0007)0.4346 (0.0001)0.44820.43370.4670
TD122 0.4738 (0.0003)0.4367 (0.0010)0.45890.44890.4733
TD123 0.4336 (0.0011)0.3805 (0.0070)0.41790.40040.4324
TD124 0.4498 (0.0001)0.4114 (0.0000)0.32070.40430.4495
TD125 0.4028 (0.0002)0.3902 (0.0005)0.34720.35060.3988
TD126 0.4174 (0.0007)0.3623 (0.0012)0.40510.37360.4121
TD128 0.4458 (0.0017)0.4294 (0.0000)0.36150.40380.4427
TD129 0.4396 (0.0002)0.3975 (0.0029)0.37450.39040.4375
TD130B 0.4074 (0.0003)0.4010 (0.0001)0.39170.36370.4022
TD131 0.4104 (0.0013)0.3729 (0.0020)0.39490.37590.4079
TD132 0.4439 (0.0002)0.4356 (0.0005)0.40680.42040.4407
TD133 0.3555 (0.0003)0.3509 (0.0005)0.31340.31540.3533
TD134 0.4317 (0.0012)0.3649 (0.0309)0.41240.39800.4285
TD135 0.4460 (0.0007)0.3664 (0.0231)0.42000.41780.4449
TD136 0.4434 (0.0001)0.4069 (0.0005)0.37640.37310.4432
TD137 0.4276 (0.0005)0.3835 (0.0029)0.41610.37260.4270
TD138B 0.3911 (0.0003)0.3631 (0.0003)0.36960.35470.3875
TD139 0.4377 (0.0000)0.4277 (0.0000)0.42300.42050.4353
TD140 0.4151 (0.0009)0.3685 (0.0021)0.39260.37910.4143
TD142 0.4373 (0.0000)0.4216 (0.0042)0.41140.37700.4295
TD143 0.4465 (0.0001)0.4392 (0.0003)0.41110.41070.4434
TD144 0.4073 (0.0002)0.3411 (0.0061)0.36860.36350.4082
TD145 0.4540 (0.0005)0.3809 (0.0011)0.43360.43060.4494
Table 3

Comparisons of the conductance metric.

DatasetUPSOSpectral clusteringFastQDanonLouvain
ASD67B 0.3479 (0.0057)0.3642 (0.0001)0.56290.55980.3793
ASD70B 0.3720 (0.0028)0.4476 (0.0058)0.59370.49610.4213
ASD75B 0.2210 (0.0000)0.2585 (0.0000)0.39120.43420.2220
ASD76C 0.2305 (0.0855)0.2546 (0.0000)0.66940.42840.3098
ASD82 0.3155 (0.0005)0.3536 (0.0013)0.58860.38700.3179
ASD83B 0.4531 (0.0006)0.4640 (0.0194)0.49420.47400.4547
ASD90B 0.3387 (0.0325)0.4247 (0.0028)0.63930.45770.4074
ASD91B 0.3461 (0.0354)0.4360 (0.0029)0.35100.61530.4206
ASD92 0.2726 (0.0000)0.3851 (0.0223)0.61380.29230.3628
ASD95 0.3407 (0.0477)0.3967 (0.0011)0.56860.49780.3723
ASD97 0.2773 (0.0471)0.3541 (0.0067)0.53750.39280.3382
ASD102 0.3095 (0.0001)0.3184 (0.0015)0.52800.54100.3169
ASD103 0.2831 (0.0003)0.2995 (0.0424)0.35650.37080.2832
ASD106 0.3569 (0.0074)0.3912 (0.0251)0.43960.41310.3577
ASD108 0.3510 (0.0019)0.3644 (0.0107)0.46660.48130.3876
ASD112 0.3104 (0.0001)0.3501 (0.0024)0.47500.48840.3159
ASD113 0.3505 (0.0298)0.3725 (0.0002)0.48010.56590.3703
ASD115 0.2821 (0.0009)0.3292 (0.0012)0.56900.34970.3770
ASD117 0.3444 (0.0624)0.4642 (0.0036)0.61170.47210.3812
ASD119 0.2305 (0.0442)0.4344 (0.0238)0.24360.24490.3521
ASD120 0.3495 (0.0010)0.3642 (0.0012)0.37210.43480.3534
ASD124 0.3445 (0.0435)0.3835 (0.0104)0.62410.61510.3821
ASD127 0.2640 (0.0004)0.3739 (0.0006)0.49040.37190.3395
ASD129 0.3648 (0.0000)0.4552 (0.0012)0.56460.38070.4096
ASD131 0.3936 (0.0200)0.4379 (0.0268)0.53570.43470.4118
ASD132 0.2245 (0.0005)0.3295 (0.0020)0.39210.36290.3633
ASD142 0.2243 (0.0011)0.3665 (0.0011)0.26530.39710.3116
ASD143 0.3914 (0.0054)0.4562 (0.0267)0.61570.46720.4204
TD86C 0.3516 (0.0570)0.4259 (0.0003)0.63060.51360.3625
TD101B 0.2422 (0.0004)0.4026 (0.0073)0.42430.42860.3870
TD103B 0.2280 (0.0030)0.3739 (0.0223)0.22920.23320.3158
TD105 0.2368 (0.0007)0.2454 (0.0017)0.42630.29480.3749
TD107B 0.3336 (0.0024)0.3661 (0.0000)0.46180.47320.3413
TD108B 0.3522 (0.0003)0.3689 (0.0005)0.48780.58750.3687
TD111B 0.3200 (0.0388)0.4257 (0.0032)0.60670.57870.4281
TD112B 0.2342 (0.0005)0.4137 (0.0225)0.39510.40470.3708
TD114 0.2541 (0.0521)0.3611 (0.0000)0.30000.45580.3848
TD118 0.3600 (0.0125)0.3902 (0.0142)0.48930.55710.3777
TD120 0.2535 (0.0508)0.3719 (0.0007)0.49580.49570.3676
TD121 0.2743 (0.0242)0.3059 (0.0001)0.38110.38710.2761
TD122 0.1898 (0.0007)0.3160 (0.0006)0.20520.37170.2880
TD126 0.2602 (0.0454)0.3878 (0.0012)0.34110.37740.3747
TD132 0.2231 (0.0006)0.2261 (0.0002)0.24650.23840.2243
TD133 0.4000 (0.0006)0.4010 (0.0011)0.53190.51890.4017
TD134 0.2502 (0.0526)0.3739 (0.0370)0.51260.41280.2649
TD135 0.2851 (0.0626)0.4011 (0.0309)0.45910.48430.3022
TD140 0.2708 (0.0017)0.3939 (0.0017)0.49860.51530.3833
TD142 0.2309 (0.0000)0.2429 (0.0032)0.52430.28890.2453
TD144 0.3574 (0.0041)0.4481 (0.0102)0.57360.36530.4091
TD145 0.2168 (0.0029)0.3712 (0.0008)0.47630.35030.3183
ASD93B0.3384 (0.0001)0.3761 (0.0023)0.2957 0.2837 0.3544
ASD96B0.2880 (0.0000)0.3455 (0.0434)0.5993 0.2223 0.2876
ASD990.3170 (0.0004)0.3303 (0.0322)0.2893 0.2439 0.3179
ASD1040.3141 (0.0000)0.3259 (0.0000)0.2855 0.2854 0.3151
ASD1110.3839 (0.0000)0.3891 (0.0004)0.2866 0.2853 0.3809
ASD1140.3693 (0.0044)0.3948 (0.0039)0.3634 0.3576 0.3751
ASD1160.3974 (0.0138)0.4165 (0.0007)0.4772 0.3825 0.3943
ASD1250.3478 (0.0264)0.4216 (0.0164)0.5709 0.3181 0.4066
ASD1300.3349 (0.0006)0.3507 (0.0174)0.4471 0.2749 0.3944
ASD1330.3337 (0.0006)0.3599 (0.0248)0.5303 0.2722 0.3385
ASD1340.3563 (0.0334)0.4460 (0.0032)0.5267 0.3155 0.4194
ASD1380.3463 (0.0032)0.3511 (0.0152)0.4444 0.2719 0.3347
TD100C0.3565 (0.0002)0.3601 (0.0044)0.5727 0.3244 0.3606
TD102B0.3387 (0.0003)0.3805 (0.0002)0.4720 0.2715 0.3414
TD1240.2949 (0.0003)0.3319 (0.0000)0.4469 0.2544 0.2959
TD1250.3443 (0.0016)0.3479 (0.0004)0.5600 0.3071 0.4080
TD1280.3055 (0.0183)0.3264 (0.0000)0.5013 0.2585 0.3645
TD1290.3146 (0.0015)0.3451 (0.0017)0.3144 0.2690 0.3323
TD1360.3175 (0.0002)0.3432 (0.0005)0.3995 0.2914 0.3177
TD138B0.3241 (0.0407)0.3734 (0.0002)0.3291 0.1568 0.3231
TD1390.3146 (0.0001)0.3255 (0.0000) 0.2431 0.52790.3200
TD1430.3040 (0.0002)0.3080 (0.0003) 0.2403 0.24910.2990
ASD73C0.3280 (0.0014)0.3459 (0.0008) 0.3091 0.43430.3503
TD113B0.2731 (0.0675)0.3749 (0.0000) 0.1559 0.16130.3938
TD1230.3128 (0.0004)0.3480 (0.0137) 0.2448 0.42280.3687
TD130B0.3272 (0.0011)0.3301 (0.0002) 0.2646 0.45170.3145
ASD87B0.3111 (0.0010) 0.2148 (0.0000)0.23250.23590.2958
TD1310.3509 (0.0242)0.3776 (0.0010)0.48970.5127 0.3437
TD1370.3402 (0.0000)0.3721 (0.0019)0.42550.4057 0.3363
From Table 2, it can be obviously observed that, for all 79 rsfMRI brain networks, the modularity metrics obtained by UPSO are all the best among five algorithms. This fully demonstrates that the proposed algorithm outperforms other four competing algorithms in terms of modularity. The main reasons for UPSO obtaining good results are explained as follows: Firstly, UPSO is a heuristic optimization algorithm, so it can search a good solution as much as possible. Secondly, as UPSO is a swarm intelligent optimization algorithm, it can use all the individuals in a swarm to search the optimal solution, while the other four algorithms can use only one individual. Last but not the least, UPSO can use the uniform design to obtain the solutions scattered evenly over the feasible solution space. In five algorithms, the gaps of the results obtained by UPSO and Louvain algorithm are much less than those by UPSO and other three algorithms, and even UPSO and Louvain algorithm obtain the identical results for ASD90B and ASD95 brain networks. Thus, the Louvain algorithm is the most competing in the other four algorithms. We can also see from Table 2 that the standard deviations obtained by UPSO and the spectral clustering are all very small compared to the average values obtained by them. This demonstrates that UPSO and the spectral clustering are both relatively stable in terms of modularity for 79 rsfMRI brain networks. Meanwhile, we can also observe that, for 65 of 79 brain networks, the standard deviations obtained by UPSO are less than or equal to those obtained by the spectral clustering. This demonstrates that UPSO has higher stability than the spectral clustering in terms of modularity. From Table 3, we can clearly see that UPSO obtains the best conductance metrics for most brain networks, but not for all 79 brain networks. This is because that the evaluation perspectives of two metrics are different. However, UPSO obtains the best conductance metrics for 50 brain networks and accounts for about 63% of 79 brain networks. This manifests that UPSO is superior to other competing algorithms in terms of conductance. Meanwhile, this also demonstrates that UPSO can acquire better conductance metrics while ensuring the best modularity metrics. The number of brain networks in that the spectral clustering, FastQ, Danon, and Louvain algorithms obtained the best conductance metrics is 1, 6, 20, and 2, respectively. We can also clearly observe that the best modularity metrics obtained by UPSO and Louvain algorithm are relatively close, but the number of brain networks in that the Louvain algorithm obtained the best conductance metrics is just 2. This fully demonstrates that UPSO outperforms the Louvain algorithm in terms of conductance. From Table 3, we can also see that the standard deviations obtained by UPSO and the spectral clustering are all very small compared to the average values obtained by them. This is also similar to data in Table 2. Namely, for 79 rsfMRI brain networks, UPSO and the spectral clustering are relatively stable in terms of both modularity and conductance metrics. Meanwhile, we can also observe that, for 42 of 79 brain networks, the standard deviations obtained by UPSO are less than or equal to those obtained by the spectral clustering. This demonstrates that UPSO has higher stability than the spectral clustering in terms of conductance. This conclusion is also similar to that concluded from Table 2.

6.2.2. Comparisons of Other Perspectives

Besides the above-mentioned comparisons, we also evaluate the performances of UPSO from other perspectives, such as influences of the uniform design, comparisons with other heuristic algorithms, and complexity analysis. To show the benefit of hybridizing the uniform design in PSO, we modify UPSO by removing the uniform design from UPSO. Namely, the initialization (Steps 1, 2, and 3) uses the random initialization method instead of the generation algorithm of the initial population based on the uniform design, and the crossover operator based on the uniform design (Step 11) is not performed. For brevity, the modified algorithm is called PSO. We compare the modularity metrics obtained by UPSO and PSO. The results are shown in Table 4.
Table 4

Comparisons of UPSO, PSO, and ABC.

DatasetUPSOPSOABCDatasetUPSOPSOABC
ASD67B 0.3127 0.31210.2996ASD142 0.4349 0.43430.4281
ASD70B 0.3784 0.37590.3646ASD143 0.4027 0.4027 0.3863
ASD73C 0.4261 0.42620.4102TD86C 0.3837 0.38240.3743
ASD75B 0.4420 0.44160.4338TD100C 0.3910 0.39090.3773
ASD76C 0.4201 0.41820.4118TD101B 0.4257 0.42000.4136
ASD82 0.4300 0.42960.4192TD102B 0.4253 0.42510.4132
ASD83B0.3527 0.3555 0.3341TD103B 0.4369 0.43420.4299
ASD87B0.4551 0.4552 0.4475TD105 0.4295 0.42910.4174
ASD90B 0.3919 0.39100.3796TD107B 0.4301 0.43000.4199
ASD91B 0.3522 0.35060.3374TD108B 0.4171 0.41390.4038
ASD92 0.3987 0.3987 0.3835TD111B 0.3341 0.33330.3164
ASD93B 0.4153 0.41390.4036TD112B 0.4207 0.41660.4124
ASD95 0.3972 0.39680.3877TD113B 0.4017 0.40160.3898
ASD96B 0.4523 0.4523 0.4447TD114 0.4068 0.40610.3917
ASD97 0.4233 0.42200.4151TD118 0.4267 0.4267 0.4147
ASD99 0.4336 0.43350.4214TD120 0.4189 0.41500.4072
ASD102 0.4428 0.44270.4324TD121 0.4676 0.4677 0.4578
ASD103 0.4443 0.4443 0.4338TD122 0.4738 0.47330.4689
ASD104 0.4362 0.43600.4253TD123 0.4336 0.42840.4226
ASD106 0.4194 0.41830.4092TD124 0.4498 0.44970.4403
ASD108 0.3997 0.39960.3842TD125 0.4028 0.40260.3860
ASD111 0.4169 0.40960.4039TD126 0.4174 0.41720.4064
ASD112 0.4515 0.45140.4392TD128 0.4458 0.43980.4371
ASD113 0.4104 0.41020.3982TD129 0.4396 0.43950.4275
ASD114 0.3960 0.39450.3836TD130B 0.4074 0.4074 0.3950
ASD115 0.3903 0.38990.3789TD131 0.4104 0.40710.4009
ASD1160.4095 0.4098 0.3975TD132 0.4439 0.44380.4378
ASD117 0.4018 0.40090.3907TD133 0.3555 0.35530.3423
ASD119 0.4386 0.43550.4307TD134 0.4317 0.42820.4244
ASD120 0.4214 0.42120.4093TD135 0.4460 0.44260.4404
ASD124 0.3926 0.38930.3793TD136 0.4434 0.44320.4304
ASD125 0.3935 0.39340.3765TD137 0.4276 0.4276 0.4185
ASD127 0.3955 0.39390.3832TD138B 0.3911 0.39030.3811
ASD129 0.3930 0.39200.3801TD139 0.4377 0.43760.4254
ASD130 0.4208 0.41880.4039TD140 0.4151 0.41280.4033
ASD131 0.4232 0.42270.4051TD142 0.4373 0.4373 0.4296
ASD132 0.4473 0.44650.4407TD143 0.4465 0.44640.4359
ASD133 0.4249 0.42480.4117TD144 0.4073 0.40720.3937
ASD134 0.4018 0.39890.3839TD145 0.4540 0.44960.4486
ASD138 0.4040 0.40380.3942
To verify the performance of UPSO, we also compare it with ABC (artificial bee colony). Similar to PSO, ABC is also a heuristic algorithm. Table 4 also shows the results obtained by ABC. From Table 4, we can clearly see that, for 67 of 79 brain networks, the modularity metrics obtained by UPSO are larger than those obtained by PSO. In comparison, there are just 4 brain networks for which the modularity metrics obtained by UPSO are less than those obtained by PSO. This fully demonstrates that the influence of the uniform design on improving the performance of UPSO is significant. Figures 2 and 3 in the next section also obviously illustrate the benefit of the uniform design.
Figure 2

Community plot of TD86C.

Figure 3

Modularity changing plot of ASD104.

By comparison of the modularity metrics obtained by UPSO and those obtained by ABC, it can be clearly seen from Table 4 that, for 79 brain networks, the modularity metrics of UPSO are all larger than those of ABC. This fully demonstrates that UPSO significantly outperforms ABC in terms of modularity. A comparison of PSO and ABC is the same as the comparison of UPSO and ABC. Namely, for 79 brain networks, the modularity metrics of PSO are all larger than those of ABC. It follows from the above that PSO is also superior to ABC for 79 rsfMRI brain networks even without the uniform design. By a detailed analysis of the proposed algorithm UPSO, its computational complexity is obtained as follows: if the number of community modules K is pregiven or preestimated, the time complexity of UPSO is O(tmax∗Npop); otherwise, the time complexity of UPSO is O(tmax∗Npop∗N), where tmax, Npop, and N, respectively, denote the maximal number of iterations, the population size, and the number of vertices in brain networks. Thus, unless it is absolutely necessary, UPSO often uses the pregiven K or the same K as that of the other methods to decrease its computational complexity.

6.2.3. Representative Brain Networks

According to different cases of the modularity and conductance metrics in Tables 2 and 3, two representative brain networks are chosen to demonstrate the performance of UPSO. TD86C Brain Network. For the TD86C brain network, the best modularity and conductance metrics are both obtained by UPSO. Figure 4 illustrates the plot of the modularity metrics obtained by UPSO and PSO.
Figure 4

Modularity changing plot of TD86C.

The community plot of the TD86C brain network is illustrated in Figure 2. From Figure 4, it can be seen that the modularity metrics obtained by UPSO and PSO both converge to a stable state when the number of iterations increases. Meanwhile, we can also clearly see that the plot of UPSO is always above that of PSO after the third iteration. This obviously illustrates that the uniform design plays an important role in improving the performance of UPSO. ASD104 Brain Network. For the ASD104 brain network, the best modularity is obtained by UPSO, while the best conductance metric is obtained by the Danon algorithm. Figure 3 illustrates the changing process of the modularity metrics obtained by UPSO and PSO with the number of iterations. Figure 5 illustrates the community plot of ASD104 brain networks.
Figure 5

Community plot of ASD104.

We can clearly observe from Figure 3 that the plots of UPSO and PSO both go up when the number of iterations increases, which show the processes of searching the optimal solution. However, the plot of UPSO is above or overlapping that of PSO in the whole iterating process. This fully illuminates that the influence of the uniform design is considerable.

7. Conclusions and Future Work

In this study, we design a particle swarm algorithm with the uniform design (UPSO) for finding the community modules in brain networks. We conduct UPSO and several competing algorithms on 79 rsfMRI brain networks. The obtained results demonstrate that UPSO can find community modules with maximal modularity and obviously outperforms other competing methods in terms of modularity. The comparison of UPSO and PSO shows that the uniform design plays an important role in improving the performance of UPSO. The comparison of PSO and ABC shows that PSO is superior to ABC for 79 rsfMRI brain networks. The proposed algorithm UPSO does not apply to very high-dimensional problems because it more likely needs long execution time. To solve the limitations, UPSO can be designed as a parallel algorithm and implemented in the cloud computing platform. In addition, our proposed algorithm is going on for further improvement, such as designing more efficient coding to speed up its converging rate and stability.
  23 in total

Review 1.  Community structure in social and biological networks.

Authors:  M Girvan; M E J Newman
Journal:  Proc Natl Acad Sci U S A       Date:  2002-06-11       Impact factor: 11.205

Review 2.  Organization, development and function of complex brain networks.

Authors:  Olaf Sporns; Dante R Chialvo; Marcus Kaiser; Claus C Hilgetag
Journal:  Trends Cogn Sci       Date:  2004-09       Impact factor: 20.229

3.  Modularity and community structure in networks.

Authors:  M E J Newman
Journal:  Proc Natl Acad Sci U S A       Date:  2006-05-24       Impact factor: 11.205

4.  Finding community structure in networks using the eigenvectors of matrices.

Authors:  M E J Newman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2006-09-11

5.  Spectral methods for community detection and graph partitioning.

Authors:  M E J Newman
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2013-10-30

6.  Changes in structural and functional connectivity among resting-state networks across the human lifespan.

Authors:  Richard F Betzel; Lisa Byrge; Ye He; Joaquín Goñi; Xi-Nian Zuo; Olaf Sporns
Journal:  Neuroimage       Date:  2014-08-07       Impact factor: 6.556

7.  Applications of community detection techniques to brain graphs: Algorithmic considerations and implications for neural function.

Authors:  Javier O Garcia; Arian Ashourvan; Sarah F Muldoon; Jean M Vettel; Danielle S Bassett
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2018-02-01       Impact factor: 10.961

Review 8.  Network science and the human brain: Using graph theory to understand the brain and one of its hubs, the amygdala, in health and disease.

Authors:  David Mears; Harvey B Pollard
Journal:  J Neurosci Res       Date:  2016-01-13       Impact factor: 4.164

9.  The UCLA multimodal connectivity database: a web-based platform for brain connectivity matrix sharing and analysis.

Authors:  Jesse A Brown; Jeffrey D Rudie; Anita Bandrowski; John D Van Horn; Susan Y Bookheimer
Journal:  Front Neuroinform       Date:  2012-11-28       Impact factor: 4.081

10.  Feature related multi-view nonnegative matrix factorization for identifying conserved functional modules in multiple biological networks.

Authors:  Peizhuo Wang; Lin Gao; Yuxuan Hu; Feng Li
Journal:  BMC Bioinformatics       Date:  2018-10-29       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.