Literature DB >> 26470848

Quantitative assessment of gene expression network module-validation methods.

Bing Li^1,2, Yingying Zhang¹, Yanan Yu¹, Pengqian Wang¹, Yongcheng Wang³, Zhong Wang¹, Yongyan Wang¹.

Abstract

Validation of pluripotent modules in diverse networks holds enormous potential for systems biology and network pharmacology. An arising challenge is how to assess the accuracy of discovering all potential modules from multi-omic networks and validating their architectural characteristics based on innovative computational methods beyond function enrichment and biological validation. To display the framework progress in this domain, we systematically divided the existing Computational Validation Approaches based on Modular Architecture (CVAMA) into topology-based approaches (TBA) and statistics-based approaches (SBA). We compared the available module validation methods based on 11 gene expression datasets, and partially consistent results in the form of homogeneous models were obtained with each individual approach, whereas discrepant contradictory results were found between TBA and SBA. The TBA of the Zsummary value had a higher Validation Success Ratio (VSR) (51%) and a higher Fluctuation Ratio (FR) (80.92%), whereas the SBA of the approximately unbiased (AU) p-value had a lower VSR (12.3%) and a lower FR (45.84%). The Gray area simulated study revealed a consistent result for these two models and indicated a lower Variation Ratio (VR) (8.10%) of TBA at 6 simulated levels. Despite facing many novel challenges and evidence limitations, CVAMA may offer novel insights into modular networks.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2015 PMID： 26470848 PMCID： PMC4607977 DOI： 10.1038/srep15258

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Modularity is a common characteristic of omics-based biological networks123. Module-based analyses that investigate or deconstruct omics-based biological networks have become a hot topic in recent years45. Various types of algorithms have been proposed to identify modules (also known as communities, clusters, and subnetworks), including network clustering67, heuristic search89, seed extension10, topology network1112, and matrix decomposition1314. However, in contrast to the large number of module detection methods4, there are few methods for module validation and evaluation. How to evaluate the accuracy and validity of modules has become a new challenge for researchers. Most previous studies used function enrichment methods to evaluate modules based on functional annotations, such as GO, MIPS, and KEGG15161718192021222324. However, some modules may be enriched with too many functions, whereas others may be enriched without any functions, and the background annotation database itself is constantly being updated. Other studies used molecular biological experimental techniques to verify the co-expression, transcription regulation or other interaction relationships among members of a given module252627282930. However, this method is only suitable for small modules that consist of only a few nodes, and it is nearly impossible to perform this method for a larger module. Thus, in the era of Big Data and omics revolution, an arising challenge is to explore rational strategies to validate biological network modules. Several published studies have employed Computational Validation Approaches based on Modular Architecture (CVAMA) to evaluate modules’ authenticity, reproducibility, and significance or to identify phenotype-related functional modules3132333435. These approaches are not limited by module size and supporting databases. With an increasing number of omics technologies and module analysis methods, CVAMA may become the new focus. In this paper, we summarized the available CVAMA methods, which were divided into topology-based approaches (TBA) and statistics-based approaches (SBA). One representative method of each was selected to validate modules obtained from genomic datasets, and comparative analyses were performed to illuminate the feasibility and challenges in CVAMA.

Results

Topology-based approaches (TBA) for module validation

A module may have several topological features, such as modularity2, connectivity36, density3637, clustering coefficient37, degree38, and edge betweenness39. Module detection methods may focus on one or a few topological criteria, and it is also essential to determine whether the identified modules have a modular structure. Therefore, we may use a single or composite topological index to evaluate whether a module is valid (Table 1). Any single topological index used to validate a module should be independent of the methods used to identify the module, such as the network perplexity index of Entropy4041. The entropy increases when the data are more uniformly distributed; therefore, a good quality module is expected to have a low entropy4243. Topological indexes, including intra-modular connectivity44 and NB value26, have been applied to evaluate whether the intra-modular structure is different from other parts of the whole network. Other indexes, such as compactness45 and weak community46, can be used to select good clusters from integrated clustering results. Because a single topological index is not likely to provide a global evaluation of the modular structure, an alternative choice is to combine multiple topological indexes into an integrated measure to assess a module’s validity. Both internal and external indexes, such as density, connectivity, and tabulation-based module preservation statistics, can be integrated to validate the existence of a module3547. Based on a global view of the modular structure, it may be advantageous to aggregate multiple module evaluation statistics into summary preservation statistics. In our study, we validated five preserved modules whose Zsummary value (an integrated index) was greater than 2.

Table 1

Topology- and statistics-based methods for module validation.

No.	Type	Index	Equation	Criteria	Application	Test data	Ref.
Topological validation
1	Integrated index	Z_summary		≥10, strongly preserved; 2~10, moderately preserved; ≤2, no preservation	Composite preservation statistics to validate whether a module is significantly preserved in another network. Apply to correlation networks (e.g., co-expression networks)	yes	354041
2		Z_summaryADJ		≥10, strongly preserved; 2~10, moderately preserved; ≤2, no preservation	Same as above. Apply to general networks (e.g., adjacency matrix networks)	yes	35
3		medianRank		The lower the better	Same as above.	yes	35
4	Single index	Entropy		The smaller the better	Access the quality of identified modules. A good quality module is expected to have a low entropy.	no	4243
5		M_pres	M_pres = cor(k^l,k^m)	The closer to 1, the better	Describe the preservation of intra-modular connectivity across two networks. A p-value can be assigned to evaluate the reproducibility of modules.	yes	4445
6		NB value		NB ≥ 0.5	A ratio of edges within a module and the total number of edges between modules is used to select modules with high intra-modular connectivity.	no	26
7		CS (S)		CS (S) > 0, the higher the better	Describe the compactness and neighboring conditions of a cluster. Apply to select good clusters from integrated clustering results	no	46
8		LS (S)		The higher the better	Judge the quality of a cluster S in a graph G and help to select good clusters from integrated clustering results.	no	47
9		Modularity		0.3 ≤ Q ≤ 0.7	Evaluate the level of modular structure and the best split of a network into modules.	no	239102
Statistical validation
1	Integer linear programming	C · (X₁, X₂, …, X_k)		C ≤ 0, the smaller the better	A classifier and integer linear programming model to select modules based on the activity of the module in case and control samples.	yes	4950
2	Bootstrap resampling	P-value	NULL	P ≤ 0.05	P-value is derived from multiscale bootstrap resampling to assess the uncertainty of clustering analysis and search for significant modules.	no	2533
3	Bootstrap resampling	Consensus score		≥ρ, the higher the better	A jackknife resampling procedure is used to assess the accuracy and robustness of functional modules resulting in an ensemble of optimal modules.	no	56
4	Permutation test	Combinatorial p-value	NULL	Combinatorial criteria: (1) P_(Zm) < 0.05; (2) P_GL, P_nSNPs, P_topo < 0.05; (3) P_emp < 0.05 Additional criteria: P_(Zm(eval)) and/or P_emp(eval) < 0.05	Significance and permutation tests are used to calculate the P value of module scores. Appropriate for GWAS data; multiple GWAS datasets are needed when using additional criteria.	yes	52
5	Permutation test	coClustering (q)		≥95%	A cross-tabulation-based statistic for determining whether modules in the reference dataset are preserved in a test dataset, a permutation test to determine the p value.	yes	45
6	Modular compatibility	Compatibility Score (Cp)		The closer to 1, the better	An indication of agreement or overlap between two sets of modules to measure the network modular compatibility between two networks.	yes	53
7		Matching p-value	NULL	P < 0.05	Modified hypergeometric test-derived p-values with Bonferroni correction to measure modules’ conservation between any two species or networks.	yes	54
8		IGP		The closer to 1, the better	Defined to validate an individual cluster’s reproducibility and prediction accuracy.	yes	55

The topology-based methods (TBA) and statistics-based methods (SBA) for module validation. The columns reports the types, index names, equations, criteria (the cut-off value to evaluate modules), applicable conditions, test data (whether this method requires an additional test network to validate a module) and references.

Statistics-based approaches (SBA) for module validation

In addition to topological criteria, a module should also be statistically significant, which means that the modular architecture distribution ought to be highly unlikely to be obtained by chance in a randomized network. Moreover, exploring the relationship between modules and various phenotypes or identifying consistent modules may also require significance testing31334849. For this reason, SBA is an important process to assess a module’s stability, phenotypic correlation or significance of consistency (Table 1). For responsive modules or module biomarker identification505152, binary or mixed integer linear programming models can be used to validate the causal or dependent relations between network modules and biological phenotypes345354. In phylogeny, resampling approaches are defined as a confidence measure for splits in a phylogenetic tree and are used to calculate consensus trees55, which can also be used to assess the robustness of modules in network analysis2556. A permutation test with a p value calculated by empirically estimating the null distribution can be adopted to determine whether the module composition is higher than expected by chance or associated with the disease being investigated5758. Moreover, given two or more networks, comparative network analysis is often used to identify modules across networks or species, and these modules are defined as consensus or conserved modules3159. Moreover, a module’s “reproducibility” can also be assessed, i.e., to what extent a module obtained from one network is compatible with modules in another network4655606162. In our study, by using hierarchical clustering, we identified 66 statistically significant modules based on approximately unbiased (AU) p-values. Module identification based simply on topological criteria or statistical significance may not discover certain types of biologically meaningful modules63. Because disparate results can be obtained from the same network with different algorithms, functional validation can be used to evaluate the performance of different module identification methods64. Although it is not our focus in this study, we summarize and list functional module validation methods reported in the published literature (Supplementary Table 1). Typically, the most widely used functional validation method is functional homogeneity evaluation6566, with indexes such as functional enriched p value74567 and R score768. Furthermore, the index of quantitative score based on function enrichment analysis may be applied to assess a module’s confidence level21 or disease relationship69. Moreover, known protein complex matching can also provide functional evidence for modules, and the commonly used indexes include the overlapping score (OS)770 and F-measure6771. Other measurements, such as the positive predictive value (PPV), accuracy, and separation, can also be used72. For small modules, the experimental techniques of molecular biology, such as real-time quantitative PCR (qPCR), western blotting, and siRNA knock-down, may be applied to validate the co-expression, co-regulation or other interaction relations among the genes or proteins within a module25262728.

Homogeneity of different models of TBA on the same dataset

Both Zsummary and medianRank are integrated topological indexes of module preservation. We applied these two models to evaluate modules identified from the same dataset (GSE24001), which was derived from 30 newly diagnosed infant acute lymphoblastic leukemia samples. Modules were identified by the Weighted Gene Co-Expression Network Analysis (WGCNA) R package73, setting 3 as the minimum module gene number. Each module was detected based on a hierarchical cluster tree and was labeled by colors (Fig. 1A). The validation results are shown in Fig. 1B,C. Five preserved modules (Zsummary ≥ 2) all had a relatively low medianRank values, and the most strongly preserved module (Zsummary = 13) had the lowest medianRank value. Similarly, the two modules that had the highest medianRank value (medianRank = 10) were both unpreserved.

Figure 1

(A) Hierarchical cluster tree showing coexpression modules identified by WGCNA. Each leaf in the tree represents one gene. The major tree branches constitute 9 modules labeled by different colors. (B) The medianRank preservation statistics (y-axis) of the modules. Each point represents a module, labeled by color and names. Low numbers on the y-axis indicate high preservation. (C) The Zsummary preservation statistics (y-axis) of the modules. The modules are labeled as in panel (B) The dashed blue and green lines indicate the thresholds. A Zsummary value over 2 represents a moderately preserved module, and a value over 10 provides strong evidence of module preservation. (D) Scatter plots showing the correlation between the Zsummary (y-axis) and module size (x-axis). (E) The cluster dendrogram with approximately unbiased (AU) P-values. The AU p-values are displayed in red, and clusters with an AU p-value lower than 0.05 are highlighted by rectangles. The calculations and the drawn figure were performed using the pvclust R package.

Example modules validated by two models of TBA

Among the 5 preserved modules, the turquoise module, which had 250 nodes and 30,833 edges, was most strongly preserved (Zsummary = 13), having the lowest medianRank value (medianRank = 2) (Fig. 2A). The preserved yellow module, which was composed of 64 nodes and 1,970 edges, had a low medianRank value (medianRank = 5) (Fig. 2B). Among the 4 unpreserved modules, the red module, consisting of 23 nodes and 253 edges (Fig. 2C), and the magenta module, consisting of 5 nodes and 9 edges (Fig. 2D), had the highest medianRank values (both medianRank = 10).

Figure 2

The preserved and unpreserved modules.

Each node is a gene, and each edge is the co-expression relationship. (A) The turquoise module. (B) The yellow module. (C) The red module. (D) The magenta module.

Variability of TBA (Zsummary) and SBA (AU P-value) results on the same dataset

Based on the same dataset (GSE24001), we compared the validation results of TBA and SBA, choosing Zsummary and AU p-value as the representative methods for each approach. For Zsummary, 5 preserved modules were validated from 9 modules. The Zsummary value was dependent on the module size, which was consistent with previous studies35 (Fig. 1D). The AU p-value used to access the modular architecture distributional probability was computed to search for significant modules (clusters). As shown in Fig. 1E, 66 modules with 3 or more genes and an AU p-value larger than 0.95 are highlighted by rectangles, which are strongly supported by the gene expression data. Different numbers of detected modules and valid modules were obtained through the two methods.

Multiple comparisons of Zsummary and medianRank on 10 datasets

For the same modules identified by WCGNA, we applied the integrated topological indexes Zsummary and medianRank to 10 datasets to further compare the results of module preservation evaluation. The average size of modules obtained from the 10 datasets is shown in Fig. 3A. Because medianRank was a relative preservation index without a cutoff value, we compared the top 10 ranked modules validated by Zsummary and medianRank (Fig. 3B). We failed to obtain a valid Zsummary value in two datasets (GSE6448, GSE29230) when the minimum module size was set at 3. For the other 8 datasets, overlapping preserved modules validated by Zsummary and medianRank were found in 7 datasets, and consistent ranked modules were observed in 6 datasets, demonstrating the consistency of the two indexes. However, no overlapping modules in the top 10 preserved modules were found in one dataset (GSE4882).

Figure 3

The comparison of TBA (Zsummary and medianRank) and SBA (AU p-value) on 10 datasets.

(A) A mean module size of 10 datasets. The y-axis is the mean module size (nodes), and the x-axis is each dataset number. (B) Comparison of the top 10 preserved modules validated by Zsummary and medianRank. The red spots represent the number of consistently ranked modules. The green spots represent the number of overlapping modules. The blue spots represent the number of non-overlapping modules. (C) The effect of changing the minimum module size setting from 4 to 10 on two datasets. The y-axis is the percentage of preserved modules, and the x-axis is the different cutoff value settings. (D) The percentage of preserved modules validated by Zsummary on 10 datasets. (E) The percentage of significant modules validated by AU P-value on 10 datasets. In (D,E) the blue bars indicate the number of all the modules detected, and the red bars indicate the number of preserved or significant modules. (F) The VSR and FR of Zsummary and AU P-value on 10 datasets. The red bars represent Zsummary, and the blue bars represent the AU P-value.

Zsummary analysis was impeded by small module size

Because we failed to obtain a valid Zsummary value in two datasets (GSE6448, GSE29230) with a minimum module size of 3, we changed this cutoff value from 4 to 10. Then, valid Zsummary values were acquired in both datasets, and the percent of preserved modules was stable (Fig. 3C) due to the too small density or connectivity, leading to invalid Zsummary values when the minimum module size was set at 3.

Comparison of TBA (Zsummary) and SBA (AU P-value) results for 10 datasets

As mentioned above, Zsummary is a TBA index and AU P-value is an SBA index. Based on 10 datasets, we compared the performance of these two types of index. In this application, modules with 3 or more genes were considered as valid modules. The proportions of valid (preserved or significant) modules obtained by these two indexes from 10 datasets are shown in Fig. 3D,E. For different methods and datasets, both the module number and the proportion of valid modules varied greatly. Overall, the Validation Success Ratios (VSR) of Zsummary and AU P-value were 51% and 12.3%, respectively (Fig. 3F). This indicated that Zsummary obtained a higher ratio of valid modules (invalid values from two datasets were deemed as zero). A prior study adopted Zsummary to validate CASTxB6 female liver modules with 9 other expression datasets and revealed an average VSR of 86.44%74. However, Zsummary also had a higher Fluctuation Ratio (FR, 80.92%) than AU P-value (45.84%), indicating that the stability of the AU P-value results was superior to that of Zsummary (Fig. 3F).

Correlation between the network parameters and the ratio of valid modules

To further determine which network parameters influence the ratio of valid modules of the Zsummary and AU P-value methods, we selected 6 main network parameters of the 10 datasets (Supplementary Table 2), i.e., modularity, density, clustering coefficient, characteristic path length, network heterogeneity, and network centralization. Linear regression analysis indicated that none of these 6 parameters was correlated with the valid module ratio of the Zsummary and AU P-value methods (Fig. 4). This implied that the valid module ratio of genomic networks may not be influenced by a single network parameter.

Figure 4

The relationship between the percentage of valid (preserved or significant) modules (y-axis) and the network parameters (x-axis).

The network parameters were calculated by plugins in Cytoscape. (A,B) MCL versus valid module percentage. (C,D) Density versus valid module percentage. (E,F) Clustering coefficient versus valid module percentage. (G,H) Characteristic path length versus valid module percentage. (I,J) Network heterogeneity versus valid module percentage. (K,L) Network centralization versus valid module percentage. The red line added to each plot is the linear regression line with intercept 0 and slope 1.

Impact of gray area variation on 8 datasets

The gray area was the region of gray genes that was not assigned into any module and labeled in gray by WGCNA. Except for the two datasets (GSE6448 and GSE29230) without valid Zsummary values, 8 datasets were simulated by changing the gray area genes’ expression levels to 0.1, 0.5, 0.9, 1.1, 1.5, and 2 times that of the original dataset. Based on each simulated dataset, WGCNA (an R package used to compute Zsummary) and pvclust (an R package used to compute AU p-value) were performed for module identification and validation. For WGCNA, 5 datasets (GSE2283, GSE12148, GSE6738, GSE12520, and GSE4882) had no changes at the 6 simulated levels compared with the original datasets (a change in Zsummary less than the cutoff value was considered as no change). The changes in the number of modules or gray genes (only for WGCNA) and Variation Ratio (VR) for the remaining 3 datasets are shown in Fig. 5A–C. For pvclust, changes were observed in all 8 datasets compared with the original datasets, and the changes in the module number and VR are shown in Fig. 5D–K. No correlation was found between the VR and the simulated levels in the changed datasets.

Figure 5

The changes of modules or gray genes (only for WGCNA) on 8 datasets by Gray area simulation.

The blue spots represent the number of modules identified by WGCNA or pvclust. The red spots represent the number of valid (preserved or significant) modules. The gray spots are the number of gray genes. The numerical value above the spots is the VR at 6 simulated levels compared with the original dataset.

Comparison of TBA and SBA by Gray area simulation

For the 8 datasets, the average VRs of WGCNA and pvclust at the 6 simulated levels are shown in Fig. 6A. With regard to WGCNA, only the VRs in the changed datasets were calculated. The VRs of WGCNA and pvclust in all simulated datasets can be seen in Fig. 6B. WGCNA had a higher VR (8.43%) for module identification but a lower VR (8.10%) for module validation. By contrast, pvclust had a lower VR (1.29%) for module identification and a higher VR (14.06%) for module validation. Thus, the gray area changes had different impacts on the two models in terms of module identification and validation. Moreover, the VSR and FR of TBA (Zsummary) and SBA (AU p-value) were stable at each simulated level (Fig. 6C). Zsummary had a higher VSR and FR at each simulated level (2 datasets with invalid Zsummary values were not included). When data at all 6 simulated levels were aggregated, the VSR of Zsummary and AU p-value was 63.82% and 12.30%, and the FR was 55.84% and 51.42%, respectively.

Figure 6

Comparison of WGCNA (Zsummary) and pvclust (AU p-value) on Grey area simulated datasets.

(A) The average VR of the modules and genes of 8 datasets. (B) The average VR of WGCNA and pvclust of all simulated datasets. The red bars represent WGCNA, and the blue bars represent pvclust. (C) The VSR and FR of Zsummary and AU P-value of 8 datasets at 6 simulated levels. The red bars represent Zsummary, and the blue bars represent AU P-value.

Discussion

Functional enrichment and biological experiments based on module validation methods may not satisfy the rising demands of various omics networks. As an alternative choice, CVAMA can potentially provide an analytical assessment of the structure and stability of modules captured by various partitioning methods and should be considered as a crucial tool in the interpretation of network modules. The feasibility of CVAMA was demonstrated in our applications of TBA and SBA-based module validation methods in genomic network modules. As a TBA, Zsummary had a high VSR (51%) but also a high FR (80.92%) and was impeded by small module size. As an SBA, AU p-value had a low FR (45.84%) but also a low VSR (12.3%). The Gray area simulated study showed that the VSR and FR of both TBA (Zsummary) and SBA (AU p-value) remained stable at each simulated level. Meanwhile, TBA (Zsummary) had a lower VR (8.10%) for module validation at the 6 simulated levels, indicating that the gray gene changes had little impact on the topology-based models. Although different validation results were obtained by the two types of method with different gene expression datasets and module detection methods, one may choose an appropriate validation index based on the topological structure or stability of the modules. For example, if we focus on whether a module is structurally different from the rest of the network, we may use Zsummary to assess its preservation. If we focus on whether a module is stable or robust, we may choose AU P-value to assess its confidence. Taken together, the existing methods are not ideal, and further improvement is justified. Modular analysis in genomic networks is a complicated process that involves various factors. In general, there is no “golden standard” for assessing the validity and quality of modules, and different algorithms for module identification with different parameters may produce disparate module partition results47576. Thus, it is difficult to determine which module is “correct” and which module partition method outperforms others. As our application demonstrated, different types of module validation indexes for the same network may generate different outcomes. Generally speaking, each type of module validation method may have its own advantages and disadvantages, and some methods may require certain conditions (e.g., data type, network pattern, or module identification method), limiting their applicability and flexibility. There is no universally acceptable approach that can perform well for all types of data under all scenarios, which results in challenges to make a clear-cut prescription for genomic module validation. As for the possible discordance between topological criteria and biological meaning in module identification77, methods combining both function and structure have been proposed to identify functional modules78. Similarly, function and topology are also the two aspects of module validation. CVAMA may neglect the biological meaning and directly assess the correlation of modules with known functional annotation, which may deviate from the densely connected property. It is assumed that fusion of functional and topological evaluation may lead to a high quality selection of better modules. However, most of the existing methods in published literature focus on either function or topology, and researchers may only be interested in their own subject or module identification algorithm. Therefore, module validation methods that integrate both functional and topological indexes and are independent of the vagaries of module detection algorithms need to be further explored. For the modules obtained by clustering algorithms, internal and external cluster validation methods for assessing the quality of clustering results have been discussed in previous studies4779. The internal validation attempts to measure how well a given partitioning corresponds to the natural cluster structure of the data, and such indexes include compactness, connectedness, separation, combinations, and stability41. External validation attempts to compare the recovered structure to a priori knowledge and to quantify the match between them, and such indexes include unary measures and binary measures79. In addition to cluster quality assessment, how to estimate the optimal number of clusters has also been discussed4080. Generally, if a module is known to be consistent with the known knowledge, it would show stronger evidence of preservation than a module without a priori evidence, such as a known pathway or co-transcriptional regulation17818283. Gene interactions are dynamic in regulating the functioning of cells and organisms8485. A number of studies have focused on dynamic module identification, as well as the dynamic behavior of modules in networks118687. As such, module validation should not be constrained to a static situation. Modular dynamics may involve time-series molecular interaction88, environment changes89, phenotypic changes9091, and ontogenetic and phylogenetic time9293. Dynamic evaluation of modules in certain dynamic processes may generate more comprehensive results, which cannot be obtained in a static state. The more meaningful consequence is intertwined with greater challenges in dynamic CVAMA. Therefore, to address the challenge of omics-based module validation, computing-based methods are an easy and feasible choice. Despite these challenges, CVAMA, in addition to functional enrichment and biological experiments, offers novel insights into module network research and may become a new paradigm in modular analysis.

Materials and Methods

Datasets and samples

Gene expression datasets were obtained from the GEO database (http://www.ncbi.nlm.nih.gov/geo/). Eleven spotted DNA/cDNA datasets from different organisms and experiment platforms were downloaded, with the sample size ranging from 28 to 592, and the gene number ranging from 448 to 3,520. Because a test dataset was needed for Zsummary and medianRank, we selected half of the samples in each dataset as reference data and the other half as test data. The raw gene-expression information is shown in Supplementary Table 2.

Network construction and network parameter calculation

For each genomic dataset, weighted gene co-expression network analysis (WGCNA)73 was used to construct a network and to detect modules. The freely available WGCNA R package and R tutorials were described in73. After network construction, we exported each network to Cytoscape software94. Network analysis was conducted with the NetworkAnalyzer95 plugin to calculate the network topological parameters. Network modularity was calculated by the CommFinder96 plugin in Cytoscape. The network topological parameters and modularity are listed in Supplementary Table 2.

Zsummary and medianRank module preservation statistics

Zsummary is an integrated statistic implemented in functional module preservation in the WGCNA R package35. It is composed of 4 statistics related to density and 3 statistics related to connectivity that can quantitatively assess whether the density and connectivity patterns of modules defined in a reference dataset are preserved in a test dataset. A Zsummary value between 2 and 10 indicates moderate module preservation, whereas a Zsummary > 10 provides strong support for module preservation35. Another integrated index, medianRank35, is also composed of statistics related to density and connectivity. It is a rank-based measure to compare the relative preservation among multiple modules; a module with lower medianRank tends to exhibit stronger observed preservation.

Approximately unbiased (AU) p-value

The AU p-value, computed by multi-scale bootstrap resampling97, was selected as a representative SBA index. The AU p-value is often used to assess the uncertainty of clustering analysis98. In our application, the AU p-value was computed using the R package pvclust99 with 1,000 times resampling, varying the bootstrap sample size from 0.5 to 1.4-fold the real sample size of the gene expression data. We set clusters with an AU p-value larger than 0.95 as significant modules99.

Validation success ratio (VSR) and fluctuation ratio (FR)

The VSR and FR were defined for comparing the two types of module validation approaches on multiple datasets. The VSR (Eq. 1) was defined as the average percentage of valid modules against all modules available on multiple datasets. The FR (Eq. 2) was defined as the stability or variation degree of the percentage of valid modules on multiple datasets; a lower FR indicated that the percentage of valid modules was more stable. where R is the ratio of valid modules on one dataset, and N is the number of all available datasets.

Simulated comparisons by changing the gray area gene expression level

In WGCNA, genes that were not assigned into any module were labeled in gray. We called these gray gene regions the gray area. The gray area represented genes whose profiles were simulated to be independent (i.e., without any correlation structure). To illustrate the impact of gray area changes on module identification and validation by TBA and SBA, we changed the gray area genes’ expression levels to 0.1, 0.5, 0.9, 1.1, 1.5, and 2 times that of the original datasets to obtain simulated datasets. Based on the simulated datasets, we compared the Variation Ratio (VR, Eq. 3) of TBA (WGCNA) and SBA (pvclust) for module identification and validation. where ΔX is the changed number of modules or genes relative to the original data, and n is the number of the simulated datasets.

Additional Information

How to cite this article: Li, B. et al. Quantitative assessment of gene expression network module-validation methods. Sci. Rep. 5, 15258; doi: 10.1038/srep15258 (2015).

100 in total

1. An approximately unbiased test of phylogenetic tree selection.

Authors: Hidetoshi Shimodaira
Journal: Syst Biol Date: 2002-06 Impact factor: 15.683

2. Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach.

Authors: Vasyl Pihur; Susmita Datta; Somnath Datta
Journal: Bioinformatics Date: 2007-05-05 Impact factor: 6.937

3. Identifying functional modules using expression profiles and confidence-scored protein interactions.

Authors: Igor Ulitsky; Ron Shamir
Journal: Bioinformatics Date: 2009-03-17 Impact factor: 6.937

4. Robustness and accuracy of functional modules in integrated network analysis.

Authors: Daniela Beisser; Stefan Brunkhorst; Thomas Dandekar; Gunnar W Klau; Marcus T Dittrich; Tobias Müller
Journal: Bioinformatics Date: 2012-05-11 Impact factor: 6.937

5. Optimization criteria and biological process enrichment in homologous multiprotein modules.

Authors: Luqman Hodgkinson; Richard M Karp
Journal: Proc Natl Acad Sci U S A Date: 2013-06-11 Impact factor: 11.205

6. Evaluation of clustering algorithms for protein-protein interaction networks.

Authors: Sylvain Brohée; Jacques van Helden
Journal: BMC Bioinformatics Date: 2006-11-06 Impact factor: 3.169

7. Co-expression module analysis reveals biological processes, genomic gain, and regulatory mechanisms associated with breast cancer progression.

Authors: Zhiao Shi; Catherine K Derow; Bing Zhang
Journal: BMC Syst Biol Date: 2010-05-27

8. Prediction of human disease-related gene clusters by clustering analysis.

Authors: Peng Gang Sun; Lin Gao; Shan Han
Journal: Int J Biol Sci Date: 2011-01-14 Impact factor: 6.580

9. Modularity detection in protein-protein interaction networks.

Authors: Tejaswini Narayanan; Merril Gersten; Shankar Subramaniam; Ananth Grama
Journal: BMC Res Notes Date: 2011-12-29

10. Gene co-expression network analysis identifies porcine genes associated with variation in Salmonella shedding.

Authors: Arun Kommadath; Hua Bao; Adriano S Arantes; Graham S Plastow; Christopher K Tuggle; Shawn M D Bearson; Le Luo Guan; Paul Stothard
Journal: BMC Genomics Date: 2014-06-09 Impact factor: 3.969

8 in total

1. Network-Wide Screen Identifies Variation of Novel Precise On-Module Targets Using Conformational Modudaoism.

Authors: Bing Li; Jun Liu; Yanan Yu; Pengqian Wang; Yingying Zhang; Xumin Ni; Qiong Liu; Xiaoxu Zhang; Zhong Wang; Yongyan Wang
Journal: CPT Pharmacometrics Syst Pharmacol Date: 2017-11-20

2. Functional homogeneity and specificity of topological modules in human proteome.

Authors: Rama Kaalia; Jagath C Rajapakse
Journal: BMC Bioinformatics Date: 2019-02-04 Impact factor: 3.169

Review 3. Essential Oil Phytocomplex Activity, a Review with a Focus on Multivariate Analysis for a Network Pharmacology-Informed Phytogenomic Approach.

Authors: Alessandro Buriani; Stefano Fortinguerra; Vincenzo Sorrenti; Giada Caudullo; Maria Carrara
Journal: Molecules Date: 2020-04-16 Impact factor: 4.411

4. A framework for stability-based module detection in correlation graphs.

Authors: Mingmei Tian; Rachael Hageman Blair; Lina Mu; Matthew Bonner; Richard Browne; Han Yu
Journal: Stat Anal Data Min Date: 2021-01-08 Impact factor: 1.051

5. Bioinformatics analysis of common key genes and pathways of intracranial, abdominal, and thoracic aneurysms.

Authors: Siwei Bi; Ruiqi Liu; Linfeng He; Jingyi Li; Jun Gu
Journal: BMC Cardiovasc Disord Date: 2021-01-06 Impact factor: 2.298

6. GWENA: gene co-expression networks analysis and extended modules characterization in a single Bioconductor package.

Authors: Gwenaëlle G Lemoine; Marie-Pier Scott-Boyer; Bathilde Ambroise; Olivier Périn; Arnaud Droit
Journal: BMC Bioinformatics Date: 2021-05-25 Impact factor: 3.169

7. Transcriptional Analysis of Nuclear-Encoded Mitochondrial Genes in Eight Neurodegenerative Disorders: The Analysis of Seven Diseases in Reference to Friedreich's Ataxia.

Authors: Muhammad Elsadany; Reem A Elghaish; Aya S Khalil; Alaa S Ahmed; Rana H Mansour; Eman Badr; Menattallah Elserafy
Journal: Front Genet Date: 2021-12-20 Impact factor: 4.599

8. Erosion of Gene Co-expression Networks Reveal Deregulation of Immune System Processes in Late-Onset Alzheimer's Disease.

Authors: John Stephen Malamon; Andres Kriete
Journal: Front Neurosci Date: 2020-03-20 Impact factor: 4.677

8 in total