| Literature DB >> 26618778 |
Abstract
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|3), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma.Entities:
Mesh:
Year: 2015 PMID: 26618778 PMCID: PMC4664553 DOI: 10.1371/journal.pcbi.1004574
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 11Kaplan-Meier plots of the subgroups defined by median expression of ROPN1 in A) all the patients, B) the ER+ patients and C) the PR+ patients.
Blue and red curves correspond to the lower and higher expression levels of ROPN1, respectively.
Table of best average AUC-ROC across various FDR thresholds.
Each column represents the combination of network inference method and similarity/dissimilarity measure tested, and each row represents gold standard networks from which time series were generated. The best performing methods are highlighted by bold font.
| Data id | ARACNE-MI | PFN-euclid | PFN-MI | PFN-PCC | RF |
|---|---|---|---|---|---|
|
| 0.533405292 | 0.49876496 | 0.545384 |
| 0.530597 |
|
| 0.526151567 | 0.47360654 | 0.535886 |
| 0.525967 |
|
| 0.534447108 | 0.52589657 | 0.540465 |
| 0.530622 |
|
| 0.504651933 | 0.49969052 | 0.508685 | 0.507814 |
|