| Literature DB >> 27168792 |
Hong Yue1, B O Yang1, Fang Yang1, Xiao-Li Hu1, Fan-Bin Kong1.
Abstract
Recent progress in bioinformatics has facilitated the clarification of biological processes associated with complex diseases. Numerous methods of co-expression analysis have been proposed for use in the study of pairwise relationships among genes. In the present study, a combined network based on gene pairs was constructed following the conversion and combination of gene pair score values using a novel algorithm across multiple approaches. Three hippocampal expression profiles of patients with Alzheimer's disease (AD) and normal controls were extracted from the ArrayExpress database, and a total of 144 differentially expressed (DE) genes across multiple studies were identified by a rank product (RP) method. Five groups of co-expression gene pairs and five networks were identified and constructed using four existing methods [weighted gene co-expression network analysis (WGCNA), empirical Bayesian (EB), differentially co-expressed genes and links (DCGL), search tool for the retrieval of interacting genes/proteins database (STRING)] and a novel rank-based algorithm with combined score, respectively. Topological analysis indicated that the co-expression network constructed by the WGCNA method had the tendency to exhibit small-world characteristics, and the combined co-expression network was confirmed to be a scale-free network. Functional analysis of the co-expression gene pairs was conducted by Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis. The co-expression gene pairs were mostly enriched in five pathways, namely proteasome, oxidative phosphorylation, Parkinson's disease, Huntington's disease and AD. This study provides a new perspective to co-expression analysis. Since different methods of analysis often present varying abilities, the novel combination algorithm may provide a more credible and robust outcome, and could be used to complement to traditional co-expression analysis.Entities:
Keywords: Alzheimer's disease; differentially co-expressed genes and links; empirical Bayesian; gene co-expression analysis; search tool for the retrieval of interacting genes/proteins database; weighted gene co-expression network analysis
Year: 2016 PMID: 27168792 PMCID: PMC4840697 DOI: 10.3892/etm.2016.3131
Source DB: PubMed Journal: Exp Ther Med ISSN: 1792-0981 Impact factor: 2.447
Characteristics of the individual datasets included in this study.
| Accession number | Year | Sample size (cases/controls) | Platform |
|---|---|---|---|
| E-GEOD-1297 | 2004 | 31 (22/9) | Affymetrix HG-U133A |
| E-GEOD-28146 | 2011 | 30 (22/8) | Affymetrix HG-U133Plus2 |
| E-GEOD-5281 | 2007 | 23 (10/13) | Affymetrix HG-U133Plus2 |
Figure 1.Graphical representation of co-expression networks identified by four existing method. Genes are denoted as nodes and interactions between gene pairs are presented as edges. (A) Search tool for the retrieval of interacting genes/proteins database, (B) differentially co-expressed genes and links, (C) empirical Bayesian and (D) weighted gene co-expression network analysis.
Figure 2.Combined co-expression network using the novel algorithm and its degree distribution. (A) Combined co-expression network based on the novel scores of each gene pair across four methods. A total of 37 nodes and 57 edges composed this combined network. (B) Scatter-gram of gene degree in this co-expression network. The combined co-expression network was a scale-free network whose degree distribution followed a power law (y=axb, where a=12.464, b=−0.840, R2=0.881).
Topological parameters of co-expression networks constructed using four existing approaches and the new algorithm.
| Measure | STRING | DCGL | EB | WGCNA | Combined |
|---|---|---|---|---|---|
| R2 | 0.786 | 0.037 | 0.477 | 0.071 | 0.810 |
| Clustering coefficient | 0.300 | 0.178 | 0.0 | 0.820 | 0.172 |
| Mean shortest path length | 2.925 | 1.783 | 2.038 | 1.578 | 3.618 |
STRING, search tool for the retrieval of interacting genes/proteins database; DCGL, differentially expressed genes and links; EB, empirical Bayesian; WGCNA, weighted gene co-expression network analysis.
Figure 3.Five most enriched pathways of co-expression gene pairs identified by four existing methods and the novel algorithm. Co-expression gene pairs identified by EB and DCGL methods could not be enriched in any of the identified pathways. The five pathways were proteasome, oxidative phosphorylation, Parkinson's disease, Huntington's disease and Alzheimer's disease. EB, empirical Bayesian; STRING, search tool for the retrieval of interacting genes/proteins database; WGCNA, weighted gene co-expression network analysis; DCGL, differentially co-expressed genes and links.