| Literature DB >> 27556416 |
Zhi Han1,2,3, Jie Zhang3,4, Guoyuan Sun1,2, Gang Liu1,2, Kun Huang5,6.
Abstract
BACKGROUND: Gene co-expression network analysis (GCNA) is widely adopted in bioinformatics and biomedical research with applications such as gene function prediction, protein-protein interaction inference, disease markers identification, and copy number variance discovery. Currently there is a lack of rigorous analysis on the mathematical condition for which the co-expressed gene module should satisfy.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27556416 PMCID: PMC5001231 DOI: 10.1186/s12864-016-2912-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Simulation on the distribution of CCI and its relationship with noise in the data. Top: Relationship between CCI and noise level. The x-axis reflects the effects of the noise on the centralized matrix. Middle: The distribution of CCI calculated from 1000 randomly selected gene lists (with 220 genes) in the 41 lung cancer tumor samples (using GSE18842). Bottom: The distribution of CCI calculated from 1000 random permutation of the data from the correlated gene module
Fig. 2Comparison between CCI and density metrics. a The CCI versus density metrics with the increases of number of outliers under two different noise levels. b The boxplots for the two metrics with different number of outliers and noise levels. The values are normalized to the values with zero outlier. c The CCI versus density metrics with the increasing size of the interfering module. d The boxplots for the two metrics with different number of outliers and noise levels with the values normalized to the values without interfering module
Fig. 3Examples of the gene modules in tumor samples (left column) and control samples (right column). The top two modules show significant difference in co-expression between control and tumor samples with high CCIs and z-scores in tumor and low CCIs as well as low z-scores in control samples. The bottom module has high CCIs and z-scores in both tumor and control samples
Enrichment analysis of the 15 gene modules that are specific to tumor samples in lung cancer
| Module | Size | GO BP term ( | Cytoband ( | TF ( |
|---|---|---|---|---|
| 4 | 162 | Epidermis development ( | 1q21-q22 ( | AP1 ( |
| 2p24.3 ( | AREB6 ( | |||
| 5 | 159 | neuron differentiation ( | 6q14.2 ( | PAX4 ( |
| generation of neurons ( | 5q33 ( | MSX1 ( | ||
| 9 | 98 | Neurogenesis ( | MSX1 ( | |
| central nervous system development ( | RNGTGGGC UNKNOWN ( | |||
| 17 | 62 | meiotic nuclear division ( | 7p15.3-p15.1 ( | |
| meiotic cell cycle ( | 12q22-q24.1 ( | |||
| 19 | 55 | cellular glucuronidation ( | 4q13 ( | |
| uronic acid metabolic process ( | 4q31.3-q32 ( | |||
| 25 | 48 | glutamate decarboxylation to succinate ( | 4q21.22 ( | |
| glutamate catabolic process ( | 8p11.22 ( | |||
| 38 | 36 | calcium ion export ( | 7q21.3 ( | |
| 9p21.3 ( | ||||
| 44 | 33 | vasodilation of artery involved in baroreceptor response to increased systemic arterial blood pressure ( | 7p12.2 ( | RORA1 ( |
| baroreceptor response to increased systemic arterial blood pressure ( | 11p15.2-p15.1 ( | ERR1 ( | ||
| 50 | 30 | fatty acid derivative metabolic process ( | 4q28-q32 ( | WGTTNNNNNAAA UNKNOWN ( |
| icosanoid metabolic process ( | FOXO4 ( | |||
| 66 | 21 | 4p16.3 ( | E2F1 ( | |
| 67 | 21 | 9q21.33 ( | RACTNNRTTTNC UNKNOWN ( | |
| 9q22.32 ( | ||||
| 70 | 20 | 1q22-q23.2 ( | ||
| 8q22-q23 ( | ||||
| 81 | 18 | 21q22.3 ( | ||
| 84 | 18 | 4q31.23 ( | CREB ( | |
| 116 | 13 | Xp11.23 ( | MEIS1 ( |
Fig. 4The OncoPrint plots for different types of mutations on the genes in module 66 in the lung adenocarcinoma patients. Top: OncoPrint for genes in Module 66 (with 21 genes) in the lung adenocarcinoma study in TCGA generated generated by cBioPortal. Bottom: Oncoprint for the same gene module in lung squamous cell tumors in TCGA. The genes circled in red are all on cytobands 4p13-16 and the ones circled in blue are on cytoband 8p11.23
Fig. 5Correlations between the copy number measurements and the gene expression levels (measured using RNA-seq) of gene MRFAP1. Top: The box plot for the expression levels of gene MRFAP1 with respect to inferred copy number variation. Bottom: The correlation between the expression levels of MRFAP1 with the measurement for copy number values is 0.726 (PCC)
Fig. 6Examples of co-expressed genes on the same cytobands from the same gene module. Top: The correlation between expression levels of MRFAP1 and GRPL1 is 0.650 (PCC). Bottom: The correlation between the expression levels of SLBP and GRPL1 is 0.606 (PCC)