| Literature DB >> 31074376 |
Tongxin Wang1, Jie Zhang2, Kun Huang3,4.
Abstract
BACKGROUND: Gene Co-expression Network Analysis (GCNA) helps identify gene modules with potential biological functions and has become a popular method in bioinformatics and biomedical research. However, most current GCNA algorithms use correlation to build gene co-expression networks and identify modules with highly correlated genes. There is a need to look beyond correlation and identify gene modules using other similarity measures for finding novel biologically meaningful modules.Entities:
Keywords: Gene co-expression network analysis; Low-rank representation; Subspace clustering
Mesh:
Substances:
Year: 2019 PMID: 31074376 PMCID: PMC6509871 DOI: 10.1186/s12859-019-2733-5
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Summary of microarray datasets
| Dataset | Disease | Number of samples | Platform |
|---|---|---|---|
| GSE31684 | bladder cancer | 93 | GPL570 |
| GSE54002 | breast cancer | 433 | GPL570 |
| GSE102484 | breast cancer | 683 | GPL570 |
Gene modules identified by LRR, PCC and WGCNA
| Dataset | GSE31684 | GSE54002 | GSE102484 |
|---|---|---|---|
| nMdl LRR | 26 | 76 | 63 |
|
| 0.487 | 0.471 | 0.457 |
| nMdl PCC | 97 | 100 | 81 |
|
| 0.552 | 0.518 | 0.496 |
| Kolmogorov-Smirnov test | 2.43e-02 | 1.09e-04 | 9.22e-04 |
| nMdl WGCNA | 56 | 147 | 81 |
nMdl: number of gene modules identified
: mean of CCI values of gene modules identified
Fig. 1Comparison between LRR21, PCC30 and WGCNA44 in GSE54002. a. Venn diagram of genes in LRR21, PCC30 and WGCNA44; b-d. Heatmaps of expression profiles of LRR21, PCC30 and WGCNA44. Red bars beside gene symbols indicate genes on 9q34.11
Summary of LRR21,PCC30 and WGCNA44 in GSE54002
| Module | LRR21 | PCC30 | WGCNA44 |
|---|---|---|---|
| number of genes | 37 | 21 | 40 |
| CCI | 0.385 | 0.486 | 0.420 |
| number of genes on 9q34.11 | 14 | 2 | 10 |
| 2.01e-26 | 2.38e-3 | 4.57e-17 |
Fig. 2OncoPrint visualization of breast cancer patients with genetic mutations. Genes are from LRR21 which are also on 9q34.11
Fig. 3Number of enriched GO BPs and chromosome bands in microarray datasets. P value cutoff = 0.01
The most significant GO BPs only enriched in our method
| GO ID | Name | Module ID | |
|---|---|---|---|
| GSE31684 | |||
| GO:0043547 | positive regulation of GTPase activity | 3.59e-06 | 4 |
| GO:0036336 | dendritic cell migration | 7.73e-06 | 4 |
| GO:0043087 | regulation of GTPase activity | 2.89e-05 | 4 |
| GO:0006929 | substrate-dependent cell migration | 9.66e-05 | 13 |
| GO:0070232 | regulation of T cell apoptotic process | 1.07e-04 | 4 |
| GSE54002 | |||
| GO:0000082 | G1/S transition of mitotic cell cycle | 1.27e-07 | 6 |
| GO:0031109 | microtubule polymerization or depolymerization | 7.13e-07 | 6 |
| GO:0000083 | regulation of transcription involved in | 8.48e-07 | 6 |
| G1/S transition of mitotic cell cycle | |||
| GO:0007019 | microtubule depolymerization | 6.00e-06 | 6 |
| GO:0051783 | regulation of nuclear division | 7.21e-06 | 6 |
| GSE102484 | |||
| GO:0050766 | positive regulation of phagocytosis | 1.70e-06 | 1 |
| GO:0006271 | DNA strand elongation involved in DNA replication | 1.95e-06 | 10 |
| GO:0022616 | DNA strand elongation | 1.95e-06 | 10 |
| GO:0050764 | regulation of phagocytosis | 3.18e-06 | 1 |
| GO:0070374 | positive regulation of ERK1 and ERK2 cascade | 8.79e-06 | 1 |
P value cutoff = 0.01
The most significant chromosome bands only enriched in our method
| Chromosome band | Module ID | CNVs validated in separate cohorts(TCGA, METABRIC) | |
|---|---|---|---|
| GSE31684 | |||
| 4q13.3 | 2.47e-06 | 13 | Yes,NA |
| Yp11.2 | 8.25e-04 | 12 | No,NA |
| Yq11.223 | 8.25e-04 | 12 | No,NA |
| 4q21.1 | 8.35e-04 | 4 | Yes,NA |
| 19p13.12 | 1.25e-03 | 5 | No,NA |
| GSE54002 | |||
| 11p15.1 | 1.92e-12 | 44 | Yes,Yes |
| 11q12.2 | 1.61e-06 | 19 | Yes,Yes |
| 19q13.31 | 1.03e-05 | 61 | Yes,Yes |
| 9q31.3 | 2.12e-05 | 26 | No,Yes |
| 9q22.33 | 2.28e-05 | 26 | No,Yes |
| GSE102484 | |||
| 8p11.23 | 3.97e-19 | 53 | Yes,Yes |
| 9q34.3 | 1.36e-09 | 14 | Yes,Yes |
| 15q22.31 | 4.21e-09 | 27 | Yes,Yes |
| 13q14.3 | 2.27e-08 | 18 | Yes,Yes |
| 7q36.3 | 4.12e-08 | 7 | Yes,Yes |
P value cutoff = 0.01
Fig. 4Number of overlapping enriched GO BPs and chromosome bands. GSE54002 and GSE102484 are two breast cancer datasets
Examples of modules identified by our method with lower CCIs and higher PPI densities
| Dataset | Module(nGenes) | nGenes Overlap | CCI | PPI density |
|---|---|---|---|---|
| GSE54002 | LRR5(86) | 16 | 0.457 | 3.81e-03 |
| PCC14(28) | 0.498 | 0 | ||
| GSE54002 | LRR5(86) | 54 | 0.457 | 3.81e-03 |
| WGCNA18(94) | 0.467 | 2.38e-03 | ||
| GSE54002 | LRR7(78) | 18 | 0.340 | 3.12e-03 |
| PCC33(20) | 0.534 | 0 | ||
| GSE54002 | LRR7(78) | 38 | 0.340 | 3.12e-03 |
| WGCNA26(77) | 0.389 | 1.42e-03 | ||
| GSE102484 | LRR17(65) | 15 | 0.291 | 2.46e-03 |
| PCC24(28) | 0.400 | 0 | ||
| GSE102484 | LRR17(65) | 28 | 0.291 | 2.46e-03 |
| WGCNA23(47) | 0.382 | 1.03e-03 | ||
| GSE102484 | LRR8(96) | 83 | 0.359 | 5.68e-03 |
| WGCNA12(115) | 0.367 | 4.17e-03 |
nGenes: number of genes in the module nGenes Overlap: number of genes in both modules
Fig. 5Kaplan-Meier curve of two groups of samples stratified using gene module LRR2 in GSE31684. GSE31684 is a bladder cancer dataset
Fig. 6Number of enriched GO BPs and chromosome bands in a melanoma cell dataset GSE72056. P value cutoff = 0.01
The most significant GO BPs only enriched using our method in GSE72056
| GO ID | Name | Module ID | |
|---|---|---|---|
| GO:0006259 | DNA metabolic process | 1.32e-10 | 6 |
| GO:0006260 | DNA replication | 2.59e-07 | 6 |
| GO:0006281 | DNA repair | 3.17e-06 | 6 |
| GO:0006261 | DNA-dependent DNA replication | 5.10e-06 | 6 |
| GO:0046794 | transport of virus | 8.60e-06 | 2 |
P value cutoff = 0.01
The most significant chromosome bands only enriched using our method in GSE72056
| chromosome band | Module ID | |
|---|---|---|
| 5p15.33 | 1.41e-06 | 18 |
| 6p22.1 | 3.44e-05 | 15 |
| 17q11.2 | 8.46e-05 | 11 |
| 17q11-q12 | 1.83e-03 | 11 |
| 6q24.1-q24.2 | 1.83e-03 | 11 |
P value cutoff = 0.01