| Literature DB >> 21619705 |
Abstract
BACKGROUND: Cells must respond to various perturbations using their limited available gene repertoires. In order to study how cells coordinate various responses, we conducted a comprehensive comparison of 1,186 gene expression signatures (gene lists) associated with various genetic and chemical perturbations.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21619705 PMCID: PMC3123203 DOI: 10.1186/1752-0509-5-87
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Top 20 most frequently appearing genes in 1,186 published gene sets in MSigDB
| Gene | Frequency in | #PubMed | Gene | Frequency in | #PubMed |
|---|---|---|---|---|---|
| MYC | 86 | 13323 | TNFAIP3 | 54 | 201 |
| STAT1 | 75 | 2719 | CDC2 | 54 | 4052 |
| ID2 | 66 | 364 | IL8 | 52 | 675 |
| CDKN1A | 64 | 6713 | HMGB2 | 52 | 151 |
| IFITM1 | 62 | 41 | RHOB | 50 | 270 |
| SERPINE1 | 59 | 6486 | GADD45A | 50 | 224 |
| ISG15 | 58 | 192 | PCNA | 49 | 8179 |
| VEGF | 56 | 22283 | CCND1 | 49 | 7243 |
| CTGF | 56 | 1024 | ATF3 | 49 | 207 |
| IL6 | 55 | 2100 | TOP2A | 48 | 136 |
Figure 1A highly connected network of published gene sets. Nodes represent gene sets and edges represent significant overlaps. We identified 7,419 significant overlaps (FDR < 0.001) among 958 published gene sets. Only 2,915 highly significant (FDR < 1.0 × 10-6) overlaps are shown in this figure. We observed a large number of significant overlaps among diverse expression signatures organized in a modular fashion.
Summary of 22 modules consisting of groups of heavily interconnected gene sets
| ID | #Sets | Cluster Density | Representitive GeneSet | Biological Theme | Most freqently shared genes between gene sets | Most significantly enriched GO Term | P values |
|---|---|---|---|---|---|---|---|
| 1a | 31 | 13.5 | P21_P53_ANY_DN_49 (Sup. Figure 1) | Cell cycle, especially M phase | FOXM1,CCNB1,KIF2C,KIF11,CDC2,CCNB2,UBE2C,CDC20,MKI67,DLG7 | Cell Cycle | 2.80E-45 |
| 1b | 29 | 12.8 | DER_IFNB_UP_93 (Figure 3) | Immune response/interferon | MX2,MX1,OAS1,STAT1,OAS2,ISG15,IFITM1,IFIT3,IRF7,IFI27Response to virus | 1.30E-19 | |
| 2 | 15 | 3.7 | IL1_CORNEA_UP_63 | Inflammatory response/IL1 | IL1B,CCL4,PLAUR,CXCL2,CCL5,TNFAIP3,CD44,IER3,NFKBIA,PLEK | Immune system process | 1.00E-15 |
| 3 | 16 | 3.6 | TNFA_NFKB_DEP_UP_18 (Figure 5) | Inflammatory response/TNFa related | GPNMB,CXCL1,SOD2,MMP9,CXCL5,CCL2,CHI3L1,CXCL3,CD9,IL1B | Immune system process | 2.70E-20 |
| 4 | 14 | 3.1 | GENOTOXINS_ALL_4HRS_REG_27 (Figure 4) | Cell cycle, DNA damage response | CKS2,ECT2,MAD2L1,BUB1,AURKA,CCNB2,CKS1B,PRC1,TRIP13,RRM2 | Cell Cycle | 3.10E-22 |
| 5 | 20 | 2.1 | HOHENKIRK_MONOCYTE_DEND_DN_122 | Inflammatory response/blood cells | STAT1,RAB2,CXCL3,HSPH1,SFRS3,S100A4,S100A8,TOP2A,ACTR1B,ANXA1 | inflammatory response | 7.40E-04 |
| 6 | 14 | 2.0 | TGFBETA_ALL_UP_80 | Cell adhesion, differentiation | IGFBP3,COL6A3,THBS2,CSPG2,SERPINE1,COL1A2,COL3A1,COL6A1,LOX,TIMP1 | Cell adhesion | 9.60E-07 |
| 7 | 6 | 2.0 | ESR_FIBROBLAST_DN_18 | cell differentiation | GNPNAT1,LCK,NDRG1,SOX7,TAF1C,TP53I11,ZNF507,CEPT1,GABBR1,HGF | -- | |
| 8 | 35 | 2.0 | HYPOXIA_REG_UP_38 (Figure 6) | Cell cycle arrest | POSTN,MTHFD1,MYC,TFRC,COL6A3,CSPG2,CTPS,SNRPA1,WEE1,ADORA2B | cell cycle arrest | 8.30E-02 |
| 9 | 18 | 1.7 | UVC_XPCS_4HR_DN_242 | Down-regulated by UV, TNFa | ITGB5,AXL,ARL4C,RGS4,ACTA2,ARHGAP1,ARL6IP5,CAP2,COL1A2,CYFIP2 | Signal Transduction | 7.50E-04 |
| 10 | 7 | 1.7 | SCHUMACHER_MYC_UP_54 ( Figure 2) | MYC target genes | NME1,HSPE1,HSPD1,LDHA,TFRC,APEX1,CDK4,EBNA1BP2,ENO1,FKBP4 | Intracellular organelle lumen | 1.80E-02 |
| 11 | 5 | 1.6 | GNATENKO_PLATELET_UP_47 | Platelet genes | PF4,PPBP,TMSB4X,GPX1,HIST2H2AA4,ACTB,B2M,CCL5,CD99,CFL1 | * | |
| 12 | 7 | 1.6 | ADIPOCYTE_PPARG_UP_16 | Lipid metabolic process | ADIPOQ,AQP7,DGAT1,FASN,RETN,CIDEC,COX7B,NDUFS1,NR1H3,SCARB1 | Lipid metabolic process | 7.10E-04 |
| 13 | 27 | 1.6 | CHANG_SERUM_RESPONSE_DN_194 | SC4MOL,ACTB,HMGCS1,BAK1,ZYX,SCD,BAD,CTNNA1,FDPS,ITGB4 | hemopoiesis | 3.60E-03 | |
| 14 | 4 | 1.5 | CISPLATIN_PROBCELL_UP_17 | ABI1,CDKN1A,EI24,LPIN1,TOB1,TP53INP1,TXNIP,ABLIM1,CARHSP1,H2AFJ | * | ||
| 15 | 6 | 1.3 | UVC_LOW_ALL_UP_19 | Response to DNA damage, UV | BTG2,CDKN1A,GDF15,BTG1,DDB2,PLXNB2,FDXR,GPRC5A | * | |
| 16 | 7 | 1.3 | FLECHNER_KIDNEY_TRANSPLANT_WELL_UP_565 | RAB1A,ATP2A2,COL1A2,PGK1,RAB2,SPARC,WEE1,AGL,CFLAR,DNAJA1 | * | ||
| 17 | 5 | 1.2 | CMV_HCMV_TIMECOURSE_ALL_UP_470 | IL6 induced | MX1,IRF4,TNFSF10,CCNC,DNAJB9,GATM,GNA13,HBEGF,ICAM1,JUNB | Response to stimulus | 4.80E-02 |
| 18 | 5 | 1.2 | REN_E2F1_TARGETS_50 | E2F1 target genes | PCNA,TOP2A,KIAA0101,POLA2,RFC4 | * | |
| 19 | 6 | 1.2 | STEMCELL_HEMATOPOIETIC_UP_1452 | Stem cell enriched | ATP5D,BTBD14A,EEF1A1,GLUL,NTAN1,NUDCD2,PCGF2,ATP1A1,CD151,DYNC1H | * | |
| 20 | 4 | 1.0 | ZHAN_MULTIPLE_MYELOMA_VS_NORMAL_DN_4 | Leukemia related | ELA2,BLNK,BZRAP1,CD24,CD7,CEBPD,CST7,DNTT,HGF,KCNE1L | * | |
| 21 | 4 | 1.0 | NADLER_OBESITY_DN_38 | Obesity down, adipocyte up | PPA1,ACSL1,AGT,ALDH2,CFD,CRAT,FABP4,LDHB,PC,PPARG | * |
Cluster density is defined by dividing the number of significant overlaps (edges) by the number of gene sets within the sub-network.
* Less than 20 genes.
A summary of overlaps that were discussed in details in this paper
| Overlapping Gene Sets | Explanation and supporting references | Repressive? |
|---|---|---|
| Chang_Serum_Response_up & Schumacher_MYC_up | The c-Myc oncogene mediates response to serum stimulation and triggers proliferative growth [ | |
| Sana_IFNG_Endothelial_Dn & Zeller_MYC_Up, MYC_Targets | Interferon γ (IFNG) inhibits cell growth through suppression of c-MYC expression [ | Yes |
| Taketa_NUP9_HOXA9_3d_Up and interferon α and β gene sets | Transduction of fusion protein NUP98-HOXA9 induces "up-regulation of IFNβ1 and is accompanied by marked up-regulation of IFN-induced genes" [ | |
| CMV (cytomegalovirus) infection & Various cytokine regulated gene sets | Host cell response to CMV infection might be mediated by these cytokines. | |
| StemCell_Embryonic_up & BRCA_Prognosis_Neg | Aggressive tumors share some expression signature of embryonic stem cells [ | |
| P53_Genes_All & Zeller_MYC_Dn | p53 represses the oncogene MYC possibly through miRNA-145 [ | Yes |
| Gay_YY1_up & P53_Genes_All | YY1 inhibits the activation of p53 [ | Yes |
| Cancer_undifferentiated_Meta_up & IDX_TSA_UP_Cluster3 | Genes involved in TSA-induced differentiation of fibroblasts into adipocytes are also upregulated in undifferentiated tumors. | |
| Peng_Glutamine_Dn & several MYC upregulated gene sets | Glutamine starvation might suppress cell growth by repression of MYC pathway. | Yes |
| Manalo_hypoxia_Dn, StemCell_Embryonic_up, Le_Myelin_up | Cell cycle genes are regulated by hypoxia, stem cells, and growth after wounding. | Partly |
Figure 2Sub-network #10: MYC target genes. A) The sub-network of MYC target genes. Nodes represent gene sets and edges represent significant overlaps. The size of the node indicates the number of genes in the gene set, which is also numerically labelled by the number at the end of each gene set name. The thickness of the edges is proportional to significance (-log10(FDR)). The labels of edges indicate the number of overlapping genes. Red dashed edges represent repressive connections between "X" upregulated genes and "Y" downregulated genes. The same settings were used to generate subsequent figures in this paper. Gene sets explicitly related to MYC oncoprotein are highlighted in yellow. B) List of 15 genes (columns) that appear three times or more in these seven gene sets (rows). Red indicates that a gene is included in a gene set. This figure shows that our approach can identify overlaps between biologically related gene signatures.
Figure 3Sub-network #1b: Expression signature of interferons α and β. Only edges with extremely significant overlaps (FDR <1.0 × 10-20) are shown here. Nodes represent gene sets and edges represent significant overlaps. The size of the node indicates the number of genes in the gene set, which is also numerically labelled by the number at the end of each gene set name. The thickness of the edges is proportional to significance (-log10(FDR)). The labels of edges indicate the number of overlapping genes. This figure shows that expression signatures of interferon α and β are very similar across multiple studies and that pathogen responses rely on a core set of shared genes.
Figure 4Sub-network #4: Stem cell related gene sets and breast cancer prognosis predictors are linked. Nodes represent gene sets and edges represent significant overlaps. The size of the node indicates the number of genes in the gene set, which is also numerically labelled by the number at the end of each gene set name. The thickness of the edges is proportional to significance (-log10(FDR)). The labels of edges indicate the number of overlapping genes. The labels of edges indicate the number of overlapping genes. Red dashed edges represent repressive connections between "X" upregulated genes and "Y" downregulated genes. We observed overlaps between stem cell related expression signature and breast cancer prognosis predictors. Expression of some cell cycle related genes is shared between stem cells and invasive breast cancers.
Figure 5Sub-network #8: Glutamine starvation and c-Myc genes. The glutamine starvation down-regulated gene set is highlighted in yellow, which is connected to sets of MYC target genes. Nodes represent gene sets and edges represent significant overlaps. The size of the node indicates the number of genes in the gene set, which is also numerically labelled by the number at the end of each gene set name. The thickness of the edges is proportional to significance (-log10(FDR)). The labels of edges indicate the number of overlapping genes. The labels of edges indicate the number of overlapping genes. Red dashed edges represent repressive connections between "X" upregulated genes and "Y" downregulated genes. The "Peng_Glutamine_Dn" list significantly overlaps with almost all MYC related gene sets.
Figure 6Expression of c-Myc target genes in B-lymphoma cells upon glutamine, leucine and glucose starvation as well as rapamycin treatment. Red and green denote higher and lower expression, respectively. Glutamine and leucine deficiencies, but not glucose deficiency, strongly downregulate many MYC target genes. The anticancer drug rapamycin has a similar effect on these genes, suggesting that rapamycin mimics amino acid starvation.