| Literature DB >> 26005638 |
Scooter Willis1, Pradip De1, Nandini Dey1, Bradley Long2, Brandon Young1, Joseph A Sparano3, Victoria Wang4, Nancy E Davidson5, Brian R Leyland-Jones1.
Abstract
PURPOSE: Triple negative (TN) breast cancers which lack expression of the estrogen (ER), progesterone (PR), and human epidermal growth factor 2 (HER2) receptors convey a poor prognosis due in part to a lack of targeted therapies.Entities:
Keywords: Breast cancer; ER +; GSEA; NR6A1; Triple negative
Year: 2015 PMID: 26005638 PMCID: PMC4436509 DOI: 10.1016/j.mgene.2015.04.002
Source DB: PubMed Journal: Meta Gene ISSN: 2214-5400
Fig. 1Each cohort consisting of TN and ER+(HER2 −) samples are run using GSEA to determine gene sets that are enriched and share a common promoter motif. The p-value from each enriched gene set is combined and ranked using Stouffer weighted Z to identify gene sets that have consistent enriched gene sets across all cohorts. The transcription factor for each ranked enriched gene set are searched in the STICH 4.0 database for chemical inhibitors or activators. Additionally, common set of genes in each gene set shown to be enriched across maximum number of cohorts are searched for known chemical and genomic perturbation gene set to identify possible inhibitors or activators.
Results of GSEA using gene set c3.tft.v4.0.symbols.gmt in seven different TN vs ER+(HER2 −) cohorts ranked by Stouffer weighted Z score. Each cohort has a range of metadata to classify a sample as TN or ER+(HER2 −). For NNN that indicates the first N = ER − status, second N = PR − status and third N = HER2 − status. An X indicates any value and in Metabric a 1 was used to indicate HER2 − status.
| E2100 NNN vs PXN | E2197 NNN vs PXN | GSE25055 NNN vs PXN | GSE25065 NNN vs PXN | Metabric discovery Nn1 vs PX1 | Metabric validation Nn1 vs PX1 | TCGA BRCA NNN vs PXN | Stouffer weighted Z | |
|---|---|---|---|---|---|---|---|---|
| V$HIF1_Q5 | 0.428 | 0.002 | 0.002 | 0.103 | 0.006 | 0.041 | 0.003 | 4.98E − 09 |
| V$HIF1_Q3 | 0.536 | 0.000 | 0.000 | 0.021 | 0.022 | 0.067 | 0.011 | 1.08E − 08 |
| KTGGYRSGAA_UNKNOWN | 0.810 | 0.000 | 0.024 | 0.031 | 0.034 | 0.008 | 0.030 | 1.86E − 07 |
| V$E2F1_Q6 | 0.777 | 0.000 | 0.000 | 0.008 | 0.111 | 0.043 | 0.040 | 2.16E − 07 |
| V$GNCF_01 | 0.000 | 0.000 | 0.016 | 0.124 | 0.020 | 0.047 | 0.338 | 4.34E − 07 |
| V$HMGIY_Q6 | 0.006 | 0.010 | 0.028 | 0.051 | 0.057 | 0.028 | 0.029 | 9.29E − 07 |
| MCAATNNNNNGCG_UNKNOWN | 0.543 | 0.004 | 0.164 | 0.092 | 0.039 | 0.125 | 0.000 | 3.40E − 06 |
| V$E2F_01 | 0.556 | 0.000 | 0.036 | 0.030 | 0.175 | 0.056 | 0.023 | 3.78E − 06 |
| TTCNRGNNNNTTC_V$HSF_Q6 | 0.175 | 0.008 | 0.022 | 0.004 | 0.070 | 0.089 | 0.085 | 4.11E − 06 |
| CTGRYYYNATT_UNKNOWN | 0.008 | 0.056 | 0.055 | 0.061 | 0.024 | 0.066 | 0.032 | 1.02E − 05 |
| V$MYCMAX_02 | 0.620 | 0.000 | 0.020 | 0.288 | 0.176 | 0.111 | 0.036 | 2.37E − 05 |
| V$MYC_Q2 | 0.402 | 0.014 | 0.004 | 0.099 | 0.116 | 0.175 | 0.052 | 3.61E − 05 |
| V$TAXCREB_02 | 0.570 | 0.032 | 0.037 | 0.160 | 0.047 | 0.032 | 0.051 | 5.53E − 05 |
| CATTGTYY_V$SOX9_B1 | 0.281 | 0.030 | 0.080 | 0.017 | 0.045 | 0.084 | 0.118 | 6.33E − 05 |
| V$E2F_Q3 | 0.606 | 0.002 | 0.029 | 0.059 | 0.249 | 0.167 | 0.071 | 7.11E − 05 |
| V$AP1_Q4 | 0.428 | 0.008 | 0.185 | 0.471 | 0.052 | 0.018 | 0.063 | 9.65E − 05 |
| MYAATNNNNNNNGGC_UNKNOWN | 0.265 | 0.021 | 0.106 | 0.131 | 0.079 | 0.100 | 0.121 | 2.59E − 04 |
| V$NFY_Q6 | 0.800 | 0.010 | 0.029 | 0.157 | 0.091 | 0.133 | 0.168 | 2.71E − 04 |
| V$AP1_01 | 0.333 | 0.019 | 0.113 | 0.358 | 0.056 | 0.057 | 0.180 | 3.90E − 04 |
| V$USF_C | 0.756 | 0.008 | 0.152 | 0.252 | 0.275 | 0.095 | 0.014 | 5.34E − 04 |
| CCAWYNNGAAR_UNKNOWN | 0.713 | 0.028 | 0.048 | 0.189 | 0.454 | 0.119 | 0.003 | 6.90E − 04 |
| V$NFKAPPAB_01 | 0.054 | 0.022 | 0.529 | 0.634 | 0.017 | 0.050 | 0.164 | 7.14E − 04 |
| TGASTMAGC_V$NFE2_01 | 0.702 | 0.055 | 0.061 | 0.151 | 0.052 | 0.028 | 0.318 | 7.44E − 04 |
| V$MYCMAX_01 | 0.823 | 0.015 | 0.187 | 0.413 | 0.246 | 0.069 | 0.008 | 8.60E − 04 |
| TTGCWCAAY_V$CEBPB_02 | 0.420 | 0.010 | 0.097 | 0.181 | 0.120 | 0.293 | 0.180 | 8.64E − 04 |
| V$TCF11MAFG_01 | 0.586 | 0.138 | 0.053 | 0.054 | 0.066 | 0.050 | 0.179 | 9.81E − 04 |
| V$BACH2_01 | 0.230 | 0.035 | 0.173 | 0.243 | 0.097 | 0.164 | 0.120 | 0.001 |
| V$ZIC3_01 | 0.050 | 0.079 | 0.080 | 0.325 | 0.096 | 0.237 | 0.157 | 0.002 |
| RAAGNYNNCTTY_UNKNOWN | 0.657 | 0.088 | 0.067 | 0.205 | 0.082 | 0.090 | 0.326 | 0.004 |
| V$TCF1P_Q6 | 0.148 | 0.042 | 0.067 | 0.332 | 0.228 | 0.305 | 0.169 | 0.004 |
| V$AP1FJ_Q2 | 0.202 | 0.065 | 0.256 | 0.385 | 0.158 | 0.148 | 0.132 | 0.006 |
| V$AP1_Q2 | 0.253 | 0.146 | 0.116 | 0.376 | 0.158 | 0.141 | 0.088 | 0.006 |
Promoter motif gene signatures that are enriched in the most cohorts indicating that genes are consistently enriched when comparing TN vs ER+(HER2 −) expression.
| Gene Set | #Genes | #Cohorts | Genes Enriched in the Most Cohorts |
|---|---|---|---|
| CATTGTYY_V$SOX9_B1 | 21 | 7 | [ANKRD11 BCL11A CDK8 CLIC4 EHBP1 FEN1 FGF9 ILF2 MID1 NASP NDRG2 NXN RBMS1 S100A1 SEPHS1 SFRP1 SLC6A9 TLE1 TRIM2 VLDLR YEATS2] |
| V$HIF1_Q3 | 18 | 7 | [ATP1B3 BCL11A CA9 E2F3 EN1 ENO1 EPHB3 GPM6B IGF2BP3 KIAA0664 KLF11 LRP8 NFIL3 NPR3 ST6GAL1 VEGFA YBX1 YEATS2] |
| V$HMGIY_Q6 | 16 | 7 | [ANKRD11 ATP1B3 BCL11A CXCL10 DLL3 DONSON EN1 FAM98A LMO4 MIA NDRG2 NMU PPP2R3A PTPN22 ST8SIA1 TFAP2C] |
| V$HIF1_Q5 | 13 | 7 | [BCL11A CA9 E2F3 EN1 EPHB3 GPM6B IGF2BP3 KLF11 LRP8 NT5DC2 PIM1 VEGFA YBX1] |
| V$MYC_Q2 | 10 | 7 | [CSDA FOSL1 FXYD6 IL15RA IVNS1ABP LRP8 PABPC4 REXO2 TAGLN2 YBX1] |
| TTCNRGNNNNTTC_V$HSF_Q6 | 8 | 7 | [CRYAB E2F3 H2AFY2 LRP8 SLC15A2 TBPL1 TFCP2L1 VEGFA] |
| MYAATNNNNNNNGGC_UNKNOWN | 7 | 7 | [AMD1 BBOX1 CDC20 KRT9 PTK7 TLE1 TLE4] |
| TTGCWCAAY_V$CEBPB_02 | 7 | 7 | [CEBPB CLDN10 KRT23 SOX10 SPIB SRPK1 SYNCRIP] |
| CTGRYYYNATT_UNKNOWN | 6 | 7 | [CDC42EP1 COL9A1 CRYAB DKK1 SMOX SOX10] |
| V$E2F1_Q6 | 36 | 6 | [ARHGAP11A ATAD5 CDC25A CDK1 DMD DNMT1 E2F3 EFNA5 EHBP1 EPHB1 EZH2 FANCG FBXO5 GAPDH GINS3 GMNN GPRC5B H2AFZ KLF5 MCM3 MCM4 MCM6 MCM7 MYC NASP NCL NUP62 PIM1 POLD1 PRPS2 RANBP1 RBL1 RRM2 SYNCRIP TYRO3 WDR62] |
| V$E2F_Q3 | 28 | 6 | [ARHGAP11A CDC25A CDC45 CDK1 DMD DNMT1 DPYSL2 E2F3 EPHB1 EZH2 FBXO5 GMNN KIF15 KLF5 MCM3 MCM4 MCM6 MCM7 NASP NPR3 NUP153 PIM1 PLK4 POLA2 RANBP1 TCP1 TIPIN USP1] |
| V$NFY_Q6 | 18 | 6 | [ACTL6A BUB1 CDK1 CENPF CKS2 GART LRP8 MYO1E NEK2 PNOC PRR11 PTCH1 SFRP1 TLE4 TOP2A TTK UGP2 ZBTB5] |
| KTGGYRSGAA_UNKNOWN | 12 | 6 | [EZH2 MCM3 MCM6 MCM7 MYC PIM1 POLD1 RANBP1 RFC4 SYNCRIP TUBA4A YARS] |
| V$GNCF_01 | 10 | 6 | [ABTB2 AMD1 BCL11A BCL11B FGF9 ITGB8 MALL NFIB NFIL3 RASAL1] |
| V$E2F_01 | 9 | 6 | [AMD1 DNMT1 E2F3 MCM3 MCM4 MYC NASP RANBP1 SMC4] |
| MCAATNNNNNGCG_UNKNOWN | 8 | 6 | [AMD1 CDCA3 EPHB1 PIGA PPP2R5D SH3BGRL3 SUV39H1 ZFAND5] |
| V$TAXCREB_02 | 2 | 6 | [BCL11A CRLF1] |
| V$NFKAPPAB_01 | 26 | 5 | [CBX2 CCL5 CD40 CD70 CD83 CDK6 CXCL10 CXCL11 CXCL16 IL27RA ITGB4 LIX1L LTA MSN NFKB2 NFKBIB ORAI1 RBMS1 RELB SMOC1 SOX10 TNFRSF9 TRIM47 UBD WNT10A ZBTB5] |
| V$AP1_Q4 | 25 | 5 | [ASS1 CA9 COL27A1 CORO1C EDN1 EPHA2 FGF11 GJB3 GPR3 IRAK1 KCNN4 KIAA0664 LAMB3 LAMC2 MAP2 MMP7 PIM2 REXO2 SLC4A11 STK40 TAGLN2 UBE2C UCHL3 USP13 XIRP1] |
| V$MYCMAX_01 | 22 | 5 | [ATAD3A CSK EIF2C2 ESRRA GCSH GPM6B HMGA1 IVNS1ABP KCNN4 KIAA0664 LRP8 ODC1 PABPC1 PPRC1 PRDX4 RANBP1 REXO2 SLC6A15 SYNCRIP TIMM8A TUBA4A YEATS2] |
| V$AP1_01 | 20 | 5 | [AQP5 CA9 CALB2 DMD ENO1 EPHA2 FGF9 FOSL1 GSTP1 IRAK1 LY6D MAP2 MMP7 PIM1 REXO2 S100A2 SERPINB5 TAGLN2 TINAGL1 TUBA4A] |
| V$TCF11MAFG_01 | 18 | 5 | [CDC45 DYNC1I1 E2F3 ENO1 FGF9 FOSL1 IRX4 JOSD1 LMO4 NUDT11 PLS3 PPARGC1A PTCH1 REXO2 SEL1L3 SERPINB5 SOBP UCHL1] |
| TGASTMAGC_V$NFE2_01 | 12 | 5 | [ANGPTL4 ASS1 CA9 CALB2 E2F3 FOSL1 GSTP1 MYO10 PLS3 S100A2 SERPINB5 TINAGL1] |
| CCAWYNNGAAR_UNKNOWN | 10 | 5 | [AKR1B1 ARL4C CDC45 CKS1B DMD FAM49A RBMS1 TBX19 UQCRH YBX1] |
| V$MYCMAX_02 | 18 | 4 | [CALB2 DZIP1 EN1 FBL FRMD4A HDGF HIF1A IVNS1ABP KIAA0664 NOTCH1 PABPC4 PPRC1 PTCH1 PTK7 SOX10 SYNCRIP TNFRSF21 TYRO3] |
| V$USF_C | 12 | 4 | [ASS1 BATF3 EIF2C2 ETV4 FBL IVNS1ABP LRP8 PABPC4 SLC6A15 ST6GAL1 TNFRSF21 YBX1] |
Promoter motif signatures from Table 2 compared to chemical and genetic perturbation gene sets in c2.cgp.v4.0.symbols.gmt. The identified chemical or genetic perturbation is shown experimentally to down regulate the enriched genes in the Genes found column.
| Chemical or genetic perturbations | CGP gene set | Genes found |
|---|---|---|
| Ras inhibitor Salirasib | BLUM_RESPONSE_TO_SALIRASIB_DN | |
| EGFR inhibitor CL-387785 | KOBAYASHI_EGFR_SIGNALING_24HR_DN | |
| Aminopeptidase inhibitor Tosedostat (CHR-2797) | KRIGE_RESPONSE_TO_TOSEDOSTAT_24HR_DN | |
| SB216763 inhibitor of GSK3B | WANG_RESPONSE_TO_GSK3_INHIBITOR_SB216763_DN | |
| Knockout TLX | ZHANG_TLX_TARGETS_60HR_DN | |
| Knockout BMP2 | LEE_BMP2_TARGETS_DN |
Determination of transcription factors are expressed in many tissues using Gene Expression Barcode 2.0 (McCall et al., 2011). Rows that are bold indicate transcription factors that are not normally expressed in breast tissue.
| Gene | Genbank | Proportion | Entropy | High entropy | Tissues |
|---|---|---|---|---|---|
| HIF1A | 0.916 | 0.289 | Yes | Many | |
| HMGA1 | 0.534 | 0.349 | Yes | Many | |
| HMGA1 | 0 | 0.01 | No | None | |
| MYC | 0.649 | 0.24 | Yes | Many | |
| MYC | 0 | 0.005 | No | None | |
| JUN | 0.947 | 0.087 | No | Many | |
| JUN | 0.16 | 0.251 | Yes | Adipose_tissue breast breast_stroma breast_stroma:tumor bronchus cd31 + _cells cd49a + _cells cervix dorsal_root_ganglia endometrium lung myometrium ovary prostate_gland skeletal_muscle stomach_cardiac thyroid trachea trigeminal_ganglia urethra vagina | |
| JUN | 0.634 | 0.39 | Yes | Many | |
| JUN | 0.092 | 0.178 | No | Breast_stroma breast_stroma:tumor bronchus cd49a + _cells fallopian_tube_epithelium head_and_neck_squamous_cell_carcinoma:tumor lung ovary parotid_gland thyroid trachea trigeminal_ganglia | |
| SOX9 | 0.695 | 0.222 | Yes | Many | |
| SOX9 | 0.679 | 0.293 | Yes | Many | |
| NFKB1 | 0.374 | 0.216 | Yes | Many | |
| NFE2 | 0.084 | 0.06 | No | Blood bone_marrow cd14 + _cells lymphocytes macaca_mulatta_blood monocyte-derived_dendritic_cells monocyte neutrophils pbmc platelets stratagene_reference_rna | |
| BACH2 | 0.076 | 0.121 | No | Blood cd3 + _t_cells cd49a + _cells cd4 + _cells jurkat_cells lymphocytes macaca_mulatta_blood pbmc platelets tonsil | |
| BACH2 | 0 | 0.053 | No | None | |
| CEBPB | 0.977 | 0.162 | No | Many | |
| NFE2L1 | 0.947 | 0.073 | No | Many | |
| NFE2L1 | 0.977 | 0.033 | No | Many | |
| NFE2L1 | 0.924 | 0.093 | No | Many | |
| ZIC3 | 0.008 | 0.006 | No | Cerebellum | |
| STAT1 | 0.916 | 0.333 | Yes | Many | |
| STAT1 | 0.336 | 0.305 | Yes | Many |
Fig. 2E2197 expression levels of ESR1 and NR6A1/GCNF, HNF1A, ZIC3 grouped by IHC determined TN or ER +. Expression levels of ESR1 establish an expected expression range. Box and Whisker plot with expression values outside of the 10–90 percentile indicated as black dots. ESR1 and HIF1A are shown for reference. Other genes are not expected to be expressed in normal tissue as indicated in Table 5. GCNF/NR6A1 has high expression in both TN and ER + samples where expression is only expected in germ cells of gonads.
Transcription factors that target the identified enriched gene sets and drugs associated with inhibition and activation determined by STITCH 4.0.
| Transcription factor | STITCH 4.0 drug inhibition | STITCH 4.0 activation | Gene set |
|---|---|---|---|
| HIF1A | Rapamycin | Oxygen, deferoxamine | [V$HIF1_Q3, V$HIF1_Q5] |
| E2F1 | CD437, camptothecin | V$E2F1_Q6 | |
| NR6A1 GCNF | Androgen | DR-0, retinoic acid | V$GNCF_01 |
| HMGA1 | Mevalonate | V$HMGIY_Q6 | |
| MYC | Estrogen | [V$MYCMAX_02, V$MYCMAX_01] | |
| JUN | TAM67 | Adenosine triphosphate, rapamycin, troglitazone | [V$AP1_Q4,V$AP1_01,V$AP1FJ_Q2, V$AP1_Q2] |
| SOX9 | Retinoic acid | Dexamethasone, androgen, DMOG | CATTGTYY_V$SOX9_B1 |
| NFKB/NFKB1 | 15d-PGJ2, curcumin, aspirin | Adenosine triphosphate, aspirin | V$NFKAPPAB_01 |
| NFE2 | Vanadate | TGASTMAGC_V$NFE2_01 | |
| BACH2 | V$BACH2_01 | ||
| CEBPB | Retinoic acid | Retinoic acid | TTGCWCAAY_V$CEBPB_02 |
| NFE2L1 | Sodium arsenite | V$TCF11MAFG_01 | |
| TCF1/HNF1A | V$TCF1P_Q6 | ||
| ZIC3 | V$ZIC3_01 | ||
| STAT1 | Retinoic acid, V205 | V$ISRE_01 |