| Literature DB >> 27092172 |
Darius Wlochowitz1, Martin Haubrock1, Jetcy Arackal2, Annalen Bleckmann2, Alexander Wolff3, Tim Beißbarth3, Edgar Wingender1, Mehmet Gültas1.
Abstract
Transcription factors (TFs) are gene regulatory proteins that are essential for an effective regulation of the transcriptional machinery. Today, it is known that their expression plays an important role in several types of cancer. Computational identification of key players in specific cancer cell lines is still an open challenge in cancer research. In this study, we present a systematic approach which combines colorectal cancer (CRC) cell lines, namely 1638N-T1 and CMT-93, and well-established computational methods in order to compare these cell lines on the level of transcriptional regulation as well as on a pathway level, i.e., the cancer cell-intrinsic pathway repertoire. For this purpose, we firstly applied the Trinity platform to detect signature genes, and then applied analyses of the geneXplain platform to these for detection of upstream transcriptional regulators and their regulatory networks. We created a CRC-specific position weight matrix (PWM) library based on the TRANSFAC database (release 2014.1) to minimize the rate of false predictions in the promoter analyses. Using our proposed workflow, we specifically focused on revealing the similarities and differences in transcriptional regulation between the two CRC cell lines, and report a number of well-known, cancer-associated TFs with significantly enriched binding sites in the promoter regions of the signature genes. We show that, although the signature genes of both cell lines show no overlap, they may still be regulated by common TFs in CRC. Based on our findings, we suggest that canonical Wnt signaling is activated in 1638N-T1, but inhibited in CMT-93 through cross-talks of Wnt signaling with the VDR signaling pathway and/or LXR-related pathways. Furthermore, our findings provide indication of several master regulators being present such as MLK3 and Mapk1 (ERK2) which might be important in cell proliferation, migration, and invasion of 1638N-T1 and CMT-93, respectively. Taken together, we provide new insights into the invasive potential of these cell lines, which can be used for development of effective cancer therapy.Entities:
Keywords: Wnt pathway; colorectal cancer; master regulator analysis; pathway analysis; promoter analysis
Year: 2016 PMID: 27092172 PMCID: PMC4820448 DOI: 10.3389/fgene.2016.00042
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Workflow for the study of distinct colorectal cancer cell lines. A multi-step workflow is outlined for the comparison of the 1638N-T1 and CMT-93. (A) The analysis begins with the identification of signature genes based on RNA-seq samples using the Trinity platform. This step generates two disjunct lists of signature genes which are further applied to different geneXplain analyses. (B) The signature genes are searched for overrepresented TRANSPATH pathways. Enriched transcription factor binding sites (TFBSs) are searched within the −1 kb/+100 bp promoter regions of the signature genes to obtain transcription factors (TFs). (C) The TFs are then searched for overrepresented TRANSPATH pathways. (D) A master regulatory network is generated by searching for a master regulator (red node) up to 10 steps upstream of the TFs (blue nodes) in TRANSPATH. The master regulator is connected via intermediate molecules (green nodes) with the TFs.
Figure 2Master regulatory network based on the intersection-specific TF set. The color coding red, blue and green represent nodes for master regulators, regulated transcription factors and connecting molecules, respectively.
Overrepresented TRANSPATH pathways for the signature genes of 1638NT-1.
| detoxification and bioactivation of tobacco-derived carcinogen NNK | Cbr3, Ugt1a1, Ugt1a2, Ugt1a6a, Ugt1a7c | 4.755E-4 |
| NNK → NNAL-O-glucuronide, NNAL-N-glucuronide | Cbr3, Ugt1a1, Ugt1a2, Ugt1a6a, Ugt1a7c | 4.755E-4 |
| heme, globin → bilirubin beta-diglucuronide | Ugt1a1, Ugt1a2, Ugt1a6a, Ugt1a7c | 0.00326 |
| hemoglobin oxidation | Ugt1a1, Ugt1a2, Ugt1a6a, Ugt1a7c | 0.00326 |
| IFNgamma → Rap1 | Cybb, Hspa1a, Ifngr1, Ncf4 | 0.01253 |
| Syk → RhoA | Syk, Vav2 | 0.01349 |
| Hck → RhoA | Hck, Vav2 | 0.01349 |
| hepoxilin A3 → Hepoxillin A3-D | Ggt7, Tgm2 | 0.01349 |
| G-alpha-q → IP3 | Cybb, Ncf4, Plcb1 | 0.01385 |
| BCR → p38 | C3,Cybb, Ncf4, Syk | 0.01618 |
| BCR —MLK3 → c-Jun | C3, Cybb, Ncf4, Syk | 0.01618 |
| catabolism of PAF | Enpp2, Pla2g7, Plcb1, Plcg2 | 0.01618 |
| alpha IIb beta3 → Rac1 | Cybb, Fyb, Ncf4, Prkg1, Syk | 0.0211 |
| alpha IIb beta3 pathway | Cybb, Fyb, Ncf4, Prkg1, Syk | 0.0211 |
| IL-8 → ERK2 | Cxcl1, Cybb, Gnai1, Il8, Ncf4 | 0.02495 |
| WAVE2 → Arp2/3 complex | Acta1, Actr3b, Cybb, Cyfip2, Ncf4 | 0.02495 |
| Epo → Syk | Epor, Syk | 0.02577 |
| PMCA4 —/ nNOS | Dmd, Snta1 | 0.02577 |
| Wnt activation of LRP5/6/frizzled/axin complex | Fzd4, Fzd8, Wnt1 | 0.0268 |
| SDF-1 → G-protein | Cxcr4, Cybb, Gnai1, Ncf4, Pik3r5 | 0.02923 |
| BCR → cytoskeletal reorganization | C3, Cybb, Ncf4, Syk | 0.03089 |
| BCR → c-Jun | C3, Cybb, Ncf4, Syk | 0.03089 |
| SLP-65 —/ Raf-1 | Cybb, Ncf4, Plcg2 | 0.03503 |
| dehydroepiandrosterone → estriol 16-glucuronide | Cyp1b1, Cyp4a12a, Cyp4b1, Ugt1a1, Ugt1a2, Ugt1a6a, Ugt1a7c | 0.03888 |
| IL-3 → STAT5 | Csf2rb, Il3ra | 0.04102 |
| beta-glucan —DECTIN1 → IP3, DAG | Plcg2, Syk | 0.04102 |
| metabolism of estrogens | Cyp1b1, Cyp4a12a, Cyp4b1, Ugt1a1, Ugt1a2, Ugt1a6a, Ugt1a7c | 0.04299 |
| Rac1 —p65PAK → Arp2/3 complex | Acta1, Actr3b, Cybb, Ncf4 | 0.04396 |
| Src → Rac1 | Cybb, Ncf4, Vav2 | 0.0444 |
| N-cadherin —Eplin → actin | Acta1, Cdh2, Ctnna2 | 0.0444 |
Overrepresented TRANSPATH pathways for the signature genes of CMT-93.
| beta-catenin:E-cadherin complex phosphorylation and dissociation | Axl, Blk, Cdh1, Epha1, Erbb3, Fes, Kit, Lck, Mertk, Ntrk1, Ret, Tek, Txk | 0.00147 |
| beta-catenin:E-cadherin complex phosphorylation and dephosphorylation | Axl, Blk, Cdh1, Epha1, Erbb3, Fes, Kit, Lck, Mertk, Ntrk1, Ret, Tek, Txk | 0.00147 |
| tyrosine dephosphorylation of plakoglobin | Axl, Blk, Cdh1, Epha1, Erbb3, Fes, Kit, Lck, Mertk, Ntrk1, Ret, Tek, Txk | 0.00166 |
| beta-catenin network | Axl, Blk, Cdh1, Epha1, Erbb3, Fes, Kit, Lck, Magi2, Mertk, Ntrk1, Ret, Tek, Txk | 0.002 |
| NGF —p75NTR → trkA | Ngf, Ntrk1 | 0.00464 |
| VDR network | Cyp27a1, Cyp2r1, Hist1h4i, Hist1h4j, Hist2h3c2, Hist2h4, Hist4h4, Vdr | 0.00541 |
| NGF → trkA | Ngf, Ntrk1 | 0.0133 |
| Tie2 dephosphorylation | Ptprb,Tek | 0.0133 |
| CO2, H2O → spermine | Arg1, Car14, Car2, Car3, Car6 | 0.01419 |
| Angiopoietin/Tie signaling | Dok2, Nos3, Ptprb, Sfn, Tek | 0.01419 |
| creatine biosynthesis and degradation | Car14, Car2, Car3, Car6, Gatm, Mat1a | 0.01625 |
| VDR → RXR-alpha → transcriptional activation | Hist1h4i, Hist1h4j, Hist2h3c2, Hist2h4, Hist4h4, Vdr | 0.01891 |
| sphinganine → ceramide-2,3,6,7 | Cers1, Cers4, Ugcg | 0.01936 |
| urea and aspartate cycles, polyamine and creatine synthesis | Arg1, Car14, Car2, Car3, Car6, Gatm | 0.02184 |
| CO2, L-ornithine → L-arginine | Car14, Car2, Car3, Car6 | 0.02475 |
| p53 → p21Cip1 | Hist1h4i, Hist1h4j, Hist2h4, Hist4h4 | 0.02475 |
| p53 → PUMA | Hist1h4i, Hist1h4j, Hist2h4, Hist4h4 | 0.02475 |
| 7-dehydrocholesterol → calcitriol | Cyp27a1, Cyp2r1 | 0.02542 |
| formation of vitamin D3 and 1alpha,25-dihydroxycholecalciferol | Cyp27a1, Cyp2r1 | 0.02542 |
| Nedd4 → trkA | Ngf, Ntrk1 | 0.02542 |
| PKAc → NR2C | Grin1, Prkaca | 0.02542 |
| NR2A:NR2B —PKAc → Ca | Grin1, Prkaca | 0.02542 |
| Vitamin D metabolism | Cyp27a1, Cyp2r1 | 0.02542 |
| Tie2 —p56Dok-2 → PAK1 | Dok2, Tek | 0.02542 |
| L-tryptophan → 5-hydroxyindoleacetate | Aldh1a7, Maoa, Tph1 | 0.03438 |
| degradation of tryptophan | Acmsd, Aldh1a7, Maoa, Tph1 | 0.03625 |
| Csk, CD45 → Lck | Lck, Ptprc | 0.04048 |
| NR2B:NR2C —CaMKII → c-Fos | Camk2d, Grin1, Prkaca | 0.0436 |
Intersection-specific TF families/subfamilies between 1638N-T1 and CMT-93.
| 1.1.1 | Jun-related factors |
| 1.1.1.1 | Jun factors |
| 1.1.1.2 | NF-E2-like factors |
| 1.1.2 | Fos-related factors |
| 1.1.2.1 | Fos factors |
| 1.1.3 | Maf-related factors |
| 1.1.3.1 | Large Maf factors |
| 1.1.3.2 | Small Maf factors |
| 1.1.8 | C/EBP-related |
| 1.1.8.1 | C/EBP |
| 1.2.1 | E2A-related factors |
| 1.2.2 | MyoD / ASC-related factors |
| 1.2.2.1 | Myogenic transcription factors |
| 1.2.3.1 | Tal / HEN-like factors |
| 1.2.6 | bHLH-ZIP factors |
| 1.2.6.1 | TFE3-like factors |
| 1.2.6.2 | USF factors |
| 1.2.6.5 | Myc / Max factors |
| 1.2.6.7 | Mad-like factors |
| 2.1.2 | Thyroid hormone receptor-related factors (NR1) |
| 2.1.2.4 | Vitamin D receptor (NR1I) |
| 2.1.3 | RXR-related receptors (NR2) |
| 2.1.3.1 | Retinoid X receptors (NR2B) |
| 2.1.3.2 | HNF-4 (NR2A) |
| 3.1.10 | POU domain factors |
| 3.1.10.2 | POU2 (Oct-1/2-like factors) |
| 3.1.4 | TALE-type homeo domain factors |
| 3.1.4.4 | PBX |
| 6.4.1 | Runt-related factors |
| 7.1.1 | SMAD factors |
| 7.1.1.1 | Regulatory Smads (R-Smad) |
| 7.1.1.3 | Repressor-Smads (I-Smad) |
Overrepresented TRANSPATH pathways based on the intersection-specific TF set of 1638N-T1 and CMT-93.
| Endothelin-1 gene regulation | Fos, Jun, Smad3, Smad4 | 2.8851480825350153E-8 |
| Transcriptional Regulation of ECM components | Smad2, Smad3, Smad4, Tfe3 | 4.2462045176114295E-7 |
| PPAR pathway | Ppara, Pparg, Rxra, Rxrb, Smad2, Smad3 | 4.401281772213993E-7 |
| BMP7 → Smad1, Smad5, Smad8 | Smad1, Smad4, Smad5, Smad9 | 9.814052909861147E-7 |
| TGFbeta pathway | Fos, Jun, Pparg, Smad1, Smad2, Smad3, Smad4, Smad5, Smad7, Smad9, Tfe3 | 1.1668767789940478E-6 |
| SMAD7, SIK1 gene induction | Smad2, Smad3, Smad4 | 2.3427402430218484E-6 |
| MIC2 signaling | Fosb, Jun, Jund, Srf | 8.907929442840803E-6 |
| Smad2/3, PPARgamma, regulation of bioavailability | Pparg, Smad2, Smad3 | 9.284406529601274E-6 |
| MIC2-isoform2 —JNK, JunD → MMP9 | Fosb, Jun, Jund | 4.556873521617853E-5 |
| TGFbeta1 → Smad1, Smad2, Smad5 | Smad1, Smad2, Smad5 | 7.900923266022097E-5 |
| MIC2-isoform2 —FosB → MMP9 | Fosb, Jund, Srf | 1.2524823045846698E-4 |
| mammalian Hippo network | Smad2, Smad3, Smad4, Smad7, Tead1 | 1.609311066368507E-4 |
| RA, 15d-PGJ2 → RXR-beta, PPAR-gamma | Pparg, Rxrb | 1.830040551373434E-4 |
| RXR-beta, VDR heterodimerization | Rxrb, Vdr | 1.830040551373434E-4 |
| Smad2/3 —TAZ → cytoplasmic retention | Smad2, Smad3, Smad4 | 3.5891749789138236E-4 |
| Sox9 —Smad3 → COL2A1 | Smad2, Smad3 | 5.443266849267895E-4 |
| MyoD regulation | Myod1, Tcf3 | 5.443266849267895E-4 |
| MKK4 —/ PPAR-gamma | Pparg, Rxrb | 0.0010793689633243411 |
| Ctbp1 —/ Smad3 | Smad3, Smad4 | 0.0010793689633243411 |
| ERK1 → NQO1 | Mafk, Nfe2l2 | 0.0010793689633243411 |
| E2F —/ Smad4 | Smad3, Smad4 | 0.0017836171536470523 |
| Nrf2 → HMOX1 | Mafk, Nfe2l2 | 0.0017836171536470523 |
| stress-associated pathways | Jun, Mitf, Myf6, Pparg, Rxra, Rxrb | 0.0023269497130444508 |
| PRIC complex → PPAR-alpha | Ppara, Rxra | 0.002652641362864685 |
| TGFbeta1 → Smad2/3 | Smad2, Smad3 | 0.002652641362864685 |
| MEK → EZR | Fos, Jun | 0.002652641362864685 |
| p38 pathway | Jun, Mitf, Myf6, Pparg | 0.0034193798154062926 |
| 15-Keto-PGE2 → TP63 | Pparg, Smad2 | 0.003682094214938695 |
| TGFbetaR-I —pak2, ERK1 → SMAD7, SERPINE1 | Smad2, Smad3, Smad4 | 0.003904012718560197 |
| 15d-PGJ2 → PPAR-gamma | Pparg, Rxra | 0.009320059910591498 |
| Regulation of mesendoderm differentiation genes | Smad2, Smad4 | 0.012994431232912744 |
| IRAK-1 —MKK3 → TNF | Fos, Jun | 0.015031819490714783 |
| JNK pathway | Jun, Pparg, Rxra, Rxrb | 0.016302058148038395 |
| VDR network | Rxra, Rxrb, Vdr | 0.01758202640044028 |
| TGFbetaR-I → ERK | Smad2, Smad3 | 0.04188993264127895 |
1638N-T1-specific TF families/subfamilies.
| 3.1.1.9 | CDX (Caudal type homeobox) |
| 3.1.9 | HD-CUT factors |
| 3.1.9.1 | ONECUT |
| 3.1.10.7 | HNF1-like factors |
| 3.3.1 | Forkhead box (FOX) factors |
| 3.3.1.1 | FOXA |
| 3.3.1.6 | FOXF |
| 3.5.3 | Interferon-regulatory factors |
| 4.1.1 | SOX-related factors |
| 4.1.1.3 | Sox-related factors, Group C |
| 4.1.1.4 | Sox-related factors, Group D |
| 4.1.1.5 | Sox-related factors, Group E |
| 6.1.3 | NFAT-related factors |
| 8.2.1 | HMGA factors |
Overrepresented TRANSPATH pathways based on the 1638N-T1-specific TF set.
| dsRNA → IRF-7:IRF-3:CBP:p300 | Irf3, Irf7 | 3.947146928237511E-4 |
| LPS → IRF-3:IRF-7:CBP:p300 | Irf3, Irf7 | 0.0014260355781928803 |
| wnt → beta-catenin | Ctnnb1, Tbp | 0.005286143325503229 |
| TLR9 pathway | Irf1, Irf7 | 0.0106622039857671 |
| TLR3 pathway | Irf3, Irf7 | 0.01598971078797065 |
| wnt pathway | Ctnnb1, Tbp | 0.02936961872680831 |
| TLR4 pathway | Irf3, Irf7 | 0.03155510304218106 |
CMT-93-specific TF families/subfamilies.
| 1.1.2 | Fos-related factors |
| 1.2.6.3 | SREBP factors |
| 2.3.3.1 | GLI-like factors |
| 3.5.2 | Ets-related factors |
| 3.5.2.1 | Ets-like factors |
| 3.5.2.2 | Elk-like factors |
| 3.5.2.3 | Elf-1-like factors |
| 6.1.1 | NF-kappaB-related factors |
| 6.1.5 | Early B-Cell Factor-related factors |
| 6.2.1 | STAT factors |
Overrepresented TRANSPATH pathways based on the CMT-93-specific TF set.
| PDGF B → STATs | Stat3, Stat5a, Stat5b | 6.272149884041184E-7 |
| STAT5 → Ccnd1 | Stat5a, Stat5b | 3.1953088992325076E-5 |
| STAT5 → CISH | Stat5a, Stat5b | 3.1953088992325076E-5 |
| STAT5 → CSN2 | Stat5a, Stat5b | 3.1953088992325076E-5 |
| PDGF B → STAT1alpha, STAT5 | Stat5a, Stat5b | 9.554461322021479E-5 |
| importin-alpha3 → NFkappaB | Nfkb1, Rela | 9.554461322021479E-5 |
| Pin1 → p50:RelA-p65 | Nfkb1, Rela | 9.554461322021479E-5 |
| Epo —Jak2 → STAT5 | Stat5a, Stat5b | 1.9046201145220444E-4 |
| Epo → STAT5 | Stat5a, Stat5b | 1.9046201145220444E-4 |
| IL-3 → STAT5 | Stat5a, Stat5b | 3.1639480397985514E-4 |
| LXR —/ IL1B | Nfkb1, Rela | 3.1639480397985514E-4 |
| SOCS-1 → p50:RelA-p65 | Nfkb1, Rela | 4.730345816689199E-4 |
| TLR8 —Btk → NF-kappaB | Nfkb1, Rela | 4.730345816689199E-4 |
| TLR9 —Btk → NF-kappaB | Nfkb1, Rela | 4.730345816689199E-4 |
| p50:RelA-p65 → SELE | Nfkb1, Rela | 4.730345816689199E-4 |
| IFNalpha/beta pathway | Stat3, Stat5a, Stat5b | 6.440779144960161E-4 |
| fMLP → NFkappaB | Nfkb1, Rela | 6.600749950470129E-4 |
| IL-2 → STAT5 | Stat5a, Stat5b | 6.600749950470129E-4 |
| IFNalpha, IFNbeta → STAT5 | Stat5a, Stat5b | 6.600749950470129E-4 |
| LXR network | Nfkb1, Rela | 6.600749950470129E-4 |
| IL-2 - STAT5 pathway | Stat5a, Stat5b | 8.772117434506635E-4 |
| cPKC —CARD9 → TRAF6 | Nfkb1, Rela | 8.772117434506635E-4 |
| mannan, Dectin2 | Nfkb1, Rela | 8.772117434506635E-4 |
| EDA-A2 —TRAF3 → p50:RelA-p65 | Nfkb1, Rela | 0.0011241425642107344 |
| EDA-A1 → p50:RelA-p65 | Nfkb1,Rela | 0.0011241425642107344 |
| IL-1 pathway | Elk1, Nfkb1, Rela | 0.0012748952830245175 |
| neurotrophic signaling | Elk1, Nfkb1, Rela, Trp53 | 0.0012979014272467505 |
| NGF —p75NTR → p50:RelA-p65 | Nfkb1, Rela | 0.001400567221887963 |
| CH000000333 | Nfkb1, Rela | 0.0017061874975544628 |
| EDAR → NF-kappaB | Nfkb1, Rela | 0.0017061874975544628 |
| TNF-alpha → p50:RelA-p65 | Nfkb1, Rela | 0.0024038320457071337 |
| PDGF pathway | Stat3, Stat5a, Stat5b | 0.004038573581262634 |
| TBK1:TRIF:IKK-i → p50:RelA | Nfkb1, Rela | 0.004136568270131099 |
| dsRNA → p50:RelA | Nfkb1, Rela | 0.004136568270131099 |
| RANKL → p38 | Nfkb1, Rela | 0.004638374899213325 |
| LAT → p50:RelA | Nfkb1, Rela | 0.005722351182439321 |
| EDAR pathway | Nfkb1, Rela | 0.009597224599851443 |
| T-cell antigen receptor pathway | Elk1, Nfkb1, Rela | 0.009985935589865625 |
| LPS → NF-kappaB | Nfkb1, Rela | 0.011871624770568048 |
| NF-kappaB → genes encoding endothelial adhesion molecules | Nfkb1,Rela | 0.011871624770568048 |
| Epo pathway | Stat5a, Stat5b | 0.012677905293747096 |
| TLR9 pathway | Nfkb1, Rela | 0.012677905293747096 |
| IL-1beta → p50:RelA | Nfkb1, Rela | 0.01436112389208374 |
| TLR3 pathway | Nfkb1, Rela | 0.018969411877391994 |
| TNFR1 signaling | Nfkb1, Rela | 0.019957669453188952 |
| diacyl lipopeptide, TLR2 | Nfkb1, Rela | 0.019957669453188952 |
| p38 pathway | Stat3, Trp53 | 0.029796036656231952 |
| PRL pathway | Stat5a, Stat5b | 0.03343465884776058 |
| p50:RelA-p65 → IL8 | Nfkb1, Rela | 0.03343465884776058 |
| IL-3 signaling | Stat5a, Stat5b | 0.03343465884776058 |
| TLR4 pathway | Nfkb1, Rela | 0.037242517103675384 |
| TLR2-mediated signaling | Nfkb1, Rela | 0.04394876054564345 |
Top three master regulators for three TF sets: Intersection-specific TFs of the two cell lines, 1638N-T1-specific TFs and CMT-93-specific TFs.
| 1 | Rad23A | MLK3 | Aebp1 (ACLP) |
| 2 | Smad3 | TBK1 | Il2rg (gamma-c) |
| 3 | Melk | Siah2 | Mapk1 (ERK2) |
Figure 3Master regulatory network based on the 1638NT-1-specific TF set. The color coding red, blue and green represent nodes for master regulators, regulated transcription factors and connecting molecules, respectively.
Figure 4Master regulatory network based on the CMT-93-specific TF set. The color coding red, blue and green represent nodes for master regulators, regulated transcription factors and connecting molecules, respectively.
Figure 5Schema for potential state of canonical Wnt signaling pathway in mouse models. (A) Wnt signaling is activated in 1638N-T1. (B) Wnt signaling is inhibited in CMT-93 through cross-talks with VDR- and/or LXR-induced pathways. Interaction of tumor cells with the microenvironment has an impact on cell proliferation, invasion, and metastasis in mouse models. Signature genes and transcription factors/cofactors, whose binding sites were found to be enriched in promoters, are indicated by a red asterisk or a yellow asterisk, respectively.