| Literature DB >> 23251436 |
Christian J Gröger1, Markus Grubinger, Thomas Waldhör, Klemens Vierlinger, Wolfgang Mikulits.
Abstract
The epithelial to mesenchymal transition (EMT) represents a crucial event during cancer progression and dissemination. EMT is the conversion of carcinoma cells from an epithelial to a mesenchymal phenotype that associates with a higher cell motility as well as enhanced chemoresistance and cancer stemness. Notably, EMT has been increasingly recognized as an early event of metastasis. Numerous gene expression studies (GES) have been conducted to obtain transcriptome signatures and marker genes to understand the regulatory mechanisms underlying EMT. Yet, no meta-analysis considering the multitude of GES of EMT has been performed to comprehensively elaborate the core genes in this process. Here we report the meta-analysis of 18 independent and published GES of EMT which focused on different cell types and treatment modalities. Computational analysis revealed clustering of GES according to the type of treatment rather than to cell type. GES of EMT induced via transforming growth factor-β and tumor necrosis factor-α treatment yielded uniformly defined clusters while GES of models with alternative EMT induction clustered in a more complex fashion. In addition, we identified those up- and downregulated genes which were shared between the multitude of GES. This core gene list includes well known EMT markers as well as novel genes so far not described in this process. Furthermore, several genes of the EMT-core gene list significantly correlated with impaired pathological complete response in breast cancer patients. In conclusion, this meta-analysis provides a comprehensive survey of available EMT expression signatures and shows fundamental insights into the mechanisms that are governing carcinoma progression.Entities:
Mesh:
Year: 2012 PMID: 23251436 PMCID: PMC3519484 DOI: 10.1371/journal.pone.0051136
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Gene expression studies of EMT used for meta-analysis.
| First author | Acc. | Ref. | Cell type | Cell origin | Treatment modality | Platform | Samples |
| Ke | E-TABM-949 |
| EP156T/EPT2 | Prostate | high cell density | Agil WHG 4×44K G4112F | 2 |
| Andarawewa | GSE8240 |
| MCF10A | Breast | TGF-β+irradiation | Affy HTU133A | 3 |
| Takahashi | GSE12548/GSE15205 |
| ARPE19 | Retinal pigment | TGF-β+TNF-α | Affy U133Plus2 | 3 |
| Tay | GSE13759 |
| HCT116/E1 | Colon | serial transplantation | Affy U133A | 3 |
| Drake | GSE14405 |
| PC-3/TEM4-18 | Prostate | transendothelial migration | Affy U133Plus2 | 2 |
| Hwang | GSE14773 |
| CRC | Colon | spheroid formation | Affy U133Plus2 | 2 |
| Sartor | GSE17708 |
| A549 | Lung | TGF-β | Affy U133Plus2 | 3 |
| Papageorgis | GSE18070 |
| MCF10CA1h | Breast | H-Ras+carcinoma | Affy U133Plus2 | 3 |
| Hills | GSE20247 |
| HK2 | Kidney | TGF-β+Cpep | Illum HWG-6 v3.0 | 3 |
| Leshem | GSE22010 |
| PrEC-hTERT | Prostate | AR+T/ERG | Affy HG 1.0 ST | 4 |
| Micalizzi | GSE23655 |
| MCF7 | Breast | Six1 vector | Affy HTU133A | 6 |
| Maupin | GSE23952 |
| Panc-1 | Pancreas | TGF-β | Affy U133Plus2 | 3 |
| Taubed | GSE24202 |
| HMLE | Breast | TGF-β1; Snail1, Twist, Gsc vectors; siRNA against E-Cadherin | Affy HTU133A | 3 |
| Baniwal | GSE24261 |
| PCa C4-2B/Rx2dox | Prostate | Runx2 vector | Illum HR-8 v3.0 | 4 |
| van Zijl | GSE26391 |
| 3p/3sp | Liver | tumor cell recovery | Affy HG 1.0 ST | 2 |
| Ohashi | GSE27424 |
| EPC2-hTERT | Esophagus | Notch3 knock-down (shRNA) | Affy U133Plus2 | 3 |
| Hesling | GSE28448 |
| HMEC-TR | Breast | TGF-β+siRNA against TIFγ | Affy U133Plus2 | 2 |
| Wang | GSE28799 |
| OVCAR-3 | Ovary | spheroid formation | Affy U133Plus2 | 3 |
, lowest number of samples per class (control or test subject).
, in vitro;
, in vivo;
, consists of two studies with three datasets in total; d, consists of five datasets.
Abbreviations: Affy, Affymetrix; Agil, Agilent; AR, androgen receptor; Illum, Illumina; sh, small hairpin; si, small interfering; T/ERG, TMPRSS2/ERG; TGF, transforming growth factor; TNF, tumor necrosis factor.
Figure 1Cluster analysis of genes shared between at least 10 GES datasets shows distinguishable and significant clusters.
Genes shared between at least 10 out of 24 datasets were used for Manhattan hierarchical clustering. The type of regulation within a particular study was visualized via heatmap. Columns: genes shared between at least 10 datasets (n = 365); rows: analyzed GES (24 datasets in total); green: downregulated genes; red: upregulated genes; black: genes not regulated. GSE: Gene expression omnibus (GEO) series record; E.TABM: ArrayExpress (AE) series record; TGF, transforming growth factor; TNF, tumor necrosis factor.
Figure 2Gene expression studies cluster according to the mode of EMT initiation rather than to cell type.
The cell type and treatment modality of EMT was annotated and revealed clustering according to the mode of EMT induction. The clustering persisted when genes shared between at least 14 GES datasets were used for the analysis. (A) Hierarchical clustering of 365 genes shared between at least 10 datasets. (B) Hierarchical clustering of 41 genes shared between at least 14 datasets. The legend indicates cell type and treatment modality (right panel). *, Transcription factor vectors: Runx2, Six1, Snail, Twist and Goosecoid. GSE: Gene expression omnibus (GEO) series record; E.TABM: ArrayExpress (AE) series record; TGF, transforming growth factor; TNF, tumor necrosis factor.
EMT-core list of 130 genes shared between at least 10 GES datasets.
| Upregulated | Downregulated | |
|
| ADAM12, CDH11, CDH2, COL1A1, COL3A1, COL5A1, COL6A1, COL6A3, CTGF, CYP1B1, DLC1, FBLN1, FBLN5, FGF2, FGFR1, FN1, HAS2, LUM, MMP2, MYL9, NID2, NR2F1, NRP1, PLAT, PPAP2B, PRKCA, RECK, SERPINE1, SERPINE2, SPOCK1, TGM2, TNFAIP6, TPM1, VCAN, WNT5A | CD24, CDH1, CXADR, CXCL16, DSG3, ELF3, EPCAM, EPHA, JUP, MPZL2, OVOL2, PLXNB1, S100P, SLC7A5, SYK |
|
| CDKN2C, EMP3, FBN1, IGFBP3, IL1R1, LTBP1, MME, PMP22, PTGER2, PTX3, SRGN, SULF1, SYNE1, TAGLN, TUBA1A, VIM, ZEB1 | ABLIM1, ADRB2, ALDH1A3, ANK3, BIK CA2, CTSL2, FGFR2, FGFR3, FST, GJB3, IFI30, IL18, KLK7, KRT15, KRT17, LSR, MAP7, MBP, OCLN, PKP2, PPL, PRSS8, RAPGEF5, SPINT1 |
|
| DCN, LOX, TFPI |
|
|
| ABCA1, GALNT10, SLC22A4 | GPX3, SLC27A2, SMPDL3B, SORL1, ST6GALNAC2 |
|
| C5orf13, CDK14, EML1, FSTL1, LTBP2, MAP1B, RGS4, SYT11, TMEM158 | AGR2, C10orf10, CDS1, FAM169A, FXYD3, KLK10, LAD1, MTUS1, PLS1, PRRG4, RHOD, SERPINB1, SLPI, TMEM30B, TPD52L1, TSPAN1, ZHX2, ZNF165 |
Categories have been chosen according to the GO classifications of the enrichment tools. Genes may be present in more than one category.
see Table S3 for more information.
Number of enriched terms and pathways in all lists detected by the enrichment tools.
| Tool | 130 gene list | 365 gene list | GSE13195 core list | GSE24202 core list | ||||||||
| BP | MF | KEGG | BP | MF | KEGG | BP | MF | KEGG | BP | MF | KEGG | |
| ConsensusPathDB | 305 | 31 | 9 | 558 | 61 | 31 | 62 | 10 | 6 | 247 | 34 | 8 |
| FatiGO | 178 | 28 | 9 | 452 | 72 | 36 | 0 | 0 | 2 | 172 | 28 | 10 |
| GeneCodis | 34 | 16 | 8 | 155 | 45 | 46 | 59 | 17 | 4 | 240 | 48 | 7 |
| ToppFun | 241 | 21 | 1 | 610 | 45 | 5 | 0 | 0 | 1 | 127 | 14 | 0 |
| WebGestalt | 40 | 28 | 6 | 40 | 40 | 37 | 5 | 4 | 4 | 40 | 30 | 8 |
The numbers of enriched terms and pathways found by the particular enrichment tools are displayed. BP, GO biological process; MF, GO molecular function; KEGG, KEGG pathway. GSE13195 core list of Choi et al., GSE24202 core list of Taube et al. [13], [39].
Consistently enriched GO terms and KEGG pathways and their occurrence in the analyzed gene lists.
| Term ID | Category | Term size | 130 gene list | 365 gene list | GSE13915 core list | GSE24202 core list | ||||
| Tools | Genes | Tools | Genes | Tools | Genes | Tools | Genes | |||
|
| ||||||||||
| GO:0048646 | anatomical structure formation involved in morphogenesis | 390 | 4 | 24 | 4 | 62 | 0 | - | 4 | 22 |
| GO:0001525 | angiogenesis | 189 | 4 | 16 | 4 | 38 | 0 | - | 4 | 14 |
| GO:0007596 | blood coagulation | 182 | 4 | 13 | 4 | 29 | 0 | - | 3 | 13 |
| GO:0001568 | blood vessel development | 288 | 4 | 25 | 5 | 54 | 0 | - | 5 | 20 |
| GO:0007155 | cell adhesion | 953 | 5 | 36 | 5 | 76 | 2 | 19 | 5 | 41 |
| GO:0016049 | cell growth | 226 | 4 | 13 | 4 | 34 | 0 | - | 4 | 14 |
| GO:0016477 | cell migration | 405 | 5 | 32 | 5 | 67 | 1 | 13 | 5 | 35 |
| GO:0048870 | cell motility | 484 | 4 | 33 | 4 | 69 | 1 | 13 | 5 | 35 |
| GO:0006928 | cellular component movement | 666 | 4 | 36 | 5 | 73 | 1 | 16 | 5 | 41 |
| GO:0009790 | embryo development | 619 | 4 | 18 | 4 | 46 | 0 | - | 3 | 20 |
| GO:0008544 | epidermis development | 218 | 5 | 16 | 4 | 32 | 2 | 6 | 5 | 26 |
| GO:0007507 | heart development | 230 | 5 | 15 | 4 | 28 | 1 | 6 | 3 | 10 |
| GO:0009887 | organ morphogenesis | 800 | 5 | 21 | 5 | 54 | 0 | - | 5 | 34 |
| GO:0042127 | regulation of cell proliferation | 823 | 4 | 28 | 5 | 81 | 1 | 14 | 5 | 37 |
| GO:0050793 | regulation of developmental process | 1005 | 4 | 34 | 4 | 88 | 0 | - | 4 | 32 |
| GO:0009611 | response to wounding | 776 | 5 | 31 | 5 | 85 | 0 | - | 4 | 34 |
| GO:0001501 | skeletal system development | 394 | 4 | 14 | 4 | 35 | 0 | - | 5 | 20 |
| GO:0009888 | tissue development | 808 | 4 | 38 | 4 | 93 | 1 | 12 | 5 | 52 |
| GO:0001944 | vasculature development | 294 | 4 | 25 | 4 | 56 | 0 | - | 5 | 20 |
| GO:0042060 | wound healing | 270 | 4 | 20 | 5 | 50 | 0 | - | 3 | 19 |
|
| ||||||||||
| GO:0005509 | calcium ion binding | 1033 | 4 | 22 | 4 | 55 | 0 | - | 4 | 34 |
| GO:0030246 | carbohydrate binding | 380 | 4 | 15 | 4 | 29 | 1 | 7 | 4 | 14 |
| GO:0005518 | collagen binding | 40 | 4 | 5 | 5 | 12 | 0 | - | 0 | - |
| GO:0004866 | endopeptidase inhibitor activity | 179 | 4 | 9 | 4 | 19 | 0 | - | 4 | 9 |
| GO:0004857 | enzyme inhibitor activity | 327 | 4 | 10 | 4 | 26 | 2 | 8 | 4 | 13 |
| GO:0005201 | ECM constituent | 105 | 5 | 7 | 5 | 12 | 0 | - | 4 | 7 |
| GO:0005539 | glycosaminoglycan binding | 146 | 4 | 13 | 5 | 24 | 0 | - | 5 | 10 |
| GO:0019838 | growth factor binding | 127 | 4 | 13 | 4 | 26 | 0 | - | 5 | 14 |
| GO:0008201 | heparin binding | 108 | 4 | 9 | 4 | 17 | 0 | - | 3 | 7 |
| GO:0005178 | integrin binding | 57 | 4 | 6 | 5 | 9 | 0 | - | 4 | 7 |
| GO:0030414 | peptidase inhibitor activity | 192 | 5 | 9 | 5 | 20 | 0 | - | 4 | 9 |
| GO:0030247 | polysaccharide binding | 165 | 4 | 14 | 4 | 27 | 0 | - | 5 | 13 |
| GO:0032403 | protein complex binding | 199 | 4 | 11 | 4 | 20 | 0 | - | 2 | 8 |
| GO:0004867 | serine-type endopeptidase inhibitor activity | 118 | 4 | 9 | 4 | 14 | 0 | - | 3 | 7 |
| GO:0005200 | structural constituent of cytoskeleton | 92 | 5 | 8 | 5 | 10 | 0 | - | 5 | 13 |
|
| ||||||||||
| map04360 | axon guidance | 126 | 4 | 6 | 4 | 11 | 0 | - | 4 | 6 |
| map04512 | ECM-receptor interaction | 92 | 5 | 7 | 5 | 18 | 0 | - | 1 | 5 |
| map04510 | focal adhesion | 207 | 4 | 9 | 5 | 23 | 0 | - | 3 | 8 |
| map04010 | MAPK signaling pathway | 289 | 3 | 7 | 4 | 15 | 0 | - | 0 | - |
| map05200 | pathways in cancer | 329 | 4 | 11 | 5 | 28 | 1 | 5 | 2 | 8 |
| map04810 | regulation of actin cytoskeleton | 209 | 4 | 7 | 4 | 16 | 0 | - | 2 | 6 |
According to FatiGO category size in genome.
The maximum number of genes from the category present in the input list is displayed. ID, identity; GO, gene ontology; KEGG, Kyoto encyclopedia of genes and genomes. GSE13195 core list of Choi et al., GSE24202 core list of Taube et al. [13], [39].
Figure 3The 130 genes EMT-core list and the 365 genes list exhibit comparable enrichment ratios of GO biological processes and KEGG pathways.
The enrichment ratio is the number of observed genes divided by the number of expected genes for a given term or pathway. Enrichment ratios were obtained from WebGestalt or calculated with data from FatiGO. GO, gene ontology; BP, biological process; KEGG, Kyoto encyclopedia of genes and genomes.