| Literature DB >> 28651527 |
Princy Parsana1, Sarah R Amend2, James Hernandez2, Kenneth J Pienta2, Alexis Battle3.
Abstract
BACKGROUND: Epithelial to mesenchymal transition (EMT) is the process by which stationary epithelial cells transdifferentiate to mesenchymal cells with increased motility. EMT is integral in early stages of development and wound healing. Studies have shown that EMT could be a critical early event in tumor metastasis that is involved in acquisition of migratory and invasive properties in multiple carcinomas.Entities:
Keywords: C1orf116; EMT; Metastasis; Multi-study integration; Prostate cancer
Mesh:
Substances:
Year: 2017 PMID: 28651527 PMCID: PMC5485747 DOI: 10.1186/s12885-017-3413-3
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Epithelial to Mesenchymal Transition. During EMT, non-motile epithelial cells trans differentiate to mesenchymal cells with increased migratory potential. During this, cells show decreased expression epithelial specific genes that include E-cadherin, OVOL1 and ESRP1. At the same time, expression of mesenchymal genes such as N-cadherin, VIM and ZEB1 increases
Dataset information
| GEO ID | Platform ID | Disease Type | Cell line | Samples* | Ref |
|---|---|---|---|---|---|
| GSE12811 | GPL7319 | Breast | MCF10A | 3 | [ |
| GSE13915 | GPL7785 | Breast | BT549, EFM19 | 4 | [ |
| GSE18070 | GPL570 | Breast | MCF10CA1h | 9 | [ |
| GSE28569 | GPL6480 | Breast | MCF10A | 8 | [ |
| GSE39356 | GPL6480 | Breast | MCF-7 | 4 | [ |
| GSE8240 | GPL3921 | Breast | MCF10A | 11 | [ |
| GSE12203 | GPL2700 | Colon | Caco-2 | 4 | [ |
| GSE14773 | GPL570 | Colon | HT29, SW480 | 8 | [ |
| GSE27424 | GPL570 | Esophageal | EPc2-hTERT | 12 | [ |
| GSE26391 | GPL6244 | Liver | HCC-1.1, HCC-1.2 | 8 | [ |
| GSE14405 | GPL570 | Prostate | PC3, TEM4, TEM2 | 6 | [ |
| GSE22010 | GPL6244 | Prostate | PrEC-hTERT | 2 | [ |
| GSE22764 | GPL6884 | Prostate | PC3 | 6 | [ |
| GSE43489 | GPL570 | Prostate | PC3 | 4 | [ |
| GSE12548 | GPL570 | Retinal pigment | ARPE19 | 6 | [ |
*Indicates the number of samples included in our study
Fig. 2Workflow for multi-study data integration, normalization and identification of candidate universal EMT genes
Association of C1orf116 expression in lung and prostate cancer patients
| Test group | Wilcoxon rank sum | Bonferroni adjusted |
|---|---|---|
| Lung cancer (Director’s Lung Challenge): grade [ | ||
| Grade1 vs Grade 2 | 1.4191e-06 | 3.27E-05 |
| Grade 2 vs Grade 3 | 1.1481e-10 | 2.65E-09 |
| Grade 1 vs Grade 3 | 2.6121e-17 | 6.00E-16 |
| Lung cancer (Director’s Lung Challenge): Smoking Status [ | ||
| Never vs Past | 0.006 | 1.38E-01 |
| Past vs Current | 0.006 | 1.38E-01 |
| Never vs Current | 0.0002 | 4.60E-03 |
| Lung cancer (Okayama): Smoking status [ | ||
| Never smoker vs ever smoker | 0.0586 | 1E + 00 |
| Prostate cancer (Taylor): Tumor type [ | ||
| Primary vs Metastatic | 0.0340 | 7.82E-01 |
Fig. 3Consistency in differential expression signal across normalization methods. a Correlation heatmap showing concordance (Spearman rho) among ranks of differentially expressed genes using the four normalization methods (n = 7276). Genes were ranked by raw t-test p-values. b Correlation heatmap showing concordance (Spearman rho) among fold-change of differentially expressed genes using the four normalization methods (n = 7276). c Hierarchical Clustering of top 200 differentially expressed genes with uncorrected data shows strong clustering of samples by study rather than by phenotype. d Hierarchical Clustering of top 200 differentially expressed genes with QN + SVA (Quantile Normalized + SVA) corrected data clusters by epithelial and mesenchymal phenotype
Evaluation of sample grouping (with 7276 genes) using Baker Hubert index and phenotype information
| No normalization | Quantile Normalization (QN) | Surrogate Variable Analysis (SVA) | QN + SVA | Median Centered Column Scaled | |
|---|---|---|---|---|---|
| Baker Hubert Index | 0.0001 | 0.047 | 0.864 | 0.7995 | 0.0705 |
Enriched MsigDB Hallmark genesets
| Geneset |
| oddsratio | FDR | Genes in set |
|---|---|---|---|---|
| HALLMARK Epithelial mesenchymal transition | 9.84E-33 | 18.3575636 | 4.92E-31 |
|
| HALLMARK Estrogen response late | 9.36E-06 | 4.332224532 | 0.00019652 |
|
| HALLMARK Apical junction | 1.18E-05 | 4.516129032 | 0.00019652 |
|
| HALLMARK UV response dn | 8.16E-05 | 4.23768997 | 0.001019448 |
|
| HALLMARK Estrogen response early | 0.000247578 | 3.495078664 | 0.002475779 |
|
| HALLMARK Hypoxia | 0.000436298 | 3.276838008 | 0.003635818 |
|
| HALLMARK Inflammatory response | 0.000679488 | 3.786760716 | 0.004246802 |
|
| HALLMARK KRAS signaling up | 0.00061698 | 3.554348835 | 0.004246802 |
|
| HALLMARK Angiogenesis | 0.003822541 | 7.2 | 0.02123634 |
|
| HALLMARK Complement | 0.00451196 | 3.068992514 | 0.022559801 |
|
| HALLMARK Myogenesis | 0.00594623 | 2.929880329 | 0.027028319 |
|
| HALLMARK TGF beta signaling | 0.010673511 | 4.097902098 | 0.044472964 |
|
Rank of known epithelial and mesenchymal specific genes and DE genes found in Hallmark Epithelial to mesenchymal transition [73]
| Gene Symbol | Order in average rank | Gene Symbol | Order in average rank | Gene Symbol | Order in average rank |
|---|---|---|---|---|---|
|
| 3 |
| 45 |
| 144 |
|
| 4 |
| 54 |
| 147 |
|
| 6 |
| 73 |
| 150 |
|
| 11 |
| 81 |
| 155 |
|
| 12 |
| 84 |
| 159 |
|
| 13 |
| 90 |
| 161 |
|
| 14 |
| 95 |
| 168 |
|
| 15 |
| 98 |
| 171 |
|
| 21 |
| 106 |
| 174 |
|
| 22 |
| 108 |
| 181 |
|
| 24 |
| 118 |
| 187 |
|
| 33 |
| 120 |
| 191 |
|
| 36 |
| 124 |
| 162 |
|
| 42 |
| 127 | ||
|
| 43 |
| 129 |
*commonly used EMT marker genes
Fig. 4Expression of EMT associated genes in prostate cancer EMT. a qPCR: mRNA expression of known epithelial and mesenchymal specific genes in PC3 prostate cancer EMT model cell line. b qPCR: mRNA expression of epithelial and mesenchymal specific genes in PC3 prostate cancer cell lines previously unknown in prostate cancer EMT. * P < 0.05; ** P < 0.005; *** P < 0.0005. c Immunoblot: Protein expression of epithelial and mesenchymal specific genes in PC3 prostate cancer cell lines previously unknown in prostate cancer EMT (LSR, DPYSL3, S100A14, C1orf116, and β-actin were all probed on the same blot, so the β-actin loading control is appropriate for both Fig. 4c (LSR, DPYSL2, S100A14) and Fig. 5b (C1orf116). Data were separated into two figures for clarity)
Fig. 5C1orf116: a novel EMT regulator. a qPCR: mRNA expression of C1orf116 in EMT model prostate cancer cell lines PC3-Epi, PC3-EMT and PC3-TxR * P < 0.1; ** P < 0.05; *** P < 0.005. b Immunoblot: Protein expression of C1orf116 in EMT model prostate cancer cell lines PC3-Epi, PC3-EMT and PC3-TxR (LSR, DPYSL3, S100A14, C1orf116, and β-actin were all probed on the same blot, so the β-actin loading control is appropriate for both Fig. 4c (LSR, DPYSL2, S100A14) and Fig. 5b (C1orf116). Data were separated into two figures for clarity). c qPCR: mRNA expression of C1orf116 and other known epithelial (OVOL1, ESRP1 and CDH1) and mesenchymal (CDH2) gene in PC3-Epi cells transfected with C1orf116-siRNA relative to empty vector control * P < 0.1; ** P < 0.05; *** P < 0.005
Fig. 6C1orf116 associated genes in weighted gene correlation network module. This correlation network shows association of C1orf116 module genes obtained from WGCNA. Node size is a function of correlation with C1orf116 expression. Yellow nodes represent genes that have been previously studied in multiple (greater than 3) cancer types. Bright green nodes are the genes that have been studied in 3 or less cancer types. Light green nodes are genes that have not been specifically studied in cancer. Gray nodes were genes that were not significantly associated with expression of C1orf116
Fig. 7C1orf116 expression in cancer patient data. a Decreased expression of C1orf116 is seen in metastatic tumor type compared to primary prostate cancer (Taylor dataset); unadjusted P = 0.0340, Bonferroni adjusted P = 0.51. b Expression of C1orf116 decreases in high grade lung cancer (Director’s challenge dataset); Bonferroni adjusted P < 0.0005. c C1orf116 is downregulated in lung cancer patients with increased smoking habits (Director’s challenge dataset); unadjusted P < 0.01, Bonferroni adjusted P < 0.1. d C1orf116 is downregulated in lung cancer patients with smoking habits in comparison to non-smokers (Okayama dataset); unadjusted P = 0.0586, Bonferroni corrected P = 0.879