Literature DB >> 29360819

A core matrisome gene signature predicts cancer outcome.

Arseniy E Yuzhalin1, Tomas Urbonas2, Michael A Silva2, Ruth J Muschel1, Alex N Gordon-Weeks1,3.   

Abstract

BACKGROUND: Accumulating evidence implicates the tumour stroma as an important determinant of cancer progression but the protein constituents relevant for this effect are unknown. Here we utilised a bioinformatics approach to identify an extracellular matrix (ECM) gene signature overexpressed in multiple cancer types and strongly predictive of adverse outcome.
METHODS: Gene expression levels in cancers were determined using Oncomine. Geneset enrichment analysis was performed using the Broad Institute desktop application. Survival analysis was performed using KM plotter. Survival data were generated from publically available genesets.
RESULTS: We analysed ECM genes significantly upregulated across a large cohort of patients with ovarian, lung, gastric and colon cancers and defined a signature of nine commonly upregulated genes. Each of these nine genes was considerably overexpressed in all the cancers studied, and cumulatively, their expression was associated with poor prognosis across all data sets. Further, the gene signature expression was associated with enrichment of genes governing processes linked to poor prognosis, such as EMT, angiogenesis, hypoxia, and inflammation.
CONCLUSIONS: Here we identify a nine-gene ECM signature, which strongly predicts outcome across multiple cancer types and can be used for prognostication after validation in prospective cancer cohorts.

Entities:  

Mesh:

Year:  2018        PMID: 29360819      PMCID: PMC5808042          DOI: 10.1038/bjc.2017.458

Source DB:  PubMed          Journal:  Br J Cancer        ISSN: 0007-0920            Impact factor:   7.640


The extracellular matrix (ECM) is a multi-molecular substance that serves functions ranging from cellular adhesion and motility to cell signalling. The extracellular matrix proteins are constructed from a relatively small repertoire of phylogenetically conserved amino-acid domains and genome-wide in silico analysis has led to the categorisation and indexing of the known protein constituents of the ECM. This ECM protein inventory, termed the core matrisome (CM) (Hynes and Naba, 2012), provides a platform for the analysis of physiological and disease-specific patterns of ECM protein expression. Many solid tumours are characterised by the production of a dense, collagen-rich matrix, the deposition of which is associated with adverse outcome (Erler ; Levental ; Lu ; Naba ; Acerbi ). The alteration in ECM protein constituents resulting from the development of a cancer causes reciprocal changes in the cancer cell and activates pathways responsible for cell migration, inhibition of apoptosis and proliferation (Pickup ). The cancer ECM also promotes the formation of an unstable and chaotic vascular bed, poor oxygen delivery and hypoxia (Gilkes ); factors that favour disease progression and metastasis. Although excessive ECM deposition is a recognised hallmark of cancer, the specific proteins comprising the cancer ECM and their potential contribution to cancer biology and prognosis are less well studied. Here we address these issues using bioinformatics to compare the expression of CM genes in tumours and their normal tissue counterparts. We identify a nine-gene CM signature common to a range of cancers, the expression of which predicts poor prognosis in several solid cancer types. These data provide an impetus to further study proteins of the ECM in order to gain a greater understanding of cancer biology and develop clinical tools for prognostication.

Materials and methods

Ethics

All bioinformatics data were anonymised and required no ethical approval. Commercially available tissue microarrays were produced by US Biomax Inc. under the highest ethical standards with the donor being informed completely and with their consent. No ethical approval was required for this study.

Identification of the CM gene signature

All gene encoding proteins of the CM (http://matrisomeproject.mit.edu/) were included. Gene expression levels were determined in studies comparing lung, breast, ovarian, gastric, oesophageal or colorectal adenocarcinoma with normal tissue in Oncomine (www.oncomine.org/resource/login.html). Here gene expression data are normalised across studies enabling summative gene expression comparisons. Median gene rank (cancer vs normal analysis) was meta-analysed across studies of the same cancer type using Oncomine statistical algorithms. P values of the difference in gene rank were corrected for multiple hypothesis testing using the false discovery rate (FDR) method as described by Storey and Tibshirani (2003). Venn diagrams were generated and analysed using InteractiVenn (Heberle ).

Immunohistochemisty

Immunohistochemistry for ECM proteins was performed on colon cancer and matched normal tissue microarrays from 20 patients (US Biomax Inc). Frozen tissues were immediately fixed in ice-cold acetone and blocked in normal serum of the species in which the secondary antibody was raised. Primary antibodies for col11a1 (ab64883), col1a1 (ab34710), col10a1 (ab58632) and spp1 (ab69498) were all obtained from ABCAM. Fluorochrome-conjugated secondary antibodies were obtained from Invitrogen. Tile-scanned images were taken at × 10 magnification using the Nikon Eclipse 90i epifluorescence microscope and were analysed using ImageJ.

Determination of the effect of the nine-gene CM signature on cancer outcome

Survival analysis was performed using KM plotter for ovarian (Gyorffy ), gastric (Szász ) and lung (Győrffy ) cancers or GraphPad Prism for colorectal, renal, bladder or prostate cancer data derived from cBioportal (Cerami ; Gao ) or data set GSE17538. KM plotter is a manually curated, biannually updated database enabling survival analysis across multiple GEO data sets simultaneously. The GEO data sets used here are shown in Table 1. We used JetSet probes throughout (Li ) and patients were divided into two groups on the basis of median expression of the nine-gene signature. For the analysis of colorectal cancer data sets, a z-score threshold of +1 for gene expression was used to define patients. Kaplan–Meier survival curves were constructed and compared using the log-rank method to generate hazard ratios and P values. Survival curves were generated using GraphPad Prism Version 7. Multivariate analysis was performed in lung, gastric and colorectal cancer data sets using SPSS (2017).
Table 1

Details of data sets used for survival analysis

Cancer typeTool for survival analysisData sets in analysisTotal patients in overall survival analysisTotal patients in progression-free survival analysis
Ovarian cystadenocarcinomaKM plotterGSE14764, 15622, 18520, 19829, 23554, 26193, 26712, 27651, 30161, 3149, 51373, 63885, 65986, 9891, TCGA655614
Lung ACKM plotterGSE14814, 19188, 29013, 30219, 31210, 3141, 31908, 37745, 43580, 4573, 50081, 8894, TCGA, CAARRAY673443
Gastric ACKM plotterGSE14210, 15459, 22377, 29272, 51105, 62254631522
Colorectal ACGraphPad PrismTCGA374329
 GraphPad PrismGSE17538232203

Abbreviation: ACs=adenocarcinomas.

Geneset enrichment analysis

Geneset enrichment analysis (GSEA) analysis (Subramanian ) was performed using the Broad Institute desktop application (http://software.broadinstitute.org/gsea/downloads.jsp) on RNA-Seq expression data from TCGA colorectal, gastric, ovarian and lung cancer data sets. Phenotypes were defined on the basis of expression of the nine-gene CM signature with samples divided into high or low expression again using a z-score of +1 to define groups. Genesets were identified in the molecular signatures database (http://software.broadinstitute.org/gsea/msigdb/index.jsp), with the exception of the angiogenesis geneset, which was identified in a recent publication describing a meta-analytical approach that identified a transcriptional programme for angiogenesis in human cancers (Masiero ) and the EMT geneset (Gröger ).

Results

Identification of a CM gene signature expressed by adenocarcinomas

In order to identify ECM proteins important for cancer progression, we compared the expression of genes comprising the CM in adenocarcinomas (ACs) and normal tissues. We focused specifically on ACs because in organs where both squamous cell carcinomas (SCCs) and ACs develop, these tumour types may originate from different cell lineages (Yan ; Yuan ), are characterised by different genetic landscapes (Contag ; Gao ) and demonstrate differences in their sensitivity to various treatment modalities (Katanyoo ; Chen ). These differences may relate in part to differences in the composition of the tumour ECM or regulation of ECM expression, and we did not want this to act as a confounder in the identification of a CM gene signature. We identified a large number of CM genes expressed at significantly higher levels in ACs compared with their normal tissue counterparts (Figure 1A). Cancers of the oesophagus and lung in particular were highly different from their parent organs, with 110 and 97 of 274 CM genes significantly upregulated in these cancers, respectively. In comparison, ovarian cancers demonstrated less of a difference with only 43 of 274 CM genes significantly upregulated in the tumour (Supplementary Table 1).
Figure 1

Development of a core matrisome gene signature from multiple cancer data sets. (A) CM gene expression based on gene rank for cancer vs normal tissue in various tumour types. Red squares indicate high rank in the cancer relative to the normal tissue. Grey indicates that the gene was not measured. Genes are listed in order of median rank across the analysis of included studies for that particular cancer type. (B) Venn diagram used to identify common CM genes that are significantly overrepresented throughout all cancer types identified in A. (C) The gene signature derived from the Venn diagram in B displaying the nine common, significantly upregulated genes identified across the analyses of all cancer types from A. (D) The nine-gene CM signature showing median gene rank (red=high expression) in cancer compared with normal tissue for each included study and FDR-corrected P values for the meta-analytical comparison. (E) Fluorescence immunohistochemistry for SPP1, Col10a1, Col1a1 and Col11a1 in colon cancers and matched normal colon with quantification of the area (%) of the microarray core demonstrating positive staining (n=20 per analysis). A full colour version of this figure is available at the British Journal of Cancer journal online.

We next identified genes that were significantly upregulated across all cancer types studied and defined a signature of nine such genes (Figure 1B–D). There was a significant correlation in the expression level of most of these genes in TCGA data sets of colon, gastric, lung and ovarian ACs (Supplementary Figure 1A and B), suggesting that the expression of these genes results from a common regulatory element. Finally, immunohistochemistry demonstrated a significant increase in the expression of col11a1, col10a1 and spp1 proteins in colon cancer compared with matched normal colon tissues (Figure 1E). The expression of col1a1 was increased in cancer tissues compared with normal colon but this did not reach significance. Importantly, within colon cancers, each protein was identified within the stroma indicating deposition within the ECM. In normal colon, col11a1 was virtually undetectable and col10a1 was identified within the cytoplasm of colonic epithelial cells rather than within the stromal tissue compartment.

The nine-gene CM signature predicts long-term outcome in various cancer types

Given the widespread overexpression of the nine-gene CM signature in ACs compared to normal tissues, we hypothesised that the expression of these genes may be a requirement for cancer. Combined comparison of normalised gene expression data from multiple GSE data sets confirmed this hypothesis, as patients with cancers demonstrating overexpression of the nine-gene signature displayed reduced overall and progression-free survival for gastric, lung and ovarian cancers (Figure 2A). In three large colorectal cancer data sets, overexpression of the nine-gene signature was similarly associated with adverse outcome (Figure 2A) and it was also associated with reduced progression-free survival in a large TCGA breast cancer data set, as it was for cancers not used to generate the nine-gene signature such as those of the prostate and bladder (Supplementary Figure 2). Interestingly, there was no correlation between expression for the CM gene signature and survival in squamous cell carcinoma of the lung, head and neck or oesophagus or for oesophageal AC (Supplementary Figure 2). Multivariate analysis in gastric, lung and colorectal data sets identified the nine-gene CM signature as a factor significantly associated with disease-free survival independent of disease stage or grade (Table 2).
Figure 2

Expression of the core matrisome gene signature predicts survival in various cancer types. (A) Overall survival (top row) and recurrence-free survival (bottom row) for cohorts of patients whose tumours demonstrate overexpression (red) or normal expression (blue) of the nine-gene CM gene signature. Numbers represent hazard ratios (95% confidence intervals). (B) GSEA analysis of colorectal and gastric cancer TCGA data sets analysed for EMT, angiogenesis, hypoxia, inflammation, oxidative phosphorylation, apoptotic regulation and genomic instability geneset enrichment in patients with high or normal expression of the nine-gene CM signature. NES, normalised enrichment score. A full colour version of this figure is available at the British Journal of Cancer journal online.

Table 2

Multivariate analysis of factors relevant for disease-free and overall survival

 Disease-free survival
Overall survival
 HR (95% CI)PHR (95% CI)P
GSE62254, gastric AC
TNM stage2.22 (1.762.81)<0.001Not reported 
Lymphovascular invasion1.55 (0.92–2.61)0.10Not reported 
Lauren classification1.16 (0.85–1.58)0.36Not reported 
Matrisome signature1.59 (1.102.31)0.02Not reported 
GSE50081, lung AC
AJCC T-stage2.88 (1.326.27)0.0082.28 (1.214.28)0.01
AJCC N-stage1.35 (0.74–2.47)0.331.47 (0.87–2.5)0.15
Smoking history1.94 (0.95–3.97)0.070.81 (0.38–1.7)0.57
Matrisome signature2.12 (1.173.84)0.011.18 (0.71–1.96)0.51
GSE40967, colorectal AC
TNM stage2.6 (1.893.58)<0.0012.12 (1.582.84)<0.001
Proximal vs distal tumour location1.27 (0.89–1.81)0.191.04 (0.75–1.43)0.83
Adjuvant chemotherapy1.10 (0.75–1.61)0.640.59 (0.410.83)0.003
Nodal stage0.87 (0.64–1.17)0.361.02 (0.77–1.34)0.88
Matrisome signature1.45 (1.012.07)0.031.22 (0.87–1.71)0.24

Abbreviations: ACs=adenocarcinomas, CI=confidence interval, HR=hazard ratio.

Independently significant variables demonstrated in bold.

GSEA analysis identifies biological traits associated with expression of the CM signature

Epithelial–mesenchymal transition (EMT), angiogenesis, hypoxia, inflammation and glycolysis are all features of cancer that have been associated with poor prognosis. Several of these processes have also been linked to ECM deposition (Lu ). To gain an insight into the biological mechanisms through which the CM gene signature may define poor prognosis cancers, we performed GSEA analysis to determine whether genes governing these processes are overrepresented in cancers from patients overexpressing the nine-gene CM signature. Strikingly, colorectal, gastric (both Figure 2B), lung and ovarian cancers (both Supplementary Figure 3) expressing the CM gene signature were significantly enriched in EMT, hypoxia, angiogenesis and inflammation genesets, but showed reduced expression of the oxidative phosphorylation geneset. Importantly, several molecular signatures defining other cancer-related processes, including those for apoptosis or genomic instability, were not enriched in cancers expressing the CM signature (Figure 2B and Supplementary Figure 3).

Discussion

Here we present a comprehensive analysis of the difference in expression of CM genes in cancers and normal tissues in order to identify key constituents of the cancer ECM. We have identified commonality in the significant upregulation of nine CM genes across multiple cancer types, suggesting a potential requirement for these CM genes throughout solid tumours. Expression of the nine-gene signature predicted outcome in a broad range of cancers including those not initially used to generate the gene signature. These proteins are therefore associated with cancer progression and their combination may represent a useful biomarker for prognostication. Interestingly, the CM signature failed to predict overall or disease-free survival in squamous cell cancer data sets indicating that the ECM genes in the CM signature are not of relevance for the progression of SCC. Col11a1 has previously been linked to cancer progression (Fischer ; Cheon ; Jia ; Li ) and is expressed by cancer stromal (Galván ; Jia ) and tumour cells progressed to EMT, where it promotes migration and invasion (Sok ; Wu ). Expression of secreted phosphoprotein 1 (SPP1, osteopontin) is also reported in cancer (Shevde and Samant, 2014) and is driven by cancer-related signalling pathways including Hedgehog, Wnt/β-catenin and NFκB (Shevde and Samant, 2014). SPP1 is also expressed by tumour-associated macrophages and fibroblasts, where it is linked to angiogenesis (Kale ) and the metastatic cascade (Mi ), respectively. The protein products of several genes in our signature have not been thoroughly studied in relation to cancer; however, several are associated with functions of relevance to cancer progression. BGN, for example, interacts with toll-like receptors on the surface of macrophages to promote the synthesis of TNFα and CCL2 (Schaefer ; Moreth ); both cytokines important for cancer (Balkwill, 2006; Lim ). BGN and MXRA5 expression are promoted by the activity of TGFβ1 (Heegaard ; Poveda ), a key driver of the adverse stromal response in many cancers (Pickup ) and COMP binds TGFβ1, enhancing its biological activity (Haudenschild ). In support of these findings, our GSEA analyses link the nine-gene signature to biological processes common to poor prognosis cancers, including EMT, angiogenesis, hypoxia, inflammation and a shift away from oxidative phosphorylation as a means of energy generation. Interestingly, we failed to demonstrate enrichment of gene signatures linked to other cancer-relevant processes including apoptotic regulation and genomic instability. The association with specific cancer-relevant gene signatures may implicate the nine-gene CM signature in their regulation. Nonetheless, it should be noted that the data presented here is only correlative and from its analysis we cannot provide a mechanistic link between the expression of ECM genes and specific biological processes. Moving forward, it will be important to confirm the prognostic relevance of the gene signature in prospective cancer cohorts and look to preclinical models to investigate potential mechanisms through which they might regulate cancer progression.
  43 in total

Review 1.  Role of osteopontin in the pathophysiology of cancer.

Authors:  Lalita A Shevde; Rajeev S Samant
Journal:  Matrix Biol       Date:  2014-03-19       Impact factor: 11.583

2.  Lysyl oxidase is essential for hypoxia-induced metastasis.

Authors:  Janine T Erler; Kevin L Bennewith; Monica Nicolau; Nadja Dornhöfer; Christina Kong; Quynh-Thu Le; Jen-Tsan Ashley Chi; Stefanie S Jeffrey; Amato J Giaccia
Journal:  Nature       Date:  2006-04-27       Impact factor: 49.962

3.  A COL11A1-correlated pan-cancer gene signature of activated fibroblasts for the prioritization of therapeutic targets.

Authors:  Dongyu Jia; Zhenqiu Liu; Nan Deng; Tuan Zea Tan; Ruby Yun-Ju Huang; Barbie Taylor-Harding; Dong-Joo Cheon; Kate Lawrenson; Wolf R Wiedemeyer; Ann E Walts; Beth Y Karlan; Sandra Orsulic
Journal:  Cancer Lett       Date:  2016-09-05       Impact factor: 8.679

4.  Comparison of treatment outcomes between squamous cell carcinoma and adenocarcinoma in locally advanced cervical cancer.

Authors:  Kanyarat Katanyoo; Sompol Sanguanrungsirikul; Sumonmal Manusirivithaya
Journal:  Gynecol Oncol       Date:  2012-01-28       Impact factor: 5.482

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  Comparison of gene expression in squamous cell carcinoma and adenocarcinoma of the uterine cervix.

Authors:  Stephen A Contag; Bobbie S Gostout; Amy C Clayton; Melanie H Dixon; Renee M McGovern; Eric S Calhoun
Journal:  Gynecol Oncol       Date:  2004-12       Impact factor: 5.482

7.  Transforming growth factor beta stimulation of biglycan gene expression is potentially mediated by sp1 binding factors.

Authors:  Anne-Marie Heegaard; Zhongjian Xie; Marian Frances Young; Karina Lishmann Nielsen
Journal:  J Cell Biochem       Date:  2004-10-15       Impact factor: 4.429

Review 8.  Hypoxia and the extracellular matrix: drivers of tumour metastasis.

Authors:  Daniele M Gilkes; Gregg L Semenza; Denis Wirtz
Journal:  Nat Rev Cancer       Date:  2014-05-15       Impact factor: 60.716

9.  Differential clinical characteristics, treatment response and prognosis of locally advanced adenocarcinoma/adenosquamous carcinoma and squamous cell carcinoma of cervix treated with definitive radiotherapy.

Authors:  Jenny Ling-Yu Chen; Chao-Yuan Huang; Yu-Sen Huang; Ruey-Jien Chen; Chun-Wei Wang; Yu-Hsuan Chen; Jason Chia-Hsien Cheng; Ann-Lii Cheng; Sung-Hsin Kuo
Journal:  Acta Obstet Gynecol Scand       Date:  2014-04-22       Impact factor: 3.636

Review 10.  TNF-alpha in promotion and progression of cancer.

Authors:  Frances Balkwill
Journal:  Cancer Metastasis Rev       Date:  2006-09       Impact factor: 9.264

View more
  35 in total

1.  Noninvasive imaging of tumor progression, metastasis, and fibrosis using a nanobody targeting the extracellular matrix.

Authors:  Noor Jailkhani; Jessica R Ingram; Mohammad Rashidian; Steffen Rickelt; Chenxi Tian; Howard Mak; Zhigang Jiang; Hidde L Ploegh; Richard O Hynes
Journal:  Proc Natl Acad Sci U S A       Date:  2019-05-08       Impact factor: 11.205

Review 2.  Charting the unexplored extracellular matrix in cancer.

Authors:  Elysse C Filipe; Jessica L Chitty; Thomas R Cox
Journal:  Int J Exp Pathol       Date:  2018-04-19       Impact factor: 1.925

Review 3.  Recent advances in understanding the complexities of metastasis.

Authors:  Jessica L Chitty; Elysse C Filipe; Morghan C Lucas; David Herrmann; Thomas R Cox; Paul Timpson
Journal:  F1000Res       Date:  2018-08-01

4.  Characterization of four subtypes in morphologically normal tissue excised proximal and distal to breast cancer.

Authors:  Louise J Jones; Claude Chelala; Emanuela Gadaleta; Pauline Fourgoux; Stefano Pirró; Graeme J Thorn; Rachel Nelan; Alastair Ironside; Vinothini Rajeeve; Pedro R Cutillas; Anna E Lobley; Jun Wang; Esteban Gea; Helen Ross-Adams; Conrad Bessant; Nicholas R Lemoine
Journal:  NPJ Breast Cancer       Date:  2020-08-21

5.  Colorectal cancer liver metastatic growth depends on PAD4-driven citrullination of the extracellular matrix.

Authors:  A E Yuzhalin; A N Gordon-Weeks; M L Tognoli; K Jones; B Markelc; R Konietzny; R Fischer; A Muth; E O'Neill; P R Thompson; P J Venables; B M Kessler; S Y Lim; R J Muschel
Journal:  Nat Commun       Date:  2018-11-14       Impact factor: 14.919

6.  Tumour-Derived Laminin α5 (LAMA5) Promotes Colorectal Liver Metastasis Growth, Branching Angiogenesis and Notch Pathway Inhibition.

Authors:  Alex Gordon-Weeks; Su Yin Lim; Arseniy Yuzhalin; Serena Lucotti; Jenny Adriana Francisca Vermeer; Keaton Jones; Jianzhou Chen; Ruth J Muschel
Journal:  Cancers (Basel)       Date:  2019-05-06       Impact factor: 6.639

7.  Analysis of risk factors for colon cancer progression.

Authors:  Zhou Yang; Yusheng Chen; Dejun Wu; Zhijun Min; Yingjun Quan
Journal:  Onco Targets Ther       Date:  2019-05-22       Impact factor: 4.147

Review 8.  Brain Metastasis Organotropism.

Authors:  Arseniy E Yuzhalin; Dihua Yu
Journal:  Cold Spring Harb Perspect Med       Date:  2020-05-01       Impact factor: 6.915

9.  A five-gene signature is a prognostic biomarker in pan-cancer and related with immunologically associated extracellular matrix.

Authors:  Chunlai Yu; Mingliang You; Peizhen Zhang; Sheng Zhang; Yuzhu Yin; Xiao Zhang
Journal:  Cancer Med       Date:  2021-06-14       Impact factor: 4.452

Review 10.  Cancer Associated Fibroblasts: Naughty Neighbors That Drive Ovarian Cancer Progression.

Authors:  Subramanyam Dasari; Yiming Fang; Anirban K Mitra
Journal:  Cancers (Basel)       Date:  2018-10-29       Impact factor: 6.639

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.