| Literature DB >> 35707723 |
Morten Beck Rye1,2,3,4, Sebastian Krossa5, Martina Hall6,7, Casper van Mourik1,8, Tone F Bathen5, Finn Drabløs1, May-Britt Tessem2,5, Helena Bertilsson1,2.
Abstract
High secretion of the metabolites citrate and spermine is a unique hallmark for normal prostate epithelial cells, and is reduced in aggressive prostate cancer. However, the identity of the genes controlling this biological process is mostly unknown. In this study, we have created a gene signature of 150 genes connected to citrate and spermine secretion in the prostate. We have computationally integrated metabolic measurements with multiple transcriptomics datasets from the public domain, including 3826 tissue samples from prostate and prostate cancer. The accuracy of the signature is validated by its unique enrichment in prostate samples and prostate epithelial tissue compartments. The signature highlights genes AZGP1, ANPEP and metallothioneins with zinc-binding properties not previously studied in the prostate, and the expression of these genes are reduced in more aggressive cancer lesions. However, the absence of signature enrichment in common prostate model systems can make it challenging to study these genes mechanistically.Entities:
Keywords: Bioinformatics; Cancer; Transcriptomics
Year: 2022 PMID: 35707723 PMCID: PMC9189124 DOI: 10.1016/j.isci.2022.104451
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Correlations between metabolites citrate and spermine
(A) Correlations between all 23 metabolite measurements across 129 prostate normal and cancer samples from the Bertilsson dataset (Dataset ID 1). The correlation is particularly strong for metabolites citrate and spermine.
(B) Correlations between citrate and spermine across 129 prostate normal and cancer samples from Bertilsson. The correlation is high also when extreme values are excluded (highlighted inside the green boundary).
(C) Similarity between the citrate, spermine and the average citrate-spermine metabolite profiles across 129 prostate normal and cancer samples from Bertilsson.
Figure 2Integrity of initial and refined citrate – spermine (CS) gene signatures
(A) CMS (Correlation Module Scores) for the 150 genes most positively correlated with the CS metabolite profile for cancer samples in the Bertilsson dataset (Dataset ID 1). The CMS for was statistically significant when compared to CMSs using random metabolite profiles (lognorm test) (B) CMS for the initial cancer CS gene module from Bertilsson (red dots) evaluated in prostate cancer (red circles) and normal (blue circles) samples from 11 additional datasets. The CMS was statistically significant in all datasets (lognorm test, Tables S1 and S2).
(C) Fraction of new genes in the refined CS gene signature that replaced genes in the initial CS gene signature. The genes is sorted by importance (rank) on the bottom axis. Of the 150 top ranked genes in the initial CS signature, 74 were replaced in the refined CS signature.
(D) Increase in CMS after refinement of the initial CS signature for prostate normal and cancer samples in 11 datasets. The CMS increased in the normal samples, even though the normal samples were not used to create or refine the signature. Cohort ID 4, 5 and 8 did not include normal samples. See also Figure S1 and Tables S1–S4.
Figure 3Validation of citrate – spermine (CS) gene signature
(A) CS signature ssGSEA scores for 11,093 cancer and normal samples in 33 tumor types from the TCGA-complete dataset. We have used standard TCGA cancer type abbreviations [https://gdc.cancer.gov/resources-tcga-users/tcga-code-tables/tcga-study-abbreviations] where PRAD is the abbreviation for prostate cancer.
(B) CS signature ssGSEA scores for 53 averaged human normal tissue profiles from the GTex dataset.
(C) CS ssGSEA signature scores for 1829 human cell-line and tissue samples, including one prostate adult tissue sample, from the FANTOM dataset.
(D) Correlations between CS signature and stroma signature ssGSEA scores for cancer and normal samples in prostate cancer tissue datasets 1–12. The negative correlations in the normal samples are stronger due to less confounding from cancer tissue. Data from Mortensen (dataset ID 9) contains laser dissected prostate normal epithelium and cancer, which probably lack significant amounts of stroma.
(E) CS signature ssGSEA scores on spatial transcriptomics data from tissue slice 3.2 in Berglund (dataset ID 20). The colorbar indicates enrichment of the ssGSEA score in each pixel. The corresponding pathological tissue image can be found for image 3.2 in Supplementary Information –Figure S1B from Berglund et al. (Berglund et al., 2018), and comparison shows a strong overlap of high and low CS signature scores with epithelial/lumen and stroma tissue compartments, respectively. Results from all 12 images in Berglund are shown in Figure S2.
Figure 4Citrate – spermine (CS) signature, tumor grade and metastasis
(A) CS signature ssGSEA scores for high-grade (Gleason higher or equal to 4 + 3) and low-grade (Gleason less than or equal to 3 + 4) cancers in 9 datasets. The samples compared correspond to the HG1 and LG1 groups in Table 1. The scores were centered in each cohort before plotting to visualize similarities between datasets better. See also Table S7B.
(B) CS signature ssGSEA scores for metastatic, cancer and normal samples in 7 datasets. The scores were centered and normalized to range 0-1 before plotting to visualize similarities between datasets better. The low scores in normal samples is due to the higher content of stroma in normal samples (see Figure S3).
(C) CS signature ssGSEA scores from five laser dissected prostate and prostate cancer tissue types from Tomlins (datasets Id 18). The GSEA scores illustrate the loss in CS secretion from normal epithelium through PIN, Cancer and finally Metastasis. The CS signature scores from metastatic samples are comparable to the non-secreting stroma tissue.
(D) CS signature ssGSEA scores in 96 metastatic samples (9 from prostate and 87 from other organs) from the Hsu (dataset ID 19). Metastatic samples from prostate origin are more enriched for the CS signature than metastatic samples originating from other sites.
p-values from differential CS signature ssGSEA score analysis between high grade (HG) and low grade (LG) cancer samples in 9 datasets
| Dataset ID | Dataset abbreviation | Number of HG1 samples | Number of LG1 samples | Number of HG2 samples | Number of LG2 samples | p-value HG1 vs LG1 | p-value HG2 vs LG2 | Correlation to tumor content |
|---|---|---|---|---|---|---|---|---|
| 1 | Bertilsson | 56 | 60 | 36 | 80 | 4.7e-10 | 3.6e-5 | −0.13 |
| 2 | Chen | 15 | 50 | 11 | 54 | 0.0040 | 0.0011 | 0.14 |
| 3 | Taylor | 36 | 94 | 15 | 115 | 0.015 | 0.010 | NA |
| 4 | Sboner | 119 | 162 | 81 | 200 | 3.7e-7 | 1.7e-4 | −0.30 |
| 5 | Erho | NA | NA | 211 | 334 | NA | 0.060 | NA |
| 6 | TCGA | 296 | 200 | 200 | 296 | 2.5e-10 | 5.3e-13 | −0.04 |
| 7 | CMBR | 30 | 82 | 9 | 103 | 0.014 | 0.09 | 0.11 |
| 8 | STCK | 34 | 57 | 15 | 76 | 0.021 | 0.007 | NA |
| 12 | Kuner | 27 | 32 | NA | NA | 0.021 | NA | NA |
Two comparisons are made: First HG1 vs LG1, where HG1 is classified as samples with Gleason score 4 + 3 or higher, and LG1 as samples with Gleason score 3 + 4 or lower. Second HG2 vs LG2, where HG2 is classified as samples with Gleason score 8 or higher, and LG2 as samples with Gleason score 7 or lower. The correlation of CS signature GSEA score to tumor content is also included when available, and show, in general, that decreased CS scores in high-grade samples is not due to increased tumor content in these samples.
p-values from comparing CS signature ssGSEA scores from metastatic prostate cancer samples to cancer and normal samples and in 8 datasets
| Dataset ID | Dataset abbreviation | Number of metastatic samples | Number of cancer samples | Number of normal samples | p-value metastatic vs cancer | p-value metastatic vs normal |
|---|---|---|---|---|---|---|
| 3 | Taylor | 19 | 131 | 29 | 7.4e-15 | 2.8e-7 |
| 10 | Prensner | 12 | 78 | 38 | 3.4e-11 | 2.9e-10 |
| 13 | Aryee | 18 | NA | 21 | NA | 0.024 |
| 14 | Chandran | 25 | 65 | 81 | 8.8e-6 | 6.9e-6 |
| 15 | Cai | 29 | 22 | NA | 5.8e-13 | NA |
| 16 | Poisson | 13 | 12 | 16 | 1.5e-4 | 0.09 |
| 17 | Monzon | 21 | 10 | NA | 7.4e-11 | NA |
| 18 | Tomlins | 20 | 32 | 27 | 0.007 | 1.2e-10 |
Figure 5Genes and ontologies
(A) Principal Component Analysis (PCA) for Gene Ontology terms associated with CS signature genes. The GO terms are based on a consensus analysis over datasets 1–12 (see STAR Methods). The only observable cluster is formed by the six Metallothioneins from the CS signature.
(B) Correlation of ssGSEA signature with four gene signatures from distinct cell types in a single cell sequencing dataset (basal and luminal from the prostate epithelium, fibroblasts and smooth muscle cells from stroma) (Henry et al., 2018) in 9 datasets with normal prostate samples. The CS gene signature correlates with luminal epithelial cells (the cell-type mainly responsible for secreting citrate and spermine) whereas the correlation to stroma tissue types is negative. Dataset 9 is from laser dissected epithelium, which lack stroma tissue. See also Figure S4–S6 and Table S5. Table of the 150 Citrate-spermine signature genes with manually curated functional annotations, related to Figure 5, Table S6. Top 20 Gene Ontology terms for all citrate-spermine signature genes, the six Metallothioneins and the 10 network Hub-genes, related to Figure 5, Table S7. Differential expression results for citrate-spermine target genes, related to Figures 4 and 5.
Figure 6Model systems
(A) CS signature ssGSEA scores for all sample types in Prensner (dataset ID 10), including 58 prostate normal and cancer cell-line samples. Cell-line samples, in general, have very low CS signatures scores.
(B) Average CS signature ssGSEA scores for different cell-types in Prensner. The ssGSEA scores were highly reproducable within samples from the same cell-type. Androgen responsive cell-types (MDA-Pca-2b, LNCaP and VCaP) have somewhat higher ssGSEA scores than androgen resistant cell-types (DU145 and PC3). Prostate normal cell-types are similar to cancer cell-types. Number of samples of each cell-type are: MDA-Pca-2b (1), DU145(10), PC3(2), LNCaP(7), VCaP(10), RWPE(9), PrEC(4).
(C) CS signature ssGSEA scores in 1019 cancer cell-types from CCLE (dataset ID 24). Androgen responsive prostate cancer cell-types score higher than cancer cell-types from other cancers whereas androgen resistant cell-lines score similar to cell-types from other cancers.
(D) CS Signature ssGSEA scores in four datasets with prostate and prostate cancer derived organoids and model systems (dataset IDs 27–30) and one dataset with prostate tumors from mouse models (Aytes, dataset ID 31). The ssGSEA scores were adapted and normalized to CS signature GSEA scores from Prensner. All model systems have consistently low GSEA scores, similar to prostate cancer cell-lines. In total 64 model system samples and 384 mouse tumor samples were analyzed. Dataset descriptions: 27: 2D and 3D cultures derived from prostate cancer cell-line LNCaP (9 samples) and normal cell-line RWPE (9 samples), in total 18 samples. 28: Organoids from primary prostate epithelial cells in mono and co-culture with stromal cells (7 samples), stromal cells in co-culture (4 samples) and macrodissected tumor tissue (2 samples), in total 13 samples. 29: Cells from benign prostatic bulk (3 + 3 samples), basal (3 + 3 samples) and luminal (3 + 3 samples) tissue cultured in two different media, in total 18 samples. 30: Mouse prostate organoids with WT (6 samples) and mutated (6 samples) gene SPOP, in total 12 samples. 31: Prostate tumor tissue from mouse models with different genetic modifications, in total 384 samples. See also Figures S7–S10.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Dataset ID 1 – | Array Express: E-MTAB-1041 | |
| Dataset ID 2 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 3 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 4 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 5 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 6 – | ||
| Dataset ID 7 - | Gene Expression Omnibus: GEO: | |
| Dataset ID 8 - | Gene Expression Omnibus: GEO: | |
| Dataset ID 9 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 10 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 11 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 12 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 13 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 14 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 15 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 16 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 17 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 18 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 19 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 20 – | ||
| Dataset ID 21 – | ||
| Dataset ID 22 – | ||
| Dataset ID 23 – | ||
| Dataset ID 24 – | ||
| Dataset ID 25 – | Array Express: E-MTAB-2706 | |
| Dataset ID 26 – | Array Express: E-MTAB-4858 | |
| Dataset ID 27 – | No article reference found | Gene Expression Omnibus: GEO: |
| Dataset ID 28 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 29 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 30 – | Gene Expression Omnibus: GEO: | |
| Dataset ID 31 – | Gene Expression Omnibus: GEO: | |
| Python version 2.7 | Python Software Foundation | |
| Limma 3.26.9 with edgeR 3.12.1 | Bioconductor 3.1 R 3.2.2 | |
| Code for gene signature refinement procedure | This paper | |