| Literature DB >> 31451731 |
Stephan M Tirier1,2,3, Jeongbin Park4,2, Friedrich Preußer1,2,5, Lisa Amrhein6,7, Zuguang Gu2,8, Simon Steiger1,2, Jan-Philipp Mallm1,3,8, Teresa Krieger4,1,2, Marcel Waschow1,2, Björn Eismann1,2, Marta Gut9,10, Ivo G Gut9,10, Karsten Rippe1,3, Matthias Schlesner2,11, Fabian Theis6,7, Christiane Fuchs6,7,12, Claudia R Ball13, Hanno Glimm13,14, Roland Eils4,1,2,8,15, Christian Conrad16,17,18,19.
Abstract
Patient-derived 3D cell culture systems are currently advancing cancer research since they potentiate the molecular analysis of tissue-like properties and drug response under well-defined conditions. However, our understanding of the relationship between the heterogeneity of morphological phenotypes and the underlying transcriptome is still limited. To address this issue, we here introduce "pheno-seq" to directly link visual features of 3D cell culture systems with profiling their transcriptome. As prototypic applications breast and colorectal cancer (CRC) spheroids were analyzed by pheno-seq. We identified characteristic gene expression signatures of epithelial-to-mesenchymal transition that are associated with invasive growth behavior of clonal breast cancer spheroids. Furthermore, we linked long-term proliferative capacity in a patient-derived model of CRC to a lowly abundant PROX1-positive cancer stem cell subtype. We anticipate that the ability to integrate transcriptome analysis and morphological patho-phenotypes of cancer cells will provide novel insight on the molecular origins of intratumor heterogeneity.Entities:
Mesh:
Year: 2019 PMID: 31451731 PMCID: PMC6710272 DOI: 10.1038/s41598-019-48771-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Pheno-seq directly links visual phenotypes and gene expression in 3D culture systems at high-throughput. (a) Workflow overview for the culture and recovery of clonal spheroids for inference of morphology-specific gene expression. (b) Pheno-seq workflow based on automated dispensing and confocal imaging of recovered spheroids stained by CellTrackerRed in barcoded nanowells. (c) 2D tSNE visualization of 210 pheno-seq 3′-end RNA-seq profiles with coloring based on image feature ‘circularity’. For better visualization, all circularity values below 0.8 were set to minimum in the color code scheme. (d) Spheroid circularity plotted per cluster (k-means clustering, k = 2) as shown in (c). Violin-plot center-line: median; box limits: first and third quartile; whiskers: ±1.5 IQR. Indicated P-value from unpaired two-tailed Students t-test. (e) Same 2D tSNE visualization as shown in (c) with coloring based on PAGODA’s PC scores for HALLMARK_EMT gene set derived from the Molecular Signature Database (MSigDB)[38]. (f,g) Same 2D tSNE visualization as shown in (c) with coloring based on expression magnitude for EMT marker VIM (f) and epithelial marker KRT15 (g). (h,i) Gene set enrichment analysis based on Hallmark gene sets[35] for ‘aberrant’ (h) and ‘round’ (i) phenotype specific genes identified by differential expression analysis[37] derived from the MSigDB. Bar plots show top three enriched gene sets ranked by FDR q-values. Example genes are VIM, TGFA, FAP for ‘aberrant’ and KRT15, CA2 and KRT16 for ‘round’ phenotypes. (j,k) Validation of phenotype-specific expression for VIM (aberrant) and KRT15 (round) by whole mount immunofluorescence (IF). Plotted values reflect mean pixel intensity per classified spheroid. Box plot center-line: median; box limits: first and third quartile; whiskers: min/max values. Numbers of samples indicated on x-axis under respective phenotype class. Indicated are P-values from unpaired two-tailed Students t-test.
Figure 2Pheno-seq with a 3D model of colorectal cancer links heterogeneous proliferative phenotypes to expression signatures enriched for lineage-specific markers. (a) Clonal 3D-culture in inverse pyramidal shaped microwells and recovery strategy for HT-pheno-seq of patient-derived CRC spheroids isolated from a liver metastasis. Yellow and purple indicate heterogeneous subpopulations with functional differences in proliferative capacity[4]. (b) 2D tSNE visualization of 95 HT-pheno-seq expression profiles. Coloring by sphere size (pixel). (c) Spheroid size plotted per cluster. Violin-plot center-line: median; box limits: first and third quartile; whiskers: ±1.5 IQR). Indicated P-value calculated from unpaired two-tailed Students t-test. (d) Heatmap reflecting differential expression analysis[37] of identified clusters in (b). Selected genes are listed beside the heatmap; Fold change >1.5; adjusted P-value < 0.05; *P < 0.05, **P < 0.01, ***P < 0.001; ‘small’ cluster1: 313 differentially expressed genes; ‘big’ cluster: 130 differentially expressed genes. (e) PAGODA RNA-seq analysis heatmap of CRC spheroid pheno-seq data. Dendrogram reflects overall clustering and the rows below represent top two significant aspects of heterogeneity based on HALLMARK/GO gene sets derived from the MSigDB[38] and on de-novo identified gene sets. High PC Scores correspond to high expression of associated gene sets. Expression patterns below reflect top 10 loading genes for selected gene sets that are associated with respective aspects. Bottom: Expression pattern of genes most highly correlated with intestinal stem cell marker LGR5 (Pearson’s correlation). (f) Validation of pheno-seq by quantitative RNA-FISH for size-dependent differentiation marker TFF3 and cancer stem cell markers CD44/MYC. Plotted values reflect the pixel fraction that exceeds the background threshold per spheroid (Box plot center-line: median; box limits: first and third quartile; whiskers: min/max values; P-values from unpaired Students t-test. Numbers of samples n indicated on x-axis under respective class).
Figure 3Single-cell deconvolution of CRC spheroid pheno-seq data by maximum likelihood inference. (a) Concept of adapted maximum likelihood approach[45] based on estimated cell numbers and transformed pheno-seq data (n = 95): (1) Acquired and transformed pheno-seq data based on estimated cell numbers build a distribution of measurements for inference by the model. Coloring of cells in spheroids: red = stem-like; cyan = differentiated. (2) Assumptions on single cell distributions: Model of heterogeneous gene regulation in which single cells are supposed to exhibit gene expression at low (Pop I) or high (Pop II) levels with a common coefficient of variation. The four parameters of the model are the log-mean expression for each subpopulation (𝜇1 and 𝜇2), the proportion of cells in the high subpopulation (𝐹), and the common log-SD of expression (σ). (3) Based on the model in step 2, a likelihood function is derived that takes different numbers of cells per spheroid into account. The likelihood function is then maximized by searching through the four parameters of the model to identify those that are most likely given the experimental observations. 4) These four parameters define the inferred single cell distributions of the low and high-level populations. (b) 1,012 genes show an improved two-population fit compared to a one population fit (BIC: Bayesian information criterion). Densities of the means of the first (Pop I: low regulatory state) and second population (Pop II: high regulatory state) for all identified 1,012 genes. (c) Gene set enrichment analysis for two-population genes based on Hallmark gene sets[35] derived from the MSigDB[38]. Bar plot showing top enriched gene sets ranked by FDR q-values. (d) Selected human colonic stem and differentiation markers[46] that have been identified by pheno-seq deconvolution. (e) Scatter plots for relations of PROX1 expression and estimated cell numbers (lower) and between PROX1 expression and expression of the major intestinal stem cell marker LGR5 (upper) as well as associated Pearson’s correlation coefficients (r). (f) RNA-FISH staining of CRC spheroids for PROX1 (Atto550) and DAPI counterstaining for visualization of DNA. Merged images: DNA: cyan; PROX1: red. Images represent Z-projections (scale bar 30 µm and 10 µm for magnified merged image).
Figure 4Scoring of pheno-seq data for subtype-specific signatures links long-term proliferative capacity to a stem-like subtype in CRC. (a) Strategy to define lineage-specific expression signatures. Stem: PROX1 correlated genes (top 20); Transit-amplifying (TA): Ribosomal genes (n = 24); Terminally differentiated (Tdiff): TFF3 correlated genes (top 20). (b) Violin Plots showing (cluster-specific) pheno-seq expression profiles scored for subtype signatures (see Methods). Violin-plot center-line: median; box limits: first and third quartile; whiskers: ±1.5 IQR). Indicated P-value calculated from unpaired two-tailed Students t-test.