| Literature DB >> 36077264 |
Adriano Cuccu1, Federica Francescangeli1, Maria Laura De Angelis1, Alessandro Bruselles1, Alessandro Giuliani2, Ann Zeuner1.
Abstract
Quiescent cancer cells (QCCs) are a common feature of solid tumors, representing a major obstacle to the long-term success of cancer therapies. We isolated QCCs ex vivo from non-small cell lung cancer (NSCLC) and colorectal cancer (CRC) xenografts with a label-retaining strategy and compared QCCs gene expression profiles to identify a shared "quiescence signature". Principal Component Analysis (PCA) revealed a specific component neatly discriminating quiescent and replicative phenotypes in NSCLC and CRC. The discriminating component showed significant overlapping, with 688 genes in common including ZEB2, a master regulator of stem cell plasticity and epithelial-to-mesenchymal transition (EMT). Gene set enrichment analysis showed that QCCs of both NSCLC and CRC had an increased expression of factors related to stemness/self renewal, EMT, TGF-β, morphogenesis, cell adhesion and chemotaxis, whereas proliferating cells overexpressed Myc targets and factors involved in RNA metabolism. Eventually, we analyzed in depth by means of a complex network approach, both the 'morphogenesis module' and the subset of differentially expressed genes shared by NCSLC and CRC. This allowed us to recognize different gene regulation network wiring for quiescent and proliferating cells and to underpin few genes central for network integration that may represent new therapeutic vulnerabilities. Altogether, our results highlight common regulatory pathways in QCCs of lung and colorectal tumors that may be the target of future therapeutic interventions.Entities:
Keywords: colorectal cancer; dormancy; lung cancer; quiescence
Mesh:
Year: 2022 PMID: 36077264 PMCID: PMC9456317 DOI: 10.3390/ijms23179869
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
PCA results. (A) Decomposition of the correlation matrix expressed as eigenvalues, difference between adjacent eigenvalues, proportion of variance explained by each component and cumulative variance. (B) Component pattern (loadings). Each loading corresponds to the correlation coefficient between original variables (samples) and corresponding component.
|
| ||||
|
|
|
|
|
|
| 1 | 5.970 | 4.816 | 0.498 | 0.498 |
| 2 | 1.154 | 0.248 | 0.096 | 0.594 |
| 3 | 0.906 | 0.307 | 0.076 | 0.670 |
| 4 | 0.599 | 0.049 | 0.050 | 0.720 |
|
| ||||
|
|
|
|
|
|
| p136.1 | 0.273 | 0.364 | 0.300 | 0.053 |
| p136.2 | 0.230 | 0.495 | 0.224 | −0.037 |
| p136.3 | 0.275 | 0.364 | 0.275 | 0.068 |
| m136.1 | 0.305 | −0.040 | −0.225 | 0.358 |
| m136.2 | 0.213 | 0.298 | −0.600 | −0.689 |
| m136.3 | 0.246 | 0.268 | −0.462 | 0.531 |
| p229.1 | 0.320 | −0.200 | 0.178 | −0.160 |
| p229.2 | 0.318 | −0.178 | 0.226 | −0.166 |
| p229.3 | 0.316 | −0.159 | 0.201 | −0.140 |
| m229.1 | 0.318 | −0.285 | −0.176 | 0.133 |
| m229.2 | 0.325 | −0.307 | −0.052 | 0.068 |
| m229.3 | 0.296 | −0.235 | −0.064 | −0.113 |
PC1 = ‘tissue attractor’ [30]. This component is by far the most relevant in terms of percent of variance explained (around 50%) and has all positive loadings; thus, it can be considered as a ‘size’ component [31] stemming from the largely invariant tissue-specific average genome expression profile. PC2 = ‘cell line effect’ this component accounts for approximately 10% of total variance; it is a ‘shape’ component [31] having both positive and negative loading correspondent to a directionality of effect having as poles (opposite loadings, see Table 1B) the lsc136 and lsc229 cell lines. This implies the presence of a neat ‘effect line’ (no exception to this loading sign rule, Table 1B) pointing to the presence of genes with a correlated expression with opposite up (down) expression in the two lines independently of the PKH26+/PKH26− phenotype. PC3 = ‘quiescent/replicative effect’ this component accounts for 7.5% of total variance and discriminates PKH26+ and PKH26− (p vs. m) phenotypes in terms of loading sign. Again, this is a shape component pointing to the presence of a clear opposite pattern of gene activation/repression discriminating the two conditions. It is worth noting that principal components are each other orthogonal by construction, thus implying the above effects (tissue attractor, line effect, phenotype effect) do not superimpose. This means that the quiescent/dormant discrimination happens through the same underlying mechanism in both lines.
Figure 1PCA and limma results. (A) Samples projection on the loading space of the second and third components. Each point corresponds to a sample of the log-transformed original expression data matrix, whose coordinates are given by the PC2 and PC3 loadings of that sample (see Table 1B). It is worth noting the cell line discrimination on PC2 and the quiescent/proliferative separation on PC3. (B) Mean difference plot reporting average log expression values versus log2 fold changes (logFC). LogFC are obtained performing differential expression analysis on the log-transformed original expression data matrix. Significantly differentially expressed genes are obtained by sorting in decreasing order the LogFC distribution and keeping the top 6000 values (highlighted in red—PKH26+ specific) and the bottom 6000 (highlighted in blue—PKH26− specific), which resulted to be associated to |LogFC| > 1.95. (C,D) Venn diagrams reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma and PCA approaches highlighting the relevant superposition between the two methods. Duplicated genes are removed. Statistical significance of the overlap: (C) p-value < 1 × 10−4; RR = 24.0; (D) p-value < 1 × 10−4; RR = 23.4.
Typical ‘stemness’ genes up-regulated in the lung cancer PKH26+ state being part of the top 6000 genes selected by limma analysis. Differential expression evaluated through limma as previously described.
| Gene Name | logFC | |
|---|---|---|
| KLF4 | 2.1154 | 0.0770 |
| PLAUR/CD87 | 2.3351 | 0.0837 |
| CD44 | 3.7835 | 0.0186 |
| ALCAM/CD166 | 6.4852 | 0.0000 |
PCA results. (A) Decomposition of the correlation matrix expressed as eigenvalues, difference between adjacent eigenvalues, proportion of variance explained by each component and cumulative variance. (B) Component pattern (loadings). Each loading corresponds to the correlation coefficient between original variables (samples) and corresponding component.
|
| ||||
|
|
|
|
|
|
| 1 | 9.874 | 9.832 | 0.987 | 0.987 |
| 2 | 0.042 | 0.017 | 0.004 | 0.991 |
| 3 | 0.025 | 0.010 | 0.003 | 0.994 |
| 4 | 0.015 | 0.003 | 0.002 | 0.996 |
|
| ||||
|
|
|
|
|
|
| PKH26+1 | −0.317 | 0.156 | −0.272 | 0.336 |
| PKH26+2 | −0.317 | 0.065 | −0.151 | 0.255 |
| PKH26+3 | −0.317 | 0.100 | −0.221 | 0.166 |
| PKH26+4 | −0.316 | −0.038 | 0.694 | 0.478 |
| PKH26+5 | −0.314 | −0.799 | −0.040 | 0.087 |
| PKH26−1 | −0.316 | 0.347 | 0.353 | −0.396 |
| PKH26−2 | −0.317 | 0.143 | −0.363 | −0.013 |
| PKH26−3 | −0.317 | 0.121 | −0.262 | −0.157 |
| PKH26−4 | −0.317 | 0.233 | 0.208 | −0.163 |
| PKH26−5 | −0.316 | −0.336 | 0.057 | −0.591 |
Figure 2Differential expression analysis performed with Principal Component Analysis is consistent with that performed with the standard limma. (A) Samples projection on the loading space of the first and fourth components. Each point corresponds to a sample of the log-transformed original expression data matrix, whose coordinates correspond to PC2 and PC3 loadings of that sample (see Table 2B). (B) Mean Difference plot reporting average log expression values versus log2 fold changes (logFC). LogFC are obtained performing differential expression analysis on the log-transformed original expression data matrix. Significantly differentially expressed genes are achieved sorting in decreasing order the LogFC distribution and keeping the top 6000 values (highlighted in red—PKH26+ specific) and the bottom 6000 (highlighted in blue—PKH26− specific), which resulted to be associated to |LogFC| > 0.2. (C,D) Venn diagrams reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma and PCA approaches highlighting the relevant superposition between the two methods. Duplicated genes were removed. Statistical significance of the overlap: (C) p-value < 1 × 10−4; RR = 12.9; (D) p-value < 1 × 10−4; RR = 11.8.
Typical ‘stemness’ genes up-regulated in the colorectal cancer PKH26+ state. Differential expression evaluated through the limma approach.
| Gene Name | logFC | |
|---|---|---|
| KLF4 | 0.3838 | 0.0078 |
| AXIN2 | 0.4281 | 0.0481 |
| LGR5 | 0.8000 | 0.0095 |
| BMI1 | 0.3926 | 0.0015 |
Figure 3Shared genes resulting from differential expression analysis performed with the standard limma. (A) Venn diagram reporting the number of significantly up-regulated genes in the PKH26+ state identified by the limma approach in the lung (3661 genes) and colon (2539 genes) systems, and in both systems (688 genes). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 7.2. Genes are considered significant if they appear in the first 6000 of the ranking built on the logFC, with a nominal p-value < 0.05. Duplicated genes are removed. (B) Venn diagram reporting the number of significantly down-regulated genes in the PKH26+ state identified by the limma approach in the lung (3672 genes) and colon (2898 genes) systems and in both systems (599 genes). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 5.9. Genes are considered significant if they appear in the last 6000 of the ranking built on the logFC, with a nominal p-value < 0.05. Duplicated genes are removed. (C) Barplot reporting the number of significantly up- and down-regulated genes in the PKH26+ state, identified by the limma approach and shared by the lung and colon systems, belonging to some interesting KEGG categories (reported on the x-axis) found using Panther. Abbreviations: num_genes = number of genes; down = down-regulated; up = up-regulated.
Relevant pathways shared by the two lung and colon systems. In yellow are highlighted the pathways enriched in PKH26−; in red are highlighted those enriched in PKH26+.
| Database | Pathway |
|---|---|
| Hallmark | MYC_TARGETS_V1 |
| Hallmark | MYC_TARGETS_V2 |
| Hallmark | EPITHELIAL_MESENCHYMAL_TRANSITION |
| Gene Ontology | NCRNA_METABOLIC_PROCESS |
| Gene Ontology | TRNA_METABOLIC_PROCESS |
| Gene Ontology | RRNA_METABOLIC_PROCESS |
| Gene Ontology | MITOTIC_CELL_CYCLE |
| Gene Ontology | MITOTIC_CELL_CYCLE_PROCESS |
| Gene Ontology | MRNA_PROCESSING |
| Gene Ontology | RNA_PROCESSING |
| Gene Ontology | MESENCHIME_DEVELOPMENT |
| Gene Ontology | REGULATION_OF_CHEMOTAXIS |
| Gene Ontology | CELL_CHEMOTAXIS |
| Gene Ontology | CYTOKINE_PRODUCTION |
| Gene Ontology | RESPONSE_TO_TRANSFORMING_GROWTH_FACTOR_BETA |
| Gene Ontology | NEGATIVE_REGULATION_OF_CELL_POPULATION_PROLIFERATION |
| Gene Ontology | TRANSFORMING_GROWTH_FACTOR_BETA_RECEPTOR_SIGNALING_PATHWAY |
| Gene Ontology | POSITIVE_REGULATION_OF_CATALYTIC_ACTIVITY |
| Gene Ontology | REGULATION_OF_CELL_ADHESION |
Figure 4Lung QCCs and colorectal QCCs share some interesting significantly enriched pathways. (A) RNA sequencing-generated gene set enrichment analysis for Mesenchyme Development (nominal p-value = 0.032, FDR q-value = 0.212), regulation of chemotaxis (nominal p-value = 0.012, FDR q-value = 0.127), cell chemotaxis (nominal p-value = 0.005, FDR q-value = 0.105), cytokine production (nominal p-value = 0.050, FDR q-value = 0.271), response to transforming growth factor beta (nominal p-value < 1 × 10−10, FDR q-value = 0.058), negative regulation of cell population proliferation (nominal p-value = 0.018, FDR q-value = 0.248), transforming growth factor beta receptor signaling (nominal p-value < 1 × 10−10, FDR q-value = 0.057), positive regulation of catalytic activity (nominal p-value = 0.019, FDR q-value = 0.256), regulation of cell adhesion (nominal p-value < 1 × 10−10, FDR q-value = 0.064) and epithelial mesenchymal transition (nominal p-value = 0.005, FDR q-value = 0.051) in PKH26+ lung cancer cells (compared with PKH26− cells). (B) RNA sequencing-generated gene set enrichment analysis for Mesenchyme Development (nominal p-value < 1 × 10−10, FDR q-value = 0.015), regulation of chemotaxis (nominal p-value < 1 × 10−10, FDR q-value = 0.026), cell chemotaxis (nominal p-value = 0.002, FDR q-value = 0.038), cytokine production (nominal pvalue = 0.002, FDR q-value = 0.069), response to transforming growth factor beta (nominal p-value = 0.011, FDR q-value = 0.108), negative regulation of cell population proliferation (nominal p-value = 0.002, FDR q-value = 0.105), transforming growth factor beta receptor signaling (nominal p-value = 0.046, FDR q-value = 0.178), positive regulation of catalytic activity (nominal p-value = 0.030, FDR q-value = 0.230), regulation of cell adhesion (nominal p-value = 0.047, FDR q-value = 0.250) and epithelial mesenchymal transition (nominal p-value < 1 × 10−10, FDR q-value = 0.002) in PKH26+ colon cancer cells (compared with PKH26− cells). (C) Venn diagram reporting the number of significantly enriched gene sets identified in lung PKH26+ cells (282 enriched pathways, p-value < 0.05), colon PKH26+ cells (233 enriched pathways, p-value < 0.05) and in both lung and colon PKH26+ cells (137 enriched pathways, p-value < 0.05). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 3.4. (D) Venn diagram reporting the number of significantly enriched gene sets identified in lung PKH26− cells (91 enriched pathways, p-value < 0.05), colon PKH26− cells (166 enriched pathways, p-value < 0.05) and in both lung and colon PKH26− cells (22 enriched pathways, p-value < 0.05). Statistical significance of the overlap: p-value = 0.003; RR = 4.4.
Figure 5Network built on the genes up-regulated in PKH26+ shared by the lung and colon systems. (A) Network visualization. It was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). (B) Genes’ projection in the Clustering coefficient/betweenness space. It refers to the network built considering the usual threshold of Pearson r = |0.7|. Genes linked to the TGF-β pathway are highlighted in blue.
Morphogenesis pathways, significantly enriched in the PKH26+ state, shared by the two lung and colon systems.
| Database | Pathway |
|---|---|
| Gene Ontology | ANATOMICAL_STRUCTURE_FORMATION_INVOLVED_IN_MORPHOGENESIS |
| Gene Ontology | ANIMAL_ORGAN_MORPHOGENESIS |
| Gene Ontology | BLOOD_VESSEL_MORPHOGENESIS |
| Gene Ontology | EMBRYONIC_MORPHOGENESIS |
| Gene Ontology | HEART_MORPHOGENESIS |
| Gene Ontology | MORPHOGENESIS_OF_AN_EPITHELIUM |
| Gene Ontology | REGULATION_OF_ANATOMICAL_STRUCTURE_MORPHOGENESIS |
| Gene Ontology | TISSUE_MORPHOGENESIS |
| Gene Ontology | TUBE_DEVELOPMENT |
| Gene Ontology | TUBE_MORPHOGENESIS |
Figure 6Network built on the PKH26+, PKH26− and all-samples datasets. (A) Betweenness centrality scores in both PKH26+ (x-axis) and PKH26− (y-axis) networks. “High traffic” genes behaving differently between PKH26+ and PKH26− are highlighted in red. (B) Network visualization from the PKH26+ dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S14 for more details. (C) Network visualization from the PKH26− dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S15 for more details. (D) Network visualization from the all-samples dataset. Network was built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). See Table S16 for more details. (E) Venn diagram reporting the number of genes identified in the PKH26+ (40 genes) and in the PKH26− network (61 genes), and in both PKH26+ and PKH26− networks (49 genes). Networks were built considering as connected only pairs of nodes with extremely elevated correlations (r > |0.98|). Statistical significance of the overlap: p-value < 1 × 10−4; RR = 3.6.
Pathway distribution across the classes. PKH26+, PKH26− and all-samples datasets refer to the genes composing the giant components reported in Figure 6B–E.
| Pathway | PKH26+ | PKH26− | All_Samples |
|---|---|---|---|
| ANATOMICAL_STRUCTURE_FORMATION_INVOLVED_IN_MORPHOGENESIS | 16 | 28 | 18 |
| ANIMAL_ORGAN_MORPHOGENESIS | 17 | 23 | 15 |
| BLOOD_VESSEL_MORPHOGENESIS | 7 | 17 | 7 |
| EMBRYONIC_MORPHOGENESIS | 7 | 12 | 8 |
| HEART_MORPHOGENESIS | 7 | 4 | 1 |
| MORPHOGENESIS_OF_AN_EPITHELIUM | 9 | 12 | 6 |
| REGULATION_OF_ANATOMICAL_STRUCTURE_MORPHOGENESIS | 12 | 24 | 14 |
| TISSUE_MORPHOGENESIS | 12 | 15 | 8 |
| TUBE_DEVELOPMENT | 14 | 23 | 11 |
| TUBE_MORPHOGENESIS | 11 | 22 | 9 |
Correlation matrix for Table 5.
| PKH26+ | PKH26− | All_Samples | |
|---|---|---|---|
|
| 1.00 | 0.85 | 0.93 |
|
| 0.85 | 1.00 | 0.80 |
|
| 0.93 | 0.80 | 1.00 |