| Literature DB >> 32264880 |
Jordan Mandel1, Raghunandan Avula2, Edward V Prochownik3,4,5,6.
Abstract
BACKGROUND: Long-term survival in numerous cancers often correlates with specific whole transcriptome profiles or the expression patterns of smaller numbers of transcripts. In some instances, these are better predictors of survival than are standard classification methods such as clinical stage or hormone receptor status in breast cancer. Here, we have used the method of "t-distributed stochastic neighbor embedding" (t-SNE) to show that, collectively, the expression patterns of small numbers of functionally-related transcripts from fifteen cancer pathways correlate with long-term survival in the vast majority of tumor types from The Cancer Genome Atlas (TCGA). We then ask whether the sequential application of t-SNE using the transcripts from a second pathway improves predictive capability or whether t-SNE can be used to refine the initial predictive power of whole transcriptome profiling.Entities:
Keywords: Dimensionality reduction; Signal transduction; T-SNE; Transcriptional profiling; Tumor metabolism
Mesh:
Substances:
Year: 2020 PMID: 32264880 PMCID: PMC7140376 DOI: 10.1186/s12885-020-06756-x
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Sequential t-SNE analysis of KIRC and LGG. a. t-SNE-generated patterns of KIRC tumor Pyrimidine Biosynthesis Pathway transcripts showing two distinct clusters. n = number of tumors in each group b. Kaplan-Meier survival curves of the patient groups corresponding to the tumor clusters in A. M = median survival (in days) of each of the groups. c. t-SNE patterns of Notch Pathway transcripts. d. Kaplan-Meier survival curves of the patient groups corresponding to the tumor clusters in C. e. Kaplan-Meier survival of the favorable survival Notch Pathway cluster from D after t-SNE analysis using transcripts from the Pyrimidine Biosynthesis Pathway. Two t-SNE clusters similar to those depicted in A were observed (not shown). f. Kaplan-Meier survival of the unfavorable survival Notch Pathway cluster from D. g. t-SNE-generated patterns of LGG Notch Pathway transcripts showing two distinct clusters. h. Kaplan-Meier survival curves of each of the patient groups corresponding to the tumor clusters in (G). i. t-SNE-generated patterns of LGG Wnt Pathway transcripts showing four distinct clusters. j. Kaplan-Meier survival curves of each of the patient groups corresponding to the tumor clusters in (I). k. Sequential t-SNE Clustering. The favorable survival t-SNE Wnt Cluster 1 tumors from Fig. 1i were re-analyzed using Notch Pathway transcripts, which generated the expected two t-SNE clusters (not show) with significant survival differences. (l). The unfavorable survival Cluster 2 tumors from Fig. 1i were similarly re-analyzed using Notch Pathway transcripts and were shown to be comprised of two sub-clusters with significant differences in long-term survival
Fig. 2Sequential t-SNE analysis of SARC and KIRP. a. t-SNE-generated patterns of SARC Myc Pathway transcripts showing two distinct clusters. b. Kaplan-Meier survival curves of each of the tSNE clusters depicted in A. c. t-SNE analysis of SARC TGF-β Pathway transcripts. d. Kaplan-Meier survival curves of patients from each of the clusters shown in C. e. Kaplan-Meier curves of patients from the unfavorable survival TGF-β Pathway cluster from C after t-SNE analysis with Myc Pathway transcripts. Two t-SNE clusters similar to those depicted in A were generated (not shown). F. Kaplan-Meier survival of patients from the favorable survival TGF-β Pathway cluster shown in C after t-SNE analysis with Myc Pathway transcripts. Two t-SNE groups similar to those shown in A were generated (not shown). g. t-SNE analysis of KIRP Cell Cycle Pathway transcripts showing two major tumor clusters and a third comprised of only seven tumors. h. Kaplan-Meier survival curves of patients from Clusters 1 and 2 depicted in G. i. t-SNE-generated patterns of KIRP Pentose Phosphate Pathway transcripts showing two major clusters. j Kaplan-Meier survival curves of each of patients from the tumor clusters depicted in I. k. Kaplan-Meier curves of individuals from the unfavorable survival Pentose Phosphate Pathway cluster from I after t-SNE analysis using transcripts from the Cell Cycle Pathway. Two t-SNE clusters similar to those depicted in A were generated (not shown). l. Kaplan-Meier curves of individuals from the favorable survival Pentose Phosphate Pathway cluster from K after t-SNE analysis using transcripts from the Myc Pathway. Two t-SNE clusters similar to those depicted in A were generated (not shown)
Fig. 3Sequential t-SNE analysis of OV and UCEC. a. t-SNE-generated patterns of Pyrimidine Biosynthesis Pathway transcripts showing four OV tumor clusters. b. Kaplan-Meier survival curves of patients from each of the Clusters depicted in A. c. t-SNE-generated patterns of OV Cell Cycle Pathway transcripts. d. Kaplan-Meier survival curves of patients from each of the Clusters shown in C. e. Kaplan-Meier survival curves of patients from unfavorable survival Cell Cycle Pathway Cluster 3 from C after t-SNE re-analysis using transcripts from the Pyrimidine Biosynthesis Pathway. Four t-SNE clusters similar to those depicted in A were generated (not shown) with two of these (Clusters 2 and 4) showing significant survival differences. f. Favorable survival Cell Cycle Pathway t-SNE Cluster 2 from (D) was analyzed with Pyrimidine Biosynthesis Pathway transcripts. Of these, Cluster 1 (median survival 1736 days) had a more favorable long-term survival than either Cluster 3 (1336 days) or Cluster 4 (1213 days) (P = 0.017 and P = 0.004, respectively. g. t-SNE-generated patterns of UCEC Myc Pathway transcripts. h. Kaplan-Meier survival of patients from the groups depicted in G. (I). t-SNE-generated patterns of UCEC Wnt Pathway transcripts. j. Kaplan-Meier survival curves of patients from each of the groups depicted in I. k. Kaplan-Meier survival curves of individuals from favorable survival Wnt Pathway t-SNE Cluster 2 following subsequent t-SNE profiling with Myc Pathway transcripts. l. Kaplan-Meier survival curves of patients from the unfavorable survival Wnt Pathway Cluster 1 cohort in K following repeat t-SNE profiling with Myc Pathway transcripts
Fig. 4Sequential hierarchical clustering/t-SNE profiling of LGG. a. Hierarchical clustering of 534 LGG transcriptomes from TCGA showing four distinct groups (“Dendros’). At the bottom of the panel, the rows of colored bars represent the clusters into which each tumor was grouped following t-SNE analysis with that pathway’s transcripts. b. Kaplan-Meier survival of each Dendro (D1-D4) and the P values for each pair-wise comparison. c. Kaplan-Meier survival of all 534 LGGs based on the t-SNE Clusters to which they were assigned after profiling with Pyrimidine Biosynthesis Pathway transcripts. The number of tumors in each Cluster and the median survival are indicated as are the P values for significant pair-wise comparisons. d. Kaplan-Meier survival of patients from the 149 member Dendro 4 group based on their Pyrimidine Biosynthesis Pathway t-SNE Cluster identities. e. Kaplan-Meier survival for all LGG patients based on the t-SNE Clusters to which they were assigned after profiling with Hippo Pathway transcripts f. Kaplan-Meier survival of patients from the 115 Dendro 2 group based on their Hippo Pathway t-SNE Cluster identities. g. Kaplan-Meier survival of patients from the Dendro 4 group based on their Hippo Pathway t-SNE Cluster identities. h. Kaplan-Meier survival of all LGG patients based on the t-SNE Clusters to which they were assigned after profiling with PI3-kinase Pathway transcripts. i. Kaplan-Meier survival of patients from Dendro 4 based on their PI3-kinase Pathway t-SNE Cluster identities. j. Kaplan-Meier survival of all LGG based on the t-SNE Clusters to which they were assigned after profiling with Wnt Pathway Pathway transcripts. k. Kaplan-Meier survival of patients from Dendro 4 based on their Wnt Pathway t-SNE Cluster identities
Fig. 5Sequential hierarchical clustering/t-SNE profiling of HCC. a. Hierarchical clustering of 374 HCC transcriptomes from TCGA showing six Dendros. At the bottom of the panel, the colored bars represent the results of t-SNE profiling performed with the four indicated transcript pathways. Each HCC was assigned a t-SNE Cluster identity within the indicated family as described in the legend to Fig. 4. b. Kaplan-Meier survival of patients from each Dendro (D1-D6). The only significant difference among the six groups was D1 vs. D4 (P = 0.021). c. Kaplan-Meier survival of all 374 HCC patients based on the three Clusters generated by t-SNE profiling of tumors with Purine Biosynthesis Pathway transcripts. The number of tumors in each Cluster and the median survival are indicated as are the P values for significant pair-wise comparisons. d. Kaplan-Meier survival of the 58 patients from Dendro 3 based on the Purine Biosynthesis Pathway t-SNE Cluster identities of their tumors. e. Kaplan-Meier survival for all patients based on the two Clusters generated by t-SNE profiling of tumors with Pyrimidine Biosynthesis Pathway transcripts. f. Kaplan-Meier survival of patients from Dendro 6 based on the two Pyrimidine Biosynthesis Pathway t-SNE Cluster identities of their tumors. g. Kaplan-Meier survival of all patients based on the three t-SNE Clusters to which their tumors were assigned after profiling with PI-3 Kinase Pathway transcripts. h. Kaplan-Meier survival of patients from the Dendro 2 group based on the three PI3-Kinase Pathway t-SNE Clusters to which their tumors were assigned. i. Kaplan-Meier survival of patients from the Dendro 3 group based on the three PI3-Kinase Pathway t-SNE Clusters to which their tumors were assigned. j. Kaplan-Meier survival for all patients based on the three TGF-β Pathway transcript t-SNE Clusters to which their tumors were assigned. k. Kaplan-Meier survival of patients from Dendro 4 based on the three TGF-β Pathway t-SNE Clusters to which their tumors were assigned. Small discrepancies in numbers of patients are due to slight differences in which patients were hierarchically clustered, and/or to missing or incomplete survival data