| Literature DB >> 31002564 |
Arunima Shilpi1, Manoj Kandpal1, Yanrong Ji1, Brandon L Seagle1, Shohreh Shahabi1, Ramana V Davuluri1.
Abstract
PURPOSE: Molecular cancer subtyping is an important tool in predicting prognosis and developing novel precision medicine approaches. We developed a novel platform-independent gene expression-based classification system for molecular subtyping of patients with high-grade serous ovarian carcinoma (HGSOC).Entities:
Mesh:
Substances:
Year: 2019 PMID: 31002564 PMCID: PMC6873993 DOI: 10.1200/CCI.18.00096
Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN: 2473-4276
Number of Up- and Downregulated Genes or Transcript Variants Identified in the TCGA Ovarian Cohort Exon Array Data
FIG 1.(A) Unsupervised non-negative matrix factorization clustering of 569 patients with high-grade serous ovarian carcinoma (HGSOC) on the basis of highly variable (930) isoform-level signatures. The clusters were identified and assigned into four subgroups on the basis of original The Cancer Genome Atlas core grouping. The color code for samples in each cluster are as follows: differentiated (D), black; immunoreactive (I), red; mesenchymal (M), pink; and proliferative (P), blue. (B) The concordance table shows the agreement of sample assignment with The Cancer Genome Atlas subgroups (gene-based subtypes) and our isoform-based subgroups. Although the P subgroup showed good agreement, the I subgroup showed the worst agreement followed by the M subgroup. (C and D) Kaplan-Meier survival curves plotted to determine the prognostic difference among four the subtypes identified using gene- and isoform-level expression-based clustering, respectively. The statistical significance in overall survival was determined at a threshold P = .05.
Cox Proportional Hazards Regression Model by Gene- and Isoform-Based Subgroups
FIG 2.The classifier was generated on the basis of the selection subset of (A) isoform/transcript variants and (B) genes. Out-of-bag (OOB) error rate was plotted, where the x-axis denotes the selection of features/variables and the y-axis represents the error rate.
Concordance Between the True-Class Labels and Predicted Calls by the Isoform-Level Classifier That Was Trained on the Exon Array Data Set and Applied on RNA-Seq Data Set
Overlap in Cluster Membership of 245 Ovarian Serous Samples Between Our Predicted Isoform-Based Subgroups and AOCS Clusters (Gene Expression Omnibus GSE9891)
FIG 3.The prediction of four classes in 378 high-grade serous ovarian carcinoma samples obtained from Gene Expression Omnibus (GSE9891 and GSE26712) data set was clinically evaluated for prognostic determinates. Kaplan-Meir survival plot of the four subtypes shows significant difference in overall survival rate (P < .001).