| Literature DB >> 34356119 |
Min-Hwan Sohn1,2,3, Se Ik Kim4, Jong-Yeon Shin2, Hee Seung Kim4, Hyun Hoon Chung4, Jae-Weon Kim4, Maria Lee4,5, Jeong-Sun Seo1,2,3,6.
Abstract
High-grade serous ovarian cancer (HGSOC) is one of the deadliest cancers that can occur in women. This study aimed to investigate the molecular characteristics of HGSOC through integrative analysis of multi-omics data. We used fresh-frozen, chemotherapy-naïve primary ovarian cancer tissues and matched blood samples of HGSOC patients and conducted next-generation whole-exome sequencing (WES) and RNA sequencing (RNA-seq). Genomic and transcriptomic profiles were comprehensively compared between patients with germline BRCA1/2 mutations and others with wild-type BRCA1/2. HGSOC samples initially divided into two groups by the presence of germline BRCA1/2 mutations showed mutually exclusive somatic mutation patterns, yet the implementation of high-dimensional analysis of RNA-seq and application of epithelial-to-mesenchymal (EMT) index onto the HGSOC samples revealed that they can be divided into two subtypes; homologous recombination repair (HRR)-activated type and mesenchymal type. Patients with mesenchymal HGSOC, characterized by the activation of the EMT transcriptional program, low genomic alteration and diverse cell-type compositions, exhibited significantly worse overall survival than did those with HRR-activated HGSOC (p = 0.002). In validation with The Cancer Genome Atlas (TCGA) HGSOC data, patients with a high EMT index (≥the median) showed significantly worse overall survival than did those with a low EMT index (<the median) (p = 0.030). In conclusion, through a comprehensive multi-omics approach towards our HGSOC cohorts, two distinctive types of HGSOC (HRR-activated and mesenchymal) were identified. Our novel EMT index seems to be a potential prognostic biomarker for HGSOC.Entities:
Keywords: epithelial-to-mesenchymal transition; gene signature; high-grade serous carcinoma; homologous recombination repair; ovarian cancer
Mesh:
Substances:
Year: 2021 PMID: 34356119 PMCID: PMC8303300 DOI: 10.3390/genes12071103
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Patients’ clinicopathologic characteristics.
| Characteristics | All |
| ||
|---|---|---|---|---|
| Age, years | ||||
| Mean ± SD | 52.8 ± 8.4 | 54.2 ± 9.4 | 51.4 ± 7.4 | 0.705 |
| Family Hx of breast cancer | 1 (5.0) | 1 (10.0) | 0 | >0.999 |
| Family Hx of ovarian cancer | 1 (5.0) | 1 (10.0) | 0 | >0.999 |
| FIGO stage | 0.779 | |||
| IIIA | 2 (10.0) | 1 (10.0) | 1 (10.0) | |
| IIIB | 1 (5.0) | 1 (10.0) | 0 | |
| IIIC | 11 (55.0) | 5 (50.0) | 6 (60.0) | |
| IV | 6 (30.0) | 3 (30.0) | 3 (30.0) | |
| CA-125, IU/mL | ||||
| Median (range) | 798.5 (5.1–3545.0) | 798.0 (5.1–3545.0) | 798.5 (47.0–2433.0) | 0.940 |
| Lymph node metastasis | 12 (60.0) | 6 (60.0) | 6 (60.0) | >0.999 |
| Residual tumor after surgery | 0.139 | |||
| No gross | 14 (70.0) | 9 (90.0) | 5 (50.0) | |
| <1 cm | 5 (25.0) | 1 (10.0) | 4 (40.0) | |
| ≥1 and <2 cm | 1 (5.0) | 0 | 1 (10.0) | |
| Chemotherapy at primary treatment | 0.628 | |||
| 6 cycles of paclitaxel–carboplatin | 14 (70.0) | 6 (60.0) | 8 (80.0) | |
| 9 cycles of paclitaxel–carboplatin | 6 (30.0) | 4 (40.0) | 2 (20.0) | |
| Recurrence | 16 (80.0) | 9 (90.0) | 7 (70.0) | 0.582 |
| Treatment-free interval, months | ||||
| Median (range) | 20.4 (3.0–73.0) | 20.9 (13.5–73.0) | 19.6 (3.0–67.9) | 0.496 |
| Germline | 0.033 | |||
| Wild-type | 15 (75.0) | 5 (50.0) | 10 (100.0) | |
| Mutation | 5 (25.0) | 5 (50.0) | 0 | |
| Germline | 0.033 | |||
| Wild-type | 15 (75.0) | 5 (50.0) | 10 (100.0) | |
| Mutation | 5 (25.0) | 5 (50.0) | 0 |
Abbreviations: CA-125, cancer antigen 125; FIGO, International Federation of Gynecology and Obstetrics; Hx, history; SD, standard deviation.
Figure 1Comparisons of survival outcomes between germline BRCA1/2 mutation and wild-type groups. (A) Progression-free survival. (B) Overall survival.
Figure 2Genomic mutational characterization of 20 HGSOC samples. The distribution of somatic mutations among three categories of samples. Each column displayed here represents an individual case. LN, LVSI, TMB, and SCNA stand for lymph node, lymphovascular space invasion, tumor mutational burden, and somatic copy number alteration, respectively.
Figure 3Two distinctive patterns of molecular subtype identified through RNA-seq data analysis. (A) Transcriptional landscape of HGSOC samples through principal component analysis. Samples are represented by different shapes and colors by their origin and grouped according to K-means clustering with k = 2 (cluster A and cluster B). (B) Hierarchical clustering of samples represents the expression profile of 30 HRR genes. (C) Hierarchical clustering of samples with the expression profile of 38 EMT-TFs reproduced the result from the PCA analysis. (D) Violin plots showing difference in EMT index and gene expressions of CDH1, VIM, and TGFB1 between cluster A and cluster B. Each p value was calculated via Mann–Whitney U test. (E) A violin plot-view of HRD score distribution between cluster A and cluster B (left), and relationship between EMT-index and HRD sum scores (right). HRD scores between cluster A and cluster B were compared using Mann–Whitney U test. Statistical dependence between EMT index and HRD scores were computed through Spearman’s rank correlation coefficients. LoH, NtAI, and LST stand for loss of heterozygosity, number of telomeric allelic imbalances, and large-scale transition, respectively.
Figure 4Co-expression gene module identification and cell-type enrichment. (A) Interaction network of identified gene modules and over representation analysis using HALLMARK gene sets. (B) EMT index and cell-type enrichment analysis results across 20 HGSOC samples divided by cluster A and cluster B and by order of increasing EMT-index. * Mann–Whitney U test p < 0.05 between cluster A and cluster B.
Figure 5Application of the EMT index to TCGA HGSOC data. (A) Distribution of EMT index of TCGA HGSOC displayed on a box plot. (B) Scatter plots illustrating relationship between the EMT index and EMT-related gene expression in the cohort. Each dot represents each sample analyzed, and red lines are a linear trend representation of the scatter plots. (C) Kaplan–Meier plot depicting overall survival of TCGA HGSOC samples falling into EMT-high (red) and -low (blue) groups. (D) EMT index for four TCGA subtypes was compared and the mesenchymal subtype exhibited the highest EMT index (one-way ANOVA test p < 0.001; Tukey’s HSD adjusted p < 0.005 ** and < 0.05 *). Red dots and blue dots inside the violin plots represent EMT-high and -low samples, respectively.