| Literature DB >> 35360863 |
Zhihang Wang1, Lili Yang2, Zhenyu Huang1, Xuan Li1, Juan Xiao1, Yinwei Qu1, Lan Huang1, Yan Wang1,3.
Abstract
In this paper, high-grade serous ovarian cancer (HGSOC) is studied, which is the most common histological subtype of ovarian cancer. We use a new analytical procedure to combine the bulk RNA-Seq sample for ovarian cancer, mRNA expression-based stemness index (mRNAsi), and single-cell data for ovarian cancer. Through integrating bulk RNA-Seq sample of cancer samples from TCGA, UCSC Xena and single-cell RNA-Seq (scRNA-Seq) data of HGSOC from GEO, and performing a series of computational analyses on them, we identify stemness markers and survival-related markers, explore stem cell populations in ovarian cancer, and provide potential treatment recommendation. As a result, 171 key genes for capturing stem cell characteristics are screened and one vital cancer stem cell subpopulation is identified. Through further analysis of these key genes and cancer stem cell subpopulation, more critical genes can be obtained as LCP2, FCGR3A, COL1A1, COL1A2, MT-CYB, CCT5, and PAPPA, are closely associated with ovarian cancer. So these genes have the potential to be used as prognostic biomarkers for ovarian cancer.Entities:
Keywords: cancer stem cells; gene biomarkers; high-grade serous ovarian cancer; mRNAsi; ovarian cancer; single-cell
Year: 2022 PMID: 35360863 PMCID: PMC8964092 DOI: 10.3389/fgene.2022.861954
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Holistic approach. (A) Data collection. (B) mRNAsi of ovarian cancer bulk RNA-Seq sample is calculated and analyzed. (C) Access to stemness-related key genes. (D) Single-cell data of ovarian cancer are analyzed using Seurat and SingleR package. (E) Annotation cell types for sub-population.
FIGURE 2The correlation of mRNAsi index with ovarian cancer. (A) The scatter plot shows that the mRNAsi expression in 416 tumor cases is higher than that in 88 normal cases (p-value < 0.05). (B) The tumor case is divided into two groups based on their mRNAsi score. The Kaplan Meier survival curve shows that the low group enjoyed a lower survival probability. And it is significant statistical differences as a whole (p-value < 0.05). (C) The tumor cases are divided into two groups based on their age (median age = 59), it is no statistical differences (p = 0.07). (D) The distribution of the mRNAsi for the clinical grade. The mRNAsi scores increase in more advanced clinical grades, and extremely so in G4 (p = 0.07). (E) The distribution of mRNAsi scores for stage of ovarian cancer cases.
FIGURE 3The summary of differential expression genes in ovary cancer samples vs. controls. (A) The heatmap shows the top 50 differentially expressed genes. (B) The volcano shows a log-fold change of expression of each gene. The red dots represent the up-regulated genes and the green ones represent the down-regulated genes. The black dots represent the gene without significant differential expression in cancer vs. controls.
FIGURE 4The gene modules are identified by weighted gene co-expression network analysis (WGCNA) and related to the mRNAsi in ovary cancer. (A) The indexes are used to determine the power of weight in the co-expression network. (B) The branches of the cluster for the different gene modules. (C) The correlation between the gene modules and the mRNAsi. (D,E) Scatter plot showing the filter of key genes. Each scatter represents a gene. The gene correlated with a module (module membership) and mRNAsi together is considered as the key gene.
Results of stemness-related key gene enrichment analysis.
| Pathway |
|
|---|---|
| Extracellular matrix organization | 1.06E-09 |
| Cell-substrate adhesion | 3.73E-06 |
| Wound healing | 3.78E-06 |
| Collagen fibril organization | 6.48E-06 |
| Collagen catabolic process | 1.53E-05 |
| Positive regulation of epithelial cell migration | 5.69E-05 |
| Epithelial cell migration | 3.79E-04 |
| Cell-cell signaling by Wnt | 5.67E-04 |
| Maintenance of location | 8.43E-04 |
FIGURE 5(A) Using the STRING (https://string-db.org/) to build the protein interaction network of key genes. (B) The number of edges of key genes through the protein interaction network. The X-axis represents the total number of edges connected by genes, and the Y-axis represents the gene name.
FIGURE 6Visualizing each cell population using UMAP. The distribution of individual cell populations in single-cell data. (A,C) The main distribution of stemness-related key genes in all cell populations. (B,D) The proportion of each cell type in each cell population.
Stemness-related high-risk genes obtained from the tumor cell population.
| Cell population | Reported(cor) |
|---|---|
| Tissue stem cells population | COL1A2(0.55) |
| COL1A1(0.53) | |
| MT-CYB(0.54) | |
| CCT5(0.50) | |
| PAPPA(0.50) |
Each gene is followed by a correlation coefficient(cor) with the stemness-related key gene.
Stemness-related high-risk genes affect stem cell-related pathways.
| Stem cell-related pathways |
|---|
| BEIER_GLIOMA_STEM_CELL_UP |
| GAL_LEUKEMIC_STEM_CELL_UP |
| PECE_MAMMARY_STEM_CELL_UP |
| PECE_MAMMARY_STEM_CELL_DN |
| HOEBEKE_LYMPHOID_STEM_CELL_UP |
| GO_HEMATOPOIETIC_STEM_CELL_HOMEOSTASIS |
| GO_NEGATIVE_REGULATION_OF_STEM_CELL_DIFFERENTIATION |
| GO_POSITIVE_REGULATION_OF_HEMATOPOIETIC_STEM_CELL_PROLIFERATION |