| Literature DB >> 35361267 |
Siyu Chen1,2, Yong Wu1,2, Simin Wang1,2, Jiangchun Wu1,2, Xiaohua Wu3,4, Zhong Zheng5,6.
Abstract
BACKGROUND: Ovarian cancer (OC) is the deadliest tumor in the female reproductive tract. And increased resistance to platinum-based chemotherapy represents the major obstacle in the treatment of OC currently. Robust and accurate gene expression models are crucial tools in distinguishing platinum therapy response and evaluating the prognosis of OC patients.Entities:
Keywords: Biomarkers; Ovarian cancer; Platinum response; Prognostic model
Mesh:
Substances:
Year: 2022 PMID: 35361267 PMCID: PMC8973612 DOI: 10.1186/s13048-022-00969-3
Source DB: PubMed Journal: J Ovarian Res ISSN: 1757-2215 Impact factor: 4.234
Fig. 1Flowchart of this study
Clinicopathological characteristics of patients with OC in this study
| Variables | Training set( | Validation set( | ||||
|---|---|---|---|---|---|---|
| Resistance ( | Sensitive ( | Resistance ( | Sensitive ( | |||
| Age (years, mean ± sd) | 61.77 ± 11.47 | 58.55 ± 11.32 | 0.0524a | – | – | – |
| Neoplasm subdivision (Bilateral/Left/Right/−) | 51/7/7/4 | 109/27/14/11 | 0.4066b | – | – | – |
| Stage (II / III / IV) | 1/60/8 | 10/127/24 | 0.2167b | – | – | – |
| Histologic grade (G2/G3/G4/−) | 7/60/1/1 | 25/133/0/3 | 0.3213b | – | – | – |
| Lymphatic invasion (Yes/No/−) | 17/7/45 | 35/30/96 | 0.282b | – | – | – |
| Recurrence (Yes/No) | 61/8 | 122/5/34 | 0.06702b | – | – | – |
| Death (Dead/Alive/−) | 58/11 | 82/79 | 1.674E-6b | 33/1 | 33/3/5 | 0.6145b |
Overall survival (months, mean ± sd) | 29.71 ± 14.34 | 48.59 ± 25.33 | 1.342E-11a | 27.01 ± 14.51 | 51.82 ± 27.97 | 1.915E-5a |
Notes: “-”:not know; a T-test; b Fisher test
Fig. 2Identification of DEGs. A The CN signal of the TCGA samples. The horizontal axis represents the detection area on each chromosome, the vertical axis represents the 230 ovarian cancer samples included in the analysis. 1–22 and X, Y indicates the chromosome number, and blue indicates log2 (CN) < 0, while red indicates log2 (CN) level > 0. B Volcano plots of the DEGs in gene expression. C Volcano plots of the DEGs in CN signals. D The Venn diagram of the DEGs and 108 genes as overlapping genes in both expression and CN signal levels. E Gene Ontology (GO) functional enrichment analysis of the 48 DEGs in the biological process subsection of GO (BP); molecular function subsection of GO (MF); a cellular component subsection of GO (CC)
DEGs significantly related KEGG pathways
| Term | Count | Genes | |
|---|---|---|---|
| hsa04514: Cell adhesion molecules (CAMs) | 2 | 0.021093 | CD274, NLGN1 |
| hsa04020: Calcium signaling pathway | 2 | 0.027187 | NOS2, CACNA1C |
| hsa05200: Pathways in cancer | 2 | 0.045151 | NOS2, MMP1 |
Fig. 3Construction and evaluation of the SVM classifier. A OOB error calculated by the RandomForest algorithm and 10 genes were selected optimally when the OOB error is the smallest. B The receiver operating characteristic (ROC) curve (area under the curve (AUC) of the TCGA training set (solid line) and GSE63885 validation set (dotted line)
10 optimal feature genes contained in the SVM classifier
| Gene | DE mRNA | DE CN | Variant | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| logFC | logFC | Chrom | Variant Position | Type | Variant Classification | Amino acids | Codons | Reference Allele | Tumor Allele | dbSNP_RS | |||
| CD209 | 0.358543 | 0.0142 | −0.42485 | 0.012717 | 19 | 7,807,996 | SNP | Missense | R | Agg/Cgg | C | T | rs11465393 |
| CD274 | 0.437529 | 0.00976 | 0.460916 | 0.002227 | 9 | 5,462,907 | SNP | Missense | G/V | gGg/gTg | G | T | novel |
| HIST1H3I | −0.43754 | 0.00255 | 0.30903 | 0.010138 | 6 | 27,839,796 | SNP | Missense | P/LX | cca/cTca | A | G | novel |
| HIST1H4L | −0.43335 | 0.0143 | 0.30903 | 0.010138 | 6 | 27,841,216 | SNP | Missense | T | acA/acG | C | T | novel |
| NLGN1 | 1.019701 | 0.00143 | 0.351201 | 0.004621 | 3 | 173,525,589 | SNP | Missense | T/M | aCg/aTg | G | C | novel |
| NTRK3 | −0.36699 | 0.0247 | 0.295746 | 0.007258 | 15 | 88,576,170 | SNP | Silent | L | Ctg/Ttg | C | G | novel |
| 88,483,904 | SNP | Nonsense | T | acG/acC | C | A | novel | ||||||
| 88,522,597 | SNP | Intron | R/W | Cgg/Tgg | G | A | novel | ||||||
| 88,423,602 | SNP | Missense | T | acG/acA | G | A | novel | ||||||
| 88,420,205 | SNP | Missense | D/X | Gat/at | C | A | novel | ||||||
| 88,476,380 | SNP | Missense | Y | taC/taT | A | T | novel | ||||||
| PNLDC1 | −0.85134 | 0.000718 | −0.34754 | 0.003742 | 6 | 160,240,313 | SNP | Silent | I/M | atC/atG | G | C | novel |
| SLC22A3 | −0.51215 | 0.0225 | −0.30286 | 0.005976 | 6 | 160,872,083 | SNP | Missense | Y/* | taC/taG | C | T | novel |
| SLC5A1 | 0.298462 | 0.00642 | 0.441319 | 0.001623 | 22 | 32,495,226 | SNP | Nonsense | V | gtC/gtT | C | G | novel |
| SYNM | 0.558712 | 0.00288 | 0.448385 | 0.001034 | 15 | 99,672,483 | SNP | Silent | T/R | aCg/aGg | A | G | novel |
| 99,669,768 | SNP | Silent | T | acC/acT | G | A | novel | ||||||
| 99,671,917 | SNP | Missense | R | cgG/cgC | G | C | novel | ||||||
| 99,672,905 | SNP | Missense | T/N | aCc/aAc | C | G | novel | ||||||
| 99,653,864 | SNP | Silent | D/E | gaT/gaA | C | T | novel | ||||||
| 99,672,085 | SNP | Missense | Y/H | Tac/Cac | G | T | novel | ||||||
Fig. 4Survival analysis of the DEGs and clinical factors. A The Venn diagram of DEGs related to prognosis. B The K-M curve of the overall survival of the patients with different platinum response statuses
Univariate cox regression analysis of clinicopathological characteristics
| Variables | Univariate analysis | ||
|---|---|---|---|
| HR | 95% CI | ||
| Age | 1.008 | 0.9929–1.024 | 0.2972 |
| Platinum response status (Sensitive/Resistant) | 0.2168 | 0.1498–0.3137 | 5.55E-16 |
| Neoplasm subdivision (Bilateral/Left/Right) | 0.8438 | 0.6318–1.127 | 0.249 |
| Stage (II / III / IV) | 1.052 | 0.7266–1.524 | 0.787 |
| Lymphatic invasion (Yes/No) | 0.9587 | 0.5293–1.736 | 0.889 |
| Histologic grade(G1-G2/G3-G4) | 1.272 | 0.8054–2.009 | 0.3015 |
| Recurrence (Yes/No) | 0.6948 | 0.3519–1.372 | 0.2913 |
Fig. 5Construction and validation of the prognostic risk model. A. Cross-validation likelihood filters the lambda parameter (20.88803) when cvl takes the maximum value (− 771.2244). B Based on the L1-penalized regularized regression algorithm, the optimal prognostic gene coefficient distribution line for Cox-PH model screening. C Prognostic prediction by the prognostic risk model in the TCGA training dataset. D Prognostic prediction by the prognostic risk model in the GSE63885 validation set. E The correlation between the K-M curve of the platinum response status and the prognostic model prediction
Optimal prognostic-related genes used to construct the prognostic risk model
| Gene | coef | Hazard Ratio | 95%CI | |
|---|---|---|---|---|
| GJA8 | −0.42542 | 0.8324 | 0.5970–1.1605 | 0.00068 |
| PNLDC1 | 0.430375 | 1.2266 | 0.9523–1.5799 | 0.00225 |
| SLC5A1 | −0.20707 | 0.9885 | 0.8578–1.1391 | 0.01138 |
| VSTM2L | 1.169891 | 1.1904 | 1.0645–1.3313 | 0.01152 |
| CACNA1C | 1.195075 | 2.2235 | 1.4023–3.5256 | 0.027923 |
| SEZ6L | −1.64918 | 0.6669 | 0.4555–0.9763 | 0.03024 |
| GDF3 | 0.442726 | 1.4139 | 1.0336–1.9340 | 0.03722 |
| SYNM | −1.78725 | 0.5982 | 0.4016–0.8912 | 0.04873 |
Fig. 6Screening and validation of the hub genes. A The intersection of the genes between the SVM classifier and the prognostic risk model. B The expression of PNLDC1, SLC5A1, and SYNM in the training TCGA database. C The expression of PNLDC1, SLC5A1, and SYNM in the validation set GSE63885. D Representative immunohistochemistry images of PNLDC1, SLC5A1, and SYNM. E The expression of PNLDC1, SLC5A1, and SYNM in the FUSCC cohort
The CNV information of the 3 hub genes
| Gene | Variat | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Chrom | Variant Position | Type | Variant Classification | Amino acids | Codons | Reference Allele | Tumor Allele | dbSNP_RS | |
| PNLDC1 | 6 | 160,240,313 | SNP | Silent | I/M | atC/atG | G | C | novel |
| SLC5A1 | 22 | 32,495,226 | SNP | Nonsense | V | gtC/gtT | C | G | novel |
| SYNM | 15 | 99,672,483 | SNP | Silent | T/R | aCg/aGg | A | G | novel |
| 99,669,768 | SNP | Silent | T | acC/acT | G | A | novel | ||
| 99,671,917 | SNP | Missense | R | cgG/cgC | G | C | novel | ||
| 99,672,905 | SNP | Missense | T/N | aCc/aAc | C | G | novel | ||
| 99,653,864 | SNP | Silent | D/E | gaT/gaA | C | T | novel | ||
| 99,672,085 | SNP | Missense | Y/H | Tac/Cac | G | T | novel | ||
Fig. 7Survival analysis of the hub genes. A PFS analysis of PNLDC1. B PFS analysis of SLC5A1. C PFS analysis of SYNM
| Observed | |||
| positive | negative | ||
| Predicted | positive | A | B |
| negative | C | D | |
Sensitivity = A/(A + C); Specificity = D/(B + D); PPV (Positive Predictive Value) = A/(A + B); NPV (Negative Predictive Value) = D/(D + C)