| Literature DB >> 31681404 |
Zhitong Bing1,2,3, Yuxiang Yao4, Jie Xiong5, Jinhui Tian1,2, Xiangqian Guo6, Xiuxia Li1,2,7, Jingyun Zhang1,2, Xiue Shi8, Yanying Zhang9, Kehu Yang1,2,8,9.
Abstract
Different analytical methods or models can often find completely different prognostic biomarkers for the same cancer. In the study of prognostic molecular biomarkers of ovarian cancer (OvCa), different studies have reported a variety of prognostic gene signatures. In the current study, based on geometric concepts, the linearity-clustering phase diagram with integrated P-value (LCP) method was used to comprehensively consider three indicators that are commonly employed to estimate the quality of a prognostic gene signature model. The three indicators, namely, concordance index, area under the curve, and level of the hazard ratio were determined via calculation of the prognostic index of various gene signatures from different datasets. As evaluation objects, we selected 13 gene signature models (Cox regression model) and 16 OvCa genomic datasets (including gene expression information and follow-up data) from published studies. The results of LCP showed that three models were universal and better than other models. In addition, combining the three models into one model showed the best performance in all datasets by LCP calculation. The combination gene signature model provides a more reliable model and could be validated in various datasets of OvCa. Thus, our method and findings can provide more accurate prognostic biomarkers and effective reference for the precise clinical treatment of OvCa.Entities:
Keywords: Cox regression; gene signature; ovarian cancer; prognosis index; robust prognostic model
Year: 2019 PMID: 31681404 PMCID: PMC6798149 DOI: 10.3389/fgene.2019.00931
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1The study flowchart. The chart describes the process of filtering available dataset and method of calculation and analysis.
The 13 published gene signatures for the prognosis of ovarian cancer selected for meta-analysis.
| Gene signature | Number of genes | Number of samples | Form of parameter | TNM stage | Grade | Dataset |
|---|---|---|---|---|---|---|
|
| 57 | 195 | β | III, IV | High grade | GSE26712 |
|
| 200 | 53 | HR | III, IV | 3 | GSE18520 |
|
| 300 | 80 | β | III, IV | 2, 3 | GSE14764 |
|
| 86 | 110 | HR | III, IV | 1, 2, 3 | GSE13876 |
|
| 88 | 157 | β | III, IV | 1, 2, 3 | GSE17260, GSE9891 |
|
| 7 | 35 | HR | I, III, IV | 1, 2, 3, NI | – |
|
| 193 | 489 | β | II, III, IV | 1, 2, 3 | TCGA,2011 |
|
| 126 | 300 | β | III, IV | 2, 3 | GSE32062 |
|
| 37 | 1287 | HR | I, II, III, IV | 1, 2, 3 | TCGA, GSE14764, GSE15622, GSE19829, |
|
| 100 | 489 | β | II, III, IV | High grade | TCGA, GSE9899 |
|
| 200 | 1525 | β | I, II, III, IV | High grade | TCGA, E.MTAB.386, GSE12418, |
|
| 32 | 1757 | HR | III, IV | High grade | TCGA, GSE14764, GSE15622, GSE19829, |
|
| 19 | 484 | HR and β | I, II, III, IV | 1, 2, 3 | TCGA, GSE9899 |
Figure 2Overlapping of 13 gene signature models that were tested by Jaccard index. The numbers on edges represent the number of genes in each model. (A) The numbers in lattices represent the number of genes overlapping between different models. (B) The relationship of function among the 13 gene signature models. The blue lines represent the connection between two gene signature models. The purple lines represent at least three models associated with each other. (C) The overlapping of GO enrichment and KEGG among 13 gene signature models.
Figure 3The boxplots of the three indicators and p-value. The three indicators and p-value, HR (A), C-index (B), AUC (C), and P-value (D) of each model of the 16 datasets are depicted by boxplot, respectively. The dashed line represents the threshold for each indicator (HR = 1 as the threshold, C-index = 0.5 as threshold, AUC = 0.5 as threshold, and P-value = 0.05 as significant threshold). The P-values of log-rank (D) from comparison of high-risk and low-risk cohorts are also considered.
Figure 4Two-dimensional linearity-clustering phase diagram of models. Tags in the figure denote the various models as described in . The coordinates show the reciprocals of residual of a line fitting and gathering degree of scatter points of one model in 3D space repetitively. The color of point reflects the overall confidence level. (A) Distribution of the 13 gene signature models in linearity-clustering phase diagram. (B) Distribution of the combination models in linearity-clustering phase diagram.
Figure 5Three indicators clustered across 16 different datasets: (A) HR, (B) C-index, (C) AUC of 5 years. (D) Log-rank p-value of the 13 models clustered in 16 different datasets. (E) The overlapping of the four indicators.
Figure 6GO enrichment of WRT model. (A) Risky and protective gene enrichment in biological processes. (B) Cellular component and (C) molecular function.