| Literature DB >> 25587357 |
Syed Haider1, Jun Wang1, Ai Nagano1, Ami Desai2, Prabhu Arumugam2, Laurent Dumartin1, Jude Fitzgibbon3, Thorsten Hagemann4, John F Marshall2, Hemant M Kocher2, Tatjana Crnogorac-Jurcevic1, Aldo Scarpa5, Nicholas R Lemoine1, Claude Chelala1.
Abstract
BACKGROUND: Improved usage of the repertoires of pancreatic ductal adenocarcinoma (PDAC) profiles is crucially needed to guide the development of predictive and prognostic tools that could inform the selection of treatment options.Entities:
Year: 2014 PMID: 25587357 PMCID: PMC4293116 DOI: 10.1186/s13073-014-0105-3
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Figure 1mRNA abundance patterns of 36-gene signature. Three genes (RFX8, RPSAP58 and GTF2IRD2B) were removed as over 50% of the validation cohort samples did not have corresponding expression profiles available given the annotation libraries used at the time of this study, and were therefore deemed unsuitable for clustering. The annotations on the right represent HGNC gene symbols along with asterisks indicating the significance of the mRNA-based differential expression between the predicted risk groups (***P adjusted < 0.001, **P adjusted < 0.01 and *P adjusted < 0.05). The covariates along the horizontal axis show a patient’s predicted risk group (black is low risk and red is high risk) and underlying dataset (red = Badea, green = Biankin, blue = Collisson, cyan = Donahue, magenta = Grutzmann, yellow = Pei, grey = Stratford and white = Winter). These results show two clusters of differential gene expression between the patient groups that demonstrate significantly different overall survival. HGNC, HUGO Gene Nomenclature Committee.
Figure 2Performance assessment of PDAC classifier. (A) Kaplan–Meier survival analysis of patient risk groups identified with the training cohort using the 36-gene signature. (B) Kaplan–Meier survival analysis of the predicted risk group of patients in the merged validation cohort using the 36-gene signature. The hazard ratio (HR) was estimated using a Cox proportional hazards model, and curves were compared using a log-rank test. (C-F) Independent validation across all the individual datasets that make up the validation cohort. (G) Assessment of classification accuracy using sensitivity and specificity estimates. Patients in the validation cohort were dichotomised by a median survival of 20 months (grey line), and classed into low- and high-risk groups, dashed black and red curves respectively. The solid curves represent a patient’s predicted risk group. Comparison between the two sets of groups indicates an overall 64.68% classification accuracy. (H) Forest plots of the genes in the prognostic signature. A Cox proportional hazards model was fitted to the signature genes in a univariate context. The horizontal axis represents HR (black squares) and 95% CIs (solid line). The asterisks represent the significance of the difference in patient outcome between the low- and high-expression groups (***P < 0.001, **P < 0.01 and *P < 0.05; Wald test). CI, confidence interval; HR, hazard ratio; PDAC, pancreatic ductal adenocarcinoma.
Figure 3Overlap among PDAC gene signatures. (A) Venn diagram detailing overlaps between the 36-gene signature and existing PDAC gene signatures. (B) Same as (A) except all the candidate prognostic genes used to derive the 36-gene signature were assessed for overlap with the existing PDAC gene signatures. PDAC, pancreatic ductal adenocarcinoma.
Figure 4Random prognostic gene signatures for PDAC. (A–E) Randomisation results of 5 million gene signatures with the initial feature set of 225 genes. All signatures were trained and validated on the same datasets as used for the derivation of the 36-gene signature. A χ2 statistic was used a performance metric for comparing random signatures with the 36-gene signature. The dashed lines are the χ2 values for the 36-gene signature. Overall, only 0.19% of the random signatures outperformed the 36-gene signature across all datasets. (F) Percentage inclusion of genes in the 36-gene signature for the randomly selected gene signatures. Thirty-one out of 36 genes were amongst the top 5% genes with highest inclusion frequency, with SEMA3A ranked highest (overall ranked 11th). PDAC, pancreatic ductal adenocarcinoma.