| Literature DB >> 24098079 |
Abstract
PURPOSE: Personalized medicine is predicated on the concept of identifying subgroups of a common disease for better treatment. Identifying biomarkers that predict disease subtypes has been a major focus of biomedical science. In the era of genome-wide profiling, there is controversy as to the optimal number of genes as an input of a feature selection algorithm for survival modeling. PATIENTS AND METHODS: The expression profiles and outcomes of 544 patients were retrieved from The Cancer Genome Atlas. We compared four different survival prediction methods: (1) 1-nearest neighbor (1-NN) survival prediction method; (2) random patient selection method and a Cox-based regression method with nested cross-validation; (3) least absolute shrinkage and selection operator (LASSO) optimization using whole-genome gene expression profiles; or (4) gene expression profiles of cancer pathway genes.Entities:
Keywords: TCGA; brain; feature selection; glioblastoma; personalized medicine; survival modeling
Mesh:
Substances:
Year: 2013 PMID: 24098079 PMCID: PMC3790279 DOI: 10.2147/IJN.S40733
Source DB: PubMed Journal: Int J Nanomedicine ISSN: 1176-9114
Survival prediction comparison between the 1-NN survival prediction method and the random patient selection method
| Measure | Type of prediction
| |
|---|---|---|
| 1-NN survival prediction | Random patient selection | |
| MAD | 386.2 | 455.8 |
Note:
The average of MAD values for the random patient selection method was computed by repeating the simulation of survival prediction 100 times.
Abbreviations: 1-NN, 1-nearest neighbor survival prediction method; MAD, mean absolute difference between observed survival (in days) and predicted survival (in days).
Figure 1Histogram of absolute difference between observed survival (in days) and survival (in days) predicted by the 1-NN method.
Abbreviation: 1-NN, 1-nearest neighbor survival prediction method.
Survival prediction comparisons based on Pearson’s correlation coefficient
| Measure | Type of prediction
| ||
|---|---|---|---|
| 1-NN survival prediction | Coxnet with whole genome | Coxnet with cancer pathway genes | |
| Correlation | 0.18 | −0.22 | −0.24 |
Notes:
Pearson’s correlation coefficient between observed survival and predicted survival;
Pearson’s correlation coefficient between observed survival and predicted relative risks for nested tenfold cross-validation, where an inner loop was used for LASSO parameter determination.
Abbreviations: 1-NN, 1-nearest neighbor survival prediction method; Coxnet, regularized Cox regression.12
Annotation to biological pathways of the top 32 genes (among 300 preselected cancer genes) used for building the models selected by Coxnet12
| Pathway | Genes selected by Coxnet |
|---|---|
| Wnt pathway | |
| JNK pathway | |
| Apoptosis | |
| ECM receptor interaction | |
| ERBB pathway | |
| HIF pathway | |
| AKT pathway | |
| NFkB pathway | |
| Retinoic acid receptor | |
| Hedgehog pathway | |
| Inflammation | |
| Resistance to chemotherapy | |
| Cell cycle | |
| Gene expression during myeloid and B-lymphoid cell development |
Note:
Pathways with more than three genes selected by Coxnet.
Abbreviations: Coxnet, regularized Cox regression; JNK, C-Jan N-terminal kinase; ECM, extracellular matrix; HIF, hypoxia-inducible factor; NFkB, nuclear factor-kappa B; IL, interleukin; EGF, epidermal growth factor.