| Literature DB >> 22548963 |
Yupeng Cun1, Holger Fröhlichholger Fröhlich.
Abstract
BACKGROUND: Stratification of patients according to their clinical prognosis is a desirable goal in cancer treatment in order to achieve a better personalized medicine. Reliable predictions on the basis of gene signatures could support medical doctors on selecting the right therapeutic strategy. However, during the last years the low reproducibility of many published gene signatures has been criticized. It has been suggested that incorporation of network or pathway information into prognostic biomarker discovery could improve prediction performance. In the meanwhile a large number of different approaches have been suggested for the same purpose.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22548963 PMCID: PMC3436770 DOI: 10.1186/1471-2105-13-69
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Employed breast cancer data sets
| GSE2034 | 286 | 93 | 183 | Wang et al. 2005 [ |
| GSE1456 | 159 | 34 | 119 | Pawitan et al. 2005 [ |
| GSE2990 | 187 | 42 | 116 | Sotiriou et al. 2006 [ |
| GSE4922 | 249 | 69 | 159 | Ivshina et al. 2006 [ |
| GSE7390 | 198 | 56 | 135 | Desmedt et al. 2007 [ |
| GSE11121 | 200 | 28 | 154 | Schmidt et al. 2008 [ |
Figure 1Prediction performance in terms of area under ROC curve (AUC) PAM (prediction analysis of microarray data), sigGenNB (SAM + Naïve Bayes), sigGenSVM (SAM + SVM),SCADSVM, HHSVM (Huberized Hinge loss SVM), RFE (Recursive Feature Elimination), RRFE (Reweighted Recursive Feature Elimination), graphK (graph diffusion kernels for SVMs), graphKp (p-step random walk graph kernel for SVMs), networkSVM (Network-based SVM), PAC (Pathway Activity Classification), aveExpPath (average pathway expression), HubClassify (classification by significant hub genes), pathBoost.
Figure 2Signature stability. The y-axis shows the fraction of genes, being selected between 91 and 100 times.
ANOVA analysis for prediction performance (AUC)
| PAM | 4 | No |
| sigGenNB | 3 | No |
| sigGenSVM | 2 | No |
| SCAD | 6 | No |
| No | ||
| RFE | 1 | No |
| RRFE | 6 | Yes |
| graphK | 2 | Yes |
| graphkKp | 1 | Yes |
| networkSVM | 1 | Yes |
| PAC | 0 | Yes |
| Yes | ||
| HubClassify | 6 | Yes |
| pathBoost | 4 | Yes |
| | ||
PAM (prediction analysis of microarray data), sigGenNB (SAM + Naïve Bayes), sigGenSVM (SAM + SVM),SCAD-SVM, HHSVM (Huberized Hinge loss SVM), RFE (Recursive Feature Elimination), RRFE (Reweighted Recursive Feature Elimination), graphK (graph diffusion kernels for SVMs), graphKp (p-step random walk graph kernel for SVMs), networkSVM (Network-based SVM), PAC (Pathway Activity Classification), aveExpPath (average pathway expression), HubClassify (classification by significant hub genes), pathBoost.
Gene selection stability according to stability index (lower = better)
| PAM | 0.237 | 0.282 | 0.259 | 0.302 | 0.281 | 0.277 | 0.279 |
| sigGenNB | 0.209 | 0.193 | 0.173 | 0.289 | 0.208 | 0.272 | 0.208 |
| sigGenSVM | 0.209 | 0.193 | 0.173 | 0.289 | 0.208 | 0.272 | 0.208 |
| SCAD | 0.245 | 0.265 | 0.268 | 0.232 | 0.229 | 0.251 | 0.247 |
| HHSVM | 0.212 | 0.191 | 0.210 | 0.199 | 0.197 | 0.205 | 0.202 |
| RFE | 0.285 | 0.298 | 0.295 | 0.287 | 0.293 | 0.291 | 0.292 |
| RRFE | 0.224 | 0.240 | 0.211 | 0.246 | 0.209 | 0.248 | 0.232 |
| graphK | 0.276 | 0.290 | 0.295 | 0.285 | 0.283 | 0.285 | 0.285 |
| graphkKp | 0.269 | 0.281 | 0.276 | 0.271 | 0.273 | 0.276 | 0.274 |
| networkSVM | |||||||
| PAC | 0.249 | 0.257 | 0.245 | 0.259 | 0.158 | 0.181 | 0.248 |
| aveExpPath | 0.189 | 0.192 | 0.156 | 0.294 | 0.190 | 0.237 | 0.191 |
| HubClassify | 0.215 | 0.106 | 0.138 | 0.120 | 0.113 | ||
| pathBoost | 0.200 | 0.206 | 0.247 | 0.199 | 0.235 | 0.213 | 0.210 |
Figure 3Interpretability of signatures (enriched disease genes). For aveExpPath and PAC the enrichment of the particular disease category within selected pathway genes is shown. A represents data GSE2034 [34]; B represents data GSE11121 [39]; C represents data GSE1456 [35]; D represents data GSE2990 [36]; E represents data GSE4922 [37]; F represents data GSE7390 [38].
Figure 4Interpretability of signatures (enriched KEGG-pathways). For aveExpPath the adjusted p-value for differential expression from the SAM-test is shown. For all other methods we tested pathway enrichment within the set of selected genes.
Figure 5Interpretability of signatures (enriched drug targets). For aveExpPath and PAC the enrichment of drug targets within selected pathway genes is shown.