| Literature DB >> 25352096 |
Riccardo De Bin1, Tobias Herold, Anne-Laure Boulesteix.
Abstract
BACKGROUND: In the last years, the importance of independent validation of the prediction ability of a new gene signature has been largely recognized. Recently, with the development of gene signatures which integrate rather than replace the clinical predictors in the prediction rule, the focus has been moved to the validation of the added predictive value of a gene signature, i.e. to the verification that the inclusion of the new gene signature in a prediction model is able to improve its prediction ability.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25352096 PMCID: PMC4271356 DOI: 10.1186/1471-2288-14-117
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Acute myeloid leukemia: REMARK-like profile of the analysis performed on the dataset
| a) Patients, treatment and variables | ||||
|---|---|---|---|---|
|
|
| |||
| Marker | OS = 86-probe-set gene-expression signature | |||
| Further variables | v1 = | |||
| Reference | Metzeler et al. (2008) | |||
| Source of the data | GEO (reference: GSE12417) | |||
|
|
|
| ||
| Training set | Assessed for eligibility | 163 |
| |
|
| ||||
| Excluded | 0 | |||
| Included | 163 |
| ||
|
| ||||
| With outcome events | 105 |
| ||
| Validation set | Assessed for eligibility | 79 |
| |
|
| ||||
| Excluded | 0 | |||
| Included | 79 |
| ||
|
| ||||
| With outcome events | 33 |
| ||
|
| ||||
| Data source | Same research group, different time (see above) | |||
| Follow-up time | Much shorter in the validation set (see text) | |||
| Survival rate | Higher in the validation set (see Figure | |||
|
| ||||
|
|
|
|
|
|
|
| ||||
| A1: univariate | 163 | 105 | v1 to v4 | Kaplan-Meier curves (Figure |
| 79 | 33 | |||
|
| ||||
| B1: overall prediction | Prediction error curves (Figure | |||
| Integrated Brier score (text) | ||||
| Training | Comparison of Kaplan-Meier curves for risk groups: | |||
| 163 | 105 | - Medians as cutpoints (Figure | ||
| B2: discriminative ability | OS, v1 to v4 | - K-mean clustering (data not shown - see text) | ||
| C-index (text) | ||||
| Validation | K-statistic (text) | |||
| B3: calibration | 79 | 33 | Kaplan-Meier curve vs average individual survival curves for risk groups (Figure | |
| Calibration slope (text) | ||||
|
| ||||
| C1: significance | 79 | 33 | OS, v1 to v4 | Multivariate Cox model (Table |
|
| ||||
| D1: overall prediction | 79 | 33 | OS, v1 to v4 | Prediction error curves based on repeated cross-validation (Figure |
| Prediction error curves based on repeated subsampling (data not shown - see text) | ||||
| Prediction error curves based on repeated bootstrap resampling (data not shown - see text) | ||||
| Integrated Brier score based on cross-validation (text) | ||||
|
| ||||
| E1: overall prediction | Female | OS, v1 to v4 | Prediction error curves (Figure | |
| E2: discriminative ability | t.: 88 54 | C-index (text) | ||
| v.: 46 16 | K-statistic (text) | |||
| E3: calibration | Male | Calibration slope (text) | ||
| E4: significance | t.: 74 51 | Multivariate Cox model (text) | ||
| E5: overall prediction | v.: 33 17 | Prediction error curves based on cross-validation (Figure | ||
Chronic lymphocytic leukemia: REMARK-like profile of the analysis performed on the dataset
| a) Patients, treatment and variables | ||||
|---|---|---|---|---|
|
|
| |||
| Marker | OS = 8-probe-set gene-expression signature | |||
| Further variables | v1 = | |||
| Reference | Herold et al. (2011) | |||
| Source of the data | GEO (reference: GSE22762) | |||
|
|
|
| ||
| Assessed for eligibility | 151 |
| ||
|
| ||||
| Training set | Excluded | 0 | ||
| Included | 151 |
| ||
|
| ||||
| With outcome events | 41 |
| ||
| Assessed for eligibility | 149 |
| ||
|
| ||||
| Validation set | Excluded | 18 | Due to missing clinical information | |
| Included | 131 |
| ||
|
| ||||
| With outcome events | 40 |
| ||
|
| ||||
| Data source | Same institution, different time (see above) | |||
| Measurement of gene expressions | Affymetrix HG-U133 vs. TaqMan LDA (see text) | |||
| Survival rate | Lower in the validation set (see Figure 4) | |||
|
| ||||
|
|
|
|
|
|
|
| ||||
| F1: univariate | 151 | 41 | v1 to v4 | Kaplan-Meier curves (Figure |
| 131 | 40 | |||
|
| ||||
| G1: significance | 131 | 40 | OS, v1 to v4 | Multivariate Cox model (Table |
|
| ||||
| H1: Overall prediction | 131 | 40 | OS, v1 to v4 | Prediction error curves based on cross-validation (Figure |
| Integrated Brier score based on cross-validation (text) | ||||
Figure 1AML: univariate Kaplan-Meier curves. Acute myeloid leukemia: Kaplan-Meier estimation of the survival curves in subgroups based on age (first row), sex (second row), FLT3-ITD (third row) and NPM1 (fourth row), computed in the training (first column) and in the validation (second column) sets.
Figure 2AML: Kaplan-Meier curves. Acute myeloid leukemia: comparison between the Kaplan-Meier estimation of the survival curves computed in the training (red line) and in the validation (green line) sets.
Figure 3CLL: univariate Kaplan-Meier curves. Chronic lymphocytic leukemia: Kaplan-Meier estimation of the survival curves in subgroups based on age (first row), sex (second row), FISH (third row) and IGVH (fourth row), computed in the training (first column) and in the validation (second column) sets. The values of FISH (third row) represent: 0 = deletion of 13q14 only; 1 = deletion of 11q22-23 but no deletion of 17p13; 2 = deletion of 17p13, 3 = trisomy 12q13 but no deletion of 17p13 or 11q22-23, 4 = no previously mentioned chromosomal aberration.
Figure 4CCL: Kaplan-Meier curves. Chronic lymphocytic leukemia: comparison between the Kaplan-Meier estimation of the survival curves computed in the training (red line) and in the validation (green line) sets.
Characteristics of the measures implemented to evaluate the prediction ability of a model
| Aspect | Measure | Characteristics |
|---|---|---|
| Discriminative ability | Kaplan-Meier curves for risk groups | Better with greater distance between the Kaplan-Meier curves for the low- and high risk groups |
| C-index | Estimates the concordance probability, i.e. the probability that the score correctly orders two patients with respect to their survival time; higher values correspond to better prediction | |
| K-statistic | Alternative to the C-index; works only under the proportional hazards assumption | |
| Calibration | Survival curves | Compares the observed survival function with the average predicted curve |
| Calibration slope | Computes the regression coeffcient of the prognostic score as unique predictor; the best values are those close to 1; related to overfitting issues | |
| Overall prediction | Prediction error curves | Presents the Brier score versus time; the closer the curves are to the X-axis, the better the prediction |
| Integrated Brier score | Computes the area under the prediction error curves; the smaller is the value, the better the prediction |
Acute myeloid leukemia: estimates of the log-hazard in a multivariate Cox model fitted on the validation data, with the standard deviations and the p-values related to the hypothesis of nullity of the coefficients (simple null hypothesis)
| Variable | Coeff | Sd(coeff) | P-value |
|---|---|---|---|
|
| 0.523 | 0.243 | 0.0312 |
|
| 0.022 | 0.015 | 0.1340 |
|
| 0.643 | 0.404 | 0.1114 |
|
| 0.436 | 0.440 | 0.3220 |
|
| -0.377 | 0.404 | 0.3497 |
Figure 5AML: prediction error curves. Acute myeloid leukemia: prediction error curves based on the Bier score computed in the validation set for the null (black line), the clinical (red line) and the combined (green line) models fitted on the training data.
Figure 6AML: Kaplan-Meier curves for low and high-risk groups. Acute myeloid leukemia: Kaplan-Meier curves computed in the validation set for risks groups based on the clinical (red) and the combined (green) scores derived in the training set: the curves below represent the survival curves for observations belonging to the high risk group, the two above the low risk group.
Figure 7AML: comparison between observed and average predicted survival curves in the validation set. Acute myeloid leukemia: comparison between the observed survival curve (Kaplan-Meier, black line) and the average predictive survival curves computed in the validation set using the clinical (red line) and combined (green line) models fitted on the training data. Continuous lines represent the average predictive survival curves computed interpolating the baseline survival curve derived in the training set. Dashed lines represent the same curves computed using an estimation of the baseline survival curve derived in the validation set. For the dotted curves, the estimates of the regression coefficients are shrunk toward 0.
Acute myeloid leukemia: differences in the estimates of the log-hazard ratio when the combined model is fitted on the training (first column) or on the validation (second column) data
| Log-hazard ratios | ||
|---|---|---|
| Variable | Training | Validation |
|
| 0.642 (0.172) | 0.523 (0.243) |
|
| 0.021 (0.008) | 0.022 (0.015) |
|
| -0.024 (0.208) | 0.643 (0.404) |
|
| 0.448 (0.253) | 0.436 (0.440) |
|
| -0.370 (0.215) | -0.377 (0.404) |
Standard deviations are reported between brackets.
Figure 8AML: prediction error curves based on 10-fold cross-validation. Acute myeloid leukemia: prediction error curves based on Brier score computed via 10-fold cross-validation (100 replications). The null (black line), the clinical (red line) and the combined (green line) models are considered. Only the validation set is used.
Figure 9AML: prediction error curves in female and male populations. Acute myeloid leukemia: prediction error curves based on the Bier score computed in the validation set for the null (black line), the clinical (red line) and the combined (green line) models, fitted on the training data, for both the female (left) and the male (right) populations.
Figure 10AML: prediction error curves based on 10-fold cross-validation in female and male populations. Acute myeloid leukemia: prediction error curves based on 10-fold cross-validation (100 replications) for the null (black lines), clinical (red lines) and combined (green lines) models in the female (left) and in the male (right) populations. Only the validation set is used.
Chronic lymphocytic leukemia: estimates of the log-hazard in a multivariate Cox model fitted on the validation data, with the standard deviations and the p-values related to the hypothesis of nullity of the coefficients (simple null hypothesis)
| Variable | Coeff | Sd (coeff) | P-value |
|---|---|---|---|
|
| -0.589 | 0.150 | 8.65×10-05 |
|
| 0.113 | 0.023 | 6.82×10-07 |
|
| 0.157 | 0.343 | 0.6472 |
|
| 0.171 | 0.459 | 0.7092 |
|
| 1.352 | 0.590 | 0.0219 |
|
| -0.195 | 0.665 | 0.7694 |
|
| -0.459 | 0.427 | 0.2823 |
|
| 0.695 | 0.416 | 0.0949 |
Figure 11CLL: prediction error curves based on 10-fold cross-validation in female and male populations. Chronic lymphocytic leukemia: prediction error curves based on 10-fold cross-validation (100 replications).