| Literature DB >> 24732597 |
S B Amin1, W-K Yip2, S Minvielle3, A Broyl4, Y Li5, B Hanlon6, D Swanson2, P K Shah7, P Moreau3, B van der Holt4, M van Duin4, F Magrangeas3, P Pieter Sonneveld3, K C Anderson8, C Li7, H Avet-Loiseau3, N C Munshi9.
Abstract
With advent of several treatment options in multiple myeloma (MM), a selection of effective regimen has become an important issue. Use of gene expression profile (GEP) is considered an important tool in predicting outcome; however, it is unclear whether such genomic analysis alone can adequately predict therapeutic response. We evaluated the ability of GEP to predict complete response (CR) in MM. GEP from pretreatment MM cells from 136 uniformly treated MM patients with response data on an IFM, France led study were analyzed. To evaluate variability in predictive power due to microarray platform or treatment types, additional data sets from three different studies (n=511) were analyzed using same methods. We used several machine learning methods to derive a prediction model using training and test subsets of the original four data sets. Among all methods employed for GEP-based CR predictive capability, we got accuracy range of 56-78% in test data sets and no significant difference with regard to GEP platforms, treatment regimens or in newly diagnosed or relapsed patients. Importantly, permuted P-value showed no statistically significant CR predictive information in GEP data. This analysis suggests that GEP-based signature has limited power to predict CR in MM, highlighting the need to develop comprehensive predictive model using integrated genomics approach.Entities:
Mesh:
Year: 2014 PMID: 24732597 PMCID: PMC4198516 DOI: 10.1038/leu.2014.140
Source DB: PubMed Journal: Leukemia ISSN: 0887-6924 Impact factor: 11.528
Characteristics and details of studies used for gene expression profile-based response prediction.
| IFM I | IFM II | HOVON | Mulligan et al. | |
|---|---|---|---|---|
| Study | IFM 2005 | IFM 2005 | HOVON 65 | APEX / |
| Number of Samples | 136 | 67 | 282 | 162 |
| Platform | Affymetrix Exon 1.0 ST array | Affymetrix Exon 1.0 ST array | Affymetrix U133 Plus 2.0 array | Affymetrix U133 Plus 2.0 array |
| Patient Population | Newly-diagnosed | Newly-diagnosed | Newly-diagnosed | Relapsed |
| Treatment Protocol | VAD, ASCT | Bortezomib, ASCT | VAD/PAD, ASCT | Bortezomib |
| Time of Response Measurement | Post-transplant | Post-Induction | Post-transplant | Post Salvage therapy |
| Complete response | 44 (32%) | 24 (36 % %) | 76 (27 %) | 73 (43%) |
: NCBI GEO accession IDs: IFM I: GSE39754, IFM II: GSE55145.
: Broyl A, et al. Blood 2010
: Includes patients with partial response; 13 patients with CR and 60 with PR.
Figure 1Flow-chart showing major steps to develop response (CR) prediction model.
Figure 2Summary of CR prediction performance
Performance of various class prediction methods in predicting CR in the validation dataset using IFM I dataset (a) and HOVON dataset (b). CCP: Compound covariate predictor; LDA: Linear discriminant analysis; KNN: K-nearest neighbor; NC: Nearest centroid; SVM: Support vector machine.
Summary of CR Prediction Analysis
Maximum achievable performance (MAP) in both training and validation sets for all four datasets using seven different class prediction methods. MAP is defined as highest accuracy achieved by one of seven classifiers methods used for a given dataset.
| a: Class Prediction Results – Maximum Achievable Accuracy – Training Set | ||||||
|---|---|---|---|---|---|---|
| Prediction | Sensitivity | Specificity | PPV | NPV | Accuracy | |
| IFM I | KNN | 24 | 77 | 33 | 69 | 60 |
| IFM II | KNN | 72 | 56 | 65 | 64 | 65 |
| HOVON | SVM | 17 | 92 | 48 | 71 | 70 |
| Mulligan et al. | KNN | 71 | 67 | 68 | 71 | 69 |
SVM: Support Vector Machines; KNN: K-nearest neighbors (n=1)
Permuting class labels to assess the power of predicting CR
During the training of classifiers, the treatment response labels were permuted 1000 times to measure the statistical significance of prediction performance. P-values indicate the proportion of total 1000 permutations giving better prediction performance than the original analysis using real treatment response labels. Hence, higher p-value suggests lower confidence in prediction analysis results using gene expression data.
| Real CR % | Predicted CR: | |
|---|---|---|
| IFM I | 32 (44/136) | 25 (0.36) |
| IFM II | 36 (24/67) | 27 (0.11) |
| HOVON | 27 (76/282) | 45 (0.13) |
| Mulligan et al. | 43 (73/162) | 43 (0.49) |
Predicted CR or Positive Predictive Value is derived from the test or validation dataset analysis using criteria giving the best possible overall model accuracy.