| Literature DB >> 23185353 |
Kui Shen1, Nan Song, Youngchul Kim, Chunqiao Tian, Shara D Rice, Michael J Gabrin, W Fraser Symmans, Lajos Pusztai, Jae K Lee.
Abstract
Previous studies have reported conflicting assessments of the ability of cell line-derived multi-gene predictors (MGPs) to forecast patient clinical outcomes in cancer patients, thereby warranting an investigation into their suitability for this task. Here, 42 breast cancer cell lines were evaluated by chemoresponse tests after treatment with either TFAC or FEC, two widely used standard combination chemotherapies for breast cancer. We used two different training cell line sets and two independent prediction methods, superPC and COXEN, to develop cell line-based MGPs, which were then validated in five patient cohorts treated with these chemotherapies. This evaluation yielded high prediction performances by these MGPs, regardless of the training set, chemotherapy, or prediction method. The MGPs were also able to predict patient clinical outcomes for the subgroup of estrogen receptor (ER)-negative patients, which has proven difficult in the past. These results demonstrated a potential of using an in vitro-based chemoresponse data as a model system in creating MGPs for stratifying patients' therapeutic responses. Clinical utility and applications of these MGPs will need to be carefully examined with relevant clinical outcome measurements and constraints in practical use.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23185353 PMCID: PMC3504014 DOI: 10.1371/journal.pone.0049529
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Two breast cancer cell line sets for MGP model training.
| Database source | GEO/EBI accession number | Microarray platform | number of breast cancer cell lines | |
| TFAC | FEC | |||
| Hoeflich | GSE12777 | HG-U133 Plus 2.0 | 41 | 39 |
| Neve | E-TABM-157 | HG-U133A | 30 | 30 |
| Common cell lines between Neve and Hoeflich data sets | 28 | 27 | ||
The TFAC chemoresponse test for one cell line (SW527) did not pass quality control; therefore, the AUC values for 41 cell lines were available for further analysis.
The FEC chemoresponse test for three cell lines (HCC1419, HCC1569, and HCC1806) did not pass quality control; therefore, the AUC values for 39 cell lines were available for further analysis.
The number of cell lines common to the Hoeflich data set and the 42 cell lines and whose chemoresponses were measured is 30.
There are 28 TFAC-treated and 27 FEC-treated cell lines common to the Neve and Hoeflich data sets.
Summary information for the gene expression and clinical outcome test sets for five clinical trials in the GEO database.
| MAQC-training | MAQC-validation | Tabchy-TFAC | Tabchy-FEC | Iwamoto | |
|
| GSE20194 | GSE20194 | GSE20271 | GSE20271 | GSE22093 |
|
| TFAC | TFAC | TFAC | FAC/FEC | FAC/FEC |
|
| 130 | 100 | 91 | 87 | 82 |
|
| 80 | 61 | 49 | 49 | 41 |
|
| 50 | 39 | 41 | 37 | 41 |
|
| 33 | 15 | 19 | 7 | 24 |
|
| 97 | 85 | 72 | 80 | 58 |
|
| 34.00% | 17.60% | 26.40% | 8.80% | 42.40% |
The Tabchy-TFAC data set (GSE20271) has 31 samples that overlap with the Iwamoto data set (GSE22093); therefore, these two data sets are not completely independent.
Figure 1Chemoresponse-derived AUD values for each cell line, labeled by ER status, for both TFAC (top) and FEC (bottom) treatments.
Figure 2Correlation between the prediction scores calculated by the superPC and COXEN methods from the Neve training set for each test set.
Figure 3Distributional difference between prediction scores calculated by the superPC or COXEN methods from the Neve training set for responders (pCR, pathologic complete response) and non-responders (RD, residual disease), with p-value and AUC.
In the top row, the prediction scores are calculated by the superPC method, and in the bottom row, the prediction scores are calculated by the COXEN method. Red lines represent median prediction scores in each group.
Figure 4ROC curve validation of MGPs from the Neve training set in 5 clinical trials.
In each figure, blue lines represent MGPs developed using the superPC method, while red lines represent MGPs developed using the COXEN method.
Prediction results for the superPC and COXEN methods in all breast cancer cell lines evaluated by AUC scores.
| Neve-based prediction | Hoeflich-based prediction | |||||
| Clinical trial test set | ER(+/−) | SuperPC | COXEN | SuperPC | COXEN | |
|
|
| All | 0.749 | 0.778 | 0.717 | 0.780 |
| ER+ | 0.791 | 0.732 | 0.664 | 0.725 | ||
| ER− | 0.605 | 0.564 | 0.523 | 0.57 | ||
|
| All | 0.717 | 0.764 | 0.733 | 0.746 | |
| ER+ | 0.449 | 0.542 | 0.390 | 0.424 | ||
| ER− | 0.524 | 0.547 | 0.515 | 0.577 | ||
|
| All | 0.731 | 0.707 | 0.682 | 0.703 | |
| ER+ | 0.750 | 0.678 | 0.728 | 0.622 | ||
| ER− | 0.644 | 0.605 | 0.556 | 0.615 | ||
|
|
| All | 0.784 | 0.793 | 0.664 | 0.688 |
| ER+ | 0.804 | 0.819 | 0.717 | 0.587 | ||
| ER− | 0.902 | 0.811 | 0.583 | 0.788 | ||
|
| All | 0.769 | 0.738 | 0.687 | 0.682 | |
| ER+ | 0.794 | 0.643 | 0.777 | 0.668 | ||
| ER− | 0.730 | 0.706 | 0.453 | 0.539 | ||
: P<0.05,
: P<0.1.
The AUC values are grouped by ER status: All (cells of both ER status), ER+ (ER− positive cells), and ER– (ER-negative cells) and are separated based on the cell line expression database used to create the cell line MGPs. Note that these five validation datasets (except Tabchy-TFAC and Iwamoto) were independent for the superPC prediction method, because this predictor was not pre-optimized or optimized using any of these data sets. For the COXEN prediction method, MAQC-training and Tabchy-FEC datasets were used for optimization, and therefore the remaining three datasets were truly independent validation sets for this method.
Prediction results for the COXEN model using either ER+ (ER-positive) or ER– (ER-negative) breast cancer cell lines, evaluated by area under receiver operator characteristic (AU-ROC) scores.
| Neve based prediction | Hoeflich based prediction | |||||
| Drug | Clinical trial test set | ER(+/−) | ER+(14) | ER−(16) | ER+(16) | ER−(23) |
|
|
| All | 0.663 | 0.631 | 0.443 | 0.785 |
| ER+ | 0.741 | 0.617 | 0.588 | 0.676 | ||
| ER− | 0.644 | 0.678 | 0.349 | 0.721 | ||
|
| All | 0.599 | 0.562 | 0.536 | 0.677 | |
| ER+ | 0.432 | 0.441 | 0.508 | 0.347 | ||
| ER− | 0.624 | 0.533 | 0.536 | 0.503 | ||
|
| All | 0.582 | 0.598 | 0.464 | 0.774 | |
| ER+ | 0.478 | 0.472 | 0.594 | 0.778 | ||
| ER− | 0.615 | 0.699 | 0.415 | 0.726 | ||
|
|
| All | 0.646 | 0.636 | 0.504 | 0.650 |
| ER+ | 0.703 | 0.529 | 0.761 | 0.428 | ||
| ER− | 0.515 | 0.720 | 0.295 | 0.818 | ||
|
| All | 0.647 | 0.749 | 0.301 | 0.769 | |
| ER+ | 0.735 | 0.655 | 0.471 | 0.765 | ||
| ER− | 0.488 | 0.733 | 0.260 | 0.706 | ||
: P<0.05,
: P<0.1.
The MAQC-Validation and Tabchy-TFAC were truly independent validation sets here because the MAQC-training and Tabchy-FEC were used to optimize the COXEN model.