| Literature DB >> 25338319 |
Timothy R Wilson1, Yuanyuan Xiao, Jill M Spoerke, Jane Fridlyand, Hartmut Koeppen, Eloisa Fuentes, Ling Y Huw, Ilma Abbas, Arjan Gower, Erica B Schleifman, Rupal Desai, Ling Fu, Teiko Sumiyoshi, Joyce A O'Shaughnessy, Garret M Hampton, Mark R Lackner.
Abstract
Breast cancers are categorized into three subtypes based on protein expression of estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor-2 (HER2/ERBB2). Patients enroll onto experimental clinical trials based on ER, PR, and HER2 status and, as receptor status is prognostic and defines treatment regimens, central receptor confirmation is critical for interpreting results from these trials. Patients enrolling onto experimental clinical trials in the metastatic setting often have limited available archival tissue that might better be used for comprehensive molecular profiling rather than slide-intensive reconfirmation of receptor status. We developed a Random Forests-based algorithm using a training set of 158 samples with centrally confirmed IHC status, and subsequently validated this algorithm on multiple test sets with known, locally determined IHC status. We observed a strong correlation between target mRNA expression and IHC assays for HER2 and ER, achieving an overall accuracy of 97 and 96%, respectively. For determining PR status, which had the highest discordance between central and local IHC, incorporation of expression of co-regulated genes in a multivariate approach added predictive value, outperforming the single, target gene approach by a 10% margin in overall accuracy. Our results suggest that multiplexed qRT-PCR profiling of ESR1, PGR, and ERBB2 mRNA, along with several other subtype associated genes, can effectively confirm breast cancer subtype, thereby conserving tumor sections and enabling additional biomarker data to be obtained from patients enrolled onto experimental clinical trials.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25338319 PMCID: PMC4223539 DOI: 10.1007/s10549-014-3163-8
Source DB: PubMed Journal: Breast Cancer Res Treat ISSN: 0167-6806 Impact factor: 4.872
HER2, ER, and PR status by local and central IHC for 158 USO 01062 study Samples
| HER2 | Central IHC | ER | Central IHC | PR | Central IHC | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| − | + | − | + | − | + | |||||||
| Local IHC | Local IHC | Local IHC | ||||||||||
| − | 132 | 2 | NPV 99 % | − | 50 | 8 | NPV 86 % | − | 50 | 23 | NPV 68 % | |
| + | 8 | 16 | PPV 67 % | + | 4 | 96 | PPV 96 % | + | 3 | 82 | PPV 96 % | |
| Spec 94 % | Sens 88 % | Acc 94 % | Spec 93 % | Sens 92 % | Acc 92 % | Spec 94 % | Sens 78 % | Acc 84 % | ||||
Spec specificity, Sens sensitivity, PPV positive predictive value, NPV negative predictive value, Acc accuracy
Fig. 1Target gene expression of a ERBB2 b ESR1 c PGR by central IHC status. Left panel: boxplot of target gene expression by central HER2 status. IHC positive group is colored in gray. Right panel: ROC analysis for predicting IHC positivity defined by different cutoffs using target gene expression. Figure legend indicates cutoffs and AUCs with 95 % confidence intervals in parentheses
Fig. 2Target gene prediction. A bimodal, 2-component Gaussian mixture distribution fit was superimposed to the actual data summarized in the histogram. The two mixture distributions are depicted in red and black lines. The dotted line indicates the cutoff between the positive and negative groups
HER2, ER, and PR status by central IHC and TGP for the training set
| HER2 | Central IHC | ER | Central IHC | PR | Central IHC | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| − | + | − | + | − | + | ||||||
| ERBB2 | ESR1 | PGR | |||||||||
| − | 139 | 4 | NPV 97 % | − | 53 | 5 | NPV 90 % | − | 39 | 16 | NPV 71 % |
| + | 0 | 14 | PPV 100 % | + | 1 | 99 | PPV 99 % | + | 14 | 89 | PPV 86 % |
| Spec 100 % | Sens 77 % | Acc 97 % | Spec 98 % | Sens 95 % | Acc 96 % | Spec 74 % | Sens 84 % | Acc 81 % | |||
Spec specificity, Sens sensitivity, PPV positive predictive value, NPV negative predictive value, Acc accuracy
Performance of multivariate prediction methods for the training set
| Accuracy | Specificity | Sensitivity | Genes picked (number of times) | |
|---|---|---|---|---|
|
| ||||
| | 0.98 | 1(140/140) | 0.83(15/18) | |
| | 0.99 | 1(140/140) | 0.94(17/18) | ERBB2(10) GRB7(1) |
| | 0.94 | 1(140/140) | 0.5(9/18) | |
|
| ||||
| | 0.95 | 0.98(53/54) | 0.93(97/104) | |
| | 0.95 | 0.96(52/54) | 0.94(98/104) | ESR1(10) GATA3(8) TFF1(4) FOXA1(3) SCUBE2(3) PGR(2) LYN(1) VAV3(1) |
| | 0.95 | 0.94(51/54) | 0.95(99/104) | |
|
| ||||
| | 0.91 | 0.92(49/53) | 0.90(95/105) | |
| | 0.91 | 0.91(48/53) | 0.91(96/105) | ESR1(10) GATA3(7) PGR(7) FOXA1(6) SCUBE2(6) TFF1(6) IGF1R(3) BCL2(2) BUB1(2) XBP1(2) CTSL2(1) ERBB3(1) IRS1(1) |
| | 0.91 | 0.91(48/53) | 0.90(95/105) | |
Fig. 3Multivariate variable importance measures (VIM) by RF for HER2, ER, and PR prediction. Y axes are –log10 based P values of the two group t-test between central IHC positive and negative groups, and (Bonferroni) adjusted P value 0.05 is marked with gray lines. Genes with two sample t-test adjusted P values ≤0.05 and fold change ≥2 were marked with gene symbols
HER2, ER and PR status by local IHC and RFP for test set 1, an additional set of USO 01062 study samples
| HER2 | Local IHC | ER | Local IHC | PR | Local IHC | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| − | + | − | + | − | + | ||||||
| RFP | RFP | RFP | |||||||||
| − | 538 | 42 | NPV 93 % | − | 249 | 20 | NPV 93 % | − | 246 | 21 | NPV 92 % |
| + | 7 | 41 | PPV 85 % | + | 28 | 337 | PPV 92 % | + | 94 | 273 | PPV 74 % |
| Spec 99 % | Sens 49 % | Acc 92 % | Spec 90 % | Sens 94 % | Acc 92 % | Spec 72 % | Sens 93 % | Acc 82 % | |||
Spec specificity, Sens sensitivity, PPV positive predictive value, NPV negative predictive value, Acc accuracy
Fig. 4Boxplots of ERBB2, ESR1, and PGR mRNA by local IHC and RFP results for test set 1, an additional set of USO 01062 study samples with local IHC status. Black points are local IHC and RFP negatives, red points are local IHC positives and RFP negatives, green points represent local IHC negatives and RFP positives, and blue points represent local IHC and RFP positives
Fig. 5Kaplan–Meier curves showing disease-free survival for disease subtypes by local IHC (a) and RFP (b)