| Literature DB >> 22619502 |
Jason B Nikas1, Walter C Low, Paul A Burgio.
Abstract
Pertaining to the female population in the USA, breast cancer is the leading cancer in terms of annual incidence rate and, in terms of mortality, the second most lethal cancer. There are currently no biomarkers available that can predict which breast cancer patients will respond to chemotherapy with both sensitivity and specificity > 80%, as mandated by the latest FDA requirements. In this study, we have developed a prognostic biomarker model (complex mathematical function) that-based on global gene expression analysis of tumor tissue collected during biopsy and prior to the commencement of chemotherapy-can identify with a high accuracy those patients with breast cancer (clinical stages I-III) who will respond to the paclitaxel-fluorouracil-doxorubicin-cyclophosphamide chemotherapy and will experience pathological complete response (Responders), as well as those breast cancer patients (clinical stages I-III) who will not do so (Non-Responders). Most importantly, both the application and the accuracy of our breast cancer prognostic biomarker model are independent of the status of the hormone receptors ER, PR, and HER2, as well as of the ethnicity and age of the subjects. We developed our prognostic biomarker model with 50 subjects [10 responders (R) and 40 non-responders (NR)], and we validated it with 43 unknown (new and different) subjects [10 responders (R) and 33 non-responders (NR)]. All 93 subjects were recruited at five different clinical centers around the world. The overall sensitivity and specificity of our prognostic biomarker model were 90.0% and 91.8%, respectively. The nine most significant genes identified, which comprise the input variables to the mathematical function, are involved in regulation of transcription; cell proliferation, invasion, and migration; oncogenesis; suppression of immune response; and drug resistance and cancer recurrence.Entities:
Keywords: biomarkers; breast cancer; global gene expression analysis; prognostic biomarker models; systems biology; treatment response
Year: 2012 PMID: 22619502 PMCID: PMC3355866 DOI: 10.4137/BMI.S9387
Source DB: PubMed Journal: Biomark Insights ISSN: 1177-2719
Clinical information pertaining to the stage and receptor status of all 93 subjects, including that of the subjects misclassified by the F1 prognostic biomarker model.
| Group | No. of subjects | Stage I | Stage II | Stage III | Unknown stage | ER(+) | ER(−) | PR(+) | PR(−) | HER2(+) | HER2(−) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| R | 10 | 0 | 2 | 5 | 3 | 2 | 8 | 1 | 9 | 3 | 7 |
| NR | 40 | 6 | 16 | 14 | 4 | 27 | 13 | 25 | 15 | 4 | 36 |
| R | 10 | 0 | 1 | 7 | 2 | 2 | 8 | 1 | 9 | 4 | 6 |
| NR | 33 | 5 | 13 | 10 | 5 | 20 | 13 | 17 | 16 | 4 | 29 |
| Discovery | |||||||||||
| R | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 1 | 0 | 1 |
| NR | 4 | 1 | 1 | 2 | 0 | 2 | 2 | 2 | 2 | 0 | 4 |
| Validation | |||||||||||
| R | 1 | 0 | 0 | 1 | 0 | 0 | 1 | 0 | 1 | 1 | 0 |
| NR | 2 | 0 | 0 | 0 | 2 | 0 | 2 | 0 | 2 | 0 | 2 |
Notes: The information and the results shown in Table 1 provide evidence that the accuracy of the F1 prognostic biomarker model is independent of the receptor status of the subject. A. Clinical information pertaining to the stage and receptor status of the 50 subjects [10 responders (R) and 40 nonresponders (NR)] used in the discovery study. B. Clinical information pertaining to the stage and receptor status of the 43 unknown subjects [10 responders (R) and 33 non-responders (NR)] used in the validation study. C. Clinical information pertaining to the stage and receptor status of all 8 subjects (5 in the discovery study and 3 in the validation study) misclassified by the F1 prognostic biomarker model.
The 9 genes (constituent variables) of the F1 prognostic biomarker model, ranked according to their ROC AUC value.
| Rank | ROC AUC | Affymetrix transcript no. | Gene symbol | Gene name | Gene function process | Gene signif. diff. expr. non-responders (NR) | Known interactions | Known drugs/chemicals/hormones |
|---|---|---|---|---|---|---|---|---|
| 1 | 0.80342 | 208712_at | CCND1 | Cyclin D1 | G1/S transition of mitotic cell cycle, positive regulation of cyclin-dependent protein kinase activity, response to estrogen stimulus—(Mutations, amplification and overexpression of this gene, which alters cell cycle progression, are observed frequently in a variety of tumors and may contribute to tumorigenesis. [provided by RefSeq, Jul 2008]) | ↑ | ESR1, BRCA1, CDK4, CDK6, TP53, RB1, MCM10 | Beta-estradiol, tretinoin, fulvestrant, sirolimus, troglitazone |
| 2 | 0.80342 | 210150_s_at | LAMA5 | Laminin, alpha 5 | Angiogenesis, cell proliferation, cytoskeleton organization, cell migration | ↑ | MYOC, ACTB, MAPK1, MAPK8, TNF, CD44 | Heparin, L-glutamic acid |
| 3 | 0.80068 | 204231_s_at | FAAH | Fatty acid amide hydrolase | Fatty acid catabolic process | ↑ | DINP, PXN | Progesterone, beta-estradiol, Fsh, bucladesine |
| 4 | 0.79658 | 216300_x_at | RARA | Retinoic acid receptor, alpha | Cell proliferation, transcription from RNA polymerase II promoter, estrogen receptor signaling pathway, response to estradiol stimulus | ↑ | ESR1, CDK7, SRC, SLC10A1, HOXB1, HOXA1, CD38, MED1 | Tretinoin, beta-estradiol, retinoid, arsenic trioxide |
| 5 | 0.79452 | 41660_at | CELSR1 | Cadherin, EGF LAG seven-pass G-type receptor 1 | G-protein coupled receptor protein signaling pathway, signal transduction | ↑ | ESR1, PSAP, EGFR | Interferon alpha |
| 6 | 0.78425 | 214768_x_at | IGKV1-5 | Immunoglobulin kappa variable 1–5 | Immune response, innate immune response, complement activation pathway | ↓ | AGTR1, AGT, MDH1, IGHA1 | |
| 7 | 0.78151 | 207470_at | DKFZp566H0824 | Hypothetical LOC54744 | ↑ | |||
| 8 | 0.77877 | 217825_s_at | UBE2J1 | Ubiquitin-conjugating enzyme E2, J1, U | Post-translational protein modification, ATP binding, adaptive immunity signaling | ↓ | BRCA1, NFKBIA, BIRC2, TRAF7, TRAF6 | Dexamethasone |
| 9 | 0.77329 | 202780_at | OXCT1 | 3-oxoacid CoA transferase 1 | CoA-transferase activity, adipose tissue development, cellular lipid metabolic process | ↓ | HRAS, CNTF, CFTR, PRKAG3, PPARGC1A | Beta-estradiol, bromobenzene, lipopolysaccharide |
Notes: The following properties of those 9 genes are shown here: Affymetrix transcript number; gene symbol; gene name; gene function and/or process; significant differential expression [over-expression (↑) or under-expression (↓)] as observed in the NR group (Non-Responders) relative to the R group (Responders); known gene interactions; and known drug/chemical/ hormone interactions.
Figure 1Scatter plot and bar graph of all 50 original subjects [10 responders (R) and 40 non-responders (NR)] used in the discovery study in connection with the F1 prognostic biomarker model.
Notes: As can be seen, 9/10 R subjects [Responders (green color)] had F1 scores lower than the determined cut-off score of 4.6683 and were therefore identified correctly by the F1 prognostic biomarker model [sensitivity = 9/10 = 0.900]. Regarding the NR group [Non-Responders (red color)], 36/40 subjects had F1 scores greater than the determined cut-off score of 4.6683 and were therefore identified correctly by the F1 prognostic biomarker model [specificity = 36/40 = 0.900]. For the discovery study, the mean F1 score of the 10 R subjects (Responders) was 4.4528 (top of the green bar) and their standard deviation (whiskers above or below the top of the green bar) was 0.3156. The mean F1 score of the 40 NR subjects (Non-Responders) was 5.2833 (top of the red bar) and their standard deviation (whiskers above or below the top of the red bar) was 0.5074. The significance level was set at α = 0.001 (two-tailed), and the probability of significance for the F1 was P = 1.06 × 10−5 (independent t-Test with T-value = 4.9195). The F1 is parametrically distributed with respect to both groups.
Statistical results of the F1 prognostic biomarker model in the discovery study {identification of the 50 original subjects [10 responders (R) and 40 non-responders (NR)]}.
| Prognostic test | ROC AUC | 95% CI of ROC AUC | T-value | R group [99.99% CI of mean] (SD) | NR group [99.99% CI of mean] (SD) | |
|---|---|---|---|---|---|---|
| F1 | 0.9425 | [0.8159, 0.9829] | 4.9195 | 1.06 × 10−5 | [4.1611, 4.7981] (0.3156) | [5.0103, 5.5300] (0.5074) |
Notes: The ROC AUC value, the 95% confidence interval of the ROC AUC Value, the T value and probability of significance (P) of the independent t-Test, the 99.99% confidence interval for the mean score of the R group (Responders) and that of the NR group (Non-Responders), along with their respective standard deviations, of the F1 prognostic biomarker model in the discovery study are shown.
Figure 2Scatter plot and bar graph of all 43 unknown (new and different) subjects [10 responders (R) and 33 non-responders (NR)] used in the validation study in connection with the F1 prognostic biomarker model.
Notes: As can be seen, 9/10 R subjects [Responders (green color)] had F1 scores lower than the determined cut-off score of 4.6683 and were therefore identified correctly by the F1 prognostic biomarker model [sensitivity = 9/10 = 0.900]. Regarding the NR group [Non-Responders (red color)], 31/33 subjects had F1 scores greater than the determined cut-off score of 4.6683 and were therefore identified correctly by the F1 prognostic biomarker model [specificity = 31/33 = 0.939]. For the validation study, the mean F1 score of the 10 R subjects (Responders) was 4.3766 (top of the green bar) and their standard deviation (whiskers above or below the top of the green bar) was 0.2041. The mean F1 score of the 33 NR subjects (Non-Responders) was 5.3028 (top of the red bar) and their standard deviation (whiskers above or below the top of the red bar) was 0.5476. The significance level was set at α = 0.001 (two-tailed), and the probability of significance for the F1 was P = 5.83 × 10−6 (independent t-Test with T-value = 5.2029). The F1 is parametrically distributed with respect to both groups.
Statistical results of the F1 prognostic biomarker model in the validation study {identification of the 43 unknown subjects [10 responders (R) and 33 non-responders (NR)], which were new and different from the 50 original subjects}.
| Prognostic test | ROC AUC | Sensitivity | Specificity | T-value | R group | NR group | |
|---|---|---|---|---|---|---|---|
| F1 | 0.9788 | 0.9000 | 0.9394 | 5.202 | 95.83 × 10−6 | 4.3766 ± 0.2041 | 5.3028 ± 0.5476 |
Notes: The ROC AUC value, the sensitivity, the specificity, the T value and probability of significance (P) of the independent t-Test, and the mean score of the R group (Responders) and that of the NR group (Non-Responders), along with their respective standard deviations, of the F1 prognostic biomarker model in the validation study are shown. As can be seen, both of those group mean scores, as observed in the validation study with the 43 unknown subjects, fall within the 99.99% confidence intervals of the respective group mean scores as predicted in the discovery study (Table 3).
Figure 3Scatter plot and bar graph of all 93 subjects [20 responders (R) and 73 non-responders (NR)] used in the entire study (discovery and validation) in connection with the F1 prognostic biomarker model.
Notes: As can be seen, 18/20 R subjects [Responders (green color)] had F1 scores lower than the determined cut-off score of 4.6683 and were therefore identified correctly by the F1 prognostic biomarker model [sensitivity = 18/20 = 0.900]. Regarding the NR group [Non-Responders (red color)], 67/73 subjects had F1 scores greater than the determined cut-off score of 4.6683 and were therefore identified correctly by the F1 prognostic biomarker model [specificity = 67/73 = 0.918]. The mean F1 score of the 20 R subjects (Responders) was 4.4148 (top of the green bar) and their standard deviation (whiskers above or below the top of the green bar) was 0.2617. The mean F1 score of the 73 NR subjects (Non-Responders) was 5.2921 (top of the red bar) and their standard deviation (whiskers above or below the top of the red bar) was 0.5223. The significance level was set at α = 0.001 (two-tailed), and the probability of significance for the F1 was P = 1.36 × 10−10 (independent t-Test with T-value = 7.2454). The F1 is parametrically distributed with respect to both groups.
Overall statistical results of the F1 prognostic biomarker model with respect to both the discovery and the validation studies combined {identification of 93 subjects [20 responders (R) and 73 non-responders (NR)]}.
| Prognostic test | ROC AUC | Sensitivity | Specificity | T-value | R group | NR group | |
|---|---|---|---|---|---|---|---|
| F1 | 0.9616 | 0.9000 | 0.9178 | 7.2454 | 1.36 × 10−10 | 4.4148 ± 0.2617 | 5.2921 ± 0.5223 |
Notes: The overall ROC AUC value, the overall sensitivity, the overall specificity, the T value and probability of significance (P) of the independent t-Test, and the mean score of the R group (Responders) and that of the NR group (Non-Responders), along with their respective standard deviations, of the F1 prognostic biomarker model with respect to both the discovery and the validation studies combined are shown.