| Literature DB >> 30683142 |
Youdinghuan Chen1,2, Yue Wang2, Lucas A Salas1,2, Todd W Miller2, Kenneth Mark2, Jonathan D Marotti3, Arminja N Kettenbach2,4, Chao Cheng5,6,7, Brock C Christensen8,9,10.
Abstract
BACKGROUND: BRCA1-mutated cancers exhibit deficient homologous recombination (HR) DNA repair, resulting in extensive copy number alterations and genome instability. HR deficiency can also arise in tumors without a BRCA1 mutation. Compared with other breast tumors, HR-deficient, BRCA1-like tumors exhibit worse prognosis but selective chemotherapeutic sensitivity. Presently, patients with triple negative breast cancer (TNBC) who do not respond to hormone endocrine-targeting therapy are given cytotoxic chemotherapy. However, more recent evidence showed a similar genomic profile between BRCA1-deficient TNBCs and hormone-receptor-positive tumors. Characterization of the somatic alterations of BRCA1-like hormone-receptor-positive breast tumors as a group, which is currently lacking, can potentially help develop biomarkers for identifying additional patients who might respond to chemotherapy.Entities:
Keywords: BRCA1; BRCAness; Breast cancer; DNA methylation; Homologous recombination; Precision medicine
Mesh:
Substances:
Year: 2019 PMID: 30683142 PMCID: PMC6347811 DOI: 10.1186/s13058-018-1090-z
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Data sets used in this study. Number of samples (n) for a given data set indicates those with support vector machine (SVM)-predicted BRCA1-like status based on copy number data
| Data set ( | Percentage ER positivitya | Percentage Predicted BRCA1-like | Purpose | Data types analyzed | Accession (if applicable) | Refs. |
|---|---|---|---|---|---|---|
| Joosse et al. (74) | 42.9 (27/63) | 47.3 (35/74) | BRCA1-like classifier training | Copy number | GSE9021, GSE9114 | [ |
| BRCAx (106) | 70.7 (41/58) | 19.8 (21/106) | BRCA1-like classifier validation | Copy number | GSE18626 | [ |
| CCLE breast cancer cell lines (10) | 10.0 (1/10) | 60.0 (6/10) | BRCA1-like classifier experimental validation | Copy number, MLPA | Additional file | [ |
| TCGA breast cancer (957) | 77.1 (704/913) | 32.2 (308/957) | BRCA1-like differential analyses | Copy number, mutation, gene expression, DNA methylation, clinical |
| [ |
| METABRIC (1968) | 76.3 (1501/1968) | 17.4 (343/1968) | BRCA1-like differential analyses | Copy number, gene expression, clinical |
| [ |
ER estrogen receptor, TCGA The Cancer Genome Atlas, CCLE Cancer Cell Line Encyclopedia, METABRIC Molecular Taxonomy of Breast Cancer International Consortium
aER status is not reported, is unknown, or is equivocal in a subset of Joosse et al., BRCAx and TCGA breast tumors. Such tumors were excluded from percentage ER positivity calculation
Fig. 1Workflow for developing a support vector machine (SVM) BRCA1-like classifier and application to publicly available datasets for biological discovery. In step 1, a new SVM-based BRCA1-like classifier is trained on re-processed and normalized array copy number data. In step 2, the receiver-operation characteristic (ROC) curves were used for evaluating our BRCA1-like classifier in a training and test set (AUC training = 1.00, AUC test = 0.75). In step 3, we applied the SVM classifier to tumors in the The Cancer Genome Atlas (TCGA) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) data sets. Finally, in step 4, we performed bioinformatics and statistical analyses attempting to understand the biological characteristics of hormone-receptor-positive breast tumors predicted to be BRCA1-like by our SVM classifier
Prevalence of BRCA1-like phenotype in the large-scale breast cancer cohorts, TCGA and METABRIC
| TCGA | METABRIC | |||||
|---|---|---|---|---|---|---|
| BRCA1-like | non-BRCA1-like | BRCA1-like | non-BRCA1-like | |||
| n | 308 | 649 | 343 | 1625 | ||
| Age, years (mean (sd)) | 56.57 (13.26) | 59.33 (13.06) | 0.002 | 56.74 (13.93) | 62.02 (12.56) | < 0.001 |
| Stage (%) | 0.22 | 0.35 | ||||
| Stage I-II | 238 (77.3) | 473 (72.9) | 220 (64.1) | 1106 (68.1) | ||
| Stage III-IV | 63 (20.5) | 165 (25.4) | 26 (7.6) | 102 (6.3) | ||
| (Missing) | 7 (2.3) | 11 (1.7) | 97 (28.3) | 417 (25.7) | ||
| ER (%) | < 0.001 | < 0.001 | ||||
| Positive | 129 (41.9) | 575 (88.6) | 108 (31.5) | 1393 (85.7) | ||
| Negative | 166 (53.9) | 43 (6.6) | 235 (68.5) | 232 (14.3) | ||
| (Missing) | 13 (4.2) | 31 (4.8) | 0 (0.0) | 0 (0.0) | ||
| PR (%) | < 0.001 | < 0.001 | ||||
| Positive | 97 (31.5) | 514 (79.2) | 57 (16.6) | 980 (60.3) | ||
| Negative | 195 (63.3) | 104 (16.0) | 286 (83.4) | 645 (39.7) | ||
| NA | 16 (5.2) | 31 (4.8) | 0 (0.0) | 0 (0.0) | ||
| HER2 (%) | 0.59 | 0.98 | ||||
| Positive | 50 (16.2) | 93 (14.3) | 43 (12.5) | 200 (12.3) | ||
| Negative | 161 (52.3) | 333 (51.3) | 300 (87.5) | 1425 (87.7) | ||
| (Missing) | 97 (31.5) | 223 (34.4) | 0 (0.0) | 0 (0.0) | ||
| Any ER, PR, or HER2 positivity (%) | < 0.001 | < 0.001 | ||||
| Yes | 159 (51.6) | 595 (91.7) | 143 (41.7) | 1508 (92.8) | ||
| No | 88 (28.6) | 12 (1.8) | 200 (58.3) | 117 (7.2) | ||
| Cannot be determined | 61 (19.8) | 42 (6.5) | 0 (0.0) | 0 (0.0) | ||
TCGA The Cancer Genome Atlas, METABRIC Molecular Taxonomy of Breast Cancer International Consortium, ER estrogen receptor, PR progesterone receptor, HER2 human epidermal growth factor receptor 2
§For any continuous measure (i.e. age), the two-tailed t-test was used. For any categorical measure, the two-tailed Fisher’s exact test was used
Fig. 2Distribution of BRCA1-like probability scores in The Cancer Genome Atlas (TCGA) (top panel) and Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) (bottom panel) data sets, stratified by hormone-receptor-based subtypes. Each vertical bar represents a patient. The height of the bar represents the probability score of being BRCA1-like assigned by our support vector machine (SVM) copy number classifier. TNBC, triple negative breast cancer
Fig. 3Molecular characteristics of support vector machine (SVM) BRCA1-like hormone-receptor-positive tumors in The Cancer Genome Atlas (TCGA). Comparison of whole exome sequencing-based mutation rates per mega base-pair (Mb) (a) and Somatic Mutational Signature 3 related with HR-deficiency (b), and Ki-67 (MKI67) gene expression as a surrogate marker for cellular proliferation (c). Variations in the number of tumors were due to data availability (see “Materials and methods”). *P < 0.05, **P < 0.01, ***P < 0.001
Fig. 4Differential DNA methylation in groups with BRCA1-like tumors relative to groups with non-BRCA1-like tumors defined by the support vector machine (SVM) BRCA1-like classifier among hormone-receptor-positive tumors in The Cancer Genome Atlas (TCGA). a Genomic context enrichment analysis of hypermethylated and hypomethylated CpGs. Solid dots and horizontal segments indicate odds ratios and 95% confidence intervals. P values were from the two-tailed Fisher’s exact test. b Correlation of the Horvath methylation age (“epigenetic clock”) with patient chronological age, colored by BRCA1-like status. c Comparison of chronological age or the Horvath methylation age between BRCA1-like and non-BRCA1-like hormone-receptor-positive tumors. P value indicates statistical significance from the covariate-adjusted linear model. n.s., not significant
Gene Ontology: Biological Processes for genes associated with hypermethylated gene regions
| Accession | Description | Number of genes | Number of input genes | Expected ratio | Observed ratio | Bonferroni | |
|---|---|---|---|---|---|---|---|
| GO:0071103 | DNA conformation change | 248 | 10 | 1.39 | 7.20 | 1.24E-06 | 0.011 |
| GO:0006323 | DNA packaging | 165 | 8 | 0.92 | 8.66 | 3.95E-06 | 0.017 |
| GO:0031497 | Chromatin assembly | 128 | 7 | 0.72 | 9.77 | 7.44E-06 | 0.021 |
| GO:0065004 | Protein-DNA complex assembly | 197 | 8 | 1.10 | 7.26 | 1.45E-05 | 0.031 |
| GO:0006333 | Chromatin assembly or disassembly | 153 | 7 | 0.86 | 8.17 | 2.38E-05 | 0.041 |
| GO:0006334 | Nucleosome assembly | 111 | 6 | 0.62 | 9.66 | 3.68E-05 | 0.045 |
| GO:0071824 | Protein-DNA complex subunit organization | 224 | 8 | 1.25 | 6.38 | 3.65E-05 | 0.045 |
The complete list of pathways can be found in Additional file 1: Table S6A