| Literature DB >> 29158497 |
Latha Kadalayil1,2, Sofia Khan3, Heli Nevanlinna3, Peter A Fasching4, Fergus J Couch5, John L Hopper6, Jianjun Liu7,8, Tom Maishman9, Lorraine Durcan9, Sue Gerty9, Carl Blomqvist10, Brigitte Rack11, Wolfgang Janni11, Andrew Collins1, Diana Eccles12, William Tapper13.
Abstract
To identify genetic variants associated with breast cancer prognosis we conduct a meta-analysis of overall survival (OS) and disease-free survival (DFS) in 6042 patients from four cohorts. In young women, breast cancer is characterized by a higher incidence of adverse pathological features, unique gene expression profiles and worse survival, which may relate to germline variation. To explore this hypothesis, we also perform survival analysis in 2315 patients aged ≤ 40 years at diagnosis. Here, we identify two SNPs associated with early-onset DFS, rs715212 (P meta = 3.54 × 10-5) and rs10963755 (P meta = 3.91 × 10-4) in ADAMTSL1. The effect of these SNPs is independent of classical prognostic factors and there is no heterogeneity between cohorts. Most importantly, the association with rs715212 is noteworthy (FPRP <0.2) and approaches genome-wide significance in multivariable analysis (P multivariable = 5.37 × 10-8). Expression quantitative trait analysis provides tentative evidence that rs715212 may influence AREG expression (P eQTL = 0.035), although further functional studies are needed to confirm this association and determine a mechanism.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29158497 PMCID: PMC5696339 DOI: 10.1038/s41467-017-01775-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Clinical characteristics of patient cohorts
| ABCFS | HEBCS | POSH stage 1 | SUCCESS-A | POSH stage 2 |
| |
|---|---|---|---|---|---|---|
| SNPs passing QC | ||||||
| Observed | 508,505 | 501,882 | 503,568 | 566,645 | 116 | |
| Imputed | 5,150,529 | 5,395,529 | 5,196,034 | 5,006,474 | 0 | |
| No. of cases passing QC | 202 | 798 | 556 | 3183 | 1303 | |
| Age at diagnosis median (range) | — | 53 (22–87) | 36 (18–40) | 54 (19–85) | 37 (20–40) | < 2.2 × 10−16 |
| No. of cases aged ≤ 40 at diagnosis | 202 | 119 (15%) | 556 (100%) | 337 (11%) | 1303 (100%) | |
| Deceased (all cause) | 67 (33%) | 317 (40%) | 268 (48%) | 171 (5%) | 278 (21%) | |
| Deceased and aged ≤ 40a | 67 (33%) | 45 (38%) | 268 (48%) | 13 (4%) | 278 (21%) | |
| OS median (IQR), years | 15.8 (14.2) | 8.0 (5.2) | 4.8 (4.5) | 4.8 (2.4) | 6.8 (3.2) | < 2.2 × 10−16 |
| Disease progression | — | 368 (46%) | 288 (52%) | 335 (11%) | 325 (25%) | |
| Progressed and aged ≤ 40a | — | 65 (55%) | 288 (52%) | 38 (11%) | 325 (25%) | |
| DFS median (IQR), years | — | 5.0 (2.7) | 3.1 (5.5) | 4.8 (2.7) | 6.3 (3.7) | < 2.2 × 10−16 |
| Oestrogen receptor (ER) | ||||||
| Positive | 91 (45%) | 515 (65%) | 193 (35%) | 2189 (69%) | 1014 (78%) | |
| Negative | 75 (37%) | 225 (28%) | 362 (65%) | 975 (31%) | 283 (22%) | 1.00 × 10−13 |
| Missing | 36 (18%) | 58 (7%) | 1 (<1%) | 19 (1%) | 6 (< 1%) | |
| Progesterone receptor (PR) | ||||||
| Positive | — | — | 112 (20%) | 2018 (63%) | 713 (55%) | |
| Negative | — | — | 369 (66%) | 1143 (36%) | 328 (25%) | 1.00 × 10−13 |
| Missing | 202 (100%) | 798 (100%) | 75 (14%) | 22 (1%) | 262 (20%) | |
| HER2 | ||||||
| Positive | — | 86 (11%) | 105 (19%) | 952 (30%) | 347 (27%) | |
| Negative | — | 400 (50%) | 407 (73%) | 2166 (68%) | 776 (59%) | 1.49 × 10−11 |
| Missing | 202 (100%) | 312 (39%) | 44 (8%) | 65 (2%) | 180 (14%) | |
| Triple negative (ER, PR, HER2) | — | — | 283 (51%) | 518 (16%) | 104 (8%) | 1.00 × 10−13 |
| Grade | ||||||
| 1 | — | 140 (18%) | 13 (2%) | 144 (5%) | 97 (7%) | |
| 2 | — | 306 (38%) | 90 (16%) | 1523 (48%) | 508 (39%) | |
| 3 | — | 275 (34%) | 435 (78%) | 1494 (47%) | 670 (51%) | 1.00 × 10−13 |
| Missing | 202 (100%) | 77 (10%) | 18 (3%) | 22 (1%) | 28 (2%) | |
| Tumour size | ||||||
| Size (mm) average (range) | — | 25.2 (1–100) | 29.7 (0–160) | 25.1 (1–220) | 25.7 (0.5–170) | 0.038 |
| T stage | ||||||
| 1 | — | 381 (48%) | 227 (41%) | 1302 (41%) | 654 (50%) | |
| 2 | — | 297 (37%) | 228 (41%) | 1645 (52%) | 505 (39%) | |
| 3 | — | 50 (6%) | 45 (8%) | 172 (5%) | 80 (6%) | |
| 4 | — | 47 (6%) | 4 (1%) | 43 (1%) | 1 ( < 1%) | 1.00 × 10−13 |
| Missing | 202 (100%) | 23 (3%) | 52 (9%) | 21 (1%) | 63 (5%) | |
| Nodal metastasis | ||||||
| Positive | — | 441 (55%) | 271 (49%) | 2044 (64%) | 679 (52%) | |
| Negative | — | 326 (41%) | 258 (46%) | 1115 (35%) | 613 (47%) | 1.00 × 10−13 |
| Missing | 202 (100%) | 31 (4%) | 27 (5%) | 24 (1%) | 11 (1%) | |
aPercentages using number of cases aged ≤ 40 years at diagnosis as the denominator
P-value for comparison between all cohorts with data (n = 3 to 5), Pearson’s Χ 2 tests were used for categorical variables and Kruskal–Wallis rank sum tests were used to compare continuous traits
Fig. 1Genome-wide analysis of breast cancer survival. The Manhattan plot shows the result of the stage-1 meta-analysis. Results are plotted as –log10 of the P-value from Cox regression. For each SNP the most significant P-value is selected from the analysis of either overall survival (OS) or disease-free survival (DFS) in all patients or the subset with early onset. The four most significant SNPs after meta-analysis of stages 1 and 2 are highlighted in green (rs410155 and rs12302097 associated with OS and DFS respectively in the whole cohort and rs715212 and rs10963755 associated with DFS in patients with early onset). This plot was produced using the qqman R package
Summary of the most significant SNPs from meta-analysis of stages 1 and 2 and their relationship with age of onset
| End point | SNPa | Allelesb | MAFc | Flanking genes (distance to SNP) | All patients | Age of onset | FPRPd | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ≤ 40 | > 40 | |||||||||||||
|
| HR (CI) |
|
| HR (CI) |
|
| HR (CI) |
| ||||||
| DFS | rs715212 | C/A | 0.270 | ADAMTSL1 | 0.0041 | 1.13 (1.04–1.23) | 0.214 | 3.54 × 10−5 | 1.27 (1.13–1.42) | 0.773 | 0.7757 | 0.98 (0.87–1.11) | 0.960 | 0.188 |
| DFS | rs10963755 | C/G | 0.245 | ADAMTSL1 | 0.0147 | 1.11 (1.02–1.21) | 0.108 | 3.91 × 10−4 | 1.22 (1.09–1.37) | 0.818 | 0.6834 | 0.97 (0.86–1.11) | 0.103 | 0.837 |
| DFS | rs12302097 | G/A | 0.061 | TXNRD1 (68.5 kb) CHST1 (38 kb) | 7.54 × 10−5 | 1.30 (1.14–1.49) | 0.594 | 6.77 × 10−5 | 1.45 (1.21–1.74) | 0.145 | 0.0742 | 1.19 (0.98–1.44) | 0.992 | 0.665 |
| OS | rs410155 | C/T | 0.062 | MT3 (8.7 kb) MT4 (11.6 kb) | 1.28 × 10−4 | 1.34 (1.15–1.55) | 0.994 | 0.0049 | 1.32 (1.09–1.60) | 0.993 | 0.0066 | 1.32 (1.08–1.62) | 0.346 | 0.599 |
aThe rs identifier from dbSNP
bMinor/major alleles
cMinor allele frequency (MAF) from 1000 genomes
dFalse positive report probability (FPRP)
P meta, P-values from fixed effects meta-analysis; HR, hazard ratio; CI 95% confidence interval; Q, Cochran P-values for heterogeneity test
Fig. 2Forest plot and meta-analysis for the four most significant SNPs associated with overall survival (OS) or disease-free survival (DFS). Forest plot showing the event rate, hazard ratio (HR), 95% confidence interval (CI) and significance level (P-value) from Cox regression in each cohort and the combined analysis for the most significant SNPs associated with DFS and OS. ABCFS*: evidence for association with OS in the ABCFS cohort is shown for each SNP but these results are excluded from the meta-analyses of DFS. The SNP subtotal rows show the result for a fixed effects meta-analysis across four studies for rs715212, rs10963755 and rs12302097 and five studies for rs410155 using I 2 and Cochran Q-statistic to assess heterogeneity in effect sizes between cohorts
Fig. 3Kaplan–Meir survival plots for the four most significant SNPs identified by meta-analyses. Kaplan–Meier plots from univariate analysis of the most significant SNP associated with disease-free survival (DFS) in cases with early onset (a, rs715212 and b, rs10963755), DFS in all cases (c, rs12302097) and overall survival (OS) in all cases (d, rs410155). For OS, the data from all five cohorts (ABCFS, HEBCS, POSH stages 1 and 2 and SUCCESS-A) was pooled whereas for DFS data were pooled across four cohorts because DFS was not recorded in the ABCFS cohort. HR: hazard ratio with 95% confidence interval
Univariable and multivariable analysis of pooled data
| Covariates at diagnosis | Univariable | Multivariable | |||
|---|---|---|---|---|---|
| HR (95% CI) |
| Events/cases | HR (95% CI) |
| |
|
| |||||
| rs715212 | 1.28 (1.14–1.43) | 1.94 × 10−5 | 698/2293 | 1.38 (1.23–1.56) | 5.37 × 10−8 |
| ER status | 0.78 (0.67–0.91) | 0.001 | 702/2295 | 1.09 (0.90–1.31) | 0.377 |
| Grade | 1.53 (1.33–1.75) | 1.92 × 10−9 | 684/2260 | 1.37 (1.17–1.61) | 9.05 × 10−5 |
| Tumour size (mm) | 1.02 (1.01–1.02) | 1.00 × 10−13 | 667/2258 | 1.01 (1.01–1.02) | 1.01 × 10−13 |
| Nodal status | 2.54 (2.16–3.00) | 1.00 × 10−13 | 676/2271 | 2.37 (1.98–2.83) | 1.00 × 10−13 |
| Cohort | 0.63 (0.59–0.67) | 1.00 × 10−13 | 705/2315 | 0.62 (0.58–0.67) | 1.00 × 10−13 |
|
| |||||
| rs10963755 | 1.21 (1.09–1.36) | 0.00065 | 689/2286 | 1.27 (1.13–1.43) | 4.51 × 10−5 |
| ER status | 0.78 (0.67–0.91) | 0.00117 | 702/2295 | 1.10 (0.92–1.33) | 0.301 |
| Grade | 1.53 (1.33–1.75) | 1.92 × 10−9 | 684/2260 | 1.38 (1.18–1.62) | 7.33 × 10−5 |
| Tumour size (mm) | 1.02 (1.01–1.02) | 1.00 × 10−13 | 667/2258 | 1.01 (1.01–1.02) | 1.00 × 10−13 |
| Nodal status | 2.54 (2.16–3.00) | 1.00 × 10−13 | 676/2271 | 2.31 (1.93–2.77) | 1.00 × 10−13 |
| Cohort | 0.63 (0.59–0.67) | 1.00 × 10−13 | 705/2315 | 0.63 (0.58–0.68) | 1.00 × 10−13 |
|
| |||||
| rs12302097 | 1.28 (1.13–1.46) | 1.65 × 10−4 | 1312/5825 | 1.26 (1.10–1.44) | 0.001 |
| ER status | 0.66 (0.59–0.74) | 7.11 × 10−13 | 1298/5756 | 0.80 (0.70–0.91) | 0.001 |
| Grade | 1.50 (1.37–1.65) | 1.00 × 10−13 | 1261/5695 | 1.56 (1.40–1.73) | 1.01 × 10−13 |
| Tumour size (mm) | 1.02 (1.02–1.02) | 1.00 × 10−13 | 1256/5711 | 1.02 (1.01–1.02) | 1.00 × 10−13 |
| Nodal status | 1.98 (1.75–2.24) | 1.00 × 10−13 | 1274/5747 | 2.10 (1.84–2.41) | 1.00 × 10−13 |
| Cohort | 0.61 (0.58–0.64) | 1.00 × 10−13 | 1316/5840 | 0.58 (0.55–0.62) | 1.00 × 10−13 |
|
| |||||
| rs410155 | 1.20 (1.04–1.39) | 0.0154 | 1063/5945 | 1.28 (1.09–1.51) | 0.0023 |
| ER status | 0.60 (0.53–0.68) | 1.00 × 10−13 | 1081/5919 | 0.66 (0.57–0.77) | 5.92 × 10−8 |
| Grade | 1.72 (1.54–1.91) | 1.00 × 10−13 | 990/5692 | 1.65 (1.46–1.87) | 1.06 × 10−13 |
| Tumour size (mm) | 1.02 (1.02–1.02) | 1.00 × 10−13 | 981/5708 | 1.02 (1.02–1.02) | 1.00 × 10−13 |
| Nodal status | 2.12 (1.85–2.44) | 1.00 × 10−13 | 998/5744 | 2.22 (1.90–2.59) | 1.00 × 10−13 |
| Cohort | 0.79 (0.76–0.83) | 1.00 × 10−13 | 1101/6039 | 0.67 (0.63–0.71) | 1.00 × 10−13 |
Univariable and multivariable analyses were performed with pooled data from ABCFS, HEBCS, POSH (stages 1 and 2) and SUCCESS-A for overall survival (OS). The ABCFS cohort was excluded from the analysis of disease-free survival (DFS). For univariate analyses, the number of events and cases is slightly different for each covariate due to variation in the number of cases with missing data. For multivariable analyses, the following number of events and cases were used: model 1: 642/2172, model 2: 634/2166, model 3: 1201/5538, model 4: 911/5482. In each model the hazard ratios (HRs) and 95% confidence interval (CI) were adjusted for oestrogen receptor status (ER), grade, maximum tumour size, nodal status, SNP and cohort. P univariable, P-values from univariable Cox regression; P multivariable, P-values from multivariable Cox regression.
Fig. 4Regional plots of association with survival (OS or DFS) at stage-1 meta-analysis, recombination rate and gene context for the most significant SNPs. Results from the stage-1 meta-analyses in a region surrounding the most significant SNP associated with DFS in patients with early onset (a, rs715212 and rs10963755), DFS in all patients (b, rs12302097) and OS in all patients (c, rs410155). In each plot, a purple diamond identifies the index SNP and the colour of other SNPs represent their linkage disequilibrium (r 2) with the index SNP from light blue (r 2 ≤ 0.4) to red (r 2 ≥ 0.8). The middle panel displays the 15 state chromatin segmentation track (ChromHMM) in breast variant human mammary epithelial cells (vHMECs, E028), mammary epithelial primary cells (HMECs, E119) and breast myoepithelial primary cells (E027) using data from the HapMap ENCODE Project. The lower panels show genes and their direction of transcription (arrows). Physical positions are relative to build 37 (hg19) of the human genome