| Literature DB >> 30572910 |
Carolyn Nickson1,2, Pietro Procopio3,4, Louiza S Velentzis3,4, Sarah Carr3, Lisa Devereux5, Gregory Bruce Mann6,7, Paul James8,9, Grant Lee3, Cameron Wellard3, Ian Campbell8,10.
Abstract
BACKGROUND: There is a growing interest in delivering more personalised, risk-based breast cancer screening protocols. This requires population-level validation of practical models that can stratify women into breast cancer risk groups. Few studies have evaluated the Gail model (NCI Breast Cancer Risk Assessment Tool) in a population screening setting; we validated this tool in a large, screened population.Entities:
Keywords: Breast Cancer Risk Assessment Tool; Breast cancer screening; Gail model; Invasive breast cancer; Machine learning; Risk stratification; Validation
Mesh:
Year: 2018 PMID: 30572910 PMCID: PMC6302513 DOI: 10.1186/s13058-018-1084-x
Source DB: PubMed Journal: Breast Cancer Res ISSN: 1465-5411 Impact factor: 6.466
Fig. 1Flow chart demonstrating how the cohort used in analyses was derived from the original lifepool cohort
Summary characteristics of the lifepool participants by age groups
| Characteristic | Age at reference screen (years) | ||
|---|---|---|---|
| 50–69 | 50–59 | 60–69 | |
| Number of subjects ( | 40,158 | 20,216 | 19,942 |
| Dates | |||
| Reference screen (range) | 1 Jul 2010 to 6 Oct 2014 | 1 Jul 2010 to 6 Oct 2014 | 1 Jul 2010 to 6 Oct 2014 |
| Questionnaire completion (range) | 1 Jul 2010 to 1 Oct 2014 | 1 Jul 2010 to 23 Sep 2014 | 31 Jul 2010 to 1 Oct 2014 |
| Observation time, years (median, range) | 4.3 (0.3–6.5) | 4.3 (0.3–6.5) | 4.3 (0.3–6.5) |
| Age | |||
| Reference screen, years (median, range) | 59 (50–69) | 55 (50–59) | 64 (60–69) |
| Diagnosis, years (median, range) | 63 (51–73) | 58 (51–64) | 67 (61–73) |
| Reference screening round (median, range) | 5 (1–18) | 4 (1–18) | 7 (1–18) |
| Diagnoses (invasive breast cancer) | |||
| Number (%) | 564 (1.4%) | 244 (1.2%) | 320 (1.6%) |
| Follow-up period | |||
| Time from reference screen to diagnosis, days (median, range) | 813 (28–1938) | 818 (28–1714) | 807 (73–1938) |
| Women screened during the follow-up period ( | 38,060 (95%) | 19,141 (95%) | 18,919 (95%) |
| Of women screened during the follow-up period, number of screens per 2 years of follow-up (median, range) | 0.8 (0.3–2.0) | 0.8 (0.3–2.0) | 0.8 (0.3–1.9) |
Risk factors used to generate the Gail model scores among cases of women with invasive breast cancer and non-cases (i.e. women without invasive breast cancer), aged 50 to 69 years within the lifepool cohorta
| Gail model variable | Group | Non-cases | Cases | |
|---|---|---|---|---|
| Age at questionnaire (years) | 50–54 | 8820 (22) | 91 (16) | < 0.001 |
| 55–59 | 11,152 (28) | 153 (27) | ||
| 60–64 | 10,968 (28) | 158 (28) | ||
| 65–69 | 8654 (22) | 162 (29) | ||
| Age at menarche (years) | ≤ 11 | 6709 (17) | 105 (19) | 0.038 |
| 12–13 | 19,995 (53) | 300 (55) | ||
| ≥ 14 | 11,224 (30) | 143 (26) | ||
| Missing | 1666 | 16 | ||
| Age at first live birthc | < 20 | 3468 (9) | 57 (10) | 0.22 |
| 20–24 | 12,011 (30) | 156 (28) | ||
| 25–29 | 12,090 (30) | 175 (31) | ||
| ≥ 30 | 6413 (16) | 111 (20) | ||
| Missing | 2330 | 11 | ||
| Nulliparous | 3282 (8) | 54 (10) | ||
| Number of first-degree relatives (mother, sisters, daughters) who have had breast cancer | 0 (or not reported) | 30,531 (77) | 390 (69) | < 0.001 |
| 1 | 8067 (20) | 158 (28) | ||
| 2 | 933 (2) | 16 (3) | ||
| 3 | 61 (0) | 0 (0) | ||
| 4 | 2 (0) | 0 (0) | ||
| Breast biopsy | No | 33,048 (86) | 446 (82) | 0.001 |
| Yes | 5158 (14) | 100 (18) | ||
| Missing | 1388 | 18 | ||
| Number of breast biopsies | 0 | 32,824 (87) | 444 (82) | 0.001 |
| 1 | 3949 (10) | 77 (14) | ||
| 2 | 1040 (3) | 21 (4) | ||
| Missing | 1781 | 22 | ||
| Biopsy with atypical hyperplasia | No | 1490 (88) | 23 (92) | 0.76 |
| Yes | 209 (12) | 2 (8) | ||
| Missingd | 3459 | 75 | ||
| Race/ethnicity | Mixede | 38,428 (97) | 555 (98) | 0.059 |
| Asian | 1166 (3) | 9 (2) |
aThe distribution of values for each variable is presented without inclusion of missing values
bExcluding missing. P-values for binary categories are from chi-square tests or Fisher’s exact test as appropriate; variables with three or more categories were assessed using a non-parametric test for trend (Stata ‘nptrend’)
cData for assessment of this variable were not directly available; we used age at first full-term pregnancy for all women who had at least one live birth
dMissing shown only for women who responded ‘Yes’ to breast biopsy
eOther: women of non-Asian ethnicity
Fig. 2Expected and observed outcomes according to Gail scores generated by baseline questionnaires. Overall chi-squared test, p < 0.0001 (D1–D9 categories only; p = 0.57). D decile
Comparison of expected and observed cases of invasive breast cancer, and hazard ratios for observed cases, according to Gail model predicted 5-year risk for all women by age group, and for group level risk quintiles (Q1 to Q5) and, within Q5, the upper two deciles of risk (D9 and D10)
| Age (years) | Quantile group | Predicted 5-year | No. of women | Observed (O) breast cancers | Expected (E) breast cancers | Person-years (PY) | O per 10,000 PY | E/0 (95% CI)a | HR (95% CI), p value | HR (95% CI), p value |
|---|---|---|---|---|---|---|---|---|---|---|
| 50–69 | Q1 | 0.6–1.1% | 8041 | 72 | 65 | 34,078 | 21 | 0.90 (0.71–1.15) | 0.59 (0.44–0.79), | Referent |
| Q2 | 1.1–1.4% | 8096 | 77 | 86 | 34,160 | 23 | 1.12 (0.89–1.42) | 0.63 (0.47–0.84), | 1.07 (0.78–1.48), | |
| Q3 | 1.4–1.7% | 8124 | 122 | 105 | 34,264 | 36 | 0.86 (0.72–1.04) | Referent | 1.70 (1.27–2.27), | |
| Q4 | 1.7–2.3% | 7902 | 132 | 131 | 33,254 | 40 | 0.99 (0.84–1.19) | 1.11 (0.87–1.43), | 1.89 (1.42–2.52), | |
| Q5 | 2.3–22.0% | 7995 | 161 | 225 | 33,690 | 48 | 1.40 (1.20–1.64) | 1.34 (1.06–1.70), | 2.28 (1.73–3.02), | |
| D9 | 2.3–3.0% | 3980 | 78 | 88 | 16,790 | 46 | 1.13 (0.91–1.43) | 1.30 (0.98–1.73), | 2.21 (1.61–3.05), | |
| D10 | 3.0–22.0% | 4015 | 83 | 137 | 16,900 | 49 | 1.65 (1.33–2.07) | 1.39 (1.05–1.83), | 2.35 (1.72–3.23), | |
| Total | 0.2–21.7% | 40,158 | 564 | 612 | 169,445 | 33 | 1.09 (1.00–1.18) | |||
| 50–59 | Q1 | 0.6–1.1% | 4046 | 35 | 29 | 17,131 | 20 | 0.83 (0.60–1.19) | 0.83 (0.53–1.30), | Referent |
| Q2 | 1.1–1.4% | 4054 | 33 | 38 | 17,167 | 19 | 1.14 (0.81–1.66) | 0.78 (0.49–1.23), | 0.94 (0.58–1.51), | |
| Q3 | 1.4–1.7% | 4062 | 42 | 46 | 17,117 | 25 | 1.10 (0.81–1.52) | Referent | 1.21 (0.77–1.89), | |
| Q4 | 1.7–2.3% | 4013 | 67 | 58 | 16,917 | 40 | 0.87 (0.68–1.12) | 1.61 (1.10–2.37), | 1.95 (1.29–2.93), | |
| Q5 | 2.3–13.9% | 4041 | 67 | 94 | 17,057 | 39 | 1.40 (1.10–1.80) | 1.60 (1.09–2.35), | 1.93 (1.28–2.90), | |
| D9 | 2.1–2.5% | 2028 | 30 | 38 | 8567 | 35 | 1.28 (0.89–1.89) | 1.43 (0.89–2.28), | 1.72 (1.06–2.80), | |
| D10 | 2.5–13.9% | 2013 | 37 | 55 | 8490 | 44 | 1.49 (1.08–2.12) | 1.77 (1.14–2.76), | 2.14 (1.35–3.40), | |
| Total | 0.2–21.7% | 20,216 | 244 | 264 | 85,388 | 29 | 1.08 (0.96–1.23) | |||
| 60–69 | Q1 | 0.9–1.1% | 4026 | 41 | 39 | 17,022 | 24 | 0.96 (0.71–1.34) | 0.64 (0.43–0.95), | Referent |
| Q2 | 1.1–1.4% | 3992 | 55 | 48 | 16,833 | 33 | 0.88 (0.68–1.17) | 0.87 (0.61–1.25), | 1.36 (0.91–2.04), | |
| Q3 | 1.4–1.7% | 4041 | 64 | 59 | 17,069 | 37 | 0.92 (0.72–1.19) | Referent | 1.56 (1.06–2.31), | |
| Q4 | 1.7–2.3% | 3946 | 74 | 75 | 16,573 | 45 | 1.02 (0.81–1.30) | 1.19 (0.85–1.67), | 1.86 (1.27–2.73), | |
| Q5 | 2.3–22.0% | 3937 | 86 | 126 | 16,560 | 52 | 1.47 (1.19–1.84) | 1.40 (1.01–1.93), | 2.18 (1.50–3.16), | |
| D9 | 2.8–3.3% | 1953 | 40 | 50 | 8192 | 49 | 1.24 (0.91–1.74) | 1.31 (0.88–1.94), | 2.04 (1.32–3.16), | |
| D10 | 3.3–22.0% | 1984 | 46 | 77 | 8368 | 55 | 1.66 (1.25–2.27) | 1.48 (1.01–2.16), | 2.31 (1.52–3.53), | |
| Total | 0.2–21.7% | 19,942 | 320 | 348 | 84,057 | 38 | 1.09 (0.97–1.22) |
CI confidence interval, D, decile, HR hazard ratio, Q quintile
a Chi-squared test across Q1–Q5 (O vs E) were: χ2= 23.0, p = 0.0001 for women 50–69 years old; χ2 = 11.0, p = 0.0262 for women 50–59 years old; and χ2 = 14.4, p = 0.0063 for women 60–69 years old
b Log rank tests for trend across hazard functions Q1–Q5 were χ2 = 52, p < 0.0001 for women 50–69 years old; χ2 = 20, p < 0.0001 for women 50–59 years old; and χ2 = 21, p < 0.0001 for women 60–69 years old
Hazard ratios for incident invasive breast cancer in women 50–69 years old, according to reduced Gail model (5-year risk) quintiles and area under the curve (AUC) for each model using continuous Gail scores
| Gail Subset (model) | Frequency in rank (%) | Risk score quintile group (5-year risk) | Test for trend | AUC (95% CI) | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Q1 | Q2 | Q3 | Q4 | Q5 | D9 | D10 | ||||
| Age (M1) | 100 | Referent | 1.25 (0.95–1.64), | 1.37 (1.03–1.83), | 1.38 (1.06–1.81), | 1.96 (1.51–2.56), | 2.06 (1.52–2.79), | 1.85 (1.33–2.58), | χ2 = 23, | 0.56 (0.53–0.58) |
| + first live birth age (M2) | 62 | Referent | 0.94 (0.70–1.26), | 1.18 (0.90–1.54), | 1.38 (1.05–1.80), | 1.64 (1.27–2.12), | 1.50 (1.09–2.06), | 1.77 (1.32–2.38), | χ2 = 21, | 0.56 (0.54–0.58) |
| + age at menarche (M3) | 62 | Referent | 1.12 (0.83–1.51), | 1.50 (1.13–1.98), | 1.49 (1.13–1.97), | 1.89 (1.44–2.47), | 1.76 (1.28–2.43), | 2.01 (1.48–2.73), | χ2 = 26, | 0.56 (0.54–0.59) |
| + number of first-degree relatives (M4) | 84 | Referent | 1.22 (0.90–1.67), | 1.43 (1.06–1.93), | 2.05 (1.55–2.71), | 2.15 (1.63–2.84), | 1.82 (1.30–2.54), | 2.49 (1.82–3.39), | χ2 = 45, | 0.58 (0.56–0.60) |
| + number of biopsies (M5) | 79 | Referent | 1.06 (0.77–1.46), | 1.69 (1.27–2.27), | 1.91 (1.43–2.54), | 2.28 (1.73–3.01), | 2.22 (1.61–3.06), | 2.34 (1.71–3.21), | χ2 = 52, | 0.59 (0.56–0.61) |
| + had biopsy (M6) | 91 | Referent | 1.07 (0.78–1.48), | 1.70 (1.27–2.27), | 1.90 (1.43–2.52), | 2.29 (1.74–3.02), | 2.24 (1.63–3.08), | 2.33 (1.70–3.20), | χ2 = 52, | 0.59 (0.56–0.61) |
| + ethnicity (M7) | 98 | Referent | 1.07 (0.78–1.48), | 1.70 (1.27–2.27), | 1.90 (1.43–2.52), | 2.29 (1.74–3.02), | 2.24 (1.63–3.08), | 2.33 (1.70–3.20), | χ2 = 52, | 0.59 (0.56–0.61) |
| + had hyperplasia (full model) (M8) | 100 | Referent | 1.07 (0.78–1.48), | 1.70 (1.27–2.27), | 1.89 (1.42–2.52), | 2.28 (1.73–3.02), | 2.21 (1.61–3.05), | 2.35 (1.72–3.23), | χ2 = 52, | 0.59 (0.56–0.61) |
Values are shown as hazard ratio (95% CI), p value
CI confidence interval, D decile, Q quintile
a Tests for trend for each model across hazard functions for risk score groups Q1 to Q5
Fig. 3Observed incident cancers per 10,000 women according to quantile groups for the reduced BCRAT models (M1–8) assessed. M model, Q quintile