| Literature DB >> 30526533 |
Carina Forsare1, Martin Bak2, Anna-Karin Falck3, Dorthe Grabau4, Fredrika Killander5,6, Per Malmström5,6, Lisa Rydén7,8, Olle Stål9, Marie Sundqvist10, Pär-Ola Bendahl5, Mårten Fernö5.
Abstract
BACKGROUND: Prognostic factors in breast cancer are often measured on a continuous scale, but categorized for clinical decision-making. The primary aim of this study was to evaluate if accounting for continuous non-linear effects of the three factors age at diagnosis, tumor size, and number of positive lymph nodes improves prognostication. These factors will most likely be included in the management of breast cancer patients also in the future, after an expected implementation of gene expression profiling for adjuvant treatment decision-making.Entities:
Keywords: Breast cancer; Categorical; Continuous; Fractional polynomials; Prognostic; Splines
Mesh:
Year: 2018 PMID: 30526533 PMCID: PMC6286551 DOI: 10.1186/s12885-018-5123-x
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Fig. 1Flow diagram of the 4568 eligible patients, inclusion periods, patients excluded, and number of patients included for the different patient materials that constituted the derivation set. Patients were excluded due to missing information on follow-up, number of positive lymph nodes, and/or tumor size. One hundred and one patients were excluded and the final number of patients included was 4477
Patient and tumor characteristics for the derivation and validation sets
| Factor | Derivation set | Validation set |
|---|---|---|
|
| 4477 | 1132 |
|
| 1315 (29)b | 289 (26) |
| 60 | 64 | |
| 25–93 | 28–99 | |
| Age < 35 | 69 (2) | 11 (1) |
| Age 35–50 | 1023 (23) | 202 (18) |
| Age > 50 | 3385 (76) | 919 (81) |
| 22 | 20 | |
| 1–120 | 1–160 | |
| T1 (≤20 mm) | 1942 (43) | 590 (52) |
| T2 (21–50 mm) | 2460 (55) | 506 (45) |
| T3 (> 50 mm) | 75 (2) | 36 (3) |
| 1 | 0 | |
| 0–47 | 0–23 | |
| Negative | 1783 (40) | 659 (58) |
| 1–3 positive | 1781 (40) | 305 (27) |
| 4–9 positive | 649 (14) | 127 (11) |
| ≥ 10 positive | 264 (6) | 41 (4) |
|
| ||
| Endocrine therapy | 2662 (59) | 673 (59) |
| Chemotherapy | 460 (10) | 52 (5) |
| Chemo-endocrine | 74 (2) | 38 (3) |
| None | 1279 (29) | 369 (33) |
| Missing | 2 | 0 |
aFollow-up truncated at 10 years
bNumbers in parentheses are percentages
Fig. 2Univariable analyses with the predictors dichotomized (< 35 vs. ≥35 years, > 20 vs. ≤20 mm, and positive vs. negative lymph nodes; 2a–c) and categorized accordingly: age at diagnosis in three categories (> 50, 35–50, and < 35 years at diagnosis), tumor size in three (T1, T2, and T3), and lymph nodes in four categories (N0, N1–3, N4–9, and N ≥ 10; 2d–f)
Fig. 3Univariable analysis of non-linear effects for age, tumor size, and number of positive lymph nodes using MFP (3a–c) and RCS (3d–f). For each factor X, an estimate of the relative hazard is shown as a function of X. The estimate is based on a Cox model with fractional polynomial transformation of X. A reference value was chosen for each factor (35 for age, 20 for tumor size, and zero for number of positive lymph nodes). The relative hazard for this value will be 1.00 per definition. For other values of each factor the relative hazard will be an estimate with a corresponding 95% CI shown as a band around the point estimate. The shaded area in the background is a kernel density estimate of the distribution of each factor and the dots represent the values observed
Fig. 4Histograms showing the distributions of the prognostic indices (PI) corresponding to the final MFP model in the derivation set and the validation set. The indices for both sets have been centered by subtracting the mean PI in the derivation set. The vertical lines represent the 16th, 50th, and 84th percentiles of the PI-distribution in the derivation set – cut-offs which identify groups of different relative sizes in the validation set
Hazard ratios from analysis of D-RFi for Cox-models with MFP- and RCStransformations of the predictors
| Model | Derivation | Validation |
|---|---|---|
|
|
|
|
| G2 vs. G1 | 2.46 (1.89–3.20) | 1.16 (0.78–1.72) |
| G3 vs. G1 | 4.28 (3.31–5.54) | 3.36 (2.43–4.65) |
| G4 vs. G1 | 10.5 (8.13–13.7) | 6.92 (4.85–9.86) |
|
| ||
| G2 vs. G1 | 2.36 (1.81–3.06) | 1.46 (1.00–2.13) |
| G3 vs. G1 | 4.15 (3.23–5.34) | 3.10 (2.20–4.35) |
| G4 vs. G1 | 10.4 (8.06–13.4) | 7.97 (5.57–11.4) |
For each method of covariate transformation, the prognostic index (PI) in the derivation set was categorized at the 16th, the 50th, and the 84th percentiles forming four risk groups named G1 to G4. The parameter estimates from the derivation set for MFP and RCS were used to calculate the PIs for each patient in the validation set. Each of these two indices was thereafter categorized into four groups using the percentile based cut-off values from the derivation set
Fig. 5Kaplan-Meier estimates of distant recurrence-free interval (D-RFi) for risk groups G1 to G4 formed by categorization of the prognostic index (PI) for the MFP model in the derivation set at the 16th, 50th and 84th percentiles. Solid lines were used for estimates in the derivation set and dashed lines for the corresponding estimates in the validation set. Note that the actual values of the PI at the cutoffs in the derivation set were used as cutoffs for the PI in the validation set, leading to different relative sizes of the four groups in the two datasets. Abbreviations: Der. = Derivation set, Val. = Validation set