| Literature DB >> 28811593 |
Zhu Zhu1,2, Ling Li2, Zhong Ye2, Tong Fu1, Ye Du1, Aiping Shi1, Di Wu1, Ke Li3, Yifan Zhu2, Chun Wang4,5, Zhimin Fan6.
Abstract
The prognostic value of routine laboratory variables in breast cancer has been largely overlooked. Based on laboratory tests commonly performed in clinical practice, we aimed to develop a new model to predict disease free survival (DFS) after surgical removal of primary breast cancer. In a cohort of 1,596 breast cancer patients, we analyzed the associations of 33 laboratory variables with patient DFS. Based on 3 significant laboratory variables (hemoglobin, alkaline phosphatase, and international normalized ratio), together with important demographic and clinical variables, we developed a prognostic model, achieving the area under the curve of 0.79. We categorized patients into 3 risk groups according to the prognostic index developed from the final model. Compared with the patients in the low-risk group, those in the medium- and high-risk group had a significantly increased risk of recurrence with a hazard ratio (HR) of 1.75 (95% confidence interval [CI] 1.30-2.38) and 4.66 (95% CI 3.54-6.14), respectively. The results from the training set were validated in the testing set. Overall, our prognostic model incorporating readily available routine laboratory tests is powerful in identifying breast cancer patients who are at high risk of recurrence. Further study is warranted to validate its clinical application.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28811593 PMCID: PMC5557903 DOI: 10.1038/s41598-017-08240-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Diagram of Study Population Selection. All of patients were histologically confirmed as having breast cancer and were diagnosed and/or treated in the Kimmel Cancer Center at the Thomas Jefferson University Hospital. A cohort of 1,596 breast cancer patients was included in this study based on the selection criteria.
Characteristics of patients in the training and testing sets.
| Patients characteristic | Training set N = 1,064 (%) | Testing set N = 532 (%) |
|
|---|---|---|---|
| Age, mean ± SD, (years) | 59.0 ± 14.0 | 59.6 ± 12.9 | 0.35 |
| Race/ethnicity | 0.91 | ||
| Caucasian | 777 (73.0) | 390 (73.3) | |
| African American | 214 (20.1) | 112 (21.0) | |
| Asian | 42 (4.0) | 17 (3.2) | |
| Others | 12 (1.1) | 5 (0.9) | |
| Unknown | 19 (1.8) | 8 (1.5) | |
| Smoking status | 0.55 | ||
| Never smoking | 571 (53.7) | 265 (49.8) | |
| Current smoking | 124 (11.6) | 67 (12.6) | |
| Former smoking | 215 (20.2) | 116 (21.8) | |
| Unknown | 154 (14.5) | 84 (15.8) | |
| Drinking status | 0.99 | ||
| Never drinking | 482 (45.3) | 237 (44.5) | |
| Current drinking | 380 (35.7) | 192 (36.1) | |
| Former drinking | 6 (0.6) | 3 (0.6) | |
| Unknown | 196 (18.4) | 100 (18.8) | |
| Tumor stage | 0.73 | ||
| Stage I | 566 (53.2) | 275 (51.7) | |
| Stage II | 355 (33.4) | 190 (35.7) | |
| Stage III | 112 (10.5) | 53 (10.0) | |
| Stage IV | 31 (2.9) | 14 (2.6) | |
| Tumor grade | 0.04 | ||
| Well differentiated | 121 (11.4) | 87 (16.4) | |
| Moderately differentiated | 410 (38.5) | 201 (37.8) | |
| Poorly differentiated | 395 (37.1) | 178 (33.5) | |
| Not determined | 138 (13.0) | 66 (12.4) | |
| Tumor histology | 0.80 | ||
| Invasive ductal carcinoma | 868 (81.6) | 438 (82.3) | |
| Invasive lobular carcinoma | 74 (6.9) | 38 (7.1) | |
| Mixed carcinoma | 106 (10.0) | 46 (8.7) | |
| Othersa | 16 (1.5) | 10 (1.9) | |
| Tumor size | 0.47 | ||
| 2–9 mm | 181 (17.0) | 98 (18.4) | |
| 10–29 mm | 447 (42.0) | 232 (43.6) | |
| 30–49 mm | 106 (10.0) | 54 (10.1) | |
| 50–99 mm | 71 (6.7) | 28 (5.3) | |
| ≥10 cm | 13 (1.2) | 2 (0.4) | |
| Unknown/not found | 246 (23.1) | 118 (22.2) | |
| Lymph nodes metastatic rate | 0.50 | ||
| 0% | 591 (55.5) | 289 (54.3) | |
| 1–20% | 128 (12.0) | 77 (14.5) | |
| 20–49% | 55 (5.2) | 31 (5.8) | |
| 50–79% | 32 (3.0) | 23 (4.3) | |
| 80–100% | 34 (3.2) | 15 (2.8) | |
| Not determined | 176 (16.5) | 76 (14.3) | |
| Unknown | 48 (4.5) | 21 (4.0) | |
| Estrogen receptor status | 0.77 | ||
| Negative | 223 (21.0) | 104 (19.6) | |
| Positive | 734 (69.0) | 371 (69.7) | |
| Unknown | 107 (10.0) | 57 (10.7) | |
| Progesterone receptor status | 0.76 | ||
| Negative | 313 (29.5) | 154 (28.9) | |
| Positive | 642 (60.3) | 317 (59.6) | |
| Unknown | 109 (10.2) | 61 (11.5) | |
| Chemotherapy | 0.97 | ||
| No | 618 (58.1) | 307 (57.7) | |
| Yes | 414 (38.9) | 208 (39.1) | |
| Unknown | 32 (3.0) | 17 (3.2) | |
| Radiation therapy | 0.12 | ||
| No | 603 (56.7) | 290 (54.5) | |
| Yes | 427 (40.1) | 214 (40.2) | |
| Unknown | 34 (3.2) | 28 (5.3) | |
| Hormone therapy | 0.64 | ||
| No | 652 (61.3) | 323 (60.7) | |
| Yes | 339 (31.9) | 178 (33.5) | |
| Unknown | 73 (6.9) | 31 (5.8) |
Abbreviations: SD, standard deviation.
aOthers include colloid, medullary, tubular, papillary carcinoma, and Paget’s disease.
Candidate laboratory variables selected by univariate analysis in the training set.
| Variablesa | % of missing value | No. of patients disease free/recurrence | HR (95% CI) | Cox | Log-rank | Brootstrap % (<0.05) |
|---|---|---|---|---|---|---|
| HCT | 3.57 | <0.0001 | 98.1 | |||
| ≤37.48% | 295/218 | 1.00 | ||||
| >37.48% | 358/155 | 0.65 (0.53–0.80) | <0.0001 | |||
| HGB | 3.48 | <0.0001 | 99.7 | |||
| ≤12.50 T/L | 306/211 | 1.00 | ||||
| >12.50 T/L | 348/162 | 0.62 (0.51–0.76) | <0.0001 | |||
| RBC | 3.01 | <0.0001 | 97.9 | |||
| ≤4.19 T/L | 300/216 | 1.00 | ||||
| >4.19 T/L | 358/158 | 0.67 (0.54–0.82) | 0.0001 | |||
| RDW | 3.57 | <0.0001 | 100.0 | |||
| ≤13.40% | 371/160 | 1.00 | ||||
| >13.40% | 282/213 | 1.85 (1.51–2.27) | <0.0001 | |||
| Albumin | 36.09 | 0.03 | 70.1 | |||
| ≤4.30 g/dL | 266/115 | 1.00 | ||||
| >4.30 g/dL | 191/108 | 0.74 (0.57–0.97) | 0.03 | |||
| ALP | 30.64 | 0.002 | 87.4 | |||
| ≤69 IU/L | 268/111 | 1.00 | ||||
| >69 IU/L | 224/135 | 1.48 (1.15–1.91) | 0.002 | |||
| INR | 39.85 | <0.0001 | 99.9 | |||
| ≤1.02 | 234/99 | 1.00 | ||||
| >1.02 | 175/132 | 2.00 (1.54–2.60) | <0.0001 | |||
| PT | 45.77 | <0.0001 | 100.0 | |||
| ≤13.55 | 214/75 | 1.00 | ||||
| >13.55 | 147/141 | 2.14 (1.61–2.85) | <0.0001 |
Abbreviations: RBC, red blood cell; HGB, hemoglobin; HCT, hematocrit; RDW, red cell distribution width; INR, international normalized ratio; ALP, alkaline phosphatase; HR, hazard ratio; CI, confidence interval.
aVariables were categorized by the median value in the study population.
Parameter estimates and standard errors in the final model.
| Variables | Coefficients | SE |
|---|---|---|
| Age | 0.03 | 0.004 |
| Race (Caucasian as reference) | ||
| African American | 0.21 | 0.13 |
| others | 0.15 | 0.23 |
| Stage (stage I as reference) | ||
| II | 0.29 | 0.17 |
| III | 1.07 | 0.23 |
| IV | 1.87 | 0.28 |
| Tumor size (2–9 mm as reference) | ||
| 10–29 mm | −0.07 | 0.20 |
| 30–49 mm | −0.11 | 0.15 |
| 50–99 mm | 0.06 | 0.21 |
| ≥10 cm | 0.08 | 0.21 |
| Unknown/not found | 0.07 | 0.40 |
| Lymph nodes metastatic rate (0% as reference) | ||
| 1–20% | −0.11 | 0.20 |
| 20–49% | −0.15 | 0.26 |
| 50–79% | −0.15 | 0.31 |
| 80–100% | 0.38 | 0.26 |
| Not determined | 0.08 | 0.17 |
| Unknown | 0.06 | 0.23 |
| ER status (Negative as reference) | ||
| Positive | −0.18 | 0.20 |
| Unknown | 0.14 | 1.05 |
| PR status (Negative as reference) | ||
| Positive | −0.15 | 0.18 |
| Unknown | −0.13 | 1.04 |
| Chemotherapy (without chemotherapy as reference) | ||
| With chemotherapy | 0.08 | 0.42 |
| Unknown | 0.02 | 0.14 |
| Radiation therapy (without radiation therapy as reference) | ||
| With radiation | 0.26 | 0.35 |
| Unknown | −0.43 | 0.12 |
| Hormone therapy (without hormone therapy as reference) | ||
| With hormone | −0.30 | 0.27 |
| Unknown | 0.002 | 0.13 |
| Square of HGB* | −0.004 | 0.001 |
| Nature Logarithm of ALP* | 0.35 | 0.17 |
| Inverse cube of INR* | −0.47 | 0.20 |
Abbreviations: ER, estrogen receptor; PR, progesterone receptor; HGB, hemoglobin; ALP, alkaline phosphatase; INR, international normalized ratio; SE, standard error.
*In order to calculate the prognostic index for the testing set, the missing data of HGB, ALP, and INR in the testing set were imputed as the mean values from the training set (after normality transformation, 154.57 for HGB, 4.24 for ALP, and 0.93 for INR, respectively).
Figure 2Assessment of model performance. The receiver operating characteristics curves were used to assess the performance of the final model in the training and testing sets.
Summary of disease free survival by risk category.
| Groups | No. of patients disease free/recurrence | Median DFS time (year) | HR (95% CI) | Cox | Log-rank |
|---|---|---|---|---|---|
|
| |||||
| Low-risk | 287/68 | NR | 1.00 | <0.0001 | |
| Medium-risk | 247/108 | 10.00 (8.14–11.93) | 1.75 (1.30–2.38) | 0.0003 | |
| High-risk | 149/205 | 4.26 (3.32–4.92) | 4.66 (3.54–6.14) | <0.0001 | |
|
| |||||
| Low-risk | 151/26 | NR | 1.00 | <0.0001 | |
| Medium-risk | 133/45 | 10.77 (8.77–12.51) | 1.98 (1.22–3.21) | 0.006 | |
| High-risk | 86/91 | 5.66 (3.72–7.43) | 5.33 (3.44–8.27) | <0.0001 | |
Abbreviations and definitions: DFS, disease free survival; HR, hazard ratio; CI, confidence interval; NR, not reached.
Figure 3Disease free survival of different risk groups stratified by the final model. Kaplan-Meier survival estimates were used to characterize patients of different risk groups classified by the final model in the training (A) and testing (B) sets.