| Literature DB >> 35296284 |
Hyunmin Jung1, Mingshan Lu1,2, May Lynn Quan2,3,4, Winson Y Cheung1,3, Shiying Kong1,2,3, Sasha Lupichuk4, Yuanchao Feng2,5, Yuan Xu6,7,8,9.
Abstract
BACKGROUND: In cancer survival analyses using population-based data, researchers face the challenge of ascertaining the timing of recurrence. We previously developed algorithms to identify recurrence of breast cancer. This is a follow-up study to detect the timing of recurrence.Entities:
Keywords: Breast cancer; Identification algorithm; Real-world data; Survival analysis; Timing of recurrence
Mesh:
Year: 2022 PMID: 35296284 PMCID: PMC8925135 DOI: 10.1186/s12885-022-09333-6
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
The comparison of characteristics between chart review and algorithm determined cohort
| Variable | Chart-review determined | Algorithm estimated recurrence | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Median (IQR) | 40 (36–53) | 40 (36–50) | 0.545 | 40 (37–53) | 0.436 | 40 (37–50) | 0.883 | 40 (36–53) | 0.405 | |
| 0-I | 95 (15.9) | 9 (7.4) | < 0.0001 | 5 (4.3) | 0.423 | 6 (6.1) | 0.598 | 10 (6.9) | 0.921 | |
| II | 311 (52) | 41 (33.9) | 39 (33.6) | 30 (30.3) | 51 (35.4) | |||||
| III | 192 (32.1) | 71 (58.7) | 72 (62.1) | 63 (63.6) | 83 (57.6) | |||||
| Negative | 223 (37.3) | 51 (42.1) | 0.216 | 54 (46.6) | 0.332 | 41 (41.4) | 0.89 | 63 (43.8) | 0.688 | |
| Positive | 375 (62.7) | 70 (57.9) | 62 (53.4) | 58 (58.6) | 81 (56.3) | |||||
| Negative | 159 (26.6) | 37 (30.6) | 0.266 | 40 (34.5) | 0.364 | 31 (31.3) | 0.878 | 47 (32.6) | 0.596 | |
| Positive | 439 (73.4) | 84 (69.4) | 76 (65.5) | 68 (68.7) | 97 (67.4) | |||||
| Negative | 437 (73.1) | 98 (81) | 0.028 | 93 (80.2) | 0.82 | 79 (79.8) | 0.761 | 112 (77.8) | 0.324 | |
| Positive | 161 (26.9) | 23 (19) | 23 (19.8) | 20 (20.2) | 32 (22.2) | |||||
| Ductal | 538 (90) | 106 (87.6) | 0.349 | 98 (84.5) | 0.595 | 83 (83.8) | 0.478 | 125 (86.8) | 0.949 | |
| Lobular | 27 (4.5) | 5 (4.1) | 6 (5.2) | 6 (6.1) | 6 (4.2) | |||||
| Others | 33 (5.5) | 10 (8.3) | 12 (10.3) | 10 (10.1) | 13 (9.1) | |||||
| 1 | 45 (7.5) | 5 (4.1) | 0.284 | 6 (4.2) | 0.024 | 4 (4.0) | 0.04 | 4 (3.5) | 0.038 | |
| 2 | 211 (35.3) | 44 (36.3) | 46 (31.9) | 30 (30.3) | 35 (30.2) | |||||
| 3 | 342 (57.2) | 72 (59.5) | 92 (63.9) | 65 (65.7) | 77 (66.4) | |||||
| No surgery | 5 (0.8) | 1 (0.8) | 0.002 | 1 (0.9) | 0.370 | 1 (1) | 0.052 | 5 (3.5) | 0.135 | |
| BCS | 159 (26.6) | 18 (14.9) | 13 (11.2) | 7 (7.1) | 23 (16) | |||||
| Mastectomy | 434 (72.6) | 102 (84.3) | 102 (87.9) | 91 (91.9) | 116 (80.6) | |||||
| No | 51 (8.5) | 10 (8.3) | 0.614 | 7 (6) | 0.377 | 7 (7.1) | 0.658 | 15 (10.4) | 0.357 | |
| Yes | 547 (91.5) | 111 (91.7) | 109 (94) | 92 (92.9) | 129 (89.6) | |||||
| No | 192 (32.1) | 43 (35.5) | 0.396 | 36 (31) | 0.397 | 32 (32.3) | 0.63 | 49 (34) | 0.762 | |
| Yes | 214 (35.8) | 37 (30.6) | 34 (29.3) | 29 (29.3) | 42 (29.2) | |||||
| Unknown | 192 (32.1) | 41 (33.9) | 46 (39.7) | 38 (38.4) | 53 (36.8) | |||||
| No | 209 (34.9) | 40 (33.1) | 0.587 | 36 (31) | 0.658 | 31 (31.3) | 0.718 | 48 (33.3) | 0.573 | |
| Yes | 194 (32.4) | 44 (36.4) | 40 (34.5) | 34 (34.3) | 47 (32.6) | |||||
| Unknown | 195 (32.6) | 37 (30.6) | 40 (34.5) | 34 (34.3) | 49 (34) | |||||
* indicates the statistically significant difference (P < 0.05) between the recurrent and non-recurrent patients by univariate analysis based on chart review data
^ The p-values is for the statistical comparison between algorithm-estimated recurrence cohort and the chart review determined recurrence cohort
The difference between algorithm-estimated and real chart review determined date of recurrence (N = 121)*
| Absolute difference from chart review (month) | High-sensitivity algorithm | High-PPV algorithm | High-accuracy algorithm | |||
|---|---|---|---|---|---|---|
| 51 (42.1) | 51 (42.1) | 41 (33.9) | 41 (33.9) | 47 (38.8) | 47 (38.8) | |
| 24 (19.8) | 75 (62.0) | 22 (18.2) | 63 (52.0) | 22 (18.2) | 69 (57.0) | |
| 11 (9.1) | 86 (71.1) | 10 (8.3) | 73 (60.3) | 9 (7.4) | 78 (64.5) | |
| 12 (9.9) | 98 (81.0) | 16 (13.2) | 89 (73.5) | 14 (11.5) | 92 (76.0) | |
| 23 (19.0) | 121 (100.0) | 32 (26.5) | 121 (100.0) | 29 (24.0) | 121 (100.0) | |
RFS recurrence-free survival, PPV positive predictive value
* This was determined based on chart review
Fig. 1The comparison between estimated and chart review derived K-M curves for RFS. A shows the comparison of K-M curves between the high sensitivity algorithm estimated and chart review data with logrank p-value = 0.117; B shows the comparison of K-M curves between the high PPV algorithm estimated and chart review data with logrank p-value = 0.111; C shows the comparison of K-M curves between the high accuracy algorithm estimated and chart review data with logrank p-value = 0.729
The comparison of Cox regression of RFS between chart review and algorithm determined cohort
| Age (year) | < = 35 | Reference | 0.124 | Reference | 0.123 | Reference | 0.125 | Reference | 0.162 |
| 36–40 | 1.51 (0.89–2.57) | 1.62 (0.92–2.84) | 1.42 (0.78–2.57) | 1.39 (0.85–2.27) | |||||
| 41–55 | 0.92 (0.51–1.66) | 0.87 (0.47–1.6) | 0.83 (0.44–1.6) | 0.75 (0.44–1.29) | |||||
| > = 56 | 0.66 (0.33–1.32) | 0.7 (0.35–1.4) | 0.53 (0.25–1.16) | 0.72 (0.4–1.31) | |||||
| Tumor stage* | 0-I | Reference | < 0.0001 | Reference | < 0.0001 | Reference | < 0.0001 | Reference | < 0.0001 |
| II | 1.83 (0.82–4.06) | 2.92 (1.08–7.91) | 1.84 (0.71–4.78) | 2.3 (1.09–4.87) | |||||
| III | 7.19 (3.24–15.92) | 11.83 (4.38–31.91) | 8.62 (3.37–22.03) | 8.19 (3.85–17.43) | |||||
| ER status | Negative | Reference | 0.622 | Reference | 0.181 | Reference | 0.397 | Reference | 0.229 |
| Positive | 0.87 (0.5–1.52) | 0.69 (0.41–1.19) | 0.77 (0.43–1.4) | 0.74 (0.45–1.21) | |||||
| PR status | Negative | Reference | 0.199 | Reference | 0.074 | Reference | 0.672 | Reference | 0.31 |
| Positive | 0.71 (0.43–1.19) | 0.63 (0.38–1.05) | 0.89 (0.51–1.55) | 0.79 (0.49–1.25) | |||||
| HER2 status* | Negative | Reference | 0.003 | Reference | 0.003 | Reference | 0.007 | Reference | 0.016 |
| Positive | 0.48 (0.3–0.77) | 0.48 (0.3–0.77) | 0.5 (0.3–0.83) | 0.6 (0.4–0.91) | |||||
| Histology | Ductal | Reference | 0.182 | Reference | 0.058 | Reference | 0.102 | Reference | 0.102 |
| Lobular | 0.52 (0.2–1.36) | 0.9 (0.37–2.2) | 0.97 (0.39–2.43) | 0.57 (0.24–1.38) | |||||
| Other | 1.51 (0.75–3.01) | 2.17 (1.13–4.17) | 2.16 (1.06–4.41) | 1.67 (0.9–3.08) | |||||
| Tumor grade | 1 | Reference | 0.379 | Reference | 0.052 | Reference | 0.065 | Reference | 0.101 |
| 2 | 1.92 (0.75–4.93) | 2.19 (0.76–6.3) | 1.68 (0.58–4.88) | 1.83 (0.77–4.38) | |||||
| 3 | 1.92 (0.75–4.93) | 3.12 (1.0–8.86) | 2.66 (0.93–7.63) | 2.38 (0.99–5.66) | |||||
| Hormone therapy* | No | Reference | 0.013 | Reference | 0.002 | Reference | 0.016 | Reference | 0.05 |
| Yes | 0.74 (0.4–1.36) | 1.22 (0.64–2.36) | 0.84 (0.42–1.67) | 0.99 (0.57–1.72) | |||||
| Unknown | 3.65 (1.35–9.81) | 5.14 (2.06–12.83) | 4.34 (1.47–12.79) | 2.72 (1.16–6.35) | |||||
| Radiotherapy* | No | Reference | 0.017 | Reference | 0.002 | Reference | 0.030 | Reference | 0.058 |
| Yes | 1.01 (0.55–1.87) | 1.03 (0.55–1.96) | 0.88 (0.44–1.75) | 0.97 (0.56–1.69) | |||||
| Unknown | 0.25 (0.1–0.66) | 0.22 (0.09–0.52) | 0.24 (0.09–0.7) | 0.38 (0.17–0.85) | |||||
| Chemotherapy | No | Reference | 0.158 | Reference | 0.216 | Reference | 0.258 | Reference | 0.001 |
| Yes | 0.57 (0.27–1.24) | 0.57 (0.24–1.38) | 0.59 (0.24–1.47) | 0.33 (0.17–0.63)^ | |||||
| Year of diagnosis | 2007–2009 | Reference | 0.784 | Reference | 0.228 | Reference | 0.394 | Reference | 0.188 |
| 2010–2012 | 1.29 (0.63–2.63) | 1.7 (0.82–3.52) | 1.07 (0.48–2.39) | 1.78 (0.96–3.31) | |||||
| 2013–2015 | 1.27 (0.56–2.88) | 2.07 (0.9–4.75) | 1.55 (0.63–3.79) | 1.71 (0.83–3.54) |
HR hazard ratio, CI confidence interval, PPV positive predictive value, ER estrogen receptor, PR progesterone receptor, HER2 Human Epidermal growth factor Receptor 2
* indicates that the variable was statistically significant (p-value < 0.05) based on chart review data
^ indicates that the variable was statistically significant (p-value < 0.05) based on high-sensitivity algorithm estimated data