| Literature DB >> 34017679 |
Ji-Yeon Kim1, Yong Seok Lee2, Jonghan Yu3, Youngmin Park2, Se Kyung Lee3, Minyoung Lee2, Jeong Eon Lee3, Seok Won Kim3, Seok Jin Nam3, Yeon Hee Park1, Jin Seok Ahn1, Mira Kang4, Young-Hyuck Im1.
Abstract
Several prognosis prediction models have been developed for breast cancer (BC) patients with curative surgery, but there is still an unmet need to precisely determine BC prognosis for individual BC patients in real time. This is a retrospectively collected data analysis from adjuvant BC registry at Samsung Medical Center between January 2000 and December 2016. The initial data set contained 325 clinical data elements: baseline characteristics with demographics, clinical and pathologic information, and follow-up clinical information including laboratory and imaging data during surveillance. Weibull Time To Event Recurrent Neural Network (WTTE-RNN) by Martinsson was implemented for machine learning. We searched for the optimal window size as time-stamped inputs. To develop the prediction model, data from 13,117 patients were split into training (60%), validation (20%), and test (20%) sets. The median follow-up duration was 4.7 years and the median number of visits was 8.4. We identified 32 features related to BC recurrence and considered them in further analyses. Performance at a point of statistics was calculated using Harrell's C-index and area under the curve (AUC) at each 2-, 5-, and 7-year points. After 200 training epochs with a batch size of 100, the C-index reached 0.92 for the training data set and 0.89 for the validation and test data sets. The AUC values were 0.90 at 2-year point, 0.91 at 5-year point, and 0.91 at 7-year point. The deep learning-based final model outperformed three other machine learning-based models. In terms of pathologic characteristics, the median absolute error (MAE) and weighted mean absolute error (wMAE) showed great results of as little as 3.5%. This BC prognosis model to determine the probability of BC recurrence in real time was developed using information from the time of BC diagnosis and the follow-up period in RNN machine learning model.Entities:
Keywords: adjuvant cohort; breast cancer; machine learning; real time prediction; recurrence model; surveillance
Year: 2021 PMID: 34017679 PMCID: PMC8129587 DOI: 10.3389/fonc.2021.596364
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Study cohort.
Characteristics of the study population.
| Mean ± SD | 49.2 ± 9.9 | 49.4 ± 9.9 | 46.7 ± 10.5 | <0.001 |
| Median (interquartile range) | 48.0 [43.0; 55.0] | 48.0 [43.0; 55.0] | 46.0 [39.0; 53.0] | <0.001 |
| 0.001 | ||||
| Pre-menopausal | 7,576 (57.8%) | 6,816 (57.3%) | 760 (62.6%) | |
| Post-menopausal | 5,481 (41.8%) | 5,033 (42.3%) | 448 (36.9%) | |
| Unknown | 60 (0.5%) | 54 (0.5%) | 6 (0.5%) | |
| 0.783 | ||||
| Underweight | 455 (3.5%) | 414 (3.5%) | 41 (3.4%) | |
| Normal | 6,011 (45.8%) | 5,450 (45.8%) | 561 (46.2%) | |
| Overweight | 3,032 (23.1%) | 2,756 (23.2%) | 276 (22.7%) | |
| Obesity | 3,136 (23.9%) | 2,852 (24.0%) | 284 (23.4%) | |
| High obesity | 477 (3.6%) | 425 (3.6%) | 52 (4.3%) | |
| Unknown | 6 (0.0%) | 6 (0.1%) | 0 (0.0%) | |
| 0.999 | ||||
| Yes | 13 (0.1%) | 12 (0.1%) | 1 (0.1%) | |
| No | 13,104 (99.9%) | 11,891 (99.9%) | 1,213 (99.9%) | |
| Baseline CA 15-3 (Mean ± SD) | 10.5 ± 11.2 | 10.4 ± 10.8 | 12.5 ± 14.2 | <0.001 |
| <0.001 | ||||
| T1 | 7,947 (60.6%) | 7,415 (62.3%) | 532 (43.8%) | |
| T2 | 4,594 (35.0%) | 4,026 (33.8%) | 568 (46.8%) | |
| T3 | 509 (3.9%) | 417 (3.5%) | 92 (7.6%) | |
| T4 | 17 (0.1%) | 11 (0.1%) | 6 (0.5%) | |
| Unknown | 50 (0.4%) | 34 (0.3%) | 16 (1.3%) | |
| <0.001 | ||||
| N0 | 8,304 (63.3%) | 7,756 (65.2%) | 548 (45.1%) | |
| N1 | 3,390 (25.8%) | 3,044 (25.6%) | 346 (28.5%) | |
| N2 | 871 (6.6%) | 715 (6.0%) | 156 (12.9%) | |
| N3 | 487 (3.7%) | 335 (2.8%) | 152 (12.5%) | |
| Unknown | 65 (0.5%) | 53 (0.4%) | 12 (1.0%) | |
| <0.001 | ||||
| Well | 3,186 (24.3%) | 3,067 (25.8%) | 119 (9.8%) | |
| Moderate | 5,431 (41.4%) | 4,949 (41.6%) | 482 (39.7%) | |
| Poorly | 4,036 (30.8%) | 3,490 (29.3%) | 546 (45.0%) | |
| Unknown | 464 (3.5%) | 397 (3.3%) | 67 (5.5%) | |
| <0.001 | ||||
| Yes | 3,846 (29.3%) | 3,315 (27.9%) | 531 (43.7%) | |
| No | 8,749 (66.7%) | 8,147 (68.4%) | 602 (49.6%) | |
| Unknown | 522 (4.0%) | 441 (3.7%) | 81 (6.7%) | |
| <0.001 | ||||
| Yes | 1,279 (9.8%) | 1,142 (9.6%) | 137 (11.3%) | |
| No | 7,682 (58.6%) | 6,836 (57.4%) | 846 (69.7%) | |
| Unknown | 4,156 (31.7%) | 3,925 (33.0%) | 231 (19.0%) | |
| <0.001 | ||||
| Positive | 9,853 (75.1%) | 9,083 (76.3%) | 770 (63.4%) | |
| Negative | 3,219 (24.5%) | 2,784 (23.4%) | 435 (35.8%) | |
| Unknown | 45 (0.3%) | 36 (0.3%) | 9 (0.7%) | |
| <0.001 | ||||
| Positive | 8,981 (68.5%) | 8,302 (69.7%) | 679 (55.9%) | |
| Negative | 4,089 (31.2%) | 3,563 (29.9%) | 526 (43.3%) | |
| Unknown | 47 (0.4%) | 38 (0.3%) | 9 (0.7%) | |
| 0.005 | ||||
| Positive | 2,729 (20.8%) | 2,434 (20.4%) | 295 (24.3%) | |
| Negative | 10,079 (76.8%) | 9,192 (77.2%) | 887 (73.1%) | |
| Unknown | 309 (2.4%) | 277 (2.3%) | 32 (2.6%) | |
| <0.001 | ||||
| ER or PR+, HER2– | 8,321 (63.4%) | 7,704 (64.7%) | 617 (50.8%) | |
| ER or PR+, HER2+ | 1,471 (11.2%) | 1,310 (11.0%) | 161 (13.3%) | |
| ER and PR–, HER2- | 1,758 (13.4%) | 1,488 (12.5%) | 270 (22.2%) | |
| ER and PR–, HER2+ | 1,258 (9.6%) | 1,124 (9.4%) | 134 (11.0%) | |
| Unknown | 309 (2.4%) | 277 (2.3%) | 32 (2.6%) | |
| <0.001 | ||||
| Positive | 1,571 (12.0%) | 1,380 (11.6%) | 191 (15.7%) | |
| Negative | 8,633 (65.8%) | 8,150 (68.5%) | 483 (39.8%) | |
| Unknown | 2,913 (22.2%) | 2,373 (19.9%) | 540 (44.5%) | |
| <0.001 | ||||
| Positive | 2,090 (15.9%) | 1,850 (15.5%) | 240 (19.8%) | |
| Negative | 8,111 (61.8%) | 7,677 (64.5%) | 434 (35.7%) | |
| Unknown | 2,916 (22.2%) | 2,376 (20.0%) | 540 (44.5%) | |
| <0.001 | ||||
| 0–9 | 2,344 (17.9%) | 2,252 (18.9%) | 92 (7.6%) | |
| 10–19 | 2,057 (15.7%) | 1,912 (16.1%) | 145 (11.9%) | |
| 20–29 | 1,263 (9.6%) | 1,146 (9.6%) | 117 (9.6%) | |
| 30–39 | 956 (7.3%) | 853 (7.2%) | 103 (8.5%) | |
| 40–49 | 446 (3.4%) | 399 (3.4%) | 47 (3.9%) | |
| 50–59 | 489 (3.7%) | 419 (3.5%) | 70 (5.8%) | |
| 60–69 | 422 (3.2%) | 370 (3.1%) | 52 (4.3%) | |
| 70–79 | 285 (2.2%) | 251 (2.1%) | 34 (2.8%) | |
| 80–89 | 321 (2.4%) | 276 (2.3%) | 45 (3.7%) | |
| 90–99 | 190 (1.4%) | 158 (1.3%) | 32 (2.6%) | |
| Unknown | 4,344 (33.1%) | 3,867 (32.5%) | 477 (39.3%) | |
| <0.001 | ||||
| Yes | 9,496 (72.4%) | 8,684 (73.0%) | 812 (66.9%) | |
| No | 3,427 (26.1%) | 3,043 (25.6%) | 384 (31.6%) | |
| Stop by patient | 2 (0.0%) | 1 (0.0%) | 1 (0.1%) | |
| Unknown | 192 (1.5%) | 175 (1.5%) | 17 (1.4%) | |
| <0.001 | ||||
| Yes | 8,572 (65.4%) | 7,596 (63.8%) | 976 (80.4%) | |
| No | 4,367 (33.3%) | 4,141 (34.8%) | 226 (18.6%) | |
| Stop by patient | 36 (0.3%) | 32 (0.3%) | 4 (0.3%) | |
| Unknown | 142 (1.1%) | 134 (1.1%) | 8 (0.7%) | |
| <0.001 | ||||
| Yes | 9,747 (74.3%) | 8,995 (75.6%) | 752 (61.9%) | |
| No | 3,162 (24.1%) | 2,717 (22.8%) | 445 (36.7%) | |
| Unknown | 208 (1.6%) | 191 (1.6%) | 17 (1.4%) | |
| <0.001 | ||||
| Yes | 1,446 (17.9%) | 1,333 (18.9%) | 113 (10.8%) | |
| No | 6,303 (77.8%) | 5,409 (76.7%) | 894 (85.1%) | |
| Unknown | 349 (4.3%) | 306 (4.3%) | 43 (4.1%) | |
*For comparison of non-recurrent vs. recurrent patients.
ER, estrogen receptor; HER2, human epidermal growth factor receptor 2; PR, progesterone receptor; EGFR, epidermal growth factor receptor; SD, standard deviation.
Figure 2Mean numbers of follow-up visits per year after surgery. Error bar indicates standard deviation.
Figure 3Performance of the final model in terms of the receiver operating characteristic (ROC) curves. (A) The ROC and area under the ROC curve (AUC) were evaluated using the separated test group at each 2-, 5-, and 7-year point. The comparison of ROC and AUC of logistic regression (LR), random forest (RF), gradient boosting (GB), and our deep-learning (DL) models at (B) 2 year, (C) 5 year, and (D) 7 year point.
Figure 4Performance of the final model in terms of model error (predicted–observed) over each of the 2/5/7 years. Patients were grouped by breast cancer features. (A) Pathologic T stage. (B) Pathologic N stage. (C) ER/PR/HER2 status. (D) EGFR status. (E) CK56 status. For model specificity and accuracy, Median Absolute Error (MAE) and weighted Mean Absolute Error (wMAE) were calculated at each group or bin. Solid lines indicate observed recurrence proportion, and dashed lines indicate predicted recurrence proportion.