| Literature DB >> 31475114 |
Ming Li1, Cheng Zhan1, Xizhao Sui2, Wei Jiang1, Yu Shi1, Xiaodong Yang1, Mingxiang Feng1, Jun Wang2, Qun Wang1.
Abstract
Objective: To propose modifications to refine prognostication over anatomic extent of the current tumor, node, and metastasis (TNM) staging system of non-small cell lung cancer (NSCLC) for a better distinction, and reflect survival differences of lung adenocarcinoma and squamous cell carcinoma. Study Design: Three large cohorts were included in this study. The training cohort consisted of 124,788 patients in the Surveillance, Epidemiology, and End Results (SEER) database (2006-2015). The validation cohort consisted of 4,247 patients from the Zhongshan Hospital, Fudan University (FDZSH; 2005-2014), and People's Hospital, Peking University (PKUPH; 2000-2017). The algorithm generated a hierarchical clustering model based on the unsupervised learning for survival data using Kaplan-Meier curves and log-rank test statistics for recursive partitioning and selection of the principal groupings.Entities:
Keywords: TNM staging system; machine learning; modification; non-small cell lung cancer; survival difference
Year: 2019 PMID: 31475114 PMCID: PMC6702456 DOI: 10.3389/fonc.2019.00771
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Clinical and histopathologic characteristics of patients.
| Gender | |||
| Male | 64,448 (51.6) | 918 (54.8) | 1,391 (54.1) |
| Female | 60,340 (48.4) | 757 (45.2) | 1,181 (45.9) |
| Age, years | |||
| <65 | 46,348 (37.1) | 1,149 (68.6) | 1,471 (57.2) |
| ≥65 | 78,440 (62.9) | 526 (31.4) | 1,101 (42.8) |
| Location | |||
| Upper lobe | 75,130 (60.2) | 882 (52.7) | 1,313 (51.0) |
| Middle lobe | 5,888 (4.7) | 114 (6.8) | 174 (6.8) |
| Lower lobe | 37,072 (29.7) | 518 (30.9) | 763 (29.7) |
| Overlapping lesion of lung | 1,396 (1.1) | 155 (9.2) | 133 (5.2) |
| Unknown | 5,302 (4.3) | 6 (0.4) | 189 (7.3) |
| Histology | |||
| Squamous cell carcinoma | 42,380 (34.0) | 434 (25.9) | 497 (19.3) |
| Adenocarcinoma | 82,408 (66.0) | 1,241 (74.1) | 2,075 (80.7) |
| Grade | |||
| Well-differentiated; Grade I | 9,730 (7.8) | 34 (2.0) | 105 (4.1) |
| Moderately differentiated; Grade II | 32,599 (26.1) | 751 (44.8) | 1,222 (47.5) |
| Poorly differentiated; Grade III | 37,182 (29.8) | 626 (37.4) | 859 (33.4) |
| Undifferentiated; anaplastic; Grade IV | 1,023 (0.8) | 264 (15.8) | 386 (15.0) |
| Unknown | 44,254 (35.5) | – | |
| 8th AJCC/UICC stage | |||
| I | 34,259 (27.5) | 985 (58.8) | 1,644 (63.9) |
| II | 14,720 (11.8) | 304 (18.1) | 360 (14.0) |
| III | 27,190 (21.8) | 372 (22.2) | 484 (18.8) |
| IV | 48,619 (38.9) | 14 (0.9) | 84 (3.3) |
| Modified 8th stage | |||
| I | 33,803 (27.1) | 994 (59.4) | 1,614 (62.8) |
| II | 28,048 (22.5) | 535 (31.9) | 704 (27.4) |
| III | 25,647 (20.6) | 139 (8.3) | 188 (7.3) |
| IV | 37,290 (29.8) | 7 (0.4) | 66 (2.5) |
| Treatment | |||
| Surgery | 44,194 (35.4) | 1,675 (100) | 2,572 (100) |
| No surgery | 80,312 (64.4) | – | – |
| Unknown | 282 (0.2) | – | – |
Figure 1Kaplan-Meier survival curves for the patients and patients with adenocarcinoma or squamous cell carcinoma separately from the SEER cohort (A) using the 8th edition of the TNM staging system (I; II; III; IV), (B) the 8th edition of the TNM staging system (IA; IB; IIA; IIB; IIIA; IIIB; IIIC; IV), (C) the modified edition of the TNM staging system (I; II; III; IV), (D) the modified edition of the TNM staging system (IA; IB; IIA; IIB; IIIA; IIIB; IIIC; IV).
Cox proportional hazards regression model output for the 8th edition of the TNM staging system and modified staging system using the SEER cohort, FDZSH cohort, and PKUPH cohort.
| I to II | 0.4670 | 0.4003 | 0.2685 | 0.2390 | 0.3429 | 0.2929 | ||||||
| II to III | 0.6048 | 0.4920 | 0.5791 | 0.5071 | 0.5424 | 0.4897 | ||||||
| III to IV | 0.4973 | 0.6286 | 1.0300 | 0.5202 | 0.6109 | 0.7504 | ||||||
| IA to IB | 0.5793 | 0.5739 | 0.3458 | 0.2295 | 1.4100 | 0.3941 | ||||||
| IB to IIA | 0.7243 | 0.6434 | 0.8165 | 0.6004 | 1.3060 | 0.5633 | ||||||
| IIA to IIB | 0.9687 | 0.7054 | 0.7300 | 0.8219 | 0.8350 | 0.6196 | ||||||
| IIB to IIIA | 0.7033 | 0.6796 | 0.6928 | 0.5494 | 1.0260 | 0.6808 | ||||||
| IIIA to IIIB | 0.6899 | 0.8246 | 0.5363 | 0.9640 | 1.1120 | 1.0150 | ||||||
| IIIB to IIIC | 0.8931 | 0.8530 | 0.5778 | 1.9250 | 2.9420 | 0.7121 | ||||||
| IIIC to IV | 0.6734 | 0.7461 | 2.4030 | 0.2802 | 0.3422 | 1.0500 | ||||||
TNM, tumor, node, metastasis.
Figure 2Hierarchical clustering model of the modified staging system using Kaplan-Meier curves and log-rank test statistics for recursive partitioning and selection of the principal groupings.
Figure 3Kaplan-Meier survival curves for the patients and patients with adenocarcinoma or squamous cell carcinoma separately from the FDZSH cohort (A) using the 8th edition of the TNM staging system (I; II; III; IV), (B) the 8th edition of the TNM staging system (IA; IB; IIA; IIB; IIIA; IIIB; IIIC; IV), (C) the modified edition of the TNM staging system (I; II; III; IV), (D) the modified edition of the TNM staging system (IA; IB; IIA; IIB; IIIA; IIIB; IIIC; IV).
Figure 4Kaplan-Meier survival curves for the patients and patients with adenocarcinoma or squamous cell carcinoma separately from the PKUPH cohort (A) using the 8th edition of the TNM staging system (I; II; III; IV), (B) the 8th edition of the TNM staging system (IA; IB; IIA; IIB; IIIA; IIIB; IIIC; IV), (C) the modified edition of the TNM staging system (I; II; III; IV), (D) the modified edition of the TNM staging system (IA; IB; IIA; IIB; IIIA; IIIB; IIIC; IV).