| Literature DB >> 29968970 |
Mathew T Hueman1, Huan Wang2, Charles Q Yang3, Li Sheng4, Donald E Henson5, Arnold M Schwartz6,7, Dechang Chen5.
Abstract
Integrating additional prognostic factors into the tumor, lymph node, metastasis staging system improves the relative stratification of cancer patients and enhances the accuracy in planning their treatment options and predicting clinical outcomes. We describe a novel approach to build prognostic systems for cancer patients that can admit any number of prognostic factors. In the approach, an unsupervised learning algorithm was used to create dendrograms and the C-index was used to cut dendrograms to generate prognostic groups. Breast cancer data from the Surveillance, Epidemiology, and End Results Program of the National Cancer Institute were used for demonstration. Two relative prognostic systems were created for breast cancer. One system (7 prognostic groups with C-index = 0.7295) was based on tumor size, regional lymph nodes, and no distant metastasis. The other system (7 prognostic groups with C-index = 0.7458) was based on tumor size, regional lymph nodes, no distant metastasis, grade, estrogen receptor, progesterone receptor, and age. The dendrograms showed a relationship between survival and prognostic factors. The proposed approach is able to create prognostic systems that have a good accuracy in survival prediction and provide a manageable number of prognostic groups. The prognostic systems have the potential to permit a thorough database analysis of all information relevant to decision-making in patient management and prognosis.Entities:
Keywords: C-index; breast cancer; cancer staging; dendrogram; machine learning; survival
Mesh:
Year: 2018 PMID: 29968970 PMCID: PMC6089151 DOI: 10.1002/cam4.1629
Source DB: PubMed Journal: Cancer Med ISSN: 2045-7634 Impact factor: 4.452
Definitions of T, N, M, G, ER, PR, and A for SEER breast cancer data diagnosed 1990‐2003
| Categories | Criteria | |
|---|---|---|
| Tumor size (T) | Tis | Carcinoma in situ |
| T1 | Tumor 2 cm or less in greatest dimension | |
| T2 | Tumor more than 2 cm but not more than 5 cm in greatest dimension | |
| T3 | Tumor more than 5 cm in greatest dimension | |
| T4 |
Tumor of any size with direct extension to (a) chest wall or (b) skin, only as described below | |
| Regional positive lymph nodes (N) | N0 | No regional lymph node metastasis histologically |
| N1 | Metastasis in 1 to 3 axillary lymph nodes, and/or in internal mammary nodes with microscopic disease detected by sentinel lymph node dissection but not clinically apparent | |
| N2 | Metastasis in 4 to 9 axillary lymph nodes, or in clinically apparent internal mammary lymph nodes in the absence of axillary lymph node metastasis | |
| N3 | Metastasis in 10 or more axillary lymph nodes, or in infraclavicular lymph nodes, or in clinically apparent ipsilateral internal mammary lymph nodes in the presence of 1 or more positive axillary lymph nodes; or in more than 3 axillary lymph nodes with clinically negative microscopic metastasis in internal mammary lymph nodes; or in ipsilateral supraclavicular lymph nodes | |
| Distant metastasis (M) | M0 | No distant metastasis |
| M1 | Distant metastasis | |
| Histological grade (G) | G1 | Well differentiated |
| G2 | Moderately differentiated | |
| G3 | Poorly differentiated or undifferentiated | |
| Estrogen receptor expression (ER) | ER+ | Cancer cells can receive signals from estrogen |
| ER− | Cancer cells cannot receive signals from estrogen | |
| Progesterone receptor expression (PR) | PR+ | Cancer cells can receive signals from progesterone |
| PR− | Cancer cells cannot receive signals from progesterone | |
| Age at diagnosis (A) | A1 | Age at diagnosis ranging from 20 to 50 in years |
| A2 | Age at diagnosis equal to or larger than 51 in years |
Figure 1Breast cancer‐specific survival of 17 combinations of SEER breast cancer patients diagnosed 1990‐2003
Figure 2Generation of the prognostic system on the basis of T, N, and M using the SEER breast cancer patients diagnosed 1990‐2003. (A) Dendrogram for 12 combinations according to T, N, and M. A 10‐year survival rate in percentage is given beneath each combination. (B) C‐index curve and n*(=7). (C) Cutting the dendrogram according to n*. (D) Breast cancer‐specific survival of 7 prognostic groups
EACCD and AJCC grouping of the SEER breast cancer patients diagnosed 1990‐2003
| EACCD | AJCC | ||
|---|---|---|---|
| Group 1 | TisN0M0 T1N0M0 | Stage 0 | TisN0M0 |
| Group 2 | T1N1M0 | Stage I | T1N0M0 |
| T2N0M0 | |||
| T3N0M0 | |||
| Group 3 | T1N2M0 | Stage IIA | T1N1M0 |
| T2N1M0 | T2N0M0 | ||
| Group 4 | T2N2M0 | Stage IIB | T2N1M0 |
| T3N1M0 | T3N0M0 | ||
| Group 5 | T1N3M0 | Stage IIIA | T1N2M0 |
| T3N2M0 | T2N2M0 | ||
| T4N0M0 | T3N1M0 | ||
| T3N2M0 | |||
| Group 6 | T2N3M0 | Stage IIIB | T4N0M0 |
| T3N3M0 | T4N1M0 | ||
| T4N1M0 | T4N2M0 | ||
| T4N2M0 | |||
| Group 7 | T4N3M0 | Stage IIIC | T1N3M0 |
| T2N3M0 | |||
| T3N3M0 | |||
| T4N3M0 | |||
Figure 3Breast cancer‐specific survival of 7 AJCC stage groups defined in Table 2
Figure 4Dendrogram (in black color) for 165 combinations of the SEER breast cancer patients diagnosed 1990‐2003. A 10‐year survival rate in percentage is given beneath each combination. Lines in red color show 7 prognostic groups from cutting the dendrogram
Figure 5(A) C‐index curve and n*(=7) using the dendrogram in Figure 4. (B) Breast cancer‐specific survival of 7 prognostic groups from cutting the dendrogram in Figure 4 according to n*
EACCD grouping of the SEER breast cancer patients diagnosed 1990‐2003
| T | N | M | G | ER | PR | A |
|---|---|---|---|---|---|---|
|
| ||||||
| T1 | N0 | M0 | G1 | Any ER | Any PR | 20+ |
| T1 | N0 | M0 | G2 | ER− | PR+ | 20‐50 |
| T1 | N0 | M0 | G2 | ER+ | PR− | 51+ |
| T1 | N0 | M0 | G2 | ER+ | PR+ | 20+ |
| T1 | N1 | M0 | G1 | ER+ | PR− | 51+ |
| T1 | N1 | M0 | G1 | ER+ | PR+ | 20+ |
| T2 | N0 | M0 | G1 | ER+ | PR+ | 20‐50 |
|
| ||||||
| T1 | N0 | M0 | G2 | ER− | PR− | 20+ |
| T1 | N0 | M0 | G2 | ER− | PR+ | 51+ |
| T1 | N0 | M0 | G2 | ER+ | PR− | 20‐50 |
| T1 | N0 | M0 | G3 | Any ER | Any PR | 20+ |
| T1 | N1 | M0 | G2 | ER+ | PR+ | 20+ |
| T1 | N2 | M0 | G1 | ER+ | PR+ | 20‐50 |
| T2 | N0 | M0 | G1 | ER− | PR− | 51+ |
| T2 | N0 | M0 | G1 | ER+ | PR+ | 51+ |
| T2 | N0 | M0 | G2 | ER+ | PR+ | 20‐50 |
| T2 | N1 | M0 | G1 | ER+ | PR+ | 20‐50 |
| T3 | N0 | M0 | G2 | ER+ | PR+ | 20‐50 |
|
| ||||||
| T1 | N1 | M0 | G2 | ER− | PR+ | 20+ |
| T1 | N1 | M0 | G2 | ER+ | PR− | 20+ |
| T1 | N1 | M0 | G3 | ER− | PR− | 20‐50 |
| T1 | N1 | M0 | G3 | ER− | PR+ | 20+ |
| T1 | N1 | M0 | G3 | ER+ | PR− | 20‐50 |
| T1 | N1 | M0 | G3 | ER+ | PR+ | 20+ |
| T1 | N2 | M0 | G1 | ER+ | PR+ | 51+ |
| T2 | N0 | M0 | G1 | ER+ | PR− | 51+ |
| T2 | N0 | M0 | G2 | ER− | PR− | 20‐50 |
| T2 | N0 | M0 | G2 | ER− | PR+ | 51+ |
| T2 | N0 | M0 | G2 | ER+ | PR− | 20+ |
| T2 | N0 | M0 | G2 | ER+ | PR+ | 51+ |
| T2 | N0 | M0 | G3 | ER− | Any PR | 20‐50 |
| T2 | N0 | M0 | G3 | ER+ | PR− | 20‐50 |
| T2 | N0 | M0 | G3 | ER+ | PR+ | 20+ |
| T2 | N1 | M0 | G1 | ER+ | PR+ | 51+ |
| T2 | N1 | M0 | G2 | ER+ | PR+ | 20+ |
| T3 | N0 | M0 | G1 | ER+ | PR+ | 51+ |
| T3 | N0 | M0 | G2 | ER+ | PR+ | 51+ |
| T3 | N0 | M0 | G3 | ER− | PR− | 20‐50 |
|
| ||||||
| T1 | N1 | M0 | G2 | ER− | PR− | 20+ |
| T1 | N1 | M0 | G3 | Any ER | PR− | 51+ |
| T1 | N2 | M0 | G2 | ER+ | PR+ | 20+ |
| T2 | N0 | M0 | G2 | ER− | PR− | 51+ |
| T2 | N0 | M0 | G3 | ER− | Any PR | 51+ |
| T2 | N0 | M0 | G3 | ER+ | PR− | 51+ |
| T2 | N1 | M0 | G1 | ER+ | PR− | 51+ |
| T2 | N1 | M0 | G2 | ER+ | PR− | 20‐50 |
| T2 | N1 | M0 | G3 | ER+ | PR+ | 20‐50 |
| T2 | N2 | M0 | G1 | ER+ | PR+ | 51+ |
| T3 | N0 | M0 | G3 | ER+ | PR+ | 20‐50 |
| T3 | N1 | M0 | G1 | ER+ | PR+ | 51+ |
| T3 | N1 | M0 | G2 | ER+ | PR+ | 20+ |
|
| ||||||
| T1 | N2 | M0 | G2 | ER+ | PR− | 51+ |
| T1 | N2 | M0 | G3 | ER+ | PR+ | 20+ |
| T2 | N1 | M0 | G2 | ER− | PR− | 20‐50 |
| T2 | N1 | M0 | G2 | ER+ | PR− | 51+ |
| T2 | N1 | M0 | G3 | ER− | Any PR | 20‐50 |
| T2 | N1 | M0 | G3 | ER+ | PR− | 20+ |
| T2 | N1 | M0 | G3 | ER+ | PR+ | 51+ |
| T2 | N2 | M0 | G2 | ER+ | PR+ | 20+ |
| T2 | N2 | M0 | G3 | ER+ | PR− | 20‐50 |
| T3 | N0 | M0 | G3 | ER− | PR− | 51+ |
| T3 | N0 | M0 | G3 | ER+ | PR+ | 51+ |
| T3 | N1 | M0 | G3 | ER+ | PR+ | 20‐50 |
| T3 | N2 | M0 | G2 | ER+ | PR+ | 20‐50 |
|
| ||||||
| T1 | N2 | M0 | G2 | ER− | PR− | 51+ |
| T1 | N2 | M0 | G3 | ER− | PR− | 20+ |
| T1 | N2 | M0 | G3 | ER+ | PR− | 51+ |
| T1 | N3 | M0 | G2 | ER+ | PR+ | 20+ |
| T1 | N3 | M0 | G3 | ER− | PR− | 20‐50 |
| T1 | N3 | M0 | G3 | ER+ | PR+ | 20+ |
| T2 | N1 | M0 | G2 | ER− | PR− | 51+ |
| T2 | N1 | M0 | G3 | ER− | Any PR | 51+ |
| T2 | N2 | M0 | G2 | ER− | PR− | 20+ |
| T2 | N2 | M0 | G2 | ER+ | PR− | 51+ |
| T2 | N2 | M0 | G3 | ER− | PR− | 20+ |
| T2 | N2 | M0 | G3 | ER+ | PR− | 51+ |
| T2 | N2 | M0 | G3 | ER+ | PR+ | 20+ |
| T2 | N3 | M0 | G2 | ER+ | PR+ | 20+ |
| T2 | N3 | M0 | G3 | ER+ | PR− | 20‐50 |
| T2 | N3 | M0 | G3 | ER+ | PR+ | 20+ |
| T3 | N1 | M0 | G3 | ER− | PR− | 20+ |
| T3 | N1 | M0 | G3 | ER+ | Any PR | 51+ |
| T3 | N2 | M0 | G2 | ER+ | PR+ | 51+ |
| T3 | N2 | M0 | G3 | ER+ | PR+ | 20+ |
| T3 | N3 | M0 | G2 | ER+ | PR+ | 20+ |
| T3 | N3 | M0 | G3 | ER+ | PR+ | 20‐50 |
| T4 | N0 | M0 | G2 | ER+ | PR+ | 51+ |
| T4 | N0 | M0 | G3 | ER− | PR− | 20+ |
| T4 | N0 | M0 | G3 | ER+ | PR+ | 51+ |
| T4 | N1 | M0 | G2 | ER+ | PR+ | 51+ |
| T4 | N1 | M0 | G3 | ER+ | PR+ | 20‐50 |
| T4 | N2 | M0 | G2 | ER+ | PR+ | 51+ |
|
| ||||||
| T1 | N3 | M0 | G3 | ER− | PR− | 51+ |
| T2 | N3 | M0 | G2 | ER+ | PR− | 51+ |
| T2 | N3 | M0 | G3 | ER− | PR− | 20+ |
| T2 | N3 | M0 | G3 | ER+ | PR− | 51+ |
| T3 | N2 | M0 | G3 | ER− | PR− | 20+ |
| T3 | N3 | M0 | G3 | ER− | PR− | 20+ |
| T3 | N3 | M0 | G3 | ER+ | Any PR | 51+ |
| T4 | N1 | M0 | G3 | ER− | PR− | 20+ |
| T4 | N1 | M0 | G3 | ER+ | Any PR | 51+ |
| T4 | N2 | M0 | G3 | ER− | PR− | 20+ |
| T4 | N2 | M0 | G3 | ER+ | PR+ | 20+ |
| T4 | N3 | M0 | G2 | ER+ | PR+ | 51+ |
| T4 | N3 | M0 | G3 | ER− | PR− | 20+ |
| T4 | N3 | M0 | G3 | ER+ | PR+ | 20+ |