| Literature DB >> 35008417 |
Juan Luis Gomez Marti1,2, Adam Brufsky3,4, Alan Wells1,2,4, Xia Jiang5.
Abstract
BACKGROUND: Risk of metastatic recurrence of breast cancer after initial diagnosis and treatment depends on the presence of a number of risk factors. Although most univariate risk factors have been identified using classical methods, machine-learning methods are also being used to tease out non-obvious contributors to a patient's individual risk of developing late distant metastasis. Bayesian-network algorithms can identify not only risk factors but also interactions among these risks, which consequently may increase the risk of developing metastatic breast cancer. We proposed to apply a previously developed machine-learning method to discern risk factors of 5-, 10- and 15-year metastases.Entities:
Keywords: Markov Blanket and Interactive Risk Factor Learner (MBIL); causal learning; machine learning; metastasis; metastatic breast cancer; risk factors
Year: 2022 PMID: 35008417 PMCID: PMC8750735 DOI: 10.3390/cancers14010253
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Definitions of variables in LSM datasets.
| Variables Included | Description | Values |
|---|---|---|
| Race | Race of patient | White, Black, Asian, American Indian or Alaskan native, native Hawaiian or other Pacific islander |
| Ethnicity | Ethnicity of patient | Not Hispanic, Hispanic |
| Smoking | Smoking history of patient | Ex-smoker, non-smoker, cigarettes, chewing tobacco, cigar |
| Alcohol usage | Alcohol usage of patient | Moderate, no use, use but not otherwise specified former user, heavy user |
| Family history | Family history of cancer | Cancer, no cancer, breast cancer, other cancer, cancer but not otherwise specified |
| Age_at_diagnosis | Age at diagnosis of the disease | 0–49, 50–69, >69 |
| Menopausal_status | Inferred menopausal status | Pre-, post- |
| Side | Side of tumor | Left, right |
| TNEG | Triple negative status in terms of patient being ER-, PR- and HER2-negative | Yes, no |
| ER | Estrogen receptor expression | Neg, pos, low pos |
| ER_percent | Percent of cell stain pos for ER receptors | 0–20, 20–90, 90–100 |
| PR | Progesterone receptor expression | Neg, pos, low pos |
| PR_percent | Percent of cell stain pos for PR receptors | 0–20, 20–90, 90–100 |
| P53 | Whether P53 is mutated | Neg, pos, low pos |
| HER2 | HER2 expression | Neg, pos |
| t_tnm_stage | Prime tumor stage in TNM system | 0, 1, 2, 3, 4, IS, 1 mic, X |
| n_tnm_stage | Number of nearby cancerous lymph nodes | 0, 1, 2, 3, 4, X |
| Stage | Composite of size and number of positive nodes | 0, 1, 2, 3 |
| Lymph_nodes_removed | Number of lymph nodes removed | 0–11, 12–22, >22 |
| Lymph_nodes_positive | Number of positive lymph nodes | 0, 1–8, >8 |
| Lymph_node_status | Whether patient has any positive lymph nodes | Neg, pos |
| Histology | Tumor histology | Lobular, ductal |
| Size | Size of tumor in mm | 0–32, 32–70, >70 |
| Grade | Grade of disease | 1, 2, 3 |
| Invasive | Whether tumor is invasive | Yes, no |
| Histology2 | Tumor histology subtypes | IDC, DCIS, ILC, NC |
| Invasive_tumor_location | Where invasive tumor is located | Mixed duct and lobular, duct, lobular, none |
| DCIS_level | Type of ductal carcinoma in situ | Solid, apocrine, cribriform, dcis, comedo, papillary, micropapillary |
| Re_excision | Removal of an additional margin of tissue | Yes, no |
| Surgical_margins | Whether there are any residual tumors | Residual tumor, no residual tumor, |
| MRIs_60_surgery | MRIs within 60 days of surgery | Yes, no |
Figure 1A BN DAG model illustrating the Markov blanket. The Markov blanket of T consists of nodes X11, X12, X13, X14 and X15. These nodes are the direct risk factors of T and separate T from the influence of the noisy variables X1–X10, X16 and X17 (adapted from [8]).
Figure 2MBIL-generated causal sets of 5-, 10- and 15-year breast cancer metastases.
Figure 3The MBIL generated an output of clinical interactions related to 5-, 10- and 15-year breast cancer metastases. HER2, ER, grade, race/ethnicity, TNEG, smoking/alcohol and surgical margins are represented. Each bar plot indicates the number of counts in which each of these variables was identified as a risk factor of metastasis at different values of alpha.
Variables interacting with an alpha of 1. ER, TNEG, HER2, race/ethnicity and alcohol/smoking are represented with their interacting variables, the number of times they interacted, the years after diagnosis when these interactions were risk factors for metastases, the total number of times the variables interacted and frequency of interaction.
| Alpha 1 | |||||
|---|---|---|---|---|---|
| Variable | Interacts with | n Times | Years after DG | Total | % |
| ER | n-TNM | 2 | 5, 10 | 6 | 33.33% |
| ER | HER2 | 2 | 5, 15 | 6 | 33.33% |
| ER | LN positive | 2 | 15, 15 | 6 | 33.33% |
| TNEG | HER2 | 1 | 5 | 7 | 14.29% |
| TNEG | Age at DG | 1 | 15 | 7 | 14.29% |
| TNEG | LN positive/status | 2 | 15 | 7 | 28.57% |
| TNEG | Ethnicity | 1 | 15 | 7 | 14.29% |
| TNEG | Stage | 1 | 15 | 7 | 14.29% |
| TNEG | Re_excision | 1 | 15 | 7 | 14.29% |
| HER2 | Stage | 6 | 5, 5, 10, 10, 10, 15 | 23 | 26.09% |
| HER2 | MRIs_60_surgery | 1 | 5 | 23 | 4.35% |
| HER2 | ER_percent | 1 | 5 | 23 | 4.35% |
| HER2 | TNEG | 1 | 5 | 23 | 4.35% |
| HER2 | Histology | 3 | 5, 10, 10 | 23 | 13.04% |
| HER2 | Grade | 2 | 5, 5 | 23 | 8.70% |
| HER2 | Invasive tumor location | 2 | 5, 10 | 23 | 8.70% |
| HER2 | ER | 1 | 5, 15 | 23 | 4.35% |
| HER2 | PR | 2 | 5, 10 | 23 | 8.70% |
| HER2 | LN positive/status | 3 | 10, 10, 15 | 23 | 13.04% |
| HER2 | Surgical margins | 1 | 10 | 23 | 4.35% |
| Race/ethnicity | Histology | 1 | 5 | 8 | 12.50% |
| Race/ethnicity | Grade | 1 | 5 | 8 | 12.50% |
| Race/ethnicity | ER_percent | 1 | 10 | 8 | 12.50% |
| Race/ethnicity | n-TNM | 1 | 10 | 8 | 12.50% |
| Race/ethnicity | Side | 1 | 10 | 8 | 12.50% |
| Race/ethnicity | LN positive/status | 1 | 10 | 8 | 12.50% |
| Race/ethnicity | TNEG | 1 | 15 | 8 | 12.50% |
| Race/ethnicity | Stage | 1 | 15 | 8 | 12.50% |
| Alcohol/smoking | LN positive/status | 1 | 15 | 1 | 100.00% |
Variables interacting with an alpha of 120. ER, TNEG, HER2, race/ethnicity and alcohol/smoking are represented with their interacting variables, the number of times they interacted, the years after diagnosis when these interactions were found to be risk factors for metastases, the total number of times the variables interacted and frequency of interaction.
| Alpha 120 | |||||
|---|---|---|---|---|---|
| Variable | Interacts with | n Times | Years after DG | Total | % |
| ER | n-TNM | 9 | 5, 5, 5, 5, 5, 5, 5, 10, 15 | 30 | 30.00% |
| ER | Surgical margins | 4 | 5, 5, 10, 15 | 30 | 13.33% |
| ER | Family history | 2 | 5, 10 | 30 | 6.67% |
| ER | LN positive/status | 4 | 5, 10, 15, 15 | 30 | 13.33% |
| ER | HER2 | 1 | 5 | 30 | 3.33% |
| ER | MRIs_60_surgery | 1 | 5 | 30 | 3.33% |
| ER | Race/ethnicity | 3 | 5, 15, 15 | 30 | 10.00% |
| ER | Histology | 1 | 5 | 30 | 3.33% |
| ER | Invasive tumor location | 1 | 5 | 30 | 3.33% |
| ER | Size | 1 | 5 | 30 | 3.33% |
| ER | Side | 1 | 5 | 30 | 3.33% |
| ER | DCIS_level | 2 | 5, 10 | 30 | 6.67% |
| TNEG | n-TNM | 4 | 5, 5, 10, 15 | 8 | 50.00% |
| TNEG | Surgical margins | 2 | 10, 15 | 8 | 25.00% |
| TNEG | Smoking | 1 | 5 | 8 | 12.50% |
| TNEG | Invasive tumor location | 1 | 5 | 8 | 12.50% |
| HER2 | ER | 1 | 5 | 12 | 8.33% |
| HER2 | n-TNM | 1 | 5 | 12 | 8.33% |
| HER2 | Stage | 5 | 5, 10, 10, 10, 15 | 12 | 41.67% |
| HER2 | Surgical margins | 2 | 5, 10 | 12 | 16.67% |
| HER2 | Histology | 1 | 10 | 12 | 8.33% |
| HER2 | Grade | 1 | 15 | 12 | 8.33% |
| HER2 | PR | 1 | 10 | 12 | 8.33% |
| Race/ethnicity | ER | 3 | 5, 15, 15 | 17 | 17.65% |
| Race/ethnicity | n-TNM | 3 | 5, 10, 15 | 17 | 17.65% |
| Race/ethnicity | Stage | 3 | 5, 15, 15 | 17 | 17.65% |
| Race/ethnicity | Surgical margins | 1 | 5 | 17 | 5.88% |
| Race/ethnicity | ER_percent | 1 | 10 | 17 | 5.88% |
| Race/ethnicity | Grade | 1 | 15 | 17 | 5.88% |
| Race/ethnicity | LN positive/status | 3 | 15 | 17 | 17.65% |
| Race/ethnicity | Re_excision | 1 | 15 | 17 | 5.88% |
| Race/ethnicity | PR_percent | 1 | 15 | 17 | 5.88% |
| Smoking/alcohol | TNEG | 1 | 5 | 4 | 25.00% |
| Smoking/alcohol | n-TNM | 1 | 5 | 4 | 25.00% |
| Smoking/alcohol | Stage | 1 | 10 | 4 | 25.00% |
| Smoking/alcohol | Histology | 1 | 10 | 4 | 25.00% |
Variables interacting with an alpha of 480. ER, TNEG, HER2, race/ethnicity and alcohol/smoking are represented with their interacting variables, the number of times they interacted, the years after diagnosis when these interactions were found to be risk factors for metastases, the total number of times the variables interacted and frequency of interaction.
| Alpha 480 | |||||
|---|---|---|---|---|---|
| Variable | Interacts with | n Times | Years after DG | Total | % |
| ER | n-TNM | 6 | 5, 5, 5, 10, 10, 10 | 19 | 31.58% |
| ER | Surgical margins | 3 | 5, 5, 10 | 19 | 15.79% |
| ER | Race | 3 | 5, 10, 15 | 19 | 15.79% |
| ER | Size | 1 | 5 | 19 | 5.26% |
| ER | Smoking | 1 | 5 | 19 | 5.26% |
| ER | Family history | 1 | 10 | 19 | 5.26% |
| ER | LN positive/status | 1 | 10 | 19 | 5.26% |
| ER | Stage | 1 | 10 | 19 | 5.26% |
| ER | DCIS_level | 1 | 10 | 19 | 5.26% |
| ER | Age at DG | 1 | 10 | 19 | 5.26% |
| TNEG | n-TNM | 1 | 10 | 2 | 50.00% |
| TNEG | Surgical margins | 1 | 10 | 2 | 50.00% |
| HER2 | Stage | 2 | 5, 10 | 7 | 28.57% |
| HER2 | Surgical margins | 4 | 5, 5, 10 | 7 | 57.14% |
| HER2 | t-TNM | 1 | 5 | 7 | 14.29% |
| Race/ethnicity | Stage | 6 | 5, 15, 15, 15, 15, 15 | 22 | 27.27% |
| Race/ethnicity | Surgical margins | 2 | 5, 5 | 22 | 9.09% |
| Race/ethnicity | ER | 3 | 5, 10, 15 | 22 | 13.64% |
| Race/ethnicity | n-TNM | 3 | 5, 10, 10 | 22 | 13.64% |
| Race/ethnicity | Family history | 1 | 10 | 22 | 4.55% |
| Race/ethnicity | LN positive/status | 1 | 10 | 22 | 4.55% |
| Race/ethnicity | ER_percent | 1 | 10 | 22 | 4.55% |
| Race/ethnicity | Grade | 1 | 15 | 22 | 4.55% |
| Race/ethnicity | Invasive tumor location | 1 | 15 | 22 | 4.55% |
| Race/ethnicity | Re-excision | 1 | 15 | 22 | 4.55% |
| Race/ethnicity | Alcohol | 1 | 15 | 22 | 4.55% |
| Race/ethnicity | Histology2 | 1 | 15 | 22 | 4.55% |
| Smoking/alcohol | t-TNM | 2 | 5, 5 | 12 | 16.67% |
| Smoking/alcohol | n-TNM | 3 | 5, 5, 5 | 12 | 25.00% |
| Smoking/alcohol | Stage | 3 | 5, 10, 15 | 12 | 25.00% |
| Smoking/alcohol | Surgical margins | 2 | 5, 10 | 12 | 16.67% |
| Smoking/alcohol | ER | 1 | 5 | 12 | 8.33% |
| Smoking/alcohol | race | 1 | 15 | 12 | 8.33% |