| Literature DB >> 33490863 |
Sangkyu Lee1,2, Joseph O Deasy1, Jung Hun Oh1, Antonio Di Meglio2, Agnes Dumas3, Gwenn Menvielle4, Cecile Charles2, Sandrine Boyault5, Marina Rousseau5, Celine Besse6,7, Emilie Thomas5, Anne Boland6,7, Paul Cottu8, Olivier Tredan5, Christelle Levy9, Anne-Laure Martin10, Sibille Everhard10, Patricia A Ganz11, Ann H Partridge12, Stefan Michiels3, Jean-François Deleuze6,7,13, Fabrice Andre2, Ines Vaz-Luis2.
Abstract
BACKGROUND: We aimed at predicting fatigue after breast cancer treatment using machine learning on clinical covariates and germline genome-wide data.Entities:
Year: 2020 PMID: 33490863 PMCID: PMC7583150 DOI: 10.1093/jncics/pkaa039
Source DB: PubMed Journal: JNCI Cancer Spectr ISSN: 2515-5091
Figure 1.Consolidated Standards of Reporting Trial (CONSORT) diagram of study population. Patients with no fatigue scores available had Overall more missing information in most baseline characteristics and other patient-reported outcomes. In selected characteristics, we recorded statistically significant differences between the 2 groups of patients. Missing fatigue score correlated with education, income, and TNM stage (Supplementary Table 4, available online). CANTO = Cancer Toxicity study; EORTC-QLQ = European Organization for Research and Treatment quality of life; GWAS = genome-wide association studies.
Predictive performance of PRFR in the validation dataset with respect to the Overall, Physical, Emotional, and Cognitive fatigue
| Fatigue category or endpoint | No. of samples (train/test) | Event rate, % | No. of SNPs with | PRFR performance | |||
|---|---|---|---|---|---|---|---|
| SNP only | SNP + clinical | ||||||
| AUC |
| AUC |
| ||||
| Overall (EORTC-QLQ-C30) | 377/161 | 12.5 | 309 | 0.42 | .89 | NA | — |
| Fatigue domains (EORTC-QLQ-12) | |||||||
| Physical | 283/121 | 19.1 | 277 | 0.44 | .78 | NA | — |
| Emotional | 515/220 | 20.8 | 257 | 0.42 | .96 | 0.42 | .96 |
| Cognitive | 820/351 | 17.0 | 299 | 0.59 | .01 | .60 | .005 |
EORTC QLQ = European Organization for Research and Treatment of Cancer Quality of Life; NA = not applicable; PRFR = preconditioned random forest regression; SNP = single-nucleotide polymorphism.
P value was estimated using Mason and Graham’s test and was 2-sided.
Statistical significance of association between clinical covariates and 4 fatigue endpoints
| Variable | Fatigue categories/endpoints | |||||||
|---|---|---|---|---|---|---|---|---|
| Overall (N = 989) | Physical (N = 763) | Emotional (N = 1274) | Cognitive (N = 2128) | |||||
| Odds ratio (95% CI) |
| Odds ratio (95% CI) |
| Odds ratio (95% CI) |
| Odds ratio (95% CI) |
| |
| Sociodemographic | ||||||||
| Age, continuous | 0.99 (0.97 to 1.01) | .16 | 0.99 (0.97 to 1) | .14 | 0.99 (0.98 to 1) | .06 | 0.98 (0.97 to 0.99) | .003 |
| Education | ||||||||
| College or higher (referent) | 1.00 (Ref) | .79 | 1.00 (Ref) | .70 | 1.00 (Ref) | .38 | 1.00 (Ref) | .02 |
| High school | 0.86 (0.55 to 1.35) | 1.15 (0.74 to 1.78) | 1.24 (0.9 to 1.71) | 1.36 (1.04 to 1.77) | ||||
| Primary school | 0.93 (0.52 to 1.63) | 1.24 (0.69 to 2.19) | 1.21 (0.78 to 1.85) | 1.55 (1.07 to 2.23) | ||||
| Monthly household income | ||||||||
| 1500 (Referent) | 1.00 (Ref) | .50 | 1.00 (Ref) | .21 | 1.00 (Ref) | .35 | 1.00 (Ref) | .22 |
| 1500-3000 | 0.7 (0.37 to 1.39) | 0.61 (0.35 to 1.12) | 0.83 (0.53 to 1.33) | 0.77 (0.52 to 1.14) | ||||
| 3000 | 0.72 (0.39 to 1.43) | 0.67 (0.37 to 1.23) | 1.03 (0.66 to 1.64) | 0.92 (0.63 to 1.38) | ||||
| Employment status | ||||||||
| Nonactive (Referent) | 1.00 (Ref) | .03 | 1.00 (Ref) | .05 | 1.00 (Ref) | .03 | 1.00 (Ref) | 6.95x10−4 |
| Active | 1.54 (1.04 to 2.3) | 1.48 (1.01 to 2.17) | 1.37 (1.03 to 1.82) | 1.5 (1.18 to 1.91) | ||||
| Marital status | ||||||||
| Not married (Referent) | 1.00 (Ref) | .80 | 1.00 (Ref) | .51 | 1.00 (Ref) | .82 | 1.00 (Ref) | .65 |
| Married | 0.83 (0.53 to 1.34) | 0.85 (0.54 to 1.34) | 0.95 (0.68 to 1.34) | 1.08 (0.81 to 1.46) | ||||
| Clinical | ||||||||
| Hormonal status | ||||||||
| Premenopause (Referent) | 1.00 (Ref) | .83 | 1.00 (Ref) | .29 | 1.00 (Ref) | .08 | 1.00 (Ref) | .004 |
| Postmenopause | 0.93 (0.61 to 1.47) | 0.79 (0.53 to 1.2) | 0.76 (0.57 to 1.03) | 0.7 (0.55 to 0.89) | ||||
| Smoking status | ||||||||
| Smoker (Referent) | 1.00 (Ref) | .26 | 1.00 (Ref) | .58 | 1.00 (Ref) | .16 | 1.00 (Ref) | .005 |
| Ex-smoker | 0.97 (0.49 to 1.97) | 0.74 (0.37 to 1.48) | 0.82 (0.5 to 1.34) | 0.74 (0.5 to 1.09) | ||||
| Nonsmoker | 0.71 (0.4 to 1.32) | 0.77 (0.44 to 1.39) | 0.69 (0.46 to 1.06) | 0.6 (0.44 to 0.84) | ||||
| Alcohol status | ||||||||
| No (Referent) | 1.00 (Ref) | 1.00 | 1.00 (Ref) | .49 | 1.00 (Ref) | .08 | 1.00 (Ref) | .10 |
| Yes | 1.01 (0.52 to 1.82) | 1.24 (0.71 to 2.11) | 1.43 (0.95 to 2.13) | 0.72 (0.48 to 1.05) | ||||
| Physical activity (GPAQ 16) | ||||||||
| Q1 (Referent) | 1.00 (Ref) | .20 | 1.00 (Ref) | .06 | 1.00 (Ref) | .03 | 1.00 (Ref) | .36 |
| Q2 | 1.47 (0.84 to 2.58) | 1.19 (0.67 to 2.13) | 0.9 (0.61 to 1.32) | 0.77 (0.55 to 1.07) | ||||
| Q3 | 0.96 (0.54 to 1.7) | 0.83 (0.47 to 1.45) | 0.68 (0.45 to 1.01) | 0.85 (0.61 to 1.18) | ||||
| Q4 | 0.87 (0.49 to 1.55) | 0.98 (0.57 to 1.69) | 0.59 (0.39 to 0.89) | 0.79 (0.57 to 1.11) | ||||
| Charlson comorbidity score, continuous | 1.17 (0.98 to 1.4) | .09 | 1 (0.83 to 1.22) | .97 | 0.97 (0.83 to 1.14) | .82 | 0.96 (0.83 to 1.11) | .59 |
| Depression (HADS) | ||||||||
| Absent (Referent) | 1.00 (Ref) | .59 | 1.00 (Ref) | .06 | 1.00 (Ref) | .06 | 1.00 (Ref) | 4.29 x 10−4 |
| Doubtful | 1.4 (0.51 to 3.29) | 0.87 (0.21 to 2.65) | 2.33 (0.94 to 5.45) | 1.87 (1.11 to 3.08) | ||||
| Certain | 1.47 (0.36 to 4.53) | 6.48 (0.74 to 78.18) | NA | 3.07 (1.31 to 6.86) | ||||
| Anxiety (HADS) | ||||||||
| Absent (Referent) | 1.00 (Ref) | .86 | 1.00 (Ref) | .31 | 1.00 (Ref) | 4.34 x 10−6 | 1.00 (Ref) | 1.41 x 10−4 |
| Doubtful | 0.88 (0.53 to 1.43) | 1.39 (0.88 to 2.18) | 1.9 (1.34 to 2.67) | 1.64 (1.24 to 2.17) | ||||
| Certain | 0.93 (0.56 to 1.53) | 1.18 (0.7 to 1.97) | 2.22 (1.45 to 3.35) | 1.62 (1.19 to 2.2) | ||||
| Symptoms and quality of life | ||||||||
| Hot flashes | ||||||||
| No (Referent) | 1.00 (Ref) | .27 | 1.00 (Ref) | .12 | 1.00 (Ref) | .006 | 1.00 (Ref) | .003 |
| Yes | 1.28 (0.84 to 1.94) | 1.4 (0.91 to 2.12) | 1.53 (1.13 to 2.06) | 1.46 (1.14 to 1.88) | ||||
| Pain (EORTC QLQ-C30), | 1.02 (1.01 to 1.04) | .006 | 1.02 (1 to 1.03) | .008 | 1.02 (1.01 to 1.02) | 7.78 x 10−5 | 1.02 (1.01 to 1.02) | 2.98 x 10−8 |
| Insomnia (EORTC QLQ-C30,) | 1.01 (1 to 1.01) | .11 | 1 (1 to 1.01) | .49 | 1 (1 to 1.01) | .07 | 1.01 (1 to 1.01) | 6.71 x 10−4 |
| Tumor characteristics | ||||||||
| Tumor grade | ||||||||
| 1 (Referent) | 1.00 (Ref) | .29 | 1.00 (Ref) | .25 | 1.00 (Ref) | .09 | 1.00 (Ref) | .70 |
| 2 | 1.49 (0.87 to 2.67) | 1.06 (0.64 to 1.79) | 1.47 (0.98 to 2.24) | 1.13 (0.81 to 1.58) | ||||
| 3 | 1.5 (0.82 to 2.84) | 1.45 (0.84 to 2.55) | 1.59 (1.02 to 2.53) | 1.15 (0.8 to 1.67) | ||||
| Tumor subtype | ||||||||
| HR+HER2+(Referent) | 1.00 (Ref) | .71 | 1.00 (Ref) | .02 | 1.00 (Ref) | .31 | 1.00 (Ref) | .29 |
| HR+HER2- | 0.95 (0.49 to 1.98) | 0.43 (0.25 to 0.78) | 0.95 (0.59 to 1.56) | 0.74 (0.52 to 1.07) | ||||
| HR-HER2+ | 1.49 (0.42 to 4.85) | 0.64 (0.19 to 1.95) | 1.61 (0.69 to 3.67) | 0.71 (0.33 to 1.44) | ||||
| HR-HER2- | 0.77 (0.26 to 2.18) | 0.62 (0.26 to 1.44) | 0.75 (0.34 to 1.59) | 0.91 (0.53 to 1.56) | ||||
| Tumor stage, AJCC | ||||||||
| I (Referent) | 1.00 (Ref) | .02 | 1.00 (Ref) | .31 | 1.00 (Ref) | .31 | 1.00 (Ref) | .01 |
| II | 1.77 (1.17 to 2.67) | 1.34 (0.9 to 2) | 1.21 (0.89 to 1.63) | 1.36 (1.06 to 1.75) | ||||
| III | 1.26 (0.55 to 2.64) | 1.21 (0.54 to 2.53) | 1.33 (0.78 to 2.21) | 1.57 (1.04 to 2.33) | ||||
| Treatment | ||||||||
| Chemotherapy | ||||||||
| No (Referent) | 1.00 (Ref) | .22 | 1.00 (Ref) | .04 | 1.00 (Ref) | .002 | 1.00 (Ref) | .002 |
| Yes | 1.28 (0.87 to 1.9) | 1.5 (1.02 to 2.2) | 1.56 (1.18 to 2.08) | 1.45 (1.14 to 1.84) | ||||
| Trastuzumab | ||||||||
| No (Referent) | 1.00 (Ref) | .14 | 1.00 (Ref) | .004 | 1.00 (Ref) | .05 | 1.00 (Ref) | .34 |
| Yes | 1.59 (0.85 to 2.84) | 2.3 (1.26 to 4.07) | 1.55 (0.99 to 2.4) | 1.2 (0.83 to 1.7) | ||||
| Endocrine therapy | ||||||||
| No (Referent) | 1.00 (Ref) | .16 | 1.00 (Ref) | .59 | 1.00 (Ref) | .40 | 1.00 (Ref) | .81 |
| Yes | 1.52 (0.88 to 2.78) | 1.17 (0.72 to 1.96) | 1.2 (0.82 to 1.78) | 1.05 (0.77 to 1.45) | ||||
| Breast surgery | ||||||||
| Breast conservation (Referent) | 1.00 (Ref) | .59 | 1.00 (Ref) | .05 | 1.00 (Ref) | .03 | 1.00 (Ref) | .44 |
| Mastectomy | 1.16 (0.72 to 1.81) | 1.55 (0.99 to 2.41) | 1.46 (1.05 to 2.01) | 1.12 (0.85 to 1.47) | ||||
| Lymphadenectomy | ||||||||
| No (Referent) | 1.00 (Ref) | .01 | 1.00 (Ref) | .04 | 1.00 (Ref) | .40 | 1.00 (Ref) | .006 |
| Axillary | 0.33 (0.05 to 3.78) | 1.91 (0.23 to 88.67) | Inf (0.87 to Inf) | |||||
| Sentinel lymph node biopsy | 0.26 (0.04 to 2.95) | 1.36 (0.16 to 63.1) | Inf (0.62 to Inf) | |||||
| Radiotherapy | ||||||||
| No (Referent) | 1.00 (Ref) | .78 | 1.00 (Ref) | 1.00 | 1.00 (Ref) | .55 | 1.00 (Ref) | .10 |
| Yes | 1.17 (0.58 to 2.6) | 1.04 (0.52 to 2.28) | 0.84 (0.51 to 1.43) | 1.53 (0.94 to 2.61) | ||||
AJCC = American Joint Committee on Cancer; CI = confidence interval; EORTC QLQ = European Organization for Research and Treatment of Cancer Quality of Life; GPAQ 16 = Global Physical Activity Questionnaire 16; HADS = Hospital Anxiety and Depression Scale; HER2 = human epidermal growth factor receptor 2; HR = hormone receptor; Q = quartile; Referent = reference level.
Assessed at baseline.
Statistical significance at Benjamini-Hochberg false discovery rate of 5%.
At least 1 drink per day.
In each subcohort, at least 86% of patients who received chemotherapy were treated with anthracycline and taxane combinations, mainly fluorouracil plus epirubicin plus cyclophosphamide followed by a taxane (docetaxel or paclitaxel) (see Supplementary Table 2, available online). In this setting, most patients received 6 cycles every 3 weeks with standard dose.
Figure 2.Risk calibration curve for the clinico-genomic Cognitive Fatigue prediction model.
Figure 3.Comparison of the area under curve (AUC) in predicting Cognitive Fatigue between the preconditioned random forest regression (PRFR) method and other conventional multivariable regression methods. Confidence intervals on validation AUC were obtained by repeating the training process using randomly selected 80% of the training data. Dotted line = prediction AUC when the first 3 principal components for ancestry were used as the only predictors. LASSO = least absolute shrinkage and selection operator.
The most statistically significantly enriched gene groups from the genes associated with the SNPs with GWAS scan P less than .001 (unfiltered) and the subset of those SNPs with top 50% VIM for the Cognitive Fatigue endpoint, obtained using the bioinformatics tool GeneGO
| Pathway maps | Biological process | ||||||
|---|---|---|---|---|---|---|---|
| Unfiltered | Top 50% VIM | Unfiltered | Top 50% VIM | ||||
| Name | FDR | Name | FDR | Name | FDR | Name | FDR |
| Nociception or pronociceptive action of nociception in spinal cord at low doses | 2.48 × 10−4 | Immune response or lectin-induced complement pathway | 0.01 | Regulation of transport | 1.52 × 10−8 | Regulation of transport | 4.03 × 10−9 |
| O-glycan biosynthesis | 0.002 | Development/ oligodendrocyte differentiation from adult stem cells | 0.01 | Synaptic transmission, glutamatergic | 4.63 × 10−7 | Chemical synaptic transmission | 4.03 × 10−9 |
| Gamma-secretase proteolytic targets | 0.004 | Role of integrins in eosinophil degranulation in asthma | 0.01 | Regulation of localization | 8.25 × 10−7 | Anterograde trans-synaptic signaling | 4.03 × 10−8 |
| Calcium-dependent regulation of normal and asthmatic smooth muscle contraction | 0.02 | Degranulation of lung mast cells | 0.02 | Regulation of metal ion transport | 8.33 × 10−7 | Trans-synaptic signaling | 6.28 × 10−8 |
|
Complement pathway disruption in thrombotic microangiopathy |
0.02 |
Alternative complement cascade disruption in age-related macular degeneration |
0.02 |
Response to alkaloid |
8.33 × 10−7 |
Synaptic transmission glutamatergic |
6.28 × 10−8 |
|
Process networks |
Diseases (by biomarkers) | ||||||
|
Unfiltered |
Top 50% VIM |
Unfiltered |
Top 50% VIM | ||||
|
Name |
FDR |
Name |
FDR |
Name |
FDR |
Name |
FDR |
| Neurophysiological processor transmission of nerve impulse | 0.004 | Inflammation and complement system | 0.03 | Schizophrenia | 2.74 × 10−10 | Huntington disease | 1.26 × 10−12 |
| Development, neurogenesis, or synaptogenesis | 0.004 | — | — | Schizophrenia spectrum and other psychotic disorders | 2.74 × 10−10 | Chorea | 1.41 × 10−12 |
| — | — | — | — | Head and neck neoplasm | 6.11 × 10−10 | Brain ischemia | 1.51 × 10−12 |
| — | — | — | — | Digestive system diseases | 7.80 × 10−9 | Depressive disorders | 1.51 × 10−11 |
| — | — | — | — | Colorectal neoplasms | 1.08 × 10−8 | Cognition disorders | 1.51 × 10−10 |
Only the groups with FDR < 0.05 are shown. FDR = false discovery rate; GWAS = genome-wide association studies; SNP = single nucleotide polymorphism; VIM = variable importance measure.