| Literature DB >> 34566889 |
Na Sun1, Dandan Ma1, Pingping Gao1, Yanling Li1, Zexuan Yan2, Zaihui Peng1, Fei Han3, Yi Zhang1, Xiaowei Qi1.
Abstract
The improvement in the quality of life is accompanied by an accelerated pace of living and increased work-related pressures. Recent decades has seen an increase in the proportion of obese patients, as well as an increase in the prevalence of breast cancer. More and more evidences prove that obesity may be one of a prognostic impact factor in patients with breast cancer. Obesity presents unique diagnostic and therapeutic challenges in the population of breast cancer patients. Therefore, it is essential to have a better understanding of the relationship between obesity and breast cancer. This study aims to construct a prognostic risk prediction model combining obesity and breast cancer. In this study, we obtained a breast cancer sample dataset from the GEO database containing obesity data [determined by the body mass index (BMI)]. A total of 1174 genes that were differentially expressed between breast cancer samples of patients with and without obesity were screened by the rank-sum test. After weighted gene co-expression network analysis (WGCNA), 791 related genes were further screened. Relying on single-factor COX regression analysis to screen the candidate genes to 30, these 30 genes and another set of TCGA data were intersected to obtain 24 common genes. Finally, lasso regression analysis was performed on 24 genes, and a breast cancer prognostic risk prediction model containing 6 related genes was obtained. The model was also found to be related to the infiltration of immune cells. This study provides a new and accurate prognostic model for predicting the survival of breast cancer patients with obesity.Entities:
Keywords: GEO; TCGA; breast cancer; obesity; prognostic model
Mesh:
Year: 2021 PMID: 34566889 PMCID: PMC8458964 DOI: 10.3389/fendo.2021.712513
Source DB: PubMed Journal: Front Endocrinol (Lausanne) ISSN: 1664-2392 Impact factor: 5.555
Sample information in the GSE24185 dataset.
| Characteristic | N=74 (%) |
|---|---|
|
| |
| ≥60 | 11 (14.8%) |
| <60 | 63 (85.1%) |
|
| |
| normal | 36 (48.6%) |
| obese | 38 (51.3%) |
|
| |
| 1/3 | 10 (13.5%) |
| 2/3 | 23 (31.0%) |
| 3/3 | 41 (55.4%) |
|
| |
| PERI | 8 (10.8%) |
| PRE | 35 (47.2%) |
| POST | 31 (41.8%) |
Figure 1Heat map (A) and volcano map (B) of the expression of DEGs.
Figure 2Identification of the functional modules associated with obesity. (A) Cluster dendrogram; (B) Analysis of network topologies for various soft threshold powers; (C) Clustering dendrogram of DEGs with dissimilarity based on topological overlap, together with assigned module colors; (D) Module-trait heatmap. Each row corresponds to a module eigengene, each column corresponds to a trait. Each cell contains the corresponding correlation and P value.
Figure 3GO term enrichment results.
Figure 4Visualization of the protein-protein interaction (PPI) network and the candidate hub genes.
A total of 30 potential prognosis-related genes were screened by univariate analysis.
| Gene | Beta | HR (95% CI for HR) | Wald Test | |
|---|---|---|---|---|
| ANOS1 | 0.36 | 1.4 (1-2) | 4.8 | 0.029 |
| BMERB1 | -0.41 | 0.67 (0.47-0.94) | 5.3 | 0.022 |
| CACNA1D | -0.49 | 0.61 (0.43-0.87) | 7.7 | 0.0056 |
| CDC42EP3 | -0.48 | 0.62 (0.43-0.89) | 6.6 | 0.01 |
| CFDP1 | 0.33 | 1.4 (1-1.9) | 4 | 0.045 |
| CHEK2 | 0.33 | 1.4 (1-1.9) | 4.2 | 0.042 |
| CIR1 | 0.45 | 1.6 (1.1- 2.2) | 7.7 | 0.0055 |
| CTBP2 | -0.34 | 0.71 (0.51- 0.99) | 4.2 | 0.042 |
| DAO | 0.37 | 1.4 (1- 2) | 4.9 | 0.026 |
| DBR1 | 0.34 | 1.4 (1-1.9) | 4.3 | 0.038 |
| DEF6 | 0.34 | 1.4 (1-1.9) | 4.3 | 0.038 |
| EEF1AKNMT | 0.33 | 1.4 (1-1.9) | 4 | 0.045 |
| FASTKD2 | 0.39 | 1.5 (1.1- 2) | 5.9 | 0.015 |
| FCER1A | -0.41 | 0.66 (0.45- 0.97) | 4.5 | 0.034 |
| HRH3 | 0.72 | 2.1 (1.3- 3.3) | 8.6 | 0.0033 |
| KANSL2 | 0.33 | 1.4 (1- 1.9) | 4.2 | 0.042 |
| KHDC4 | 0.37 | 1.5 (1.1- 2) | 5.1 | 0.023 |
| LZTFL1 | -0.35 | 0.7 (0.51- 0.97) | 4.6 | 0.032 |
| PARL | 0.35 | 1.4 (1- 2) | 4.6 | 0.033 |
| PNOC | 0.4 | 1.5 (1- 2.1) | 4.9 | 0.026 |
| PTPRCAP | 0.35 | 1.4 (1- 2) | 4 | 0.047 |
| RBM4B | 0.43 | 1.5 (1.1- 2.1) | 7 | 0.0082 |
| RHOG | 0.39 | 1.5 (1.1- 2) | 5.8 | 0.016 |
| SELENBP1 | 0.35 | 1.4 (1- 2) | 4.7 | 0.03 |
| TCERG1 | 0.34 | 1.4 (1-1.9) | 4.3 | 0.039 |
| TCL1A | 0.45 | 1.6 (1.1-2.3) | 5.2 | 0.023 |
| TP73-AS1 | -0.34 | 0.71 (0.52 - 0.99) | 4.2 | 0.041 |
| TSPYL1 | -0.45 | 0.64 (0.46 - 0.89) | 7 | 0.0083 |
| VNN1 | 0.46 | 1.6 (1-2.5) | 4.4 | 0.037 |
| ZCCHC8 | 0.41 | 1.5 (1.1-2.1) | 6.5 | 0.011 |
Figure 5Prognostic risk assessment model. (A) Tenfold cross‐validation for tuning parameter selection in the LASSO model; (B) Heat map analysis of the gene expression of six pseudogenes in the high- and low-risk groups.
Figure 6Differences in infiltration of different immune cell types between the high- and low-risk groups.
Rank-sum test of the proportion of each immune cell in the two groups.
| Cell | |
|---|---|
| Resting mast cells | 3.66E-18 |
| Eosinophils | 3.01E-12 |
| CD8+ T cells | 5.12E-08 |
| Regulatory T cells (Tregs) | 7.26E-08 |
| Naïve CD4+ T cells | 9.22E-06 |
| Resting dendritic cells | 1.19E-05 |
| Neutrophils | 2.85E-05 |
| Naïve B cells | 0.000205093 |
| M0 macrophages | 0.000572007 |
| Memory activated CD4+ T cells | 0.000847363 |
| Activated dendritic cells | 0.001242318 |
| Activated mast cells | 0.002280044 |
| Resting memory CD4+ T cells | 0.002988401 |
| Memory B cells | 0.032146684 |
| M2 macrophages | 0.063623725 |
| Plasma cells | 0.115015342 |
| Monocytes | 0.191466691 |
| Gamma delta T cells | 0.219624413 |
| Resting NK cells | 0.36141496 |
| Follicular helper T cells | 0.383084286 |
| M1 Macrophages | 0.621128772 |
| Activated NK cells | 0.826045073 |
Figure 7Evaluation of indicators related to breast cancer in the sample, blue is the low-risk group, light red is the high-risk group, each picture is a sample of different indicators.
Univariate COX regression analysis.
| Beta | HR (95% CI for HR) | Wald Test | p value | |
|---|---|---|---|---|
| Risk_class | 0.75 | 2.1 (1 - 4.4) | 4.1 | 0.044 |
| HER2-negative | -0.28 | 0.76 (0.3 - 1.9) | 0.35 | 0.55 |
| ER-negative | -0.55 | 0.57 (0.32 - 1) | 3.3 | 0.068 |
| PR-negative | -0.68 | 0.51 (0.29 - 0.9) | 5.5 | 0.019 |
| Age (≥60 | 0.43 | 1.5 (0.87 - 2.7) | 2.2 | 0.14 |
| T (T1–T2 | 0.6 | 1.8 (0.76 - 4.4) | 1.8 | 0.18 |
| N (N0–N1 | 1.3 | 3.7 (1.6 - 8.8) | 9 | 0.0027 |
| Menopause (M0 | 0.11 | 1.1 (0.63 - 2) | 0.14 | 0.71 |
Multivariate COX regression analysis.
| Beta | HR (95% CI for HR) | p value | |
|---|---|---|---|
| risk_class | 0.867648 | 2.3813 (1.0792 - 5.2544) | 0.0317 |
| HER2 negative | -0.286233 | 0.7511 (0.2892 - 1.9507) | 0.5567 |
| ER negative | 0.004105 | 1.0041 (0.4157 - 2.4256) | 0.9927 |
| PR negative | -0.996517 | 0.3692 (0.1550 - 0.8794) | 0.0244 |
| Age (≥60 | 0.698043 | 2.0098 (0.9642 - 4.1892) | 0.0625 |
| T (T1–T2 | 0.056496 | 1.0581 (0.4221 - 2.6523) | 0.9041 |
| N (N0–N1 | 1.169733 | 3.2211 (1.2833 - 8.0849) | 0.0127 |
| Menopause (M0 | 0.489737 | 1.6319 (0.7923-3.3611) | 0.1840 |