BACKGROUND: Metabolic pathways play an essential role in breast cancer. However, the role of metabolism-related genes in the early diagnosis of breast cancer remains unknown. METHODS: In our study, RNA sequencing (RNA-seq) expression data and clinicopathological information from The Cancer Genome Atlas (TCGA) and GSE20685 were obtained. Univariate cox regression and least absolute shrinkage and selection operator (LASSO) regression analyses were performed on the differentially expressed metabolism-related genes. Then, the formula of the metabolism-related risk model was composed, and the risk score of each patient was calculated. The breast cancer patients were divided into high-risk and low-risk groups with a cutoff of the median expression value of the risk score, and the prognostic analysis was also used to analyze the survival time between these two groups. In the end, we also analyzed the expression, interaction, and correlation among genes in the metabolism-related gene risk model. RESULTS: The results from the prognostic analysis indicated that the survival was significantly poorer in the high-risk group than in the low-risk group in both TCGA and GSE20685 datasets. In addition, after adjusting for different clinicopathological features in multivariate analysis, the metabolism-related risk model remained an independent prognostic indicator in TCGA dataset. CONCLUSIONS: In summary, we systematically developed a potential metabolism-related gene risk model for predicting prognosis in breast cancer patients. 2021 Annals of Translational Medicine. All rights reserved.
BACKGROUND: Metabolic pathways play an essential role in breast cancer. However, the role of metabolism-related genes in the early diagnosis of breast cancer remains unknown. METHODS: In our study, RNA sequencing (RNA-seq) expression data and clinicopathological information from The Cancer Genome Atlas (TCGA) and GSE20685 were obtained. Univariate cox regression and least absolute shrinkage and selection operator (LASSO) regression analyses were performed on the differentially expressed metabolism-related genes. Then, the formula of the metabolism-related risk model was composed, and the risk score of each patient was calculated. The breast cancer patients were divided into high-risk and low-risk groups with a cutoff of the median expression value of the risk score, and the prognostic analysis was also used to analyze the survival time between these two groups. In the end, we also analyzed the expression, interaction, and correlation among genes in the metabolism-related gene risk model. RESULTS: The results from the prognostic analysis indicated that the survival was significantly poorer in the high-risk group than in the low-risk group in both TCGA and GSE20685 datasets. In addition, after adjusting for different clinicopathological features in multivariate analysis, the metabolism-related risk model remained an independent prognostic indicator in TCGA dataset. CONCLUSIONS: In summary, we systematically developed a potential metabolism-related gene risk model for predicting prognosis in breast cancer patients. 2021 Annals of Translational Medicine. All rights reserved.
Entities:
Keywords:
Breast cancer; Gene Expression Omnibus (GEO); The Cancer Genome Atlas (TCGA); metabolism; prognostic signature
Breast cancer is one of the most frequently diagnosed malignant tumors in women worldwide (1). Currently, clinicians use baseline clinical characteristics such as tumor size, lymphatic invasion, or distant metastasis to construct their prognostic models (2-5). However, these models cannot determine the optimal timing for patient treatment, which leads to poor prognosis. Thus, establishing a reliable way of predicting prognosis and optimal treatment in breast cancer patients is urgently needed.Abnormalities of metabolism play a critical role in tumor formation and are a critical factor during oncogenesis (6-8). In recent years, studies of metabolism in cancer have provided new perspectives in cancer research, and the exploration of the relationship between metabolism and cancer has increased (8,9). In addition, recent studies have demonstrated that metabolism might be closely associated with the occurrence and development of breast cancer. For instance, MDM2 is an E3 ubiquitin ligase that targets p53 for degradation and the high expression of MDM2 in breast cancer is associated with poor prognosis (10). Besides, Fong et al. also showed that breast cancer-secreted miR-122 could promote metastasis by modifying glucose utilization (11). In addition, metabolite levels may also serve as a screening strategy to identify women at risk of developing breast cancer. The results from the study by Lécuyer et al. showed that plasma metabolites were associated with the long-term risk of developing breast cancer (12). However, the role of metabolism-related genes in the early diagnosis and treatment of breast cancer patients remains unknown.A gene risk model is a model that uses gene expression data and clinicopathological information to identify patients with a high risk of cancer. With the decreasing cost of sequencing the human genome and the rapid improvement of computer processing, an increasing amount of research has provided insight into building gene risk models from the growing abundance of bioinformatics data (13,14). Indeed, many gene risk models have been created to explore the relationships between cancers and different biological processes (15-17). As for breast cancer, bioinformatics analysis has also been used to build risk models involving long noncoding RNA, autophagy, and tumor mutation burden (18-20). However, the role of metabolism in breast cancer patients based on bioinformatics analysis remains unknown. We thus endeavored to establish the first metabolism-related gene risk model based on the close relationship between metabolism and carcinogenesis.In this study, we acquired the expression levels of metabolism-related genes and clinical characteristics from The Cancer Genome Atlas Breast Carcinoma (TCGA-BRCA) database and the Gene Expression Omnibus (GEO) database. Based on information in TCGA database, we developed a prognostic risk model and analyzed its prognostic value in breast cancer patients. In addition, we validated this risk model in the GSE20685 dataset. We believe that the results from our study may provide new insights into understanding the metabolic mechanisms in breast cancer and can help us to explore the prognostic value of this risk model in breast cancer patients. We present the following article in accordance with the TRIPOD reporting checklist (available at http://dx.doi.org/10.21037/atm-20-7600).
Methods
Data collection
The RNA sequencing (RNA-seq) expression data and clinicopathological information of female breast cancer patients from 1096 breast cancer tissue samples and 112 non-tumor tissue samples were downloaded from the TCGA-BRCA dataset as the training set (https://portal.gdc.cancer.gov/). The inclusion criteria for patients information were the following: (I) primary site: breast; (II) program: TCGA; (III) project: TCGA-BRCA; (IV) fender: female; (V) follow-up: >0 days. Furthermore, gene expression data and clinical information from the GSE20685 data set in the GEO database were used as a validation set (https://www.ncbi.nlm.nih.gov/geo/). In addition, 944 genes in the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway associated with metabolism were also extracted from the “c2.cp.kegg.v7.0.symbols“ gene sets in the Gene Set Enrichment Analysis (GSEA) platform (https://www.gsea-msigdb.org/gsea/downloads.jsp). Overlapping metabolism-related genes were identified from TCGA and GSE20685 gene expression data sets. The detailed flow-process diagram of this study is shown in . The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Figure 1
The flow-process diagram of this study.
The flow-process diagram of this study.
Bioinformatics analysis
To obtain the differentially expressed metabolism-related genes in TCGA data set, the "Limma" R package was used in the R programming language software (version 3.5.3, https://cran.r-project.org/) (21). The following values were used as thresholds: adjusted P<0.05 and |log (fold change) |>0.5. In addition, gene ontology (GO) enrichment analyses were used to find the significant biological processes underlying these differentially expressed genes. The “GOplot” package was also performed in R statistical software to concentrate on the visualization of enrichment terms (22). The correlation analysis between 24 selected genes in the risk model were also performed by the “corrplot” R package. To obtain the potential KEGG pathways in the model genes, GSEA was also used to indicate the enriched terms in TCGA dataset (https://www.gsea-msigdb.org/gsea/index.jsp). Enrichment results with a P<0.05 and a false discovery rate (FDR) value <0.25 were considered statistically significant in GSEA. The interaction of the above 24 genes in the metabolism-related gene risk model was also performed by the GeneMANIA website (http://www.genemania.org/).
Statistical analysis
To establish the metabolism-related risk model, univariate Cox regression analysis was performed on the differentially expressed metabolism-related genes, and 31 survival-related genes were identified. Then, by using the least absolute shrinkage and selection operator (LASSO) regression model, 24 genes were selected to construct the risk model, and their regression coefficients were obtained. Finally, the formula of the risk score was composed as follows, and risk scores were computed:According to the median risk score in TCGA dataset, the breast cancer patients in both TCGA and GSE20685 data sets were divided into high-risk and low-risk groups. A prognostic analysis was used to assess the performance of the risk score in predicting the prognosis of breast cancer patients. In addition, the “survival ROC” R package was used in R statistical software to obtain the time-dependent receiver operating characteristic (ROC) curve and the area under the curve (AUC) values. Furthermore, univariate and multivariate Cox regression analyses were used to determine whether the metabolism-related gene risk model could be an independent predictor. All tests were considered statistically significant with P<0.05.
Results
Expression and enrichment analysis of metabolism-related genes in breast cancer patients
Breast cancer and normal RNA-seq data were acquired from TCGA database (). The thresholds were used as adjusted P<0.05 and |log(fold change)|>0.5 to compare the expression level between breast cancer with that in normal tissue. Subsequently, 316 differentially expressed genes were obtained after the differential expression analysis in these 944 metabolism-related genes. Furthermore, we performed functional enrichment analysis to explore the biological processes underlying these differentially expressed genes. According to the results of enrichment analysis, we found that the metabolism-related enriched GO terms for biological processes were coenzyme metabolic process, organic hydroxy compound metabolic process, and alcohol metabolic process (). The heatmap of the relationship between metabolism-related genes and biological processes is displayed in .
Table 1
Clinicopathological parameters of BRCA patients in TCGA database
Clinical parameters
Variables
Total
Percentages (%)
Age
≤65
760
71.9
>65
297
28.1
Stage
Stage I
181
17.1
Stage II
598
56.6
Stage III
237
22.4
Stage IV
19
1.8
Stage X
11
1.0
Stage (−)
11
1.0
T
T1
278
26.3
T2
607
57.4
T3
132
12.5
T4
37
3.5
TX
3
0.3
N
N0
498
47.1
N1
351
33.2
N2
119
11.3
N3
72
6.8
NX
17
1.6
BRCA, breast carcinoma; TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node).
Figure 2
Enrichment analysis of differentially expressed metabolism-related genes in breast cancer patients. (A) The relationship between differentially expressed metabolism-related genes and biological processes. The outer circle shows a scatter plot for each term of the logFC of the assigned genes. (B) The heatmap of the relationship between differentially expressed metabolism-related genes and biological processes. The color of each block depends on the logFC values. GO, gene ontology; FC, fold change.
BRCA, breast carcinoma; TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node).Enrichment analysis of differentially expressed metabolism-related genes in breast cancer patients. (A) The relationship between differentially expressed metabolism-related genes and biological processes. The outer circle shows a scatter plot for each term of the logFC of the assigned genes. (B) The heatmap of the relationship between differentially expressed metabolism-related genes and biological processes. The color of each block depends on the logFC values. GO, gene ontology; FC, fold change.
Construction and validation of the prognostic metabolism-related gene risk model in breast cancer patients
After reviewing the differentially expressed metabolism-related genes, 316 metabolic genes were included to train the risk model. To establish the metabolism-related risk model, univariate cox regression analysis was first performed, and 31 survival-related genes were identified for further analysis. Then, by using the LASSO regression model in these 31 survival-related genes, 24 genes were selected to build the risk model, and the risk score of each patient was calculated (Appendix 1).The breast cancer patients were divided into high-risk and low-risk groups with a cutoff of the median expression value of the risk score, and the prognostic analysis was also used to analyze the survival time between these two groups. The results from the prognostic analysis indicated that the survival was significantly poorer in the high-risk group than in the low-risk group (, P=3.464e-14). show the risk score distribution of patients and the survival status of patients in TCGA database. In addition, the GSE20685 data set was used to validate the prognostic value of the metabolism-related risk model. By using the formula of the risk model, the risk score of each patient in the GSE20685 data set was obtained. Consistent with the findings in the training group, the survival was significantly poorer in the high-risk group than in the low-risk group (, P=1.198e-06). show the risk score distribution of patients and the survival status of patients.
Figure 3
Construction and validation of the prognostic metabolism-related gene risk model in breast cancer patients. (A) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in TCGA dataset. (B,C) The risk score distribution and the survival status of breast cancer patients in TCGA dataset. (D) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in the GSE20685 dataset. (E,F) The risk score distribution and the survival status of breast cancer patients in the GSE20685 dataset. (G) The ROC curve for the metabolism-related gene risk model. (H) ROC curves at 1 year. (I) ROC curves at 5 years. TCGA, The Cancer Genome Atlas; ROC, receiver operating characteristic; AUC, area under curve; T, tumor; N, node (lymph node); M, metastasis.
Construction and validation of the prognostic metabolism-related gene risk model in breast cancer patients. (A) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in TCGA dataset. (B,C) The risk score distribution and the survival status of breast cancer patients in TCGA dataset. (D) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in the GSE20685 dataset. (E,F) The risk score distribution and the survival status of breast cancer patients in the GSE20685 dataset. (G) The ROC curve for the metabolism-related gene risk model. (H) ROC curves at 1 year. (I) ROC curves at 5 years. TCGA, The Cancer Genome Atlas; ROC, receiver operating characteristic; AUC, area under curve; T, tumor; N, node (lymph node); M, metastasis.
The specificity and independent prognostic role of the metabolism-related gene risk model
To determine the specificity and sensitivity of the metabolism-related gene risk model, a time-dependent ROC curve was calculated for TCGA database. The AUC of our prognostic risk model was 0.756 (), which was higher than that of traditional clinical factors, including stage (AUC =0.713), residual tumor size (AUC =0.737), distant metastasis (AUC =0.560), and lymphatic invasion (AUC =0.657). Furthermore, the AUCs for 1- and 5-year survival were 0.792 () and 0.776 (), respectively. In addition, after adjusting for different clinicopathological features in the univariate and multivariate analysis, the metabolism-related gene risk model remained an independent prognostic indicator for breast cancer patients in TCGA database (univariate: HR =4.698, 95% CI: 3.362–6.567, P<0.001; multivariate: HR =4.425, 95% CI: 3.116-6.284, P<0.001, ).
Table 2
Adjusting for different clinicopathological features in univariate and multivariate Cox regression analysis
Variables
Univariate Cox regression
Multivariate Cox regression
HR
Confidence interval
P
HR
Confidence interval
P
Age
1.034
1.020–1.049
<0.001*
1.035
1.020–1.050
<0.001*
Stage
2.114
1.673–2.672
<0.001*
1.372
0.810–2.324
0.240
T
1.574
1.269–1.954
<0.001*
1.008
0.744–1.365
0.960
M
6.005
3.302–10.922
<0.001*
2.126
0.921–4.907
0.077
N
1.671
1.393–2.003
<0.001*
1.214
0.893–1.649
0.216
Risk score
4.698
3.362–6.567
<0.001*
4.425
3.116–6.284
<0.001*
*P<0.05 was considered statistically significant. HR, hazard ratio; T, tumor; N, node (lymph node); M, metastasis.
*P<0.05 was considered statistically significant. HR, hazard ratio; T, tumor; N, node (lymph node); M, metastasis.
The role of the metabolism-related gene risk model in different breast cancer subgroups
To explore the prognostic value of the metabolism-related gene signature in different subgroups, the patients were grouped by residual tumor size, lymphatic invasion, and TNM stage. The results from the prognostic analysis indicated that patients in the high-risk group showed poorer survival than those in the low-risk group in both TNM stages I–II (P=3.476e-08, ) and TNM stages III–IV (P=1.649e-05, ) patients. In addition, patients in the high-risk group also had a worse predicted prognosis than those in the low-risk group in both patients with and without lymphatic invasion (P=4.563e-07, ; P=3.333e-06, ). For tumor size in T1–T2 or T3–T4, patients in the high-risk group had a worse predicted prognosis than those in the low-risk group (P=1.313e-09, ; P=4.954e-06, ). We also conducted prognostic analysis in breast cancer patients with progesterone receptor (PR), estrogen receptor (ER), and human epithelial growth factor receptor 2 (HER2) status. The results from the prognostic analysis indicated that patients in the low-risk group had significantly better survival than those in the high-risk group in PR-negative (P=1.312e-05, ), PR-positive (P=1.236e-07, ), ER-negative (P=6.841e-04, ), ER-positive (P=3.583e-09, ), HER2-negative (P=1.228e-05, ), and HER2-positive status subgroups (P=1.581e-04, ).
Figure 4
The role of a metabolism-related gene risk model in different breast cancer cohorts in TCGA dataset. (A,B) The prognostic analysis in TNM stage I–II or TNM stage III–IV patients. (C,D) The prognostic analysis in patients with or without lymphatic invasion. (E,F) The prognostic analysis in T1–T2 or T3–T4 patients. (G,H) The prognostic analysis in PR-negative or PR-positive patients. (I,J) The prognostic analysis in ER-negative or ER-positive patients. (K,L) The prognostic analysis in HER2-negative or HER2-positive patients. TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node); M, metastasis; PR, progesterone receptor; ER, estrogen receptor; HER2, human epithelial growth factor receptor 2.
The role of a metabolism-related gene risk model in different breast cancer cohorts in TCGA dataset. (A,B) The prognostic analysis in TNM stage I–II or TNM stage III–IV patients. (C,D) The prognostic analysis in patients with or without lymphatic invasion. (E,F) The prognostic analysis in T1–T2 or T3–T4 patients. (G,H) The prognostic analysis in PR-negative or PR-positive patients. (I,J) The prognostic analysis in ER-negative or ER-positive patients. (K,L) The prognostic analysis in HER2-negative or HER2-positive patients. TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node); M, metastasis; PR, progesterone receptor; ER, estrogen receptor; HER2, human epithelial growth factor receptor 2.
Gene set enrichment analysis
To explore the enriched KEGG pathways of model genes, GSEA was performed and showed that a large number of the enriched pathways, including ether lipid metabolism, arachidonic acid metabolism, glycerophospholipid metabolism, linoleic acid metabolism, fatty acid metabolism, and alpha linolenic acid metabolism, were metabolism related (). Furthermore, we found that the top five enriched KEGG terms in the high-risk group were oocyte meiosis, cell cycle, vibrio cholerae infection, proteasome, and aminoacyl tRNA biosynthesis (). For the low-risk group, the top five enriched KEGG terms were asthma, cytokine-cytokine receptor interaction, ether lipid metabolism, glycerophospholipid metabolism, and JAK/STAT signaling pathway (). Related parameters for these KEGG terms are shown in .
Figure 5
The significantly enriched KEGG pathways in TCGA dataset by GSEA. (A) Enriched metabolism-related KEGG terms in TCGA dataset. (B,C) The top five enriched KEGG terms in high-risk and low-risk patients. KEGG, Kyoto Encyclopedia of Genes and Genomes; TCGA, The Cancer Genome Atlas; GSEA, Gene Set Enrichment Analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes.
Table 3
Related parameters of the enriched KEGG pathways in
The significantly enriched KEGG pathways in TCGA dataset by GSEA. (A) Enriched metabolism-related KEGG terms in TCGA dataset. (B,C) The top five enriched KEGG terms in high-risk and low-risk patients. KEGG, Kyoto Encyclopedia of Genes and Genomes; TCGA, The Cancer Genome Atlas; GSEA, Gene Set Enrichment Analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes.ES, enrichment score; FDR-q, false discovery rate-q value; NES, normalized enrichment score.
The expression, interaction, and correlation among genes in the metabolism-related gene risk model
A box-plot was also used to visualize the expression pattern between breast cancer tissues and normal breast tissues of these 24 genes (). In order to better understand the mutual interaction of these 24 genes in the gene risk model, GeneMANIA was used to set up a network for these genes, and the results are shown in . To further explore the correlation among these 24 genes, we carried out a correlation analysis. The results suggested that part of these metabolism-related genes showed weak-to-moderate correlation (). In these 24 genes, TYMP and IL4I1 were most positively correlated (Cor=0.63). Nevertheless, there were also some negatively correlated genes in these 24 genes, including TYMP and SUCLA2 (Cor=−0.35).
Figure 6
The expression, interaction, and correlation among genes in the metabolism-related gene risk model. (A) Expression of 24 metabolism-related genes between breast cancer tissues and normal breast tissues. (B) Interaction network analysis for these 24 metabolism-related genes. (C) Correlation analysis was used to determine the correlation among these 24 metabolism-related genes. N, normal; T, tumor.
The expression, interaction, and correlation among genes in the metabolism-related gene risk model. (A) Expression of 24 metabolism-related genes between breast cancer tissues and normal breast tissues. (B) Interaction network analysis for these 24 metabolism-related genes. (C) Correlation analysis was used to determine the correlation among these 24 metabolism-related genes. N, normal; T, tumor.
Discussion
Breast cancer is one of the most frequently diagnosed cancers in women, and its yearly incidence is still increasing (1). As both a consequence and cause of cancer formation, tumor cell-intrinsic metabolism plays a vital role in oncogenesis (23-25). For example, cancer cells can generate lactic acids, and such acidic environments favor tumor invasion and can suppress the anticancer immune system (26-28). Some recent studies have also demonstrated the close connection between metabolism and the development of breast cancer. Lécuyer’s group found that the higher fasting plasma metabolites levels of valine, lysine, arginine, glutamine, creatine, creatinine, and glucose; and lower plasma levels of lipoproteins, lipids, glycoproteins, acetone, glycerol-derived compounds, and unsaturated lipids, were associated with the risk of developing breast cancer (12). However, this study only showed the relationship between metabolites and breast cancer, and the underlying mechanism of metabolism and the role of metabolism-related genes in breast cancer remains unknown.In the present study, a significant prognostic risk model based on 24 metabolism-related genes was developed. The results showed that patients in the high-risk group were closely associated with poor prognosis. In addition, results from the prognostic analysis in different molecular subtypes of breast cancer patients also indicated that patients in the high-risk group showed poorer survival than those in the low-risk group. Beyond prognostic value, we also found that our gene risk model remained an independent prognostic factor after adjusting for different clinicopathological features. Furthermore, the GSEA enrichment analysis also showed that many significantly enriched pathways were metabolism related. These enrichment results clarify the underlying molecular mechanisms in our risk model and provide potential applications or research trends for metabolic therapy. In summary, there is excellent prognostic potential for this 24-gene-based risk model, and combining this model with other clinical features could better predict the prognosis of breast cancer patients.Previous studies have shown that the genes in our risk model are involved in different metabolic processes. It has also been noted that some of these 24 genes are closely related to the progression of breast cancer. High expression of CEL has been reported to be an independent prognostic factor related to the poor prognosis of breast cancer (29), while AK3 has been demonstrated to mainly function in the mitochondrial matrix and has been associated with the overall survival of breast cancer patients (30). Aryal et al. reported that glutamyl prolyl tRNA synthetase (EPRS) was selectively carbonylated in tumor breast tissue compared to normal breast tissue (31). MTHFD2, a metabolic enzyme involved in mitochondrial one-carbon folate metabolism and the high expression of MTHFD2 is associated with poor survival in breast cancer patients (32). As for NME3, Flentie et al. suggested that the high expression level of NME3 increased overall survival of patients with breast cancer (33). PAICS, an enzyme for de novo purine biosynthesis and plays an essential role in breast cancer proliferation (34). Additionally, others genes in our risk model were also closely related to the progression of cancer. G6PD was shown to play an essential role in controlling pentose phosphate pathway and has been considered a biomarker and a potential therapeutic target for cancer patients (35). Although the role of NMNAT2 in breast cancer remains unclear, NMNAT2 is still considered to be a metabolic enzyme associated with the progression of colorectal cancer (36). IDO1 is an enzyme that mediates tryptophan metabolism, and some results indicate that IDO1 plays an essential role in tumor progression and immune response (37,38). PLA2G1B is a secreted phospholipase that participates in regulating the digestion of phospholipids (39). Furthermore, some studies have also shown that the low expression level of PLA2G1B could also promote pancreatic cancer (40). Hinsch’s research reported that anti-apoptotic quinolinate phosphoribosyltransferase (QPRT) could be a potential biomarker for follicular thyroid carcinoma (41). In addition, Ullmark et al. also suggested that QPRT may have antiapoptotic properties (42). RDH16 is involved in the process of retinoic acid metabolism, and the expression level of RDH16 has been significantly associated with poor survival of hepatocellular carcinoma patients (43). Meanwhile, TSTA3 was shown to participate in the biosynthesis of GDP-L-fucose, and the expression level of TSTA3 was correlated with poor prognosis for patients with esophageal squamous cell carcinoma (44). TYMP has been reported to be associated with the conversion of capecitabine to 5-FU (45). Previous study has also indicated that the expression level of TYMP was associated with capecitabine response (46). UGDH is an enzyme that can convert UDP-glucose to UDP-glucuronic acid and is a poor prognostic factor in lung adenocarcinoma patients (47). ACADM is a potential biomarker for the diagnosis and treatment of clear cell renal cell carcinoma (48). NT5E is a crucial prognostic factor in different types of cancers and is an important ectonucleotidase in the catabolism of extracellular ATP to adenosine (49). PFKL is a glycolysis rate-limiting enzyme and a potential target for hepatocellular carcinoma therapy (50).The advantage of our research is that we provided and validated a reliable metabolism-related risk model in breast cancer by using large-scale databases. Even though this 24-gene-based risk model is an independent risk model for breast cancer patients, there are still some limitations in our study. First, the present research is a retrospective study based on TCGA and GEO databases. Therefore, prospective clinical research should be performed to validate the application of this model. Second, some clinicopathological features in breast cancer patients that correlated with survival, such as weight, biopsy information, and other valuable items, should be investigated. Third, experimental or clinical studies are still needed to investigate how these genes affect the progression and metabolic microenvironment of breast cancer.
Conclusions
In summary, based on comprehensive analysis of the metabolism-related genes and clinical features in breast cancer patients, we developed a 24-gene-based metabolism-related gene risk model. By using this model, the risk score of each patient was obtained, and the prognostic value of the risk scores in breast cancer patients was also analyzed. Overall, the risk model of our study can contribute to understanding the role of metabolism in breast cancer and may provide more insightful prognostic information for breast cancer patients.The article’s supplementary files as
Authors: Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth Journal: Nucleic Acids Res Date: 2015-01-20 Impact factor: 16.971