Literature DB >> 33708957

A systematic analysis of a potential metabolism-related prognostic signature for breast cancer patients.

Shibo Yu1, Xiaowen Wang2, Lizhe Zhu1, Peiling Xie1, Yudong Zhou1, Siyuan Jiang1, Heyan Chen1, Xiaoqin Liao1, Shengyu Pu1, Zhenzhen Lei1, Bin Wang1, Yu Ren1.   

Abstract

BACKGROUND: Metabolic pathways play an essential role in breast cancer. However, the role of metabolism-related genes in the early diagnosis of breast cancer remains unknown.
METHODS: In our study, RNA sequencing (RNA-seq) expression data and clinicopathological information from The Cancer Genome Atlas (TCGA) and GSE20685 were obtained. Univariate cox regression and least absolute shrinkage and selection operator (LASSO) regression analyses were performed on the differentially expressed metabolism-related genes. Then, the formula of the metabolism-related risk model was composed, and the risk score of each patient was calculated. The breast cancer patients were divided into high-risk and low-risk groups with a cutoff of the median expression value of the risk score, and the prognostic analysis was also used to analyze the survival time between these two groups. In the end, we also analyzed the expression, interaction, and correlation among genes in the metabolism-related gene risk model.
RESULTS: The results from the prognostic analysis indicated that the survival was significantly poorer in the high-risk group than in the low-risk group in both TCGA and GSE20685 datasets. In addition, after adjusting for different clinicopathological features in multivariate analysis, the metabolism-related risk model remained an independent prognostic indicator in TCGA dataset.
CONCLUSIONS: In summary, we systematically developed a potential metabolism-related gene risk model for predicting prognosis in breast cancer patients. 2021 Annals of Translational Medicine. All rights reserved.

Entities:  

Keywords:  Breast cancer; Gene Expression Omnibus (GEO); The Cancer Genome Atlas (TCGA); metabolism; prognostic signature

Year:  2021        PMID: 33708957      PMCID: PMC7944328          DOI: 10.21037/atm-20-7600

Source DB:  PubMed          Journal:  Ann Transl Med        ISSN: 2305-5839


Introduction

Breast cancer is one of the most frequently diagnosed malignant tumors in women worldwide (1). Currently, clinicians use baseline clinical characteristics such as tumor size, lymphatic invasion, or distant metastasis to construct their prognostic models (2-5). However, these models cannot determine the optimal timing for patient treatment, which leads to poor prognosis. Thus, establishing a reliable way of predicting prognosis and optimal treatment in breast cancer patients is urgently needed. Abnormalities of metabolism play a critical role in tumor formation and are a critical factor during oncogenesis (6-8). In recent years, studies of metabolism in cancer have provided new perspectives in cancer research, and the exploration of the relationship between metabolism and cancer has increased (8,9). In addition, recent studies have demonstrated that metabolism might be closely associated with the occurrence and development of breast cancer. For instance, MDM2 is an E3 ubiquitin ligase that targets p53 for degradation and the high expression of MDM2 in breast cancer is associated with poor prognosis (10). Besides, Fong et al. also showed that breast cancer-secreted miR-122 could promote metastasis by modifying glucose utilization (11). In addition, metabolite levels may also serve as a screening strategy to identify women at risk of developing breast cancer. The results from the study by Lécuyer et al. showed that plasma metabolites were associated with the long-term risk of developing breast cancer (12). However, the role of metabolism-related genes in the early diagnosis and treatment of breast cancer patients remains unknown. A gene risk model is a model that uses gene expression data and clinicopathological information to identify patients with a high risk of cancer. With the decreasing cost of sequencing the human genome and the rapid improvement of computer processing, an increasing amount of research has provided insight into building gene risk models from the growing abundance of bioinformatics data (13,14). Indeed, many gene risk models have been created to explore the relationships between cancers and different biological processes (15-17). As for breast cancer, bioinformatics analysis has also been used to build risk models involving long noncoding RNA, autophagy, and tumor mutation burden (18-20). However, the role of metabolism in breast cancer patients based on bioinformatics analysis remains unknown. We thus endeavored to establish the first metabolism-related gene risk model based on the close relationship between metabolism and carcinogenesis. In this study, we acquired the expression levels of metabolism-related genes and clinical characteristics from The Cancer Genome Atlas Breast Carcinoma (TCGA-BRCA) database and the Gene Expression Omnibus (GEO) database. Based on information in TCGA database, we developed a prognostic risk model and analyzed its prognostic value in breast cancer patients. In addition, we validated this risk model in the GSE20685 dataset. We believe that the results from our study may provide new insights into understanding the metabolic mechanisms in breast cancer and can help us to explore the prognostic value of this risk model in breast cancer patients. We present the following article in accordance with the TRIPOD reporting checklist (available at http://dx.doi.org/10.21037/atm-20-7600).

Methods

Data collection

The RNA sequencing (RNA-seq) expression data and clinicopathological information of female breast cancer patients from 1096 breast cancer tissue samples and 112 non-tumor tissue samples were downloaded from the TCGA-BRCA dataset as the training set (https://portal.gdc.cancer.gov/). The inclusion criteria for patients information were the following: (I) primary site: breast; (II) program: TCGA; (III) project: TCGA-BRCA; (IV) fender: female; (V) follow-up: >0 days. Furthermore, gene expression data and clinical information from the GSE20685 data set in the GEO database were used as a validation set (https://www.ncbi.nlm.nih.gov/geo/). In addition, 944 genes in the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway associated with metabolism were also extracted from the “c2.cp.kegg.v7.0.symbols“ gene sets in the Gene Set Enrichment Analysis (GSEA) platform (https://www.gsea-msigdb.org/gsea/downloads.jsp). Overlapping metabolism-related genes were identified from TCGA and GSE20685 gene expression data sets. The detailed flow-process diagram of this study is shown in . The study was conducted in accordance with the Declaration of Helsinki (as revised in 2013).
Figure 1

The flow-process diagram of this study.

The flow-process diagram of this study.

Bioinformatics analysis

To obtain the differentially expressed metabolism-related genes in TCGA data set, the "Limma" R package was used in the R programming language software (version 3.5.3, https://cran.r-project.org/) (21). The following values were used as thresholds: adjusted P<0.05 and |log (fold change) |>0.5. In addition, gene ontology (GO) enrichment analyses were used to find the significant biological processes underlying these differentially expressed genes. The “GOplot” package was also performed in R statistical software to concentrate on the visualization of enrichment terms (22). The correlation analysis between 24 selected genes in the risk model were also performed by the “corrplot” R package. To obtain the potential KEGG pathways in the model genes, GSEA was also used to indicate the enriched terms in TCGA dataset (https://www.gsea-msigdb.org/gsea/index.jsp). Enrichment results with a P<0.05 and a false discovery rate (FDR) value <0.25 were considered statistically significant in GSEA. The interaction of the above 24 genes in the metabolism-related gene risk model was also performed by the GeneMANIA website (http://www.genemania.org/).

Statistical analysis

To establish the metabolism-related risk model, univariate Cox regression analysis was performed on the differentially expressed metabolism-related genes, and 31 survival-related genes were identified. Then, by using the least absolute shrinkage and selection operator (LASSO) regression model, 24 genes were selected to construct the risk model, and their regression coefficients were obtained. Finally, the formula of the risk score was composed as follows, and risk scores were computed: According to the median risk score in TCGA dataset, the breast cancer patients in both TCGA and GSE20685 data sets were divided into high-risk and low-risk groups. A prognostic analysis was used to assess the performance of the risk score in predicting the prognosis of breast cancer patients. In addition, the “survival ROC” R package was used in R statistical software to obtain the time-dependent receiver operating characteristic (ROC) curve and the area under the curve (AUC) values. Furthermore, univariate and multivariate Cox regression analyses were used to determine whether the metabolism-related gene risk model could be an independent predictor. All tests were considered statistically significant with P<0.05.

Results

Expression and enrichment analysis of metabolism-related genes in breast cancer patients

Breast cancer and normal RNA-seq data were acquired from TCGA database (). The thresholds were used as adjusted P<0.05 and |log(fold change)|>0.5 to compare the expression level between breast cancer with that in normal tissue. Subsequently, 316 differentially expressed genes were obtained after the differential expression analysis in these 944 metabolism-related genes. Furthermore, we performed functional enrichment analysis to explore the biological processes underlying these differentially expressed genes. According to the results of enrichment analysis, we found that the metabolism-related enriched GO terms for biological processes were coenzyme metabolic process, organic hydroxy compound metabolic process, and alcohol metabolic process (). The heatmap of the relationship between metabolism-related genes and biological processes is displayed in .
Table 1

Clinicopathological parameters of BRCA patients in TCGA database

Clinical parametersVariablesTotalPercentages (%)
Age≤6576071.9
>6529728.1
StageStage I18117.1
Stage II59856.6
Stage III23722.4
Stage IV191.8
Stage X111.0
Stage (−)111.0
TT127826.3
T260757.4
T313212.5
T4373.5
TX30.3
NN049847.1
N135133.2
N211911.3
N3726.8
NX171.6

BRCA, breast carcinoma; TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node).

Figure 2

Enrichment analysis of differentially expressed metabolism-related genes in breast cancer patients. (A) The relationship between differentially expressed metabolism-related genes and biological processes. The outer circle shows a scatter plot for each term of the logFC of the assigned genes. (B) The heatmap of the relationship between differentially expressed metabolism-related genes and biological processes. The color of each block depends on the logFC values. GO, gene ontology; FC, fold change.

BRCA, breast carcinoma; TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node). Enrichment analysis of differentially expressed metabolism-related genes in breast cancer patients. (A) The relationship between differentially expressed metabolism-related genes and biological processes. The outer circle shows a scatter plot for each term of the logFC of the assigned genes. (B) The heatmap of the relationship between differentially expressed metabolism-related genes and biological processes. The color of each block depends on the logFC values. GO, gene ontology; FC, fold change.

Construction and validation of the prognostic metabolism-related gene risk model in breast cancer patients

After reviewing the differentially expressed metabolism-related genes, 316 metabolic genes were included to train the risk model. To establish the metabolism-related risk model, univariate cox regression analysis was first performed, and 31 survival-related genes were identified for further analysis. Then, by using the LASSO regression model in these 31 survival-related genes, 24 genes were selected to build the risk model, and the risk score of each patient was calculated (Appendix 1). The breast cancer patients were divided into high-risk and low-risk groups with a cutoff of the median expression value of the risk score, and the prognostic analysis was also used to analyze the survival time between these two groups. The results from the prognostic analysis indicated that the survival was significantly poorer in the high-risk group than in the low-risk group (, P=3.464e-14). show the risk score distribution of patients and the survival status of patients in TCGA database. In addition, the GSE20685 data set was used to validate the prognostic value of the metabolism-related risk model. By using the formula of the risk model, the risk score of each patient in the GSE20685 data set was obtained. Consistent with the findings in the training group, the survival was significantly poorer in the high-risk group than in the low-risk group (, P=1.198e-06). show the risk score distribution of patients and the survival status of patients.
Figure 3

Construction and validation of the prognostic metabolism-related gene risk model in breast cancer patients. (A) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in TCGA dataset. (B,C) The risk score distribution and the survival status of breast cancer patients in TCGA dataset. (D) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in the GSE20685 dataset. (E,F) The risk score distribution and the survival status of breast cancer patients in the GSE20685 dataset. (G) The ROC curve for the metabolism-related gene risk model. (H) ROC curves at 1 year. (I) ROC curves at 5 years. TCGA, The Cancer Genome Atlas; ROC, receiver operating characteristic; AUC, area under curve; T, tumor; N, node (lymph node); M, metastasis.

Construction and validation of the prognostic metabolism-related gene risk model in breast cancer patients. (A) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in TCGA dataset. (B,C) The risk score distribution and the survival status of breast cancer patients in TCGA dataset. (D) Prognostic analysis indicated that patients in the high-risk group had a worse predicted prognosis than those in the low-risk group in the GSE20685 dataset. (E,F) The risk score distribution and the survival status of breast cancer patients in the GSE20685 dataset. (G) The ROC curve for the metabolism-related gene risk model. (H) ROC curves at 1 year. (I) ROC curves at 5 years. TCGA, The Cancer Genome Atlas; ROC, receiver operating characteristic; AUC, area under curve; T, tumor; N, node (lymph node); M, metastasis.

The specificity and independent prognostic role of the metabolism-related gene risk model

To determine the specificity and sensitivity of the metabolism-related gene risk model, a time-dependent ROC curve was calculated for TCGA database. The AUC of our prognostic risk model was 0.756 (), which was higher than that of traditional clinical factors, including stage (AUC =0.713), residual tumor size (AUC =0.737), distant metastasis (AUC =0.560), and lymphatic invasion (AUC =0.657). Furthermore, the AUCs for 1- and 5-year survival were 0.792 () and 0.776 (), respectively. In addition, after adjusting for different clinicopathological features in the univariate and multivariate analysis, the metabolism-related gene risk model remained an independent prognostic indicator for breast cancer patients in TCGA database (univariate: HR =4.698, 95% CI: 3.362–6.567, P<0.001; multivariate: HR =4.425, 95% CI: 3.116-6.284, P<0.001, ).
Table 2

Adjusting for different clinicopathological features in univariate and multivariate Cox regression analysis

VariablesUnivariate Cox regressionMultivariate Cox regression
HRConfidence intervalPHRConfidence intervalP
Age1.0341.020–1.049<0.001*1.0351.020–1.050<0.001*
Stage2.1141.673–2.672<0.001*1.3720.810–2.3240.240
T1.5741.269–1.954<0.001*1.0080.744–1.3650.960
M6.0053.302–10.922<0.001*2.1260.921–4.9070.077
N1.6711.393–2.003<0.001*1.2140.893–1.6490.216
Risk score4.6983.362–6.567<0.001*4.4253.116–6.284<0.001*

*P<0.05 was considered statistically significant. HR, hazard ratio; T, tumor; N, node (lymph node); M, metastasis.

*P<0.05 was considered statistically significant. HR, hazard ratio; T, tumor; N, node (lymph node); M, metastasis.

The role of the metabolism-related gene risk model in different breast cancer subgroups

To explore the prognostic value of the metabolism-related gene signature in different subgroups, the patients were grouped by residual tumor size, lymphatic invasion, and TNM stage. The results from the prognostic analysis indicated that patients in the high-risk group showed poorer survival than those in the low-risk group in both TNM stages I–II (P=3.476e-08, ) and TNM stages III–IV (P=1.649e-05, ) patients. In addition, patients in the high-risk group also had a worse predicted prognosis than those in the low-risk group in both patients with and without lymphatic invasion (P=4.563e-07, ; P=3.333e-06, ). For tumor size in T1–T2 or T3–T4, patients in the high-risk group had a worse predicted prognosis than those in the low-risk group (P=1.313e-09, ; P=4.954e-06, ). We also conducted prognostic analysis in breast cancer patients with progesterone receptor (PR), estrogen receptor (ER), and human epithelial growth factor receptor 2 (HER2) status. The results from the prognostic analysis indicated that patients in the low-risk group had significantly better survival than those in the high-risk group in PR-negative (P=1.312e-05, ), PR-positive (P=1.236e-07, ), ER-negative (P=6.841e-04, ), ER-positive (P=3.583e-09, ), HER2-negative (P=1.228e-05, ), and HER2-positive status subgroups (P=1.581e-04, ).
Figure 4

The role of a metabolism-related gene risk model in different breast cancer cohorts in TCGA dataset. (A,B) The prognostic analysis in TNM stage I–II or TNM stage III–IV patients. (C,D) The prognostic analysis in patients with or without lymphatic invasion. (E,F) The prognostic analysis in T1–T2 or T3–T4 patients. (G,H) The prognostic analysis in PR-negative or PR-positive patients. (I,J) The prognostic analysis in ER-negative or ER-positive patients. (K,L) The prognostic analysis in HER2-negative or HER2-positive patients. TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node); M, metastasis; PR, progesterone receptor; ER, estrogen receptor; HER2, human epithelial growth factor receptor 2.

The role of a metabolism-related gene risk model in different breast cancer cohorts in TCGA dataset. (A,B) The prognostic analysis in TNM stage I–II or TNM stage III–IV patients. (C,D) The prognostic analysis in patients with or without lymphatic invasion. (E,F) The prognostic analysis in T1–T2 or T3–T4 patients. (G,H) The prognostic analysis in PR-negative or PR-positive patients. (I,J) The prognostic analysis in ER-negative or ER-positive patients. (K,L) The prognostic analysis in HER2-negative or HER2-positive patients. TCGA, The Cancer Genome Atlas; T, tumor; N, node (lymph node); M, metastasis; PR, progesterone receptor; ER, estrogen receptor; HER2, human epithelial growth factor receptor 2.

Gene set enrichment analysis

To explore the enriched KEGG pathways of model genes, GSEA was performed and showed that a large number of the enriched pathways, including ether lipid metabolism, arachidonic acid metabolism, glycerophospholipid metabolism, linoleic acid metabolism, fatty acid metabolism, and alpha linolenic acid metabolism, were metabolism related (). Furthermore, we found that the top five enriched KEGG terms in the high-risk group were oocyte meiosis, cell cycle, vibrio cholerae infection, proteasome, and aminoacyl tRNA biosynthesis (). For the low-risk group, the top five enriched KEGG terms were asthma, cytokine-cytokine receptor interaction, ether lipid metabolism, glycerophospholipid metabolism, and JAK/STAT signaling pathway (). Related parameters for these KEGG terms are shown in .
Figure 5

The significantly enriched KEGG pathways in TCGA dataset by GSEA. (A) Enriched metabolism-related KEGG terms in TCGA dataset. (B,C) The top five enriched KEGG terms in high-risk and low-risk patients. KEGG, Kyoto Encyclopedia of Genes and Genomes; TCGA, The Cancer Genome Atlas; GSEA, Gene Set Enrichment Analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes.

Table 3

Related parameters of the enriched KEGG pathways in

PathwaysSizeESNESPFDR-q
High risk
   KEGG_AMINOACYL_TRNA_BIOSYNTHESIS410.671.940.0100.044
   KEGG_CELL_CYCLE1240.642.180.0020.007
   KEGG_PROTEASOME460.701.880.0060.063
   KEGG_OOCYTE_MEIOSIS1120.562.170.0000.004
   KEGG_VIBRIO_CHOLERAE_INFECTION540.592.080.0000.011
Low risk
   KEGG_ASTHMA28–0.82–2.120.0000.004
   KEGG_CYTOKINE_CYTOKINE_RECEPTOR_INTERACTION264–0.58–2.120.0000.005
   KEGG_ETHER_LIPID_METABOLISM33–0.64–2.250.0000.001
   KEGG_GLYCEROPHOSPHOLIPID_METABOLISM77–0.51–2.130.0000.006
   KEGG_JAK_STAT_SIGNALING_PATHWAY155–0.57–2.190.0000.003
Metabolism-related pathways
   KEGG_ETHER_LIPID_METABOLISM33–0.64–2.250.0000.001
   KEGG_GLYCEROPHOSPHOLIPID_METABOLISM77–0.51–2.130.0000.006
   KEGG_ARACHIDONIC_ACID_METABOLISM58–0.58–2.100.0000.004
   KEGG_ALPHA_LINOLENIC_ACID_METABOLISM19–0.61–1.920.0000.022
   KEGG_FATTY_ACID_METABOLISM42–0.55–1.880.0120.027
   KEGG_LINOLEIC_ACID_METABOLISM29–0.55–1.870.0020.027

ES, enrichment score; FDR-q, false discovery rate-q value; NES, normalized enrichment score.

The significantly enriched KEGG pathways in TCGA dataset by GSEA. (A) Enriched metabolism-related KEGG terms in TCGA dataset. (B,C) The top five enriched KEGG terms in high-risk and low-risk patients. KEGG, Kyoto Encyclopedia of Genes and Genomes; TCGA, The Cancer Genome Atlas; GSEA, Gene Set Enrichment Analysis; KEGG, Kyoto Encyclopedia of Genes and Genomes. ES, enrichment score; FDR-q, false discovery rate-q value; NES, normalized enrichment score.

The expression, interaction, and correlation among genes in the metabolism-related gene risk model

A box-plot was also used to visualize the expression pattern between breast cancer tissues and normal breast tissues of these 24 genes (). In order to better understand the mutual interaction of these 24 genes in the gene risk model, GeneMANIA was used to set up a network for these genes, and the results are shown in . To further explore the correlation among these 24 genes, we carried out a correlation analysis. The results suggested that part of these metabolism-related genes showed weak-to-moderate correlation (). In these 24 genes, TYMP and IL4I1 were most positively correlated (Cor=0.63). Nevertheless, there were also some negatively correlated genes in these 24 genes, including TYMP and SUCLA2 (Cor=−0.35).
Figure 6

The expression, interaction, and correlation among genes in the metabolism-related gene risk model. (A) Expression of 24 metabolism-related genes between breast cancer tissues and normal breast tissues. (B) Interaction network analysis for these 24 metabolism-related genes. (C) Correlation analysis was used to determine the correlation among these 24 metabolism-related genes. N, normal; T, tumor.

The expression, interaction, and correlation among genes in the metabolism-related gene risk model. (A) Expression of 24 metabolism-related genes between breast cancer tissues and normal breast tissues. (B) Interaction network analysis for these 24 metabolism-related genes. (C) Correlation analysis was used to determine the correlation among these 24 metabolism-related genes. N, normal; T, tumor.

Discussion

Breast cancer is one of the most frequently diagnosed cancers in women, and its yearly incidence is still increasing (1). As both a consequence and cause of cancer formation, tumor cell­-intrinsic metabolism plays a vital role in oncogenesis (23-25). For example, cancer cells can generate lactic acids, and such acidic environments favor tumor invasion and can suppress the anticancer immune system (26-28). Some recent studies have also demonstrated the close connection between metabolism and the development of breast cancer. Lécuyer’s group found that the higher fasting plasma metabolites levels of valine, lysine, arginine, glutamine, creatine, creatinine, and glucose; and lower plasma levels of lipoproteins, lipids, glycoproteins, acetone, glycerol-derived compounds, and unsaturated lipids, were associated with the risk of developing breast cancer (12). However, this study only showed the relationship between metabolites and breast cancer, and the underlying mechanism of metabolism and the role of metabolism-related genes in breast cancer remains unknown. In the present study, a significant prognostic risk model based on 24 metabolism-related genes was developed. The results showed that patients in the high-risk group were closely associated with poor prognosis. In addition, results from the prognostic analysis in different molecular subtypes of breast cancer patients also indicated that patients in the high-risk group showed poorer survival than those in the low-risk group. Beyond prognostic value, we also found that our gene risk model remained an independent prognostic factor after adjusting for different clinicopathological features. Furthermore, the GSEA enrichment analysis also showed that many significantly enriched pathways were metabolism related. These enrichment results clarify the underlying molecular mechanisms in our risk model and provide potential applications or research trends for metabolic therapy. In summary, there is excellent prognostic potential for this 24-gene-based risk model, and combining this model with other clinical features could better predict the prognosis of breast cancer patients. Previous studies have shown that the genes in our risk model are involved in different metabolic processes. It has also been noted that some of these 24 genes are closely related to the progression of breast cancer. High expression of CEL has been reported to be an independent prognostic factor related to the poor prognosis of breast cancer (29), while AK3 has been demonstrated to mainly function in the mitochondrial matrix and has been associated with the overall survival of breast cancer patients (30). Aryal et al. reported that glutamyl prolyl tRNA synthetase (EPRS) was selectively carbonylated in tumor breast tissue compared to normal breast tissue (31). MTHFD2, a metabolic enzyme involved in mitochondrial one-carbon folate metabolism and the high expression of MTHFD2 is associated with poor survival in breast cancer patients (32). As for NME3, Flentie et al. suggested that the high expression level of NME3 increased overall survival of patients with breast cancer (33). PAICS, an enzyme for de novo purine biosynthesis and plays an essential role in breast cancer proliferation (34). Additionally, others genes in our risk model were also closely related to the progression of cancer. G6PD was shown to play an essential role in controlling pentose phosphate pathway and has been considered a biomarker and a potential therapeutic target for cancer patients (35). Although the role of NMNAT2 in breast cancer remains unclear, NMNAT2 is still considered to be a metabolic enzyme associated with the progression of colorectal cancer (36). IDO1 is an enzyme that mediates tryptophan metabolism, and some results indicate that IDO1 plays an essential role in tumor progression and immune response (37,38). PLA2G1B is a secreted phospholipase that participates in regulating the digestion of phospholipids (39). Furthermore, some studies have also shown that the low expression level of PLA2G1B could also promote pancreatic cancer (40). Hinsch’s research reported that anti-apoptotic quinolinate phosphoribosyltransferase (QPRT) could be a potential biomarker for follicular thyroid carcinoma (41). In addition, Ullmark et al. also suggested that QPRT may have antiapoptotic properties (42). RDH16 is involved in the process of retinoic acid metabolism, and the expression level of RDH16 has been significantly associated with poor survival of hepatocellular carcinoma patients (43). Meanwhile, TSTA3 was shown to participate in the biosynthesis of GDP-L-fucose, and the expression level of TSTA3 was correlated with poor prognosis for patients with esophageal squamous cell carcinoma (44). TYMP has been reported to be associated with the conversion of capecitabine to 5-FU (45). Previous study has also indicated that the expression level of TYMP was associated with capecitabine response (46). UGDH is an enzyme that can convert UDP-glucose to UDP-glucuronic acid and is a poor prognostic factor in lung adenocarcinoma patients (47). ACADM is a potential biomarker for the diagnosis and treatment of clear cell renal cell carcinoma (48). NT5E is a crucial prognostic factor in different types of cancers and is an important ectonucleotidase in the catabolism of extracellular ATP to adenosine (49). PFKL is a glycolysis rate-limiting enzyme and a potential target for hepatocellular carcinoma therapy (50). The advantage of our research is that we provided and validated a reliable metabolism-related risk model in breast cancer by using large-scale databases. Even though this 24-gene-based risk model is an independent risk model for breast cancer patients, there are still some limitations in our study. First, the present research is a retrospective study based on TCGA and GEO databases. Therefore, prospective clinical research should be performed to validate the application of this model. Second, some clinicopathological features in breast cancer patients that correlated with survival, such as weight, biopsy information, and other valuable items, should be investigated. Third, experimental or clinical studies are still needed to investigate how these genes affect the progression and metabolic microenvironment of breast cancer.

Conclusions

In summary, based on comprehensive analysis of the metabolism-related genes and clinical features in breast cancer patients, we developed a 24-gene-based metabolism-related gene risk model. By using this model, the risk score of each patient was obtained, and the prognostic value of the risk scores in breast cancer patients was also analyzed. Overall, the risk model of our study can contribute to understanding the role of metabolism in breast cancer and may provide more insightful prognostic information for breast cancer patients. The article’s supplementary files as
  50 in total

Review 1.  Coming of age: ten years of next-generation sequencing technologies.

Authors:  Sara Goodwin; John D McPherson; W Richard McCombie
Journal:  Nat Rev Genet       Date:  2016-05-17       Impact factor: 53.242

Review 2.  Lymphovascular invasion: assessment and prognostic impact in melanoma and breast cancer.

Authors:  Nicola Thompson; Sarah Storr; Siwei Zhang; Stewart Martin
Journal:  Histol Histopathol       Date:  2015-04-09       Impact factor: 2.303

3.  Nucleoside Diphosphate Kinase-3 (NME3) Enhances TLR5-Induced NFκB Activation.

Authors:  Kelly Flentie; Caleb Gonzalez; Brandon Kocher; Yue Wang; Hongtu Zhu; Jayne Marasa; David Piwnica-Worms
Journal:  Mol Cancer Res       Date:  2018-03-09       Impact factor: 5.852

4.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

5.  Predictors of early distant metastasis in women with breast cancer.

Authors:  Erick Sta Rosa Mendoza; Emmanuel Moreno; Priscilla Baetiong Caguioa
Journal:  J Cancer Res Clin Oncol       Date:  2013-01-03       Impact factor: 4.553

6.  Specific protein carbonylation in human breast cancer tissue compared to adjacent healthy epithelial tissue.

Authors:  Baikuntha Aryal; V Ashutosh Rao
Journal:  PLoS One       Date:  2018-03-29       Impact factor: 3.240

7.  Knockdown of PAICS inhibits malignant proliferation of human breast cancer cell lines.

Authors:  Minjun Meng; Yanling Chen; Jianbo Jia; Lianghui Li; Sumei Yang
Journal:  Biol Res       Date:  2018-08-10       Impact factor: 5.612

8.  Identification of a six-gene signature associated with tumor mutation burden for predicting prognosis in patients with invasive breast carcinoma.

Authors:  Feiran Wang; Chong Tang; Xuesong Gao; Junfei Xu
Journal:  Ann Transl Med       Date:  2020-04

Review 9.  More Than a Metabolic Enzyme: MTHFD2 as a Novel Target for Anticancer Therapy?

Authors:  Zhiyuan Zhu; Gilberto Ka Kit Leung
Journal:  Front Oncol       Date:  2020-04-28       Impact factor: 6.244

10.  Development of prognostic index based on autophagy-related genes analysis in breast cancer.

Authors:  Qing-Guang Lin; Wei Liu; Yu-Zhen Mo; Jing Han; Zhi-Xing Guo; Wei Zheng; Jian-Wei Wang; Xue-Bin Zou; An-Hua Li; Feng Han
Journal:  Aging (Albany NY)       Date:  2020-01-22       Impact factor: 5.682

View more
  2 in total

1.  Increased Expression of QPRT in Breast Cancer Infers a Poor Prognosis and Is Correlated to Immunocytes Infiltration.

Authors:  Qun Zhang; Xinwang Ding; Hongzhao Lu
Journal:  J Healthc Eng       Date:  2022-03-19       Impact factor: 2.682

2.  Signature based on metabolic-related gene pairs can predict overall survival of osteosarcoma patients.

Authors:  Long-Qing Li; Liang-Hao Zhang; Yao-Bo Yuan; Xin-Chang Lu; Yi Zhang; Yong-Kui Liu; Jia Wen; Manhas Abdul Khader; Tao Liu; Jia-Zhen Li; Yan Zhang
Journal:  Cancer Med       Date:  2021-05-28       Impact factor: 4.452

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.