Literature DB >> 31198978

Screening of differentially expressed genes and identification of NUF2 as a prognostic marker in breast cancer.

Wenjie Xu1, Yizhen Wang1, Yanan Wang1, Shanmei Lv1, Xiuping Xu1, Xuejun Dong1.   

Abstract

The aims of the present study were to screen differentially expressed genes (DEGs) in breast cancer (BC) and investigate NDC80 kinetochore complex component (NUF2) as a prognostic marker of BC in detail. A total of four BC microarray datasets, downloaded from the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases, were used to screen DEGs. A total of 190 DEGs with the same expression trends were identified in the 4 datasets, including 65 upregulated and 125 downregulated DEGs. Functional and pathway enrichment analyses were performed using the Database for Annotation, Visualization and Integrated Discovery. The upregulated DEGs were enriched for 10 Gene Ontology (GO) terms and 7 pathways, and the downregulated DEGs were enriched for 10 GO terms and 10 pathways. A protein‑protein interaction network containing 149 nodes and 930 edges was constructed using the Search Tool for the Retrieval of Interacting Genes, and 2 functional modules were identified using the MCODE plugin of Cytoscape. Based on an in‑depth analysis of module 1 and literature mining, NUF2 was selected for further research. Oncomine database analysis and reverse transcription‑quantitative PCR showed that NUF2 is significantly upregulated in BC tissues. In analyses of correlations between NUF2 and clinical pathological characteristics, NUF2 was significantly associated with the malignant features of BC. Using 5 additional datasets from GEO, it was demonstrated that NUF2 has a significant prognostic role in both ER‑positive and ER‑negative BC. A Gene Set Enrichment Analysis indicated that NUF2 may regulate breast carcinogenesis and progression via cell cycle‑related pathways. The results of the present study demonstrated that NUF2 is overexpressed in BC and is significantly associated with its multiple pathological features and prognosis.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31198978      PMCID: PMC6605639          DOI: 10.3892/ijmm.2019.4239

Source DB:  PubMed          Journal:  Int J Mol Med        ISSN: 1107-3756            Impact factor:   4.101


Introduction

Breast cancer (BC) is the most common female malignancy and the second leading cause of mortality in women worldwide (1). According to the World Health Organization in 2012, one-third of Asian women develop BC (2,3). Currently, BC treatment includes partial excision with or without radiotherapy and systemic therapies such as endocrine therapy, chemotherapy, molecular targeted therapy, and a combination of them (4). Although advanced therapeutic techniques based on surgery have considerably improved the survival of patients with BC and the five-year survival has increased from 75% in 1976 to 91% in 2017, high rates of metastasis and recurrence remain (1,5,6). Recently, molecular targeted therapy has been shown to play an important role in individualized treatment of BC. For instance, a monoclonal antibody against HER2, trastuzumab, has been demonstrated to improve survival of patients with BC; however, the prognosis remains poor (7,8). Therefore, to improve BC prognosis, effective therapeutic targets and prognostic biomarkers are needed. NDC80 kinetochore complex component (NUF2), also known as CDCA1, is a centromere-related protein (9). It regulates the binding of centromeres to spindle microtubules, participates in cell cycle regulation and has important roles in cell proliferation and apoptosis (10). NUF2 is overexpressed in a number of cancers, including lung cancer, cholangiocarcinoma, renal cell carcinoma and bladder cancer (11). Although the expression and prognostic significance of NUF2 in BC have been suggested (12,13), its precise role and underlying molecular mechanisms of action remain to be investigated. In the present study, 4 mRNA microarray datasets were analyzed from the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) databases to identify differentially expressed genes (DEGs) between BC tissues and normal breast tissues. The bioinformatics analysis and literature mining suggested that NUF2 is a key gene in the progression of BC. The expression of NUF2 in BC samples and its correlation with clinical pathological characteristics were then analyzed. In addition, the prognostic value of NUF2 was analyzed using individual and pooled methods. In a gene set enrichment analysis (GSEA), it was demonstrated that NUF2 might be involved in cell-cycle related pathways. The results of the present study suggest that NUF2 is a prognostic indicator of BC.

Materials and methods

Microarray data

GSE42568 (14), GSE45827 (15), GSE65194 (16) and TCGA BC microarray datasets, downloaded from GEO (17) and TCGA (18), were used to screen DEGs in BC. The TCGA dataset was used for analyses of clinical pathological characteristics associated with NUF2 in patients with BC. The following 5 additional BC microarray datasets were selected for prognostic analyses: GSE1456 (19), GSE22220 (20), NKI (21), GSE4299 (22) and GSE20685 (23). To normalize mRNA levels, patients for each dataset were reclassified into four subsets (X1, X2, X3 and X4) based on the quartile for expression values. The datasets were then reclassified into a new dataset for a pooled analysis.

DEG identification

BC-related microarray data downloaded from the GEO and TCGA databases were processed using R software (version 3.4.3; https://cran.r-project.org/). DEGs between BC tissues and normal breast tissues were identified using the limma package in R. Fold-change (FC) values were calculated and the DEGs were further selected based on the following cutoff criteria: P<0.01 and log |FC|>2. Overlapping DEGs among the four datasets were identified using Funrich (version 3.1.3; http://www.funrich.org).

Functional and pathway enrichment analyses of DEGs

Gene Ontology (GO) is used to identify enriched functions of genes in three independent categories: Biological process (BP), molecular function (MF) and cellular component (CC) (24). Kyoto Encyclopedia of Genes and Genomes (KEGG) was used to identify relevant pathways for the genes (25). GO BP and KEGG signaling pathway analyses of the DEGs were performed using the Database for Annotation Visualization and Integrated Discovery (DAVID) online tool (https://david.ncifcrf.gov/) (26) with P<0.05 as the threshold for significance.

Protein-protein interaction (PPI) network analysis

The Search Tool for the Retrieval of Interacting Genes (STRING; https://string-db.org/) was used to develop a PPI network. Using the STRING database, DEGs with a combined score ≥0.4 were chosen to construct the network, which was visualized using Cytoscape (version 3.6.1) (27). Molecular Complex Detection (MCODE), a plugin for Cytoscape, was used to construct functional modules in the PPI network.

Gene set enrichment analysis (GSEA)

A GSEA was conducted based on protocols obtained from the website (http://software.broadinstitute.org/gsea/index.jsp) and a previous study (28). GSEA (version 3.0) was run for the KEGG gene sets (c2.cp.kegg.v.6.0.symbols.gmt). The number of permutations was set to 1,000 and the phenotype labels were NUF2-high and NUF2-low. FDR <0.25 and NOM P<0.05 indicated statistical significance.

Oncomine analysis

Oncomine (https://www.oncomine.org/) is an online cancer microarray database, aiming to facilitate the discovery of novel biomarkers from genome-wide expression analyses. In the present study, the mRNA expression differences of NUF2 between BC and normal breast tissues were explored using the Oncomine database.

Patients and samples

BC and matched adjacent tissues were collected from the Pathology Department of Shaoxing People's Hospital (Shaoxing, China). Samples were obtained from 42 patients at initial diagnosis and were immediately frozen in liquid nitrogen. The present study was authorized by the Hospital Ethics Committee and informed consent was obtained from all patients.

Reverse transcription (RT)-quantitative (q)PCR

Total RNA was isolated from the BC and matched adjacent tissues using TRIzol (Invitrogen; Thermo Fisher Scientific, Inc., Waltham, MA, USA). The Nanodrop 2000 (Thermo Fisher Scientific, Inc.) was used to detect the purity and concentration of the total RNA. According to the manufacturer's protocol, RT-qPCR was performed using the LightCycler® 480 PCR apparatus (Roche Diagnostics, Basel, Switzerland) and the One Step SYBR® PrimeScript™ RT-PCR kit II (Takara Bio, Inc., Otsu, Japan). Amplification was performed under the following conditions: 42°C for 5 min, 95°C for 10 sec; 40 cycles of 95°C for 5 sec and 60°C for 20 sec; and 65°C for 15 sec. The primers used were as follows: NUF2 forward primer 5′-TACCATTCAGCAATTTAGTTACT-3′ and reverse primer 5′-TAGAATATCAGCAGTCTCAAAG-3′; and β-actin forward primer 5′-CATGTACGTTGCTATCCAGGC-3′ and reverse primer 5′-CTCCTTAATGTCACGCACGAT-3′. The relative levels of NUF2 expression were evaluated by the 2−ΔΔCq (29) method using β-actin as the control.

Statistical analyses

All statistical analyses were performed using SPSS 20.0 (IBM Corps., Armonk, NY, USA). An independent t-test was used for analyzing the continuous data. The χ2 test and χ2 test with continuity correction were performed to analyze the association of NUF2 with clinical pathological characteristics. Bonferroni's post hoc test was used to analyze the clinical pathological characteristics between more than 2 groups. Survival curves were generated by the Kaplan-Meier method and significance was determined using the log-rank test. Bonferroni's post hoc test was used for pairwise comparisons. Multivariable survival analysis was performed using the Cox proportional hazards regression model and significance was determined using the likelihood ratio test. P<0.05 was considered to indicate statistically significant differences, while for Bonferroni's test, P<0.05/N was considered to indicate statistically significant differences, where N=the number of pairwise comparisons.

Results

Identification of DEGs in BC

DEGs between the BC and normal breast tissues were screened using the GEO and TCGA databases. As shown in Fig. 1A, 1,702, 461, 600 and 337 DEGs were upregulated in the GSE45827, GSE42568, GSE65194, and TCGA datasets, and 613, 715, 264, and 872 DEGs were downregulated, respectively (Fig. 1B). In total, 190 DEGs exhibited the same expression trends in all datasets, including 65 upregulated and 125 downregulated genes.
Figure 1

Identification of DEGs in breast cancer microarray datasets. In total, (A) 65 DEGs were upregulated and (B) 125 DEGs were downregulated in the intersection of the GSE45827, GSE42568, GSE65194, and TCGA datasets. DEGs, differentially expressed genes; TCGA, the Cancer Genome Atlas.

Functional and pathway enrichment for the DEGs

GO BP and KEGG signaling pathway analyses of the DEGs were performed using DAVID. The upregulated DEGs were mainly enriched for the BP terms cell division, mitotic nuclear division and G2/M transition of mitotic cell cycle (Fig. 2A), while downregulated DEGs were significantly associated with lipid metabolic process, cholesterol homeostasis, and glucose metabolic process (Fig. 2B). Additionally, seven KEGG pathways were identified for the upregulated genes, including the p53 signaling pathway, cell cycle and extracellular matrix (ECM)-receptor interaction (Fig. 2C). The peroxisome proliferator-activated receptor (PPAR) signaling pathway, AMP-activated protein kinase (AMPK) signaling pathway and proximal tubule bicarbonate reclamation were associated with the downregulated DEGs (Fig. 2D). The detailed results are presented in Table I.
Figure 2

GO BP analysis and KEGG signaling pathway analysis of DEGs in breast cancer. GO BP analysis of (A) upregulated and (B) downregulated DEGs. KEGG signaling pathway enrichment analysis of (C) upregulated and (D) downregulated DEGs. DEGs, differentially expressed genes; GO, Gene Ontology; KEGG, Kyoto encyclopedia of genes and genomes; ECM, extracellular matrix.

Table I

Significantly enriched GO biological process terms and KEGG pathways.

A, Upregulated
TermsDescriptionNumber of genesP-value
GO Terms
 GO:0051301Cell division181.02×10−14
 GO:0007067Mitotic nuclear division161.90×10−14
 GO:0000086G2/M transition of mitotic cell cycle94.82×10−08
 GO:0031145Anaphase-promoting complex-dependent catabolic process75.40×10−07
 GO:0042787Protein ubiquitination involved in Ubiquitin-dependent protein catabolic process81.83×10−06
 GO:0007062Sister chromatid cohesion72.58×10−06
 GO:0030574Collagen catabolic process64.40×10−06
 GO:0008283Cell proliferation97.20×10−05
 GO:0051439Regulation of ubiquitin-protein ligase activity involved in mitotic cell cycle48.86×10−05
 GO:0035987Endodermal cell differentiation41.45×10−04
KEGG pathways
 hsa04110Cell cycle81.17×10−06
 hsa04512ECM-receptor interaction72.33×10−06
 hsa04115p53 signaling pathway50.000237
 hsa04114Oocyte meiosis50.001499
 hsa04510Focal adhesion50.014347
 hsa04151PI3K-Akt signaling pathway60.019962
 hsa05200Pathways in cancer60.032832

B, Downregulated
TermsDescriptionNumber of genesP-value

GO Terms
 GO:0006629Lipid metabolic process91.32×10−05
 GO:0019915Lipid storage52.06×10−05
 GO:0048662Negative regulation of smooth muscle cell proliferation54.49×10−05
 GO:0042632Cholesterol homeostasis67.95×10−05
 GO:0008217Regulation of blood pressure68.56×10−05
 GO:0006006Glucose metabolic process69.90×10−05
 GO:0019433Triglyceride catabolic process46.61×10−04
 GO:0042593Glucose homeostasis66.74×10−04
 GO:0019432Triglyceride biosynthetic process47.44×10−04
 GO:0001523Retinoid metabolic process58.33×10−04
KEGG pathways
 hsa03320PPAR signaling pathway91.14×10−07
 hsa04152AMPK signaling pathway77.34×10−04
 hsa04964Proximal tubule bicarbonate reclamation40.001072
 hsa04923Regulation of lipolysis in adipocytes50.001523
 hsa00982Drug metabolism-cytochrome P45050.003116
 hsa04920Adipocytokine signaling pathway50.003462
 hsa00350Tyrosine metabolism40.00367
 hsa00561Glycerolipid metabolism40.014956
 hsa00980Metabolism of xenobiotics by cytochrome P45040.028412
 hsa05205Proteoglycans in cancer60.033006

KEGG, Kyoto Encyclopedia of genes and genomes; GO, Gene Ontology.

PPI network analysis and the selection of NUF2

Protein interactions often play important roles in cancer progression. A PPI network analysis was performed using the STRING database and Cytoscape. The PPI network was constructed using 149 DEGs (57 upregulated and 92 downregulated DEGs) with combined scores ≥0.4, and contained 149 nodes and 930 edges (Fig. 3A). A total of two functional modules were identified using the MCODE plugin. Module 1 consisted of 35 nodes and 573 edges including NUF2, TOP2A, ASPM, and CCNB1 (Fig. 3B). Module 2 included 21 nodes and 104 edges including COL1A1, MMP1, MMP9, and LPL (Fig. 3C). Based on the degree of importance, module 1 was chosen for further analysis.
Figure 3

PPI network analysis of DEGs. (A) PPI network containing 149 nodes and 930 edges. (B) Module 1 consisted of 35 nodes and 573 edges, as identified by the MCODE plugin in Cytoscape. (C) Module 2 consisted of 21 nodes and 104 edges, as identified by the MCODE plugin in Cytoscape. Blue nodes represent downregulated genes in BC tissues; red nodes represent upregulated genes in BC tissues. PPI, protein-protein interaction; DEG, differentially expressed genes; BC, breast cancer; MCODE, Molecular Complex Detection.

The 35 genes in module 1 were ranked based on log |FC| values in the TCGA database and selected the top 10 hub genes for further analysis. The expression levels of the 10 hub genes in the BC tissues were >10-fold (log |FC|≥3.42) increased compared with those in the normal breast tissues. Through literature mining, it was identified that UBE2C, ASPM, BIRC5, TOP2A, KIF20A, CEP55, TPX2, NEK2 and ANLN, but not NUF2, have been reported extensively in BC-related studies. Therefore, NUF2 was selected as the focus of subsequent analyses.

NUF2 expression in BC

The expression of NUF2 mRNA in the BC tissues was evaluated using Oncomine (https://www.oncomine.org/) (30). The results indicated that the NUF2 expression level is significantly increased in the BC tissues compared with in the normal breast tissues (P<0.01; Fig. 4A-C). To further verify these results, 42 pairs of BC tissues and adjacent tissues were analyzed by RT-qPCR. Consistent with the results of the database analysis, the expression of NUF2 mRNA in the BC tissues was significantly increased (P<0.001) compared with in the adjacent tissues (Fig. 4D).
Figure 4

Expression of NUF2 in BC. (A) NUF2 mRNA was overexpressed in human BC tissues compared with normal breast tissues in the TCGA, (B) Curtis and (C) Richardson microarray databases, as evaluated by Oncomine analysis. (D) NUF2 mRNA expression was increased in most BC lesions compared with para-carcinoma tissues, as determined by reverse transcription-quantitative PCR. ***P<0.001. BC, breast cancer; NUF2, NDC80 kinetochore complex component.

Association of NUF2 with clinical pathological characteristics and survival of patients with BC

To further validate the clinical value of NUF2, the association between its expression and the clinical pathological characteristics of the 42 patients with BC recruited from Shaoxing People's Hospital were assessed. The expression of NUF2 was only significantly associated with age (P<0.05). Using the data in the TCGA database (contains data on 1,090 patients with BC), NUF2 expression was found to be significantly associated with age (P<0.001), estrogen receptor (ER) status (P<0.001), progesterone receptor (PR) status (P<0.001), histological type (P<0.001), TNM stage (P<0.05), and molecular subtype (P<0.001). The results are shown in Tables II and III. Furthermore, the clinical pathological characteristics that have multiple groups (>2) need a post hoc test to determine exactly what groups exhibit a difference. Therefore, Bonferroni's post hoc test was used for pairwise comparison. The results showed that NUF2 expression is statistically different between TNM stage 1 and 2 (P<0.01), and tumor stage T1 and T2 (P<0.001). In terms of molecular subtype, NUF2 expression was also significantly different in all pairwise comparisons (P<0.001) except between luminal A and normal-like, and luminal B and basal-like. The detailed results are shown in Tables IV and V.
Table II

Association of NUF2 with clinical pathological characteristics of breast cancer patients from Shaoxing People's Hospital.

Pathological characteristicsNumber of patients (%)NUF2 (%)
P-valuea
LowHigh
Age0.031
 <6021 (50.0)14 (66.7)7 (33.3)
 ≥6021 (50.0)7 (33.3)14 (66.7)
HER2 status0.05
 Positive14 (33.3)10 (71.4)4 (29.6)
 Negative28 (66.7)11 (39.3)17 (60.7)
ER status0.679b
 Positive35 (83.3)17 (48.6)18 (51.4)
 Negative7 (16.7)4 (57.1)3 (42.9)
PR status0.107
 Positive27 (64.3)11 (40.7)16 (59.3)
 Negative15 (35.7)10 (66.7)5 (33.3)
TNM stage0.751
 112 (28.6)5 (41.7)7 (58.3)
 225 (59.5)13 (52.0)12 (48.0)
 35 (11.9)3 (60.0)2 (40.0)
Tumor stage0.333
 T119 (45.2)8 (42.1)11 (57.9)
 T222 (52.4)13 (59.1)9 (40.9)
 T31 (2.4)0 (0.0)1 (100.0)
Lymph node stage0.946
 N027 (64.3)13 (48.1)14 (51.9)
 N110 (23.8)5 (50.0)5 (50.0)
 N23 (7.1)2 (66.7)1 (33.3)
 N32 (4.3)1 (50.0)1 (50.0)
Node metastasis0.533
 Yes18 (42.9)10 (55.6)8 (44.4)
 No24 (57.1)11 (45.8)13 (54.2)

Unless otherwise noted, χ2 tests were used for comparisons between groups.

χ2 test with continuity correction was used. ER, estrogen receptor; PR, progesterone receptor; TNM, tumor node metastasis; NUF2, NDC80 kinetochore complex component.

Table III

Association of NUF2 with clinical pathological characteristics of breast cancer patients derived from TCGA database.

Pathological characteristicsNumber of patients (%)NUF2 (%)
P-valuea
LowHigh
Age<0.001
 <60579 (53.2)255 (44.0)324 (66.0)
 ≥60510 (46.8)289 (56.7)221 (43.3)
HER2 status0.296
 Positive90 (21.4)39 (43.3)51 (56.7)
 Negative331 (78.6)164 (49.5)167 (50.5)
ER status<0.001
 Positive803 (77.2)467 (58.2)336 (41.8)
 Negative237 (22.8)60 (25.3)177 (74.7)
PR status<0.001
 Positive694 (66.9)415 (60.0)279 (40.0)
 Negative343 (33.1)110 (32.1)233 (67.9)
Histology type<0.001
 IDC779 (79.3)326 (41.8)453 (58.2)
 ILC203 (20.7)156 (76.8)47 (23.2)
TNM stage0.032
 1181 (17.0)108 (59.7)73 (40.3)
 2619 (58.0)293 (47.3)326 (52.7)
 3247 (23.1)119 (48.2)128 (51.8)
 420 (1.9)10 (50.0)10 (50.0)
Tumor stage<0.001
 T1279 (25.7)171 (61.3)108 (38.7)
 T2631 (58.0)283 (44.8)348 (55.2)
 T3137 (12.6)71 (51.8)66 (48.2)
 T440 (3.7)19 (47.5)21 (52.5)
Lymph node stage0.095
 N0514 (48.0)255 (49.6)259 (50.4)
 N1360 (33.6)185 (51.4)175 (48.6)
 N2120 (11.2)48 (40.0)72 (60.0)
 N376 (7.2)43 (56.6)33 (43.4)
Metastasis stage0.82
 M0906 (97.6)434 (47.9)472 (52.1)
 M122 (2.4)10 (45.5)12 (54.5)
Molecular subtype<0.001
 Luminal A419 (0.5)305 (72.8)114 (27.2)
 Luminal B190 (0.23)26 (13.7)164 (86.3)
 HER2+67 (0.08)26 (38.8)41 (61.2)
 Basal-like139 (0.16)14 (10.1)125 (89.9)
 Normal-like23 (0.03)21 (91.3)2 (8.7)

χ2 tests were used for comparisons between groups. TNM, tumor node metastasis; ER, estrogen receptor; PR, progesterone receptor; IDC, infiltrating ductal carcinoma; ILC, infiltrating lobular carcinoma; NUF2, NDC80 kinetochore complex component.

Table IV

Comparison of clinical pathological characteristics of breast cancer patients from Shaoxing People's Hospital among multiple groups.

Pathological characteristicsPairwise comparisons (P-values)
TNM stage123
 1N/A0.5560.620
 20.556N/A1.000
 30.6201.000N/A
Tumor stageT1T2T3
 T1N/A0.2781.000
 T20.278N/A0.435
 T31.0000.435N/A
Lymph node stageN0N1N2N3
 N0N/A1.0001.0001.000
 N11.000N/A1.0001.000
 N21.0001.000N/A1.000
 N31.0001.0001.000N/A

P<0.05/N was considered statistically significant, where N was the number of pairwise comparisons. TNM, tumor node metastasis.

Table V

Comparison of clinical pathological characteristics of breast cancer patients in The Cancer Genome Atlas database among multiple groups.

Pathological characteristicsPairwise comparisons (P-values)
TNM stage1234
 1N/A0.004a0.0190.405
 20.004aN/A0.8220.814
 30.0190.822N/A0.875
 40.4050.8140.875N/A
Tumor stageT1T2T3T4
 T1N/A0.000a0.0660.097
 T20.000aN/A0.1380.744
 T30.0660.138N/A0.630
 T40.0970.7440.630N/A
Lymph node stageN0N1N2N3
 N0N/A0.6050.0580.257
 N10.605N/A0.0310.410
 N20.0580.031N/A0.023
 N30.2570.4100.023N/A
Molecular subtypeLuminal ALuminal BHER2+Basal-likeNormal-like
 Luminal AN/A0.000a0.000a0.000a0.049
 Luminal B0.000aN/A0.000a0.3220.000a
 HER2+0.000a0.000aN/A0.000a0.000a
 Basal-like0.000a0.3220.000aN/A0.000a
 Normal-like0.0490.000a0.000a0.000aN/A

P<0.05/N was considered statistically significant, where N was the number of pairwise comparisons.

P<0.05/N. TNM, tumor, node and metastasis.

Furthermore, to elucidate the correlation between the expression of NUF2 and patient survival, 5 GEO datasets were used. Samples from each dataset were reclassified into four subsets (X1, X2, X3 and X4) according to the quartile of NUF2 expression. The X1 subset was set with the lowest expression as the reference to calculate the hazard ratio (HR). Each dataset was analyzed by Kaplan-Meier analysis and Cox proportional hazard analysis. High NUF2 expression was associated with shorter overall survival (OS) and progression-free survival (PFS) compared with low NUF2 expression in the GSE1456 dataset (Fig. 5A and B) and NKI dataset (Fig. 5C and D). Similar results were obtained in the GEO pooled analysis, as shown in Fig. 5E and F. Based on a further GEO pooled analysis, it was demonstrated that NUF2 expression levels are significantly associated with poor OS and PFS in both ER-positive (P<0.01; Fig. 6A and B) and ER-negative (P<0.01; Fig. 6C and D) BC, and the association is more obvious in ER-positive BC. The results of Cox proportional hazards analysis are shown in Table VII. In addition, the results of the Bonferroni's post hoc tests used to compare the Kaplan-Meier survival curves corresponding to >2 groups are shown in Table VI. These results indicated that NUF2 might be a prognostic factor for BC.
Figure 5

Role of NUF2 in the prognosis of BC patients. (A) NUF2 expression was significantly associated with OS of BC patients in the GSE1456 dataset, (C) NKI dataset, and (E) GEO pooled dataset. NUF2 expression was significantly associated with PFS of BC patients in the (B) GSE1456 dataset, (D) NKI dataset and (F) GEO pooled dataset. GEO, gene expression omnibus; BC, breast cancer; OS, overall survival; PFS, progression free survival; ER, estrogen receptor; PR, progesterone receptor; NUF2, NDC80 kinetochore complex component.

Figure 6

Prognostic value of NUF2 in ER-positive and ER-negative BC. Association of NUF2 expression with (A) OS and (B) PFS in ER-positive BC in the GEO pooled dataset. Association of NUF2 expression with (C) OS and (D) PFS in ER-negative BC in the GEO pooled dataset. GEO, gene expression omnibus; BC, breast cancer; OS, overall survival; PFS, progression free survival; ER, estrogen receptor; PR, progesterone receptor; NUF2, NDC80 kinetochore complex component.

Table VII

Univariate and multivariate analyses of NUF2 and survival in GEO datasets.

DatasetOverall survival
Progression-free survival
HR (95% CI)Adjusted HR (95% CI)HR (95% CI)Adjusted HR (95% CI)
GSE1456
 X1ReferenceReferenceReferenceReference
 X23.94 (0.82-18.95)3.66 (0.75-17.75)2.19 (0.55-8.74)1.99 (0.49-8.05)
 X36.18 (1.37-27.90)a4.6 (0.91-23.28)5.7 (1.64-19.83)b3.94 (1.00-15.45)a
 X46.69 (1.50-29.91)a5.11 (1.03-25.24)a5.21 (1.48-18.29)b3.76 (0.97-14.57)
GSE22220
 X1N/AN/AReferenceReference
 X2N/AN/A2.05 (0.95-4.43)1.69 (0.77-3.69)
 X3N/AN/A2.31 (1.07-4.96)a1.79 (0.82-3.92)
 X4N/AN/A3.34 (1.60-6.95)b2.30 (1.05-5.02)a
NKI dataset
 X1ReferenceReferenceReferenceReference
 X22.39 (1.04-5.51)a2.01 (0.87-4.68)1.35 (0.72-2.54)1.29 (0.68-2.46)
 X33.97 (1.80-8.77)b2.88 (1.28-6.48)a2.39 (1.33-4.30)b1.97 (1.07-3.61)a
 X44.48 (2.03-9.89)c2.63 (1.13-6.12)a2.31 (1.27-4.21)b1.84 (0.96-3.52)
GSE4299
 X1N/AN/AReferenceReference
 X2N/AN/A1.95 (0.99-3.85)1.93 (0.97-3.81)
 X3N/AN/A2.16 (1.09-4.27)a2.13 (1.07-4.24)a
 X4N/AN/A2.11 (1.07-4.17)a2.09 (1.04-4.19)a
GSE20685
 X1ReferenceReferenceReferenceReference
 X21.11 (0.57-2.14)1.1 (0.57-2.14)1.17 (0.67-2.05)1.16 (0.66-2.03)
 X31.58 (0.85-2.95)1.58 (0.85-2.95)1.12 (0.64-1.98)1.13 (0.64-2.00)
 X41.59 (0.85-2.96)1.58 (0.84-2.97)1.46 (0.84-2.52)1.42 (0.82-2.48)
GEO pooled
 X1ReferenceReferenceReferenceReference
 X22.69 (1.29-5.60)b2.18 (1.04-4.57)a1.67 (1.05-2.65)a1.5 (0.95-2.39)
 X34.43 (2.20-8.91)c3 (1.46-6.15)b2.72 (1.77-4.19)c2.13 (1.36-3.32)b
 X44.94 (2.46-9.92)c2.87 (1.38-5.94)b2.98 (1.94-4.59)c2.1 (1.33-3.32)b
GEO pooled ER(+)
 X1ReferenceReferenceReferenceReference
 X23.84 (1.41-10.48)b3.23 (1.18-8.88)a1.57 (0.79-3.13)1.39 (0.70-2.79)
 X35.48 (2.08-14.49)b3.93 (1.46-10.61)b3.03 (1.62-5.69)b2.32 (1.22-4.44)a
 X45.60 (2.06-15.23)b3.74 (1.34-10.46)a2.83 (1.45-5.52)b2.09 (1.04-4.16)a
GEO pooled ER(−)
 X1ReferenceReferenceReferenceReference
 X21.40 (0.46-4.20)1.19 (0.39-3.61)1.28 (0.46-3.62)1.09 (0.38-3.11)
 X32.96 (1.07-8.20)a1.88 (0.65-5.45)2.60 (1.01-6.72)a1.58 (0.58-4.34)
 X43.23 (1.22-8.54)a1.96 (0.70-5.52)2.44 (0.97-6.10)1.46 (0.54-3.91)

For multivariate analysis, HR was adjusted by ER status and Elston grade in GSE1456. In GSE22220, HR was adjusted by age and Elston grade. In NKI and GSE4299, HR was adjusted by age, Elston grade, and ER status. For GSE20685, HR was adjusted by age. HR was adjusted by ER status and Elston grade in the pooled analysis. For GEO pooled ER(+) and GEO pooled ER(−), HR was adjusted by Elston grade.

P<0.05,

P<0.01 and

P<0.001 vs. the X1 group. HR, hazard ratio; ER, estrogen receptor; GEO, gene expression omnibus; CI, confidence interval.

Table VI

Comparison of Kaplan-Meier curves among multiple groups.

DatasetsPairwise comparisons (P-values)
GSE1456 OSSubsetsX1X2X3X4
X1N/A0.0880.008a0.003a
X20.088N/A0.3560.253
X30.008a0.356N/A0.851
X40.003a0.2530.851N/A
GSE1456 PFSSubsetsX1X2X3X4
X1N/A0.2590.002a0.004a
X20.259N/A0.0440.068
X30.002a0.044N/A0.800
X40.004a0.0680.800N/A
NKI OSSubsetsX1X2X3X4
X1N/A0.0360.000a0.000a
X20.036N/A0.1130.041
X30.000a0.113N/A0.663
X40.000a0.0410.663N/A
NKI PFSSubsetsX1X2X3X4
X1N/A0.3610.002a0.003a
X20.361N/A0.0340.057
X30.002a0.034N/A0.840
X40.003a0.0570.840N/A
GEO pooled OSSubsetsX1X2X3X4
X1N/A0.007a0.000a0.000a
X20.007aN/A0.0630.017
X30.000a0.063N/A0.630
X40.000a0.0170.630N/A
GEO pooled PFSSubsetsX1X2X3X4
X1N/A0.0270.000a0.000a
X20.027N/A0.0100.002a
X30.000a0.010N/A0.639
X40.000a0.002a0.639N/A
GEO pooled ER(+) OSSubsetsX1X2X3X4
X1N/A0.006a0.000a0.000a
X20.006aN/A0.3070.314
X30.000a0.307N/A0.925
X40.000a0.3140.925N/A
GEO pooled ER(+) PFSSubsetsX1X2X3X4
X1N/A0.1970.000a0.001a
X20.197N/A0.0220.065
X30.000a0.022N/A0.730
X40.001a0.0650.730N/A
GEO pooled ER(−) OSSubsetsX1X2X3X4
X1N/A0.4950.0310.016
X20.495N/A0.0790.028
X30.0310.079N/A0.762
X40.0160.0280.762N/A
GEO pooled ER(−) PFSSubsetsX1X2X3X4
X1N/A0.6790.0290.055
X20.679N/A0.0950.106
X30.0290.095N/A0.834
X40.0550.1060.834N/A

P<0.05/N was considered statistically significant, where N was the number of pairwise comparisons.

P<0.05/N. OS, overall survival; PFS, progression-free survival; ER, estrogen receptor.

Signaling pathways associated with NUF2

Single-gene differential expression analyses in the study of biological processes are limited (28). To effectively reveal the biological significance of microarray datasets, a GSEA was performed to predict gene sets and signaling pathways associated with NUF2 using the data obtained from the TCGA database. As shown in Fig. 7A-C, NUF2 may function in cell cycle-related pathways, including the cell cycle, DNA replication, and p53 signaling pathway.
Figure 7

Signaling pathways associated with NDC80 kinetochore complex component predicted by gene set enrichment analysis. (A) Cell cycle (P=0.000; FDR=0.000; ES=0.72). (B) DNA replication (P<0.001; FDR<0.001; ES=0.84). (C) p53 signaling pathway (P=0.000; FDR=0.004; ES=0.51). FDR, false discovery rate; KEGG, Kyoto Encyclopedia of genes and genomes; ES, enrichment score.

Discussion

BC is the most common malignant tumor in women, accounting for 25% of female tumors (3). Despite advanced treatment techniques, BC remains the leading cause of death among women (31). Therefore, there is a pressing need for more effective molecular biomarkers to prevent, diagnose, and treat BC. With the development of microarray technology, hundreds of molecules have been found to be abnormally expressed during breast carcinogenesis and progression. The TCGA and GEO databases provide a large number of publicly available microarray datasets for biomarker identification. In this study, 190 DEGs with the same expression trends were identified in four datasets, and a GO BP analysis showed that the upregulated DEGs were mainly enriched in the biological processes of cell division, mitotic nuclear division, and G2/M transition of mitotic cell cycle, while the downregulated DEGs were related to lipid metabolic process, cholesterol homeostasis, and glucose metabolic process. Cell division and cell cycle are the basic processes in cell proliferation, and their abnormalities contribute to carcinogenesis and tumor progression (32). Furthermore, the activation of key regulators of lipid and cholesterol metabolism drives the estrogen-independent growth of invasive lobular breast carcinoma cells (33). A KEGG signaling pathway analysis of the DEGs in this study revealed the importance of the cell cycle, ECM-receptor interaction, PPAR signaling pathway, and AMPK signaling pathway in BC. Previous studies have reported that ECM could regulate tissue homeostasis, and its dysregulation could promote tumor progression by affecting adhesion, migration, differentiation, proliferation, and apoptosis of tumor cells (34,35). In addition, an increase in the rigidity of the ECM due to the local accumulation of crosslinked collagen matrix is associated with cancer progression (36). Yao et al (37) found that the PPAR signaling pathway is involved in breast carcinogenesis. Song et al (38) suggested that activation of the AMPK signaling pathway may be beneficial for the promotion of tumor necrosis factor-related apoptosis-inducing ligand-mediated anti-BC treatment. a PPI network was constructed in this study and two functional modules were identified. According to the MCODE scores, which represent importance, module 1 was found to play a major role in the PPI network. By combining the log |FC| values of the DEGs in the TCGA database and literature mining, NUF2 was selected for further research as a key gene in BC. Although Xiang et al (12) found that NUF2 is upregulated in BC, using cDNA microarray data of BC patients, further experiments have not been performed to verify this finding. In this study, by Oncomine analysis and RT-qPCR assay, it was verified that NUF2 is overexpressed in BC tissues, further confirming the results obtained by data mining. Shiraishi et al (39) found that NUF2 expression is significantly associated with prostate cancer recurrence, and patients with high NUF2 expression have significantly shorter survival times without biochemical recurrence. Hu et al (10) showed that the overexpression of NUF2 could be related to poor prognosis in pancreatic cancer. Zhang et al (13) found that NUF2 expression has prognostic values for BC patients, by simply using the BC-GenExMiner online analysis tool. However, further analysis has not been conducted. To this end, the present study analyzed the precise roles and underlying molecular mechanisms of NUF2 in BC. By stratified analysis and pooled analysis of five GEO datasets, it was found that patients with BC and high NUF2 expression had worse prognosis than patients with low NUF2 expression in both ER-positive and ER-negative BC. Using clinical data for 42 patients, it was demonstrated that NUF2 expression was only associated with age. Small sample size, erroneous tissue sampling, RNA degradation, and fluctuating efficiency of RT-qPCR may affect the results of the analysis. This hypothesis can be tested through the following methods: Increasing sample size, determining the type of tissue with pathological examination, detecting RNA degradation by RNA electrophoresis, and verifying amplification efficiency of RT-qPCR by the standard curve method. The lack of additional experiments to test these possibilities is a limitation to the present study. Therefore, the relationship was further analyzed using clinical data for patients with BC in the TCGA database. The expression of NUF2 was positively correlated with tumor stage and negatively correlated with ER status, consistent with the results from a number of studies showing that advanced tumors and ER-negative tumors are probably related to poor survival (40-42), suggesting that NUF2 plays an important role in tumor progression and prognosis. To elucidate the molecular mechanisms by which NUF2 mediates breast carcinogenesis and progression, GSEA was performed in this study. The results revealed that NUF2 is involved in cell cycle-related pathways. In conclusion, the present data analysis and RT-qPCR validation indicated that NUF2 is highly expressed in BC and is associated with its multiple pathological features and prognosis. NUF2 is therefore a potential therapeutic target and prognostic indicator of BC. However, this study had several limitations. First, only mRNA data were obtained from the databases and RT-qPCR, and this single-gene-level analysis cannot fully capture the expression of NUF2 in BC. Second, experimental validation of the results was not performed. Therefore, further research is required to determine the roles of NUF2 in BC. NUF2 is overexpressed in BC and is associated with its multiple pathological features. More importantly, NUF2 may play an important role in predicting the clinical outcomes of different BC subgroups.
  42 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  Systematic identification of novel protein domain families associated with nuclear functions.

Authors:  Tobias Doerks; Richard R Copley; Jörg Schultz; Chris P Ponting; Peer Bork
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

4.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method.

Authors:  K J Livak; T D Schmittgen
Journal:  Methods       Date:  2001-12       Impact factor: 3.608

5.  DAVID: Database for Annotation, Visualization, and Integrated Discovery.

Authors:  Glynn Dennis; Brad T Sherman; Douglas A Hosack; Jun Yang; Wei Gao; H Clifford Lane; Richard A Lempicki
Journal:  Genome Biol       Date:  2003-04-03       Impact factor: 13.583

6.  A conserved protein, Nuf2, is implicated in connecting the centromere to the spindle during chromosome segregation: a link between the kinetochore function and the spindle checkpoint.

Authors:  A Nabetani; T Koujin; C Tsutsumi; T Haraguchi; Y Hiraoka
Journal:  Chromosoma       Date:  2001-09       Impact factor: 4.316

7.  Resveratrol-induced gene expression profiles in human prostate cancer cells.

Authors:  Sunita B Jones; Samuel E DePrimo; Michael L Whitfield; James D Brooks
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2005-03       Impact factor: 4.254

8.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

9.  A gene-expression signature as a predictor of survival in breast cancer.

Authors:  Marc J van de Vijver; Yudong D He; Laura J van't Veer; Hongyue Dai; Augustinus A M Hart; Dorien W Voskuil; George J Schreiber; Johannes L Peterse; Chris Roberts; Matthew J Marton; Mark Parrish; Douwe Atsma; Anke Witteveen; Annuska Glas; Leonie Delahaye; Tony van der Velde; Harry Bartelink; Sjoerd Rodenhuis; Emiel T Rutgers; Stephen H Friend; René Bernards
Journal:  N Engl J Med       Date:  2002-12-19       Impact factor: 91.245

10.  Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts.

Authors:  Yudi Pawitan; Judith Bjöhle; Lukas Amler; Anna-Lena Borg; Suzanne Egyhazi; Per Hall; Xia Han; Lars Holmberg; Fei Huang; Sigrid Klaar; Edison T Liu; Lance Miller; Hans Nordgren; Alexander Ploner; Kerstin Sandelin; Peter M Shaw; Johanna Smeds; Lambert Skoog; Sara Wedrén; Jonas Bergh
Journal:  Breast Cancer Res       Date:  2005-10-03       Impact factor: 6.466

View more
  13 in total

1.  NUF2 Is a Potential Immunological and Prognostic Marker for Non-Small-Cell Lung Cancer.

Authors:  Xia Li; Lianlian Zhang; Zhongquan Yi; Jing Zhou; Wenchun Song; Panwen Zhao; Jixiang Wu; Jianxiang Song; Qinggan Ni
Journal:  J Immunol Res       Date:  2022-05-12       Impact factor: 4.493

2.  NUF2 Expression Promotes Lung Adenocarcinoma Progression and Is Associated With Poor Prognosis.

Authors:  Feng Jiang; Xiaolu Huang; Xiang Yang; Huixin Zhou; Yumin Wang
Journal:  Front Oncol       Date:  2022-06-23       Impact factor: 5.738

3.  Identification of NUF2 and FAM83D as potential biomarkers in triple-negative breast cancer.

Authors:  Xiuming Zhai; Zhaowei Yang; Xiji Liu; Zihe Dong; Dandan Zhou
Journal:  PeerJ       Date:  2020-09-21       Impact factor: 2.984

4.  Poly-SUMO-2/3 chain modification of Nuf2 facilitates CENP-E kinetochore localization and chromosome congression during mitosis.

Authors:  Divya Subramonian; Te-An Chen; Nicholas Paolini; Xiang-Dong David Zhang
Journal:  Cell Cycle       Date:  2021-04-28       Impact factor: 4.534

5.  Highly heterogeneous-related genes of triple-negative breast cancer: potential diagnostic and prognostic biomarkers.

Authors:  Yiduo Liu; Linxin Teng; Shiyi Fu; Guiyang Wang; Zhengjun Li; Chao Ding; Haodi Wang; Lei Bi
Journal:  BMC Cancer       Date:  2021-05-31       Impact factor: 4.430

6.  Identification of novel transcription factor-microRNA-mRNA co-regulatory networks in pulmonary large-cell neuroendocrine carcinoma.

Authors:  Cunliang Cai; Qianli Zeng; Guiliang Zhou; Xiangdong Mu
Journal:  Ann Transl Med       Date:  2021-01

7.  Identification of potential key genes in gastric cancer using bioinformatics analysis.

Authors:  Wei Wang; Ying He; Qi Zhao; Xiaodong Zhao; Zhihong Li
Journal:  Biomed Rep       Date:  2020-02-20

8.  Development of a Novel Prognostic Signature Based on Antigen Processing and Presentation in Patients with Breast Cancer.

Authors:  Aoshuang Qi; Mingyi Ju; Yinfeng Liu; Jia Bi; Qian Wei; Miao He; Minjie Wei; Lin Zhao
Journal:  Pathol Oncol Res       Date:  2021-04-01       Impact factor: 3.201

9.  Integrative Multi-Omics Analysis of Identified NUF2 as a Candidate Oncogene Correlates With Poor Prognosis and Immune Infiltration in Non-Small Cell Lung Cancer.

Authors:  Mengqing Chen; Shangkun Li; Yuling Liang; Yue Zhang; Dan Luo; Wenjun Wang
Journal:  Front Oncol       Date:  2021-06-10       Impact factor: 6.244

10.  An interactive nomogram based on clinical and molecular signatures to predict prognosis in multiple myeloma patients.

Authors:  Linxin Liu; Jian Qu; Yuxin Dai; Tingting Qi; Xinqi Teng; Guohua Li; Qiang Qu
Journal:  Aging (Albany NY)       Date:  2021-07-14       Impact factor: 5.682

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.