Literature DB >> 29328377

Feature genes in metastatic breast cancer identified by MetaDE and SVM classifier methods.

Youlin Tuo1, Ning An2, Ming Zhang2.   

Abstract

The aim of the present study was to investigate the feature genes in metastatic breast cancer samples. A total of 5 expression profiles of metastatic breast cancer samples were downloaded from the Gene Expression Omnibus database, which were then analyzed using the MetaQC and MetaDE packages in R language. The feature genes between metastasis and non‑metastasis samples were screened under the threshold of P<0.05. Based on the protein‑protein interactions (PPIs) in the Biological General Repository for Interaction Datasets, Human Protein Reference Database and Biomolecular Interaction Network Database, the PPI network of the feature genes was constructed. The feature genes identified by topological characteristics were then used for support vector machine (SVM) classifier training and verification. The accuracy of the SVM classifier was then evaluated using another independent dataset from The Cancer Genome Atlas database. Finally, function and pathway enrichment analyses for genes in the SVM classifier were performed. A total of 541 feature genes were identified between metastatic and non‑metastatic samples. The top 10 genes with the highest betweenness centrality values in the PPI network of feature genes were Nuclear RNA Export Factor 1, cyclin‑dependent kinase 2 (CDK2), myelocytomatosis proto‑oncogene protein (MYC), Cullin 5, SHC Adaptor Protein 1, Clathrin heavy chain, Nucleolin, WD repeat domain 1, proteasome 26S subunit non‑ATPase 2 and telomeric repeat binding factor 2. The cyclin‑dependent kinase inhibitor 1A (CDKN1A), E2F transcription factor 1 (E2F1), and MYC interacted with CDK2. The SVM classifier constructed by the top 30 feature genes was able to distinguish metastatic samples from non‑metastatic samples [correct rate, specificity, positive predictive value and negative predictive value >0.89; sensitivity >0.84; area under the receiver operating characteristic curve (AUROC) >0.96]. The verification of the SVM classifier in an independent dataset (35 metastatic samples and 143 non‑metastatic samples) revealed an accuracy of 94.38% and AUROC of 0.958. Cell cycle associated functions and pathways were the most significant terms of the 30 feature genes. A SVM classifier was constructed to assess the possibility of breast cancer metastasis, which presented high accuracy in several independent datasets. CDK2, CDKN1A, E2F1 and MYC were indicated as the potential feature genes in metastatic breast cancer.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29328377      PMCID: PMC5802200          DOI: 10.3892/mmr.2018.8398

Source DB:  PubMed          Journal:  Mol Med Rep        ISSN: 1791-2997            Impact factor:   2.952


Introduction

Breast cancer is one of the most commonly diagnosed types of cancer, accounting for one-third of cancer cases in the USA (1). The survival rate of breast cancer has improved steadily with the development of early diagnosis and adjuvant therapy; however, the overall survival of patients with metastatic disease still remains poor (2). It has been estimated that >90% of breast cancer mortalities are associated with tumor metastasis (3,4). Metastasis is associated with poor patient prognosis and an acceleration of the carcinoma progress (5). Brain, bone, lungs and liver are the most frequently targeted organs for breast cancer metastasis, and the tumor microenvironment is considered to be a critical regulator for the metastatic process (6). Comprehensive understanding of metastasis progression is very important for identifying novel therapeutic strategies to prevent metastatic disease. The MetaOmics software in R language is comprised of the MetaDE, MetaQC and MetaPath packages. The MetaDE package primarily contains 12 state-of-the-art genomic meta-analysis methods to detect differentially expressed genes (7). The MetaQC package is the quantitative and objective tool for the determination of the inclusion/exclusion criteria for meta-analysis (8). The MetaDE and MetaQC packages have been intensively utilized for data digging from microarray profiles. Fc fragment of immunoglobulin G binding protein, for example, has been reported as a candidate metastasis-associated gene using the integrated method of MetaDE and survival analysis (9). As an effective classifier for identification, the support vector machine (SVM) classifier is well suited for signature modeling (10). Guyon et al (11) applied the SVM classifier to select feature genes from DNA microarrays, and the selected genes were proved to exhibit a greater classification performance. Fan et al (10) demonstrated that the SVM classifier for feature gene selection was able to speed up the classification process and the generalization performance. In the present study, several microarray profiles of breast cancer samples (including metastatic and non-metastatic samples) were downloaded to investigate the feature genes in metastatic samples. A SVM classifier was constructed to identify feature genes, which was validated by another independent gene expression dataset from The Cancer Genome Atlas (TCGA) database.

Materials and methods

Processing of microarray data

Expression profiles matching the search terms of ‘breast cancer’, ‘homo sapiens’ and ‘metastasis’ in the Gene Expression Omnibus (GEO; www.ncbi.nlm.nih.gov/geo/) database were screened on 22nd April 2016. The profiles were selected using the following filtering criteria: i) The data was gene expression microarray data; ii) data was collected from cancerous tissue samples or cancerous-metastasis samples; iii) and the metastatic statuses of the samples were clearly recorded. A total of 5 microarray profiles were retrieved from the GEO database (Table I). The GSE46928, GSE43837, GSE46826, GSE39494 and GSE29431 profiles had a total of 52, 38, 27, 10 and 31 samples, respectively; these in turn included 11, 19, 21, 5 and 13 metastatic samples, respectively.
Table I.

Basic information of downloaded microarray data.

GEO accessionChipProbe numberTotal sample numberNon-metastasis samplesMetastasis samples
GSE46928HG-U133A22,283524111
GSE43837U133_X3P61,360381919
GSE46826Agilent-02192462,97728622
GSE39494Agilent-01485041,0001055
GSE29431HG-U133_Plus_254,675311813

GEO, Gene Expression Omnibus.

For GSE46928, GSE43837 and GSE29431 datasets based on the Affymetrix platform (Affymetrix; Thermo Fisher Scientific, Inc., Waltham, MA, USA), the raw data were used to perform background correction via Affymetrix microarray software Affy version 1.42.3 (https://bioconductor.org/packages/release/bioc/html/affy.html) in R version 3.1.0, and normalization via the quantiles method (12). For GSE46826 and GSE39494 datasets based on the Agilent platform (Agilent Technologies, Inc., Santa Clara, CA, USA), the gene names in the microarray data were identified according to Agilent platform. Then, the average values were used as the expression levels of genes corresponding to multiple probes. The Limma package 3.22.1 (13) (https://bioconductor.org/packages/release/bioc/html/limma.html) was used for the normalization of these data.

Screening of feature genes

All of the selected datasets were merged to form a novel dataset for the screening of feature genes using MetaDE.ES in the MetaDE package 1.0.5 (14). Firstly, principal component analysis and standardized mean rank methods in the MetaQC package (8) were applied to ensure quality control (QC) within the novel datasets from the different profiles. In this process, the following parameters were used: Internal QC, external QC, accuracy QC (AQCg), precision of AQCg, consistency QC (CQCg) and precision of CQCg. Tests for heterogeneity were then performed to determinate the gene expression differentiations among the different datasets; Qpval >0.05 and tau2=0 were used as the criteria for homogenous genes. Finally, the differentially expressed genes (DEGs) between metastatic samples and non-metastatic samples in the dataset were identified under the threshold of P<0.05, which were considered as feature genes in the following analysis.

Construction of the protein-protein interaction (PPI) network

The interactions between human genes in the Biological General Repository for Interaction Datasets (thebiogrid.org/, BioGRID Version 3.4.154 Released) (15), Human Protein Reference Database (www.hprd.org/, HPRD Release 9) (16) and Biomolecular Interaction Network Database (BIND 2.0) (17) were downloaded. The screened feature genes were then subjected to the downloaded interactions to obtain the PPI network, which was visualized using Cytoscape 3.6.0 software (18). The degree (the connection with other genes) and the betweenness centrality (BC) value of feature genes in the network were calculated. The following formula was used for calculating BC: Where σ is the shortest path between s and t, and σ is the node numbers in the path of σ. A high BC value indicates a high degree of feature genes in the network.

Establishment of the SVM classifier

Feature genes were ranked according to their BC values, and those that were present in the most qualified samples were collected as the training dataset for the establishment of the SVM classifier. The remaining feature genes were used as the verification datasets for the classifier. The feature genes in the SVM classifier were used to perform the two-way clustering of samples and expression levels. The clustering results were visualized using a heatmap (19). The aim of the constructed SVM classifier was to distinguish whether the cancer had metastasized by analyzing the primary cancer samples. A set of microarray data from breast cancer samples (https://cancergenome.nih.gov/) was downloaded from TCGA (tcga-data.nci.nih.gov/docs/publications/tcga/) for further clarification. In total, 597 samples were included in the dataset, among which 178 samples had clinical information regarding metastasis status, follow-up time and the clinical outcomes. There were 35 metastatic samples and 143 non-metastatic samples.

Function and pathway enrichment

Fisher's test was utilized with the ‘runHyperKEGG’ and ‘runHyperGO’ functions of the Easy Microarray Data Analysis package 1.4.4 (20) for the function and pathway enrichment of feature genes. P<0.05 was set as the cut-off criterion.

Results

Feature gene selection

The QC results of all 5 microarray profiles are displayed in Fig. 1 and Table II; the results indicated there was good quality within all datasets. Next, using the MetaDE package, 541 feature genes were identified and the top 10 were ranked by their P-values; these included, non-SMC condensing I complex subunit H, small nuclear ribonucleoprotein U11/U12 subunit 25, cellular retinoic acid binding protein 2, guanosine triphosphate binding protein 2, homer scaffolding protein 2, family with sequence similarity 64 member A, WD repeat domain (WDR) 45, dual specificity tyrosine phosphorylation regulated kinase 4, chromosome 12 open reading frame 10 and H2A histone family member Z (Table III).
Figure 1.

Quality control results of the merged datasets from 5 microarray profiles (marked as 1–5) obtained via MetaQC analysis. The first principal component is presented on the x-axis, while the second principal component is shown on the y-axis. QC, quality control; IQC, internal QC; EQC, external QC; AQCg, accuracy QC; AQCp, precision of AQCg; CQCg, consistency QC; CQCp, precision of CQCg.

Table II.

Results of quality control parameters and standardized mean rank.

Microarray profileIQCEQCCQCgCQCpAQCgAQCpSMR
GSE469284.914.7893.87148.67153.8356.442.42
GSE438375.125.0052.41101.36184.0639.301.57
GSE468264.564.2268.15146.58106.1929.434.83
GSE394942.162.9221.5864.1446.6133.907.17
GSE294313.194.1643.6689.52113.2431.163.36

QC, quality control; IQC, internal QC; EQC, external QC; AQCg, accuracy QC; AQCp, precision of AQCg; CQCg, consistency QC; CQCp, precision of CQCg; SMR, standardized mean rank.

Table III.

Top 10 feature genes selected using the MetaDE package.

GeneP-valueQQptau2Exp
NCAPH4.17×10−51.49190.828101
SNRNP251.20×10−43.86870.424101
CRABP21.55×10−40.50880.972601
GTPBP23.51×10−40.42450.980401
HOMER23.74×10−43.40710.492101
FAM64A3.93×10−42.51960.641101
WDR454.34×10−42.52870.639501
DYRK44.61×10−41.40360.843601
C12orf104.92×10−42.78850.593801
H2AFZ5.19×10−43.01970.554501

NCAPH, non-SMC condensing I complex subunit H; SNRNP35, small nuclear ribonucleoprotein U11/U12 subunit 25; CRABP2, cellular retinoic acid binding protein 2; GTPBP2, guanosine triphosphate binding protein 2; HOMER2, homer scaffolding protein 2; FAM64A, family with sequence similarity 64 member A; WDR45, WD repeat domain 45; DYRK4, dual specificity tyrosine phosphorylation regulated kinase 4; C12orf10, chromosome 12 open reading frame 10; H2AFZ, H2A histone family member Z.

PPI network of feature genes

The PPI network of feature genes was comprised of 307 nodes (feature genes) and 586 lines (interactions; Fig. 2). There were 220 nodes (shown in green) that exhibited higher expression levels in metastatic samples, as well as 87 nodes (shown in purple) that exhibited lower expression levels in metastatic samples when compared to non-metastatic samples. As shown in Fig. 3, 168 genes exhibited a log (degree) of 0–1 and only 5 genes exhibited a log (degree) of >3 in the network. In addition, the top 30 genes with the highest BC values were listed in Table IV. The top 10 feature genes were Nuclear RNA Export Factor 1 (NXF1), cyclin-dependent kinase 2 (CDK2), myelocytomatosis proto-oncogene protein (MYC), Cullin 5 (CUL5), SHC Adaptor Protein 1 (SHC1), Clathrin heavy chain (CLTC), Nucleollin (NCL), WDR1, proteasome 26S subunit, non-ATPase 2 (PSMD2), telomeric repeat binding factor 2 (TERF2; Table IV). Among these feature genes the CDK inhibitor 1A (CDKN1A), E2F transcription factor 1 (E2F1) and MYC interacted with CDK2.
Figure 2.

Protein-protein interaction network of feature genes. Green nodes are the genes that exhibited higher expression in metastatic samples, while the purple nodes are those that exhibited lower expression in metastatic samples when compared with non-metastatic samples.

Figure 3.

Distribution of node degrees in the protein-protein interaction network of feature genes. The x-axis is the log (degree) value and the y-axis is the corresponding node numbers to the degree.

Table IV.

Top 30 feature genes with the highest betweeness centrality in the protein-protein interaction network.

GeneBCEXPDegreeP-valueQQptau2
NXF10.38641663.43×10−23.71630.44580
CDK20.20470443.33×10−22.28820.68290
MYC0.13821274.91×10−23.48270.48050
CUL50.10061212.86×10−23.00800.55650
SHC10.09741161.60×10−21.15180.88600
CLTC0.07830202.66×10−22.81540.58920
NCL0.05681159.12×10−41.31210.85930
WDR10.0532188.49×10−32.57220.63180
PSMD20.04761138.31×10−43.40610.49230
TERF20.04600111.65×10−20.31610.98880
RUVBL10.04501132.51×10−20.89040.92590
PRDX10.03941104.09×10−22.00570.73470
PTEN0.03340121.99×10−33.50560.47700
HDGF0.03131103.93×10−23.44750.48590
RUNX1T10.0291042.88×10−20.29560.99010
IQCB10.02831121.20×10−30.79950.93850
AKT10.02731153.26×10−32.03180.72990
APEX10.0268161.09×10−21.85430.76250
TSR10.0263072.06×10−22.26610.68700
TUBB2A0.0258191.18×10−23.49220.47910
ETS10.0257054.11×10−33.25200.51660
PSMC50.02491111.85×10−22.78030.59520
RUNX10.0248044.45×10−22.32570.67610
SMAD90.0242063.52×10−21.35180.85250
STAU10.02391141.33×10−21.77060.77790
DBN10.02351132.31×10−32.15470.70730
SNCA0.02290102.51×10−22.90880.57320
CDKN1A0.02260121.48×10−23.77750.43690
SLC25A10.0223122.22×10−21.14380.88730
NOS20.0222094.71×10−21.05600.90120

EXP is the expression value ratio of genes between metastastic samples and non-metastastic samples, while values of 1 represent high expression in metastastic samples and values of 0 represent high expression in non-metastastic samples. BC, betweeness centrality.

SVM classifier

Feature genes ranked with BC values were picked at 10 intervals from the top 10 to the top 50, for the construction of the SVM classifier. The dataset GSE46928 with the largest sample size was used as the training dataset. As shown in Fig. 4A, the accuracy of the SVM classifier improved with the increasing number of genes and the accuracy stabilized at 100% once the top 30 genes were selected. The SVM classifier constructed by the top 30 feature genes was able to distinguish metastatic samples from the non-metastatic samples with high accuracy (Fig. 4B). The selected 30 genes were considered to be the critical biomarkers for metastatic breast cancer, and included protein kinase B serine/threonine kinase 1 (AKT1), CDKN1A, ETS proto-oncogene 1 transcription factor (ETS1), runt related transcription factor 1 (RUNX1), RUNX1 translocation partner 1 (RUNX1T1), nitric oxide synthase 2 (NOS2), MYC, phosphatase and tensin homolog (PTEN) and CDK2. Clustering analysis of these 30 feature genes and the samples in GSE46928 demonstrated that these genes have significantly different expression levels between the metastatic and non-metastatic samples (Fig. 5).
Figure 4.

Accuracy and efficacy of the support vector machine classifier. (A) The accuracy and error ratio of the classifier at different gene numbers (top 10 to top 50). (B) The classification efficacy of the classifier constructed using the top 30 genes for samples in the GSE46928 dataset. Non-metastatic samples are marked in black and the metastatic samples are marked in red.

Figure 5.

Clustering heatmap of the top 30 genes and samples in the training dataset. The color gradient from red to green represents the changes in expression level from high to low. The bars represent the samples (orange refers to metastatic samples; purple refers to non-metastatic samples). Met, metastatic samples; Non, non-metastatic samples.

The classification efficacy of the constructed classifier was also tested on the other 4 microarray datasets (Fig. 6). All samples in GSE39494 (Fig. 6B) and GSE46826 (Fig. 6D) were correctly distinguished, and only 3 samples in GSE29431 (Fig. 6A) and 4 samples in GSE43837 (Fig. 6C) were misclassified. Overall, the SVM classifier displayed good performance in terms of distinguishing between metastatic and non-metastatic samples. The correct rate, specificity, positive predictive value (PPV) and negative predictive value (NPV) were >0.89, sensitivity was >0.84 and the area under the receiver operating characteristic curve (AUROC) was >0.96 (Table V).
Figure 6.

Classification results on other microarray profiles, including (A) GSE29431, (B) GSE39494, (C) GSE43837 and (D) GSE46826. Non-metastatic samples are marked in black and metastatic samples are marked in red. The receiver operating characteristic curves of the classifier are displayed on the right-hand side. AUC, area under the curve.

Table V.

Classification effect evaluation of the support vector machine classifier.

DatasetNumber of samplesCorrect rateSensitivitySpecificityPPVNPVAUROC
GSE2943131111111
GSE39494100.9030.8460.9440.9170.8950.975
GSE4383738111111
GSE46826280.8950.8950.8950.8950.8950.965

PPV, positive predictive value; NPV, negative predictive value; AUROC, area under the receiver operating characteristic curve.

An independent dataset of breast cancer samples was downloaded from the TCGA database to test the classification effect of the constructed classifier (Fig. 7). The results revealed an accuracy of 94.38% (168/178) in 35 metastatic samples and 143 non-metastatic samples, with an AUROC of 0.958 (Fig. 7B). Based on the 30 feature genes, the survival time of patients with metastatic breast cancer was significantly shorter than the patients with non-metastatic breast cancer, and the survival status was worse (Fig. 7C).
Figure 7.

Classification effect of the support vector machine classifier on an independent sample from The Cancer Genome Atlas database. (A) The spot graph of the different samples (non-metastatic samples are marked in black and metastatic samples are marked in red). (B) The receiver operating characteristic curve and (C) the survival curve. AUC, area under the curve.

The 30 feature genes in the SVM classifier were utilized for function and pathway enrichment. The results indicated that cell cycle associated functions and pathways were the most significant terms (Fig. 8; Table VI).
Figure 8.

Enriched functions of the 30 feature genes. Gene numbers are displayed on the x-axis. The color represents the -log (P-value) and the changes from red to blue represents high -log (P-value) to low -log (P-value).

Table VI.

Enriched pathways of the 30 feature genes.

PathwayP-valueGenes
hsa05200: Pathways in cancer1.11×10−5AKT1, CDKN1A, ETS1, RUNX1T1, NOS2, RUNX1, MYC, PTEN, CDK2
hsa04012: ErbB signaling pathway3.85×10−3AKT1, CDKN1A, SHC1, MYC
hsa04115: p53 signaling pathway2.60×10−2CDKN1A, PTEN, CDK2
hsa04110: Cell cycle2.81×10−5CDKN1A, MYC, CDK2

AKT1, protein kinase B serine/threonine kinase 1; CDKN1A, cyclin-dependent kinase inhibitor 1A; ETS1, ETS proto-oncogene 1 transcription factor; RUNX1, runt related transcription factor 1; RUNX1T1, RUNX1 translocation partner 1; NOS2, nitric oxide synthase 2; MYC, myelocytomatosis proto-oncogene protein; PTEN, phosphatase and tensin homolog; ErbB, Erb-B2 receptor tyrosine kinase 2; SHC1, SHC Adaptor Protein 1.

Discussion

As breast cancer metastasis accounts for the majority of breast cancer mortalities, there have been a number of reports analyzing DEGs associated with metastasis in breast cancer. Some previous studies have identified the markers associated with metastasis using the protein-network based approach (21–23). Walsh et al (24) identified tripartite motif containing 25 as a key determinant of breast cancer metastasis using an integrated transcriptional interaction network. In the present study, MetaQC package was firstly applied to conduct QC tests for the different profiles as the MetaQC package is the quantitative and objective tool in the determination of the inclusion/exclusion criteria for meta-analysis (8). The DEGs between metastatic and non-metastatic samples in the dataset were identified using the MetaDE package, which contains 12 state of the art genomic meta-analysis methods that detect DEGs (7). In the present study, a total of 541 feature genes were identified between metastatic and non-metastatic samples. The PPI network of DEGs was constructed and was comprised of 307 feature genes and 586 interactions, among which 220 nodes exhibited higher expression levels in metastatic samples and 87 nodes exhibited lower expression levels in metastatic samples when compared with non-metastatic samples. Feature genes were ranked according to their BC that quantifies the importance of a vertex within a graph (25,26). The top 10 genes with the highest BC values included NXF1, CDK2, MYC, CUL5, SHC1, CLTC, NCL, WDR1, PSMD2 and TERF2. CDKN1A, E2F1 and MYC were the genes that interacted with CDK2. Then, the SVM classifier of screened feature genes was constructed to evaluate the classification performance. The SVM classifier constructed by the top 30 feature genes (which included AKT1, CDKN1A, ETS1, RUNX1T1, NOS2, RUNX1, MYC, PTEN and CDK2, for example) was able to distinguish metastatic samples from the non-metastatic samples; this was proved by the clustering analysis. Overall, the classifier displayed good performance with a correct rate, specificity, PPV and NPV of >0.89, sensitivity >0.84 and an AUROC of >0.96. The verification on an independent dataset exhibited an accuracy of 94.38% and an AUROC of 0.958 for the 35 metastatic samples and 143 non-metastatic samples. The survival time of the metastatic samples was revealed to be shorter than the non-metastatic samples, based on the analysis of these 30 feature genes. Cell cycle associated functions and pathways were the most significant terms of the 30 feature factors. CDK2 is reported to exert important roles in cell cycle regulation and is associated with tumor aggressiveness and poor prognosis (27,28). Kim et al (29) demonstrated that the specific activity of CDK2 could be used as a prognostic indicator for early breast cancer. Roesley et al (30) also identified that CDK2 phosphorylates breast cancer metastasis suppressor 1 (BRMS1) on Serine 237 and the mutation can prevent BRMS1 from suppressing cell migration. In addition, sirtuin 2 (SIRT2)-mediated inhibition of the migration of fibroblasts can be antagonized by the CDK2-induced SIRT2 phosphorylation (31). CDKN1A (also known as p21), one of the CDK inhibitor genes, contributes to cell cycle progression (32). Variant genotypes of CDKN1A were observed to be associated with an increased risk of breast cancer in the Chinese female population (33). When mammalian cells are exposed to DNA damaging agents, CDKN1A will inhibit cyclin/CDK2 complexes and participate in mediating growth arrest (34). The CDK2/CDKN1A ratio is considered to be a predictive factor of major clinical events in patients with oral squamous cell carcinoma (35). E2F1 is a target of cellular (c)-Myc that promotes cell cycle progression (36). The E2F1 mRNA levels are a strong determinant of clinical outcome in primary breast cancer (37). The CDK2-E2F1 signaling pathway exerts a pivotal role in regulating the G1 to S phase transition in the cell cycle (38). The interactions between CDK2/CDKN1A and CDK2/E2F1 identified in the present study indicated that they may influence the metastasis of breast cancer via their effect on the cell cycle. The proto-oncogene c-MYC encodes a transcription factor that regulates cell growth, proliferation and apoptosis. c-MYC is commonly amplified in breast cancer and promotes the phenotypic transformation of mammary cells by synergistically interacting with transforming growth factor α (39). MYC gene amplification is often acquired in lethal distant breast cancer metastases of unamplified primary tumors (40), and the overexpression of MYC significantly decreased the metastasis of breast cancer cells to lung (41). In conclusion, in the present study a SVM classifier was constructed to assess the possibility of breast cancer metastasis, which exhibited high accuracy in several independent datasets. The CDK2, CDKN1A, E2F1 and MYC genes were highlighted as the potential feature genes for metastatic breast cancer, which may interact synergistically by influencing the cell cycle. The results provided some potential markers for breast cancer metastasis, which may also be prospective precise treatment targets for metastatic breast cancer. In the group's future studies, the expression levels of the potential feature genes will be validated in clinical samples by reverse transcription-quantitative polymerase chain reaction or immunohistochemical staining.
  35 in total

1.  BIND: the Biomolecular Interaction Network Database.

Authors:  Gary D Bader; Doron Betel; Christopher W V Hogue
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

Review 2.  E2F1 and c-Myc in cell growth and death.

Authors:  Itaru Matsumura; Hirokazu Tanaka; Yuzuru Kanakura
Journal:  Cell Cycle       Date:  2003 Jul-Aug       Impact factor: 4.534

3.  Water extract of Hedyotis Diffusa Willd suppresses proliferation of human HepG2 cells and potentiates the anticancer efficacy of low-dose 5-fluorouracil by inhibiting the CDK2-E2F1 pathway.

Authors:  Xu-Zheng Chen; Zhi-Yun Cao; Tuan-Sheng Chen; You-Quan Zhang; Zhi-Zhen Liu; Yin-Tao Su; Lian-Ming Liao; Jian Du
Journal:  Oncol Rep       Date:  2012-05-25       Impact factor: 3.906

4.  Cancer statistics, 2010.

Authors:  Ahmedin Jemal; Rebecca Siegel; Jiaquan Xu; Elizabeth Ward
Journal:  CA Cancer J Clin       Date:  2010-07-07       Impact factor: 508.702

5.  Determination of the specific activity of CDK1 and CDK2 as a novel prognostic indicator for early breast cancer.

Authors:  S J Kim; S Nakayama; Y Miyoshi; T Taguchi; Y Tamaki; T Matsushima; Y Torikoshi; S Tanaka; T Yoshida; H Ishihara; S Noguchi
Journal:  Ann Oncol       Date:  2007-10-22       Impact factor: 32.976

Review 6.  Microenvironmental regulation of metastasis.

Authors:  Johanna A Joyce; Jeffrey W Pollard
Journal:  Nat Rev Cancer       Date:  2008-03-12       Impact factor: 60.716

7.  Expression of the cell cycle regulatory proteins p34cdc2, p21waf1, and p53 in node negative invasive ductal breast carcinoma.

Authors:  H P Kourea; A K Koutras; C D Scopa; M N Marangos; E Tzoracoeleftherakis; D Koukouras; H P Kalofonos
Journal:  Mol Pathol       Date:  2003-12

8.  Cytoscape 2.8: new features for data integration and network visualization.

Authors:  Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

9.  Cluster analysis of networks generated through homology: automatic identification of important protein communities involved in cancer metastasis.

Authors:  Pall F Jonsson; Tamara Cavanna; Daniel Zicha; Paul A Bates
Journal:  BMC Bioinformatics       Date:  2006-01-06       Impact factor: 3.169

10.  Low E2F1 transcript levels are a strong determinant of favorable breast cancer outcome.

Authors:  Vincent Vuaroqueaux; Patrick Urban; Martin Labuhn; Mauro Delorenzi; Pratyaksha Wirapati; Christopher C Benz; Renata Flury; Holger Dieterich; Frédérique Spyratos; Urs Eppenberger; Serenella Eppenberger-Castori
Journal:  Breast Cancer Res       Date:  2007       Impact factor: 6.466

View more
  8 in total

1.  Development of Personalized Therapeutic Strategies by Targeting Actionable Vulnerabilities in Metastatic and Chemotherapy-Resistant Breast Cancer PDXs.

Authors:  Simona Punzi; Marine Meliksetian; Laura Riva; Federica Marocchi; Giancarlo Pruneri; Carmen Criscitiello; Franco Orsi; Lorenzo Spaggiari; Monica Casiraghi; Paolo Della Vigna; Lucilla Luzi; Giuseppe Curigliano; Pier Giuseppe Pelicci; Luisa Lanfrancone
Journal:  Cells       Date:  2019-06-18       Impact factor: 6.600

2.  Characteristic genes in THP‑1 derived macrophages infected with Mycobacterium tuberculosis H37Rv strain identified by integrating bioinformatics methods.

Authors:  Yu-Wei Zhang; Yan Lin; Hui-Yuan Yu; Ruo-Nan Tian; Fan Li
Journal:  Int J Mol Med       Date:  2019-07-30       Impact factor: 4.101

3.  A predictive model for assessing prognostic risks in gastric cancer patients using gene expression and methylation data.

Authors:  Dan Luo; QingLing Yang; HaiBo Wang; Mao Tan; YanLei Zou; Jian Liu
Journal:  BMC Med Genomics       Date:  2021-01-06       Impact factor: 3.063

4.  Stroma-derived extracellular vesicle mRNA signatures inform histological nature of prostate cancer.

Authors:  Alex P Shephard; Peter Giles; Mariama Mbengue; Amr Alraies; Lisa K Spary; Howard Kynaston; Mark J Gurney; Juan M Falcón-Pérez; Félix Royo; Zsuzsanna Tabi; Dimitris Parthimos; Rachel J Errington; Aled Clayton; Jason P Webber
Journal:  J Extracell Vesicles       Date:  2021-10

5.  Identification of key miRNAs and mRNAs related to coronary artery disease by meta-analysis.

Authors:  Long Liu; Jingze Zhang; Mei Wu; Haiming Xu
Journal:  BMC Cardiovasc Disord       Date:  2021-09-16       Impact factor: 2.298

Review 6.  The genomic architecture of metastasis in breast cancer: focus on mechanistic aspects, signalling pathways and therapeutic strategies.

Authors:  Yogita Chhichholiya; Prabhat Suman; Sandeep Singh; Anjana Munshi
Journal:  Med Oncol       Date:  2021-07-16       Impact factor: 3.064

7.  Signature microRNAs and long noncoding RNAs in laryngeal cancer recurrence identified using a competing endogenous RNA network.

Authors:  Zhengyi Tang; Ganguan Wei; Longcheng Zhang; Zhiwen Xu
Journal:  Mol Med Rep       Date:  2019-04-10       Impact factor: 2.952

8.  Gene expression signatures predict response to therapy with growth hormone.

Authors:  Adam Stevens; Philip Murray; Chiara De Leonibus; Terence Garner; Ekaterina Koledova; Geoffrey Ambler; Klaus Kapelari; Gerhard Binder; Mohamad Maghnie; Stefano Zucchini; Elena Bashnina; Julia Skorodok; Diego Yeste; Alicia Belgorosky; Juan-Pedro Lopez Siguero; Regis Coutant; Eirik Vangsøy-Hansen; Lars Hagenäs; Jovanna Dahlgren; Cheri Deal; Pierre Chatelain; Peter Clayton
Journal:  Pharmacogenomics J       Date:  2021-05-27       Impact factor: 3.550

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.