Literature DB >> 34151730

Identification of significant genes and therapeutic agents for breast cancer by integrated genomics.

Xiao Sun1, Zhenzhen Luo1, Liuyun Gong1, Xinyue Tan1, Jie Chen1, Xin Liang1, Mengjiao Cai1.   

Abstract

Breast cancer is the most commonly diagnosed malignancy in women; thus, more cancer prevention research is urgently needed. The aim of this study was to predict potential therapeutic agents for breast cancer and determine their molecular mechanisms using integrated bioinformatics. Summary data from a large genome-wide association study of breast cancer was derived from the UK Biobank. The gene expression profile of breast cancer was from the Oncomine database. We performed a network-wide association study and gene set enrichment analysis to identify the significant genes in breast cancer. Then, we performed Gene Ontology analysis using the STRING database and conducted Kyoto Encyclopedia of Genes and Genomes pathway analysis using Cytoscape software. We verified our results using the Gene Expression Profile Interactive Analysis, PROgeneV2, and Human Protein Atlas databases. Connectivity map analysis was used to identify small-molecule compounds that are potential therapeutic agents for breast cancer. We identified 10 significant genes in breast cancer based on the gene expression profile and genome-wide association study. A total of 65 small-molecule compounds were found to be potential therapeutic agents for breast cancer.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34151730      PMCID: PMC8806825          DOI: 10.1080/21655979.2021.1931642

Source DB:  PubMed          Journal:  Bioengineered        ISSN: 2165-5979            Impact factor:   3.269


Introduction

Breast cancer is a frequently diagnosed cancer in women with a family history [1]. Breast cancer is a heterogeneous disease with different molecular subtypes and biological behaviors. Gene microarray technology and immunohistochemical techniques have classified breast cancers into different types [2]. The estrogen receptor (ER) is the most important prognostic and predictive immunohistochemical marker in breast cancer. ER-negative tumors tend to be of higher histological grade, are more sensitive to chemotherapy, and are more likely to metastasize to visceral organs [3,4]. Breast cancer does not have a poor prognosis, and there is no lack of therapeutic targets. ER positive tumors represent about 70% of all breast cancers and there are a lot of therapeutic targets, as well as for HER2 positive breast cancer (about 20% of all BC). The only subtype lacking for target therapies is the triple negative subtype [5,6]. There is an urgent need to find available drugs and clarify their molecular mechanisms in breast cancer treatment. Most previous studies have focused on identifying novel prognostic markers and drug targets for breast cancer [7-9]. Sulaiman et al. [10] reported that a synthetic azaspirane targets the Janus kinase/signal transducer and activation of transcription 3 pathway in breast cancer. Huang et al. [11] demonstrated that the Gαh-PLCδ1 signaling axis drives metastatic progression in breast cancer. However, due to toxicity, cost, the chemical effects of novel prognostic markers and drug targets for breast cancer that need further research [12], not all previous findings contribute to breast cancer treatment; breast cancer still lacks therapeutic targets and with poorer prognosis. And there is still an urgent need to identify additional therapeutic and prognostic targets in breast cancer [13]. Genome-wide association studies (GWAS) are widely used to characterize the genetic mechanisms that underlie complex diseases. Integrative analyses of GWAS data are rapidly becoming a standard approach to explore the genetic basis of disease susceptibility [14]. Network-wide association studies (NetWAS) can identify relevant disease-gene associations by integrating tissue-specific networks and GWAS results [15,16]. Prior studies have shown that the network-associated analysis of GWAS data is highly efficient when used to identify novel causal genes of complex diseases [17,18]. In this study, to better understand the molecular mechanisms and find therapeutic agents for breast cancer, we identified novel candidate therapeutic agents for breast cancer treatment by integrating genomic data with drug database analysis. In total, 65 small-molecule compounds were identified, including trichostatin A, LY-294,002, econazole, prestwick-1082, and vorinostat. Our study demonstrates the usefulness of this approach for evaluating the relationship among genes, diseases, and drugs. These findings will pave the way for the discovery of potential therapeutic targets for breast cancer.

Methods

Summary of GWAS datasets in breast cancer

The UK Biobank is a large, population-based prospective UK study, which was established to identify genetic and nongenetic determinants of various diseases. It comprises approximately 500,000 individuals with extensively detailed phenotypes. Their genotypes were determined using an array that included 847,441 genetic polymorphisms, enabling the identification of novel genetic variants in a uniformly genotyped and phenotyped cohort of unprecedented size [19]. Using data from the UK Biobank, samples from the participants were genotyped on the UK Biobank Axiom array and UK BiLEVE custom array. Genotype imputation was conducted with IMPUTE software against the UK10K haplotype panel and the 1000 Genomes Project phase 3 panel. GWAS analysis was performed by SNPTEST using a logistic regression model. A genome-wide gene-association study was performed using the MAGMA gene analysis tool, and multiple genes and genetic variants were identified. The Icelandic GWAS dataset from the deCODE Genetics genealogical database was based on whole-genome sequencing using Illumina technology. Finally, meta-analysis of small nucleotide polymorphisms (SNPs) in the UK Biobank and deCODE sample was performed using the METAL analysis tool [20]. The atlas of genetic associations in the UK Biobank (GeneATLAS, http://geneatlas.roslin.ed.ac.uk) helps researchers effectively analyze UK Biobank results without high computational costs. It also allows users to query genome-wide association results for 9,113,133 genetic variants and download GWAS summary statistics for more than 30 million imputed genetic variants (>23 billion phenotype–genotype pairs) [21]. We downloaded large-scale GWAS breast cancer summary data from the atlas of genetic associations. Detailed descriptions of sample characteristics, experimental designs, statistical analyses, and quality control can be found in previous studies.

Gene expression datasets

Oncomine (https://www.oncomine.org) is a cancer microarray database and web-based data mining platform for facilitating discovery. In this study, differentially expressed genes (DEGs) in breast cancer were identified by comparing cancer samples to respective normal samples using the Oncomine database. The heatmap of significant DEGs in breast cancer was driven from the Oncomine.

Identification of significant genes in breast cancer

NetWAS (https://hb.flatironinstitute.org/netwas/) integrates tissue-specific networks and significant GWAS association results, and identifies relevant disease-gene associations based on genomics. Briefly, SNP-level association statistics were converted into gene-level statistics (gene-based P values), which then were integrated with tissue-specific networks to predict the causal genes [18]. Greene et al. [13] demonstrated that NetWAS is more accurate than GWAS alone. In this study, we identified the most relevant genes in breast cancer using NetWAS.

Kyoto Encyclopedia of Genes and Genomes pathway and Gene Ontology analyses

Cytoscape is one of the most successful network biology analysis and visualization tools. It exposes more than 270 core functions and 34 applications as REST-callable functions with standardized JSON interfaces supported by Swagger documentation [22]. CluePedia, a plug-in in Cytoscape, can search for certain Kyoto Encyclopedia of Genes and Genomes (KEGG) signaling pathways of certain genes by calculating linear and nonlinear statistical dependencies from experimental data [23]. KEGG signaling pathways were identified by CluePedia. Search Tool for the Retrieval of Interacting Genes (STRING) (https://string-db.org/cgi/input.pl) is an online tool that for Gene ontology (GO) analysis in gene sets [24,25]. GO is a commonly used bioinformatics tool that provides comprehensive information on the gene function of individual genomic products based on defined features consisting of three domains: biological process (BP), cellular component (CC), and molecular function (MF) [26]. We conducted GO analysis using the STRING database.

Analysis of the correlation between significant genes and breast cancer

Gene Expression Profiling Interactive Analysis (GEPIA, http://gepia.cancer-pku.cn) is a web server for analyzing RNA-sequencing expression data of 9,736 tumors and 8,587 normal samples from The Cancer Genome Atlas and Genotype-Tissue Expression projects, using a standard processing pipeline [27]. The Human Protein Atlas (HPA, www.proteinatlas.org) is an immunohistochemistry-based map of protein expression profiles in normal tissues, cancer tissues, and cell lines, and provides a resource for pathology-based biomedical research, including protein biomarker discovery [28-30]. Correlations between significant genes and breast cancers were analyzed with GEPIA and HPA.

Analysis of the correlation between significant gene expression and overall survival

PROGgeneV2 (http://www.compbio.iupui.edu/proggene), a tool that can be used to predict the prognostic implication of genes in cancers, is written in PHP5 with a MySQL database backend, which stores gene expression data, covariates data, and metadata for cataloged studies in the form of relational database tables. Survival analysis in PROGgeneV2 is done using the backend R script; users can input multiple genes and use combined analysis to create survival plots for different genes of interest [31]. We used PROGgeneV2 to analyze the relationship between overall survival and genes that were overexpressed and underexpressed in breast cancer.

Drug prediction analysis

CMap (https://portals.broadinstitute.org/cmap/) is a collection of genome-wide transcriptional expression data from cultured human cells treated with bioactive small molecules and simple pattern-matching algorithms that together enable the discovery of functional connections among drugs, genes, and diseases through the transitory feature of common gene expression changes [32-34]. We used CMap to identify small-molecule compounds as potential therapeutic agents to target the significant genes in breast cancer.

Results

Identification of significant DEGs in breast cancer

To identify the significant DEGs in breast cancer, we retrieved GWAS summary data (C50-C50) of breast cancer from the UK Biobank, and microarray expression profiles of breast cancer from the Oncomine database. C50-C50 contained 10,478 malignant neoplasm of breast cases and 235,016 controls for the analyses, and the data were consolidated and normalized (Figure 1(a,b)).
Figure 1.

A) Q-Q Plot of C50-C50 (β = 1.05, λmean = 1.07, λmedian = 1.06) Containing 9,113,133 imputed variants that passed quality control (QC), with a P different than 0; b) Manhattan PLOT (IMPUTED) CONTAINING all (QC and non-QC) 30,798,054 imputed variants; c) Venn diagram of significant genes (CLDN7,MLLT10,RBM33,SH3RF1,SSBP4, UBE2Z,BMPER,FGF7,MSRB3,TNRC6B) in breast cancer (BC)

A) Q-Q Plot of C50-C50 (β = 1.05, λmean = 1.07, λmedian = 1.06) Containing 9,113,133 imputed variants that passed quality control (QC), with a P different than 0; b) Manhattan PLOT (IMPUTED) CONTAINING all (QC and non-QC) 30,798,054 imputed variants; c) Venn diagram of significant genes (CLDN7,MLLT10,RBM33,SH3RF1,SSBP4, UBE2Z,BMPER,FGF7,MSRB3,TNRC6B) in breast cancer (BC) From NetWAS of GWAS summary data, we converted SNP-level association statistics into gene-level statistics (gene-based P values) and identified the 127 most relevant genes in breast cancer (Table 1). A total of 1019 overexpressed genes (Supplementary Figure 1) and 1019 underexpressed genes (Supplementary Figure 2) were identified by Oncomine. The top 20 DEGs in breast cancer compared to the normal controls are shown in a heatmap (Figure 2).
Table 1.

Significant genes in breast cancer identified using NetWAS

Gene symbolTraining labelNetWAS Score
DVL210.235936
SNF810.174439
ADSL10.16157
OR2AE110.160822
ACADVL10.160651
OR2A510.149804
TRIM410.148613
GJC310.146056
TMEM161A10.143088
ITGA510.128071
SPDYE310.120586
LRRD110.116357
PHF2310.111639
ATG9B10.106706
DIDO110.105009
KRIT110.103645
CCND110.100449
SAP30BP10.100171
GRID2IP10.099812
MAFK10.096834
SLC25A1710.094211
TMEM184A10.093676
TSC22D410.093096
SCAMP210.093005
ZCWPW110.092518
LRP110.088344
PEG1010.085924
RBM3310.084988
FNIP210.082357
TNRC6B10.080609
MUC1710.080398
MUC1210.070727
C7orf6110.069049
CSK10.065686
AVIL10.064848
FGFR210.062894
OR12D210.062179
PILRB10.061788
PDLIM410.060831
AZGP110.060558
WNT210.058427
MMD10.057349
CARS10.05583
ATP6V0A410.055112
BMPER10.054279
MPV17L210.053732
RBM4810.053696
P4HA210.050581
PHLDA210.050163
NAP1L410.049917
NR2F610.048975
UBE2Z10.048287
ISYNA110.048024
CYP51A110.047283
CYP2S110.047158
MEPCE10.046038
GIP10.045434
CCS10.043749
GDF1510.041477
KCNN410.041465
NYAP110.041088
SLC2A410.040597
RBM3910.040581
SH3RF110.039898
NPLOC410.039647
PIDD110.038797
MIEF110.038407
TFAP2D10.037515
PARD6B10.034994
POTEJ10.0348
FNDC110.034535
DLX210.033885
NEUROG310.033203
SPEG10.031537
RASSF310.03124
CBX710.029876
ELOF110.028148
WNT310.027907
IGFBP210.026449
SPTBN210.02607
DLL410.024507
FZD710.024165
PLCE110.023692
KLRC410.023357
PSTK10.023215
ASIC410.022743
NEUROD410.021315
ZDHHC2410.018987
RSPO310.018473
SLC35D310.016828
ADCY310.016417
ABCG210.016354
DNAH910.016198
SMIM510.015532
CTDNEP110.015522
C12orf8010.014825
FAM71E210.014777
GPRIN210.014572
SSBP410.014453
DLG410.01316
USHBP110.013133
CLDN710.012219
ZNF1810.010693
TTLL610.00984
KCNK1710.009442
PHF2010.009367
TNP110.009195
ZBTB210.008477
RBM4310.007998
PNLIP10.007244
CASC1010.007176
ELL10.006518
SKIDA110.00598
MLLT1010.00583
ODF3L110.005735
SPAG410.00503
TAAR810.003711
UBALD210.003033
LMAN1L10.002579
LYPD510.002363
FGF710.001942
MSRB310.001837
SLC22A510.001425
ZNF42010.000842
BAZ2A10.000831
RNF17510.000786
BBS110.000378
Figure 2.

Heatmap of significant different expression genes in BC. a)Top 20 Over-expressed genes; b) Top 20 Under-expressed genes

Heatmap of significant different expression genes in BC. a)Top 20 Over-expressed genes; b) Top 20 Under-expressed genes Significant genes in breast cancer identified using NetWAS After overlapping the 127 most relevant genes with the 2038 DEGs in breast cancer, we identified 10 significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3, and TNRC6B) in breast cancer (Figure 1c). Among them, CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, and UBE2Z were overexpressed; and BMPER, FGF7, MSRB3, and TNRC6B were underexpressed.

GO and KEGG enrichment analyses of significant DEGs in breast cancer

To explore the roles of the significant DEGs in breast cancer, we played GO and KEGG enrichment analyses. BP analysis revealed that the significant genes in breast cancer were mainly enriched in the Wnt signaling pathway, calcium-modulating pathway, protein repair, gene silencing by microRNA (miRNA), mRNA cleavage involved in gene silencing by miRNA, and positive regulation of epithelial cell proliferation involved in lung morphogenesis (Table 2). MF analysis showed that significant genes were enriched in functions related to oxidoreductase activity, acting on a sulfur group of donors and disulfide as acceptor, and phosphoinositide 3-kinase (PI3K) and PIK3CA activities (Table 2). CC analysis showed that significant genes were enriched in P-bodies. KEGG analysis revealed that significant genes in breast cancer were mainly involved in pathways in cancer, breast cancer, gastric cancer, melanoma, the PI3K/Akt signaling pathway, mitogen-activated protein kinase (MAPK) signaling pathway, Ras signaling pathway, tight junctions, and ubiquitin-mediated proteolysis (Figure 3).
Table 2.

Gene ontology (GO) enrichment result of significant genes in breast cancer

 Biological processes
term IDterm descriptionfalse discovery rate
GO:0007223Wnt signaling pathway, calcium modulating pathway0.0024
GO:0030091protein repair0.0024
GO:0035195gene silencing by miRNA0.0024
GO:0035279mRNA cleavage involved in gene silencing by miRNA0.0024
GO:0060501positive regulation of epithelial cell proliferation involved in lung morphogenesis0.0024
GO:0035278miRNA mediated inhibition of translation0.003
GO:0060213positive regulation of nuclear-transcribed mRNA poly(A) tail shortening0.003
GO:0010463mesenchymal cell proliferation0.0039
GO:0060445branching involved in salivary gland morphogenesis0.0039
GO:0031069hair follicle morphogenesis0.0078
GO:2,000,026regulation of multicellular organismal development0.0082
GO:0051173positive regulation of nitrogen compound metabolic process0.0168
GO:0036092phosphatidylinositol-3-phosphate biosynthetic process0.019
GO:0010604positive regulation of macromolecule metabolic process0.0194
GO:0031325positive regulation of cellular metabolic process0.0194
GO:0048522positive regulation of cellular process0.0194
GO:0051254positive regulation of RNA metabolic process0.0194
GO:0060688regulation of morphogenesis of a branching structure0.0194
GO:0007267cell-cell signaling0.0199
GO:1,903,313positive regulation of mRNA metabolic process0.0242
Molecular functions 
GO:0016671oxidoreductase activity, acting on a sulfur group of donors, disulfide as acceptor0.0083
GO:00163031-phosphatidylinositol-3-kinase activity0.0406
GO:0046934phosphatidylinositol-4,5-bisphosphate 3-kinase activity0.0406
Cellular components 
GO:0000932P-body0.0029
Figure 3

KEGG pathway analysis of significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3 and TNRC6B) in BC

KEGG pathway analysis of significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3 and TNRC6B) in BC Gene ontology (GO) enrichment result of significant genes in breast cancer

Correlation between significant DEGs and breast cancer

To verify the significant DEGs of breast cancer, we further explore the DEGs. Consistent with the identification of significant genes, protein profiling in breast cancer samples from the HPA using immunohistochemistry showed that the gene expression of CLDN7, RBM33, SH3RF1, and UBE2Z was significantly enriched in breast cancer, whereas there was no significant enrichment of FGF7 and TNRC6B (Figure 4).
Figure 4.

The immunohistochemistry of significant genes(CLDN7, RBM33, SH3RF1, UBE2Z, FGF7 and TNRC6B) in BC

The immunohistochemistry of significant genes(CLDN7, RBM33, SH3RF1, UBE2Z, FGF7 and TNRC6B) in BC The significant DEGs (CLDN7, BMPER, FGF7, MSRB3) in breast cancer samples compared to normal samples also showed coincident results of significant gene identification (Figure 5).
Figure 5.

The different expression of significant genes genes(CLDN7, BMPER, FGF7 and MSRB3) in breast cancer samples to normal samples, the red box mean in breast cancer samples and the black box mean in normal samples. *:p < 0.05

The different expression of significant genes genes(CLDN7, BMPER, FGF7 and MSRB3) in breast cancer samples to normal samples, the red box mean in breast cancer samples and the black box mean in normal samples. *:p < 0.05

Correlation between overall survival and significant DEGs in breast cancer

In the analysis of the correlation between overall survival and significant DEG expression (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, and UBE2Z) in breast cancer, we found a shorter survival time based on GSE5881 (Figure 6a) and GSE42568 (Figure 6b) (P< 0.05). The significantly underexpressed genes (BMPER, FGF7, MSRB3, and TNRC6B) in breast cancer were correlated with a longer survival time based on GSE42568 (Figure 6c) and GSE37751 (Figure 6d) (P> 0.05), this correlation cannot be demonstrated. The overall survival analysis combined significantly underexpressed genes (BMPER, MSRB3, and TNRC6B) in breast cancer based on GSE1456_U133B (Figure 6e) and GSE3494_U133B (Figure 6f); combined FGF7 and TNRC6B based on GSE3494_U133A (Figure 6g); and combined single FGF7 based on GSE9893 (Figure 6h) were also correlated with longer survival in breast cancer (P< 0.05).
Figure 6.

Overall survival analysis combined multiple genes expression. a)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE58812; b)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE42568; c)Combined significant under-expressed genes(BMPER, FGF7, MSRB3 and TNRC6B) in BC based on GSE42568; d)Combined BMPER,FGF7,MSRB3 and TNRC6B in BC based on GSE37751; e)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE1456_U133B; f)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE3494_U133B; g)Combined FGF7 and TNRC6B in BC based on GSE3494_U133A; h)Combined FGF7 in BC based on GSE9893

Overall survival analysis combined multiple genes expression. a)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE58812; b)Combined significant Over-expressed genes(CLDN7, MLLT10, RBM33, SH3RF1, SSBP4 and UBE2Z) in BC based on GSE42568; c)Combined significant under-expressed genes(BMPER, FGF7, MSRB3 and TNRC6B) in BC based on GSE42568; d)Combined BMPER,FGF7,MSRB3 and TNRC6B in BC based on GSE37751; e)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE1456_U133B; f)Combined BMPER,MSRB3 and TNRC6B in BC based on GSE3494_U133B; g)Combined FGF7 and TNRC6B in BC based on GSE3494_U133A; h)Combined FGF7 in BC based on GSE9893 To identify potential small-molecule compounds with therapeutic effects on breast cancer, drug prediction analysis was performed using CMap. A total of 65 drugs were predicted, and the 10 most significant were trichostatin A, LY-294,002, econazole, Prestwick-1082, vorinostat, lomefloxacin, clorsulon, amantadine, thiostrepton, and orciprenaline (Table 3).
Table 3.

The most significant drugs provided by cmap to reverse core genes of breast cancers

cmap namemeanenrichmentpspecificity
trichostatin A−0.447−0.42200.3069
LY-294,002−0.389−0.40200.1534
econazole0.7160.910.000060.0154
Prestwick-10820.7490.9490.000120
vorinostat−0.508−0.5680.000380.2832
lomefloxacin−0.607−0.7350.00070
clorsulon0.7140.850.000720
amantadine0.6590.8380.001070.0063
thiostrepton−0.738−0.8310.001470.037
orciprenaline0.5890.8010.002980.0058
thiamphenicol0.4280.7250.003520.1173
khellin0.4440.7130.004590.0181
thiethylperazine0.5760.7670.005670.011
felbinac0.4760.7630.005990.0468
Chicago Sky Blue 6B0.4720.7620.006050.0061
vinburnine0.410.7560.00680.0351
scriptaid−0.688−0.8490.006830.0833
Prestwick-11030.3380.7470.007860.0435
naringenin0.4440.7450.007960.0585
adiphenine0.3850.6830.008110.2819
terazosin0.5610.740.008750.0112
oxolamine0.3390.7390.008970.0385
monobenzone−0.557−0.7360.009610.0203
chenodeoxycholic acid0.3260.7310.010340.0495
rifabutin−0.623−0.8230.011040.125
levonorgestrel−0.38−0.610.011580.0874
cinnarizine0.4130.720.012330.0146
oxybuprocaine−0.563−0.7150.013390.0225
metformin0.3350.4750.013520.0311
memantine0.3740.70.016770
acetohexamide0.2920.6980.017190.0059
proxyphylline−0.515−0.6940.018460.0792
R-atenolol0.2730.6880.019690
quinostatin−0.756−0.90.020.1832
vinblastine0.410.7810.021290.0774
colecalciferol−0.407−0.6840.021580.0217
dexibuprofen−0.501−0.6810.022480.0173
BCB0000400.6160.6790.023080.0057
levopropoxyphene0.280.6760.024010.0063
sulconazole−0.396−0.6760.024550.0855
nadolol0.5180.6730.025140.2857
hycanthone0.5140.670.026120.1814
CP-863,1870.6180.6680.027070.0704
karakoline0.3170.5570.028540.0267
etamivan0.5230.6630.02920.0268
methylergometrine−0.407−0.6620.029520.0357
homochlorcyclizine0.3210.660.030690.1087
fluorometholone−0.401−0.6560.032230.057
probucol−0.382−0.5490.032660.0061
phthalylsulfathiazole−0.406−0.5910.033080.2864
flecainide0.350.5470.033290.0058
tetraethylenepentamine−0.384−0.5420.036550.0229
josamycin0.4690.5880.036790.1235
torasemide−0.418−0.6450.037960.0545
nadide0.5110.6370.042250.0643
fosfosal0.4590.6370.042650.0142
rescinnamine−0.592−0.7230.043630.0248
diethylcarbamazine0.3960.6340.044160.0252
cyclopentolate−0.499−0.6320.04440.05
harmol0.2750.6340.044440.0853
metixene0.2880.6320.045340.1593
idoxuridine0.4740.5720.04580.0838
LM-1685−0.571−0.7140.047950.0238
iohexol0.2530.6280.048080.1657
fenbendazole0.4970.6260.04890.11
The most significant drugs provided by cmap to reverse core genes of breast cancers

Discussion

Breast cancer is the most commonly diagnosed malignancy in women worldwide and is the main cause of cancer-related death in women [35-37]. Although there are a lot of effective therapeutic agents for breast cancer, breast cancer remains a major health problem and is a top biomedical research priority [38-40], as there is an urgent need for effective breast cancer treatments. In this study, we identified 10 significant genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, UBE2Z, BMPER, FGF7, MSRB3, and TNRC6B) in breast cancer using combined GWAS data and profiling of DEGs. Protein profiling in breast cancer samples from the HPA using immunohistochemistry and analysis of significant DEGs in breast cancer samples compared to normal samples from GEPIA further verified the results. Significantly overexpressed genes (CLDN7, MLLT10, RBM33, SH3RF1, SSBP4, and UBE2Z) were correlated with shorter survival, whereas underexpressed genes (BMPER, FGF7, MSRB3, and TNRC6B) were correlated with longer survival in breast cancer. Consistent with our findings, previous studies have shown that some of these genes play important roles in the development of breast cancer. For example, Bernardi et al. [41] showed that CLDN7 is associated with a shorter time to recurrence, suggesting its contribution to the aggressiveness of breast cancer. In a GWAS, Guo et al. [42] identified common genetic loci for breast cancer risk including SSBP4. Whole transcriptome analysis by Bauer et al. [43] demonstrated that BMPER plays a possible therapeutic role in breast cancer. Fu et al. [44] demonstrated that acetylation, expression and recruitment of FGF7 promoters induce cancer growth and progression. Zhu et al. [45] found that targeting FGF7 can exert oncogenic functions in breast cancer. A previous study showed that the ZEB1-MSRB3 axis is related to breast cancer genome stability [46]. Interestingly, other DEGs in breast cancer identified in this study, including MLLT10, RBM33, SH3RF1, UBE2Z, and TNRC6B, have not been proven in previous studies. We believe that these are potentially novel key genes in breast cancer. BP analysis in GO annotation indicated that the 10 significant genes are mainly enriched in the Wnt signaling pathway, which plays an important role in the occurrence and development of many cancers. Inhibiting this pathway can suppress breast cancer growth and metastasis [47-49]. MF analysis of GO suggested that the DEGs were most significantly enriched in functions related to oxidoreductase activity. The redox reaction is accompanied by tumor development. CC analysis of GO annotation showed that the 10 DEGs were enriched in P-bodies. A previous study suggested that P-body disassembly correlates with breast cancer progression [50]. KEGG analysis of the 10 DEGs showed their enrichment in breast cancer, gastric cancer, melanoma, the PI3K/Akt signaling pathway, MAPK signaling pathway, Ras signaling pathway, tight junctions, and ubiquitin-mediated proteolysis. Some of these pathways contribute to the development of breast cancer. For example, the PI3K pathway is found in many types of cancer and plays an important role in breast cancer cell proliferation [51]. Ras signaling is a key determinant of poor survival in breast cancer patients [52]. Abnormal MAPK signaling plays a core role in the regulation of growth and survival, and the development of drug resistance in triple-negative breast cancer [53]. The aim of this work was to identify significant genes and potential therapeutic agents for breast cancer based on genomics. We found 65 potentially small-molecule compounds to reverse significant genes in breast cancer. The 10 most significant drugs were trichostatin A, LY-294,002, econazole, Prestwick-1082, vorinostat, lomefloxacin, clorsulon, amantadine, thiostrepton, and orciprenaline. Consistent, with our study, it has been reported that trichostatin A, a histone deacetylase inhibitor, has therapeutic potential in breast cancer [54]. Jiang et al. [55] showed that trichostatin A sensitizes ER-negative breast cancer cells to tamoxifen. LY294002, a specific inhibitor of the PI3K pathway, can decrease the rate of cell growth and increase the therapeutic sensitivity in MCF7 cells expressing wild-type p53, which may be useful for the treatment of breast cancer [56]. Econazole is a novel PI3K/AKT signaling pathway inhibitor, which can be used to overcome adriamycin resistance and improve chemotherapy sensitivity in breast cancer [57]. A preclinical study showed that vorinostat can prevent the formation of brain metastases in breast cancer [58]. Yang et al. [59] suggested that thiostrepton is a promising agent for triple-negative breast cancer. Kwok et al. [60] showed that thiostrepton selectively targets breast cancer cells through inhibition of Forkhead box M1 expression. However, some of the predicted drugs, such as Prestwick-1082, lomefloxacin, clorsulon, amantadine, and orciprenaline, have not been shown to directly play a role in breast cancer. Thus, future studies are needed to confirm our findings. Compared to previous studies [61-63], we conducted an analysis combining genomic data with drug database analysis to identify novel candidate therapeutic agents for breast cancer treatment. Our study demonstrates the usefulness of this approach for evaluating the relationship among genes, diseases, and drugs. These findings will pave the way for the discovery of potential therapeutic targets for breast cancer.

Conclusion:

Combined analyses of network-wide association studies, gene expression profiles, and drug databases are helpful for identifying potential therapeutic agents for diseases. This method is a new paradigm that can guide future research directions. Click here for additional data file.
  63 in total

Review 1.  A review of connectivity map and computational approaches in pharmacogenomics.

Authors:  Aliyu Musa; Laleh Soltan Ghoraie; Shu-Dong Zhang; Galina Glazko; Olli Yli-Harja; Matthias Dehmer; Benjamin Haibe-Kains; Frank Emmert-Streib
Journal:  Brief Bioinform       Date:  2018-05-01       Impact factor: 11.622

Review 2.  Women and lung cancer: what is new?

Authors:  Crystal M North; David C Christiani
Journal:  Semin Thorac Cardiovasc Surg       Date:  2013

3.  The histone deacetylase inhibitor trichostatin A sensitizes estrogen receptor alpha-negative breast cancer cells to tamoxifen.

Authors:  Eun Ryoung Jang; Soo-Jeong Lim; Eun Sook Lee; Gajin Jeong; Tae-You Kim; Yung-Jue Bang; Jong-Soo Lee
Journal:  Oncogene       Date:  2004-03-04       Impact factor: 9.867

4.  Benzo[b]furan derivatives induces apoptosis by targeting the PI3K/Akt/mTOR signaling pathway in human breast cancer cells.

Authors:  Ahmed Kamal; V Lakshma Nayak; Narayana Nagesh; M V P S Vishnuvardhan; N V Subba Reddy
Journal:  Bioorg Chem       Date:  2016-04-26       Impact factor: 5.275

5.  A subcellular map of the human proteome.

Authors:  Peter J Thul; Lovisa Åkesson; Mikaela Wiking; Diana Mahdessian; Aikaterini Geladaki; Hammou Ait Blal; Tove Alm; Anna Asplund; Lars Björk; Lisa M Breckels; Anna Bäckström; Frida Danielsson; Linn Fagerberg; Jenny Fall; Laurent Gatto; Christian Gnann; Sophia Hober; Martin Hjelmare; Fredric Johansson; Sunjae Lee; Cecilia Lindskog; Jan Mulder; Claire M Mulvey; Peter Nilsson; Per Oksvold; Johan Rockberg; Rutger Schutten; Jochen M Schwenk; Åsa Sivertsson; Evelina Sjöstedt; Marie Skogs; Charlotte Stadler; Devin P Sullivan; Hanna Tegel; Casper Winsnes; Cheng Zhang; Martin Zwahlen; Adil Mardinoglu; Fredrik Pontén; Kalle von Feilitzen; Kathryn S Lilley; Mathias Uhlén; Emma Lundberg
Journal:  Science       Date:  2017-05-11       Impact factor: 47.728

6.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

7.  PROGgeneV2: enhancements on the existing database.

Authors:  Chirayu Pankaj Goswami; Harikrishna Nakshatri
Journal:  BMC Cancer       Date:  2014-12-17       Impact factor: 4.430

8.  Identification of biomarkers, pathways and potential therapeutic agents for white adipocyte insulin resistance using bioinformatics analysis.

Authors:  Yemin Zhang; Yuyang Zheng; Yalin Fu; Changhua Wang
Journal:  Adipocyte       Date:  2019-12       Impact factor: 4.534

Review 9.  Targeting Cell Cycle in Breast Cancer: CDK4/6 Inhibitors.

Authors:  Michela Piezzo; Stefania Cocco; Roberta Caputo; Daniela Cianniello; Germira Di Gioia; Vincenzo Di Lauro; Giuseppina Fusco; Claudia Martinelli; Francesco Nuzzo; Matilde Pensabene; Michelino De Laurentiis
Journal:  Int J Mol Sci       Date:  2020-09-04       Impact factor: 5.923

10.  Pleiotropic tumor suppressor functions of WWOX antagonize metastasis.

Authors:  Giovanni Nigita; Rosario Distefano; Saleh Khawaled; Sara Oster; Sung-Suk Suh; Yoav Smith; Abed Khalaileh; Yong Peng; Carlo M Croce; Tamar Geiger; Victoria L Seewaldt; Rami I Aqeilan
Journal:  Signal Transduct Target Ther       Date:  2020-04-17
View more
  1 in total

1.  Long non-coding RNA MRPS30 divergent transcript can be detected in the cytoplasm of triple-negative breast cancer cells and is targeted by microRNA-130b.

Authors:  Dongtao Wang; Qiang Song; Tianyong Zhao; Fang Wang; Yang Yu; Jing Qi; Pengfei Lyu; Xiangyang Duan
Journal:  Bioengineered       Date:  2022-03       Impact factor: 3.269

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.