Literature DB >> 28127549

A Systematic Framework for Drug Repositioning from Integrated Omics and Drug Phenotype Profiles Using Pathway-Drug Network.

Erkhembayar Jadamba1, Miyoung Shin2.   

Abstract

Drug repositioning offers new clinical indications for old drugs. Recently, many computational approaches have been developed to repurpose marketed drugs in human diseases by mining various of biological data including disease expression profiles, pathways, drug phenotype expression profiles, and chemical structure data. However, despite encouraging results, a comprehensive and efficient computational drug repositioning approach is needed that includes the high-level integration of available resources. In this study, we propose a systematic framework employing experimental genomic knowledge and pharmaceutical knowledge to reposition drugs for a specific disease. Specifically, we first obtain experimental genomic knowledge from disease gene expression profiles and pharmaceutical knowledge from drug phenotype expression profiles and construct a pathway-drug network representing a priori known associations between drugs and pathways. To discover promising candidates for drug repositioning, we initialize node labels for the pathway-drug network using identified disease pathways and known drugs associated with the phenotype of interest and perform network propagation in a semisupervised manner. To evaluate our method, we conducted some experiments to reposition 1309 drugs based on four different breast cancer datasets and verified the results of promising candidate drugs for breast cancer by a two-step validation procedure. Consequently, our experimental results showed that the proposed framework is quite useful approach to discover promising candidates for breast cancer treatment.

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 28127549      PMCID: PMC5233404          DOI: 10.1155/2016/7147039

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Developing and discovering a new drug is a very costly and time consuming process, which can take 10–17 years with a cost of 1.3 billion dollars. Despite large investments in research and development each year, there are still only a small number of new drugs approved successfully by the Food and Drug Administration (FDA) each year. Increasing failure rates, high costs, and the lengthy testing process for drug development have led to a process called drug repositioning [1], which refers to identifying and developing new uses for existing drugs to reduce the risk and cost. Traditional drug repositioning methods primarily use information on chemical structure, side effects, and drug phenotypes and explore similar drugs based on the assumption that structurally similar drugs tend to share common indications [2-4]. In other words, the key idea behind these approaches is that molecularly similar drug structures often affect proteins and biological systems in similar ways [4]. For example, Swamidass [5] used chemical structure data to identify unexpected connections between a known drug and a disease and explored the hypothesis that if a drug has the same target as a known drug, then this new drug would also have activity against the disease. As another approach, Keiser et al. used 3665 US FDA-approved and investigational drugs that together had hundreds of targets, defining each target by its ligands. The chemical similarities between the drugs and ligand sets predicted thousands of unanticipated associations, which have been used to develop new indications for many drugs. Alternatively, some approaches use a drug phenotype, which is the expression profile of patients undergoing treatment with a drug. For example, the Connectivity Map (CMap) [6, 7] project is exploring the effects of a large number of FDA-approved chemicals (1309 drugs) on gene expression, and these effects are measured in four different cell lines, allowing researchers to analyze the different expression patterns of drug's target genes. Many computational approaches have been introduced to reposition drugs using CMap by analyzing drug-associated expression signatures to match a repositioned drug's effect with a shared perturbed gene expression profile for another disease, under the assumption that drugs that share similar CMap expression signatures have similar therapeutic applications. Using the CMap data, Iorio et al. [8] developed a drug repositioning method by constructing a drug-drug similarity network using gene set enrichment analysis (GSEA) [9] that could compute the similarity between pairs of drugs. Several different studies [3, 10–13] showed that using CMap expression profiles with a combination of various data sources such as drug target databases, drug chemical structures, and drug side effects was an improvement over the current drug target identification methods. Moreover, the rapid developments in genomics and high-throughput technologies have produced a large volume of disease gene expression profiles, protein-protein interactions, and pathways. The high-level integration of these resources using network-based approaches is reported to have great potential for discovering novel drug indications for existing drugs [14]. For example, Chen et al. [15] introduced two different inference methods for predicting drug-disease associations based on basic network topology using a bipartite graph constructed from DrugBank [16] and Online Mendelian Inheritance in Man (OMIM) [17]. Emig et al. [18] integrated gene expression profiles, drug targets, disease information, and interactions for drug repositioning. Hu and Agarwal [19] created a disease-drug network using disease microarray datasets and predicted new indications for existing drugs using their disease-drug network. Although many of the above methods have shown encouraging results for finding new indications for old drugs, there are still some limitations. For example, Yildirim et al. [20] concluded that most drugs with distinct chemical structures target the same proteins, and Keiser et al. [21] reported that structurally similar drugs may also target proteins with dissimilar functions, stating that using chemical structure alone is insufficient for successful drug repositioning [22]. In addition, care should be taken when using only the drug phenotype (drug treated) expression profile (such as CMAP) for drug repositioning because some portion of the genes or pathways that show statistically significant expression differences in cell lines treated with the drug may be expressed only because of the drug's side effects or toxicity. Furthermore, the genes expressed in the drug treated profiles for specific disease cell line or tissue only represent a small subset of the biological pathways, whereas the cooperation of genes plays an important role in complex diseases such as cancer. Pathway-based drug repositioning may be a better alternative for drug repositioning for specific diseases such as cancer. To overcome the above limitations, the current drug repositioning methods require a comprehensive and efficient computational drug repositioning approach that incorporates powerful machine learning approaches using the high-level integration of available data such as disease gene expression profiles (disease profile), drug treated expression profiles (drug phenotype profile), and drug databases (STITCH [23], DrugBank [16], therapeutic target database (TTD) [24]) to discover new drugs for a human diseases. In this study, we aim to develop a systematic computation framework that repositions drugs by employing disease profile and drug phenotype profiles on the drug network along with integrated omics data.

2. Materials and Methods

In the framework as shown in Figure 1, we firstly identify disease-specific pathways by using an integrative analysis of multiple disease gene expression profiles and construct a pathway-drug network structure using pathway-drug associations derived from the CMap drug phenotype profile. Then to discover promising candidates, for drug repositioning, we initialize node labels for the pathway-drug network using identified disease pathways and known drugs associated with breast cancer and perform network propagation in a semisupervised manner.
Figure 1

The proposed framework for drug repositioning. The proposed framework consists of several steps. First, disease-specific pathways are identified by disease pathway enrichment of multiple expression profiles for the disease of interest. Second, the drug pathway network is constructed from the drug pathway associations obtained from the drug phenotype profiles. Once the network is constructed, initial labels are assigned using disease-specific pathways and known drugs associated with the given disease. Finally, pathway-based drug repositioning is performed using semisupervised network propagation. The identified drugs are evaluated, and the final results are obtained.

In the following, the detailed explanations of our proposed framework for repositioning and evaluation method are described.

2.1. Finding Disease-Specific Pathways from Multiple Disease Expression Profiles

To identify disease pathways related to a specific disease, conventional approaches have usually focused on identifying enriched pathways between cases and controls using data from a single experiment. Specifically, when using real experimental data such as microarray gene expression data, it is possible for different studies to report different results for disease-specific pathways. That is, the results are often not reproducible or not robust even to the mildest data perturbation, so the integrated analysis of multiple existing studies can increase the reliability and generalizability of results [25]. To address these issues, our approach identifies a disease-specific pathway based on disease pathway enrichment using multiple gene expression profiles for a given phenotype, in which the disease pathway enrichment results are integrated. Each disease expression profile is preprocessed, and the pathways that show significant differences between case and control samples are identified by GSEA [9], which returns the enrichment score (ES) and nominal p value for each pathway. These scores are used for comparison analysis across pathways to detect significant pathways. Here, we considered that the integration of pathways significantly enriched for each expression profile could better represent “disease-specific pathways” for the phenotype of interest. To integrate, the pathways with a nominal p value less than 0.01 (p < 0.01) are selected as significant pathways for each expression profile, and their union is defined as “disease-specific pathways.” Figure 2 presents an illustration of the integration process.
Figure 2

Disease pathway enrichment. Disease-specific pathways are identified from multiple gene expression profiles for the same disease. For each profile, enriched pathways with p < 0.01 are selected and integrated by taking their union. The resulting pathways are considered disease-specific pathways for the given disease.

2.2. Deriving Pathway-Drug Associations from CMap Drug Phenotype Profiles

To define a pathway-drug association, pathway-drug enrichment is established from the drug phenotype expression profile (CMap: Connectivity Map) [6, 7], which contains the gene expression profiles obtained from five different cancer cell lines treated with 1309 (v2) small drug molecules, most of which are FDA-approved drugs, for a total 6100 data points representing gene expression results with control vehicle samples. The CMap data are preprocessed, batch effects are removed, and pathway enrichments are estimated by GSEA as in previous studies [11, 26, 27]. As a result, each pathway (1077) has an ES for each drug molecule (1309). The strength of the ES indicates the association degree of a pathway with a drug. As shown in Figure 3, the pathway-drug association can be represented as a 1077 × 1309 matrix, where the columns list the drugs and the rows list the pathways.
Figure 3

Drug pathway association and pathway-drug network. Associations between a drug and pathways are defined by drug pathway enrichment from drug phenotype expression profiles. The strength of ES represents the enrichment of pathway when treated with drug.

2.3. Pathway-Drug Network Construction

A pathway-drug network was established from the drug pathway association profile. By using the pathway-drug enrichment matrix (Figure 3), the pathway-drug bipartite graph structure G = (U, V, E, w) was constructed, whose vertices can be divided into two disjoint sets: U (pathways) and V (drugs) such that every edge e ∈ E with weight w represents the enrichment of pathway u ∈ U by drug v ∈ V. In other words, each node in the network corresponds to a drug or pathway, and each edge corresponds to the association between them. It can be observed that drugs tend to bind with disease-specific pathways. All nodes were initially unlabeled as 0. Semisupervised learning on a network requires a small amount of labeled data with a large amount of unlabeled data. To use the constructed bipartite graph for drug repositioning, we made following assumption as in [4]: If pharmacologically different drugs induce the same phenotype of interest, then most of molecular pathways they target must be shared. In other words, drugs used to treat the same disease (phenotype) target similar pathways. For example, if we have some prior knowledge on certain drugs that are used to treat a specific disease, then most of the molecular pathway they target should be similar. In Figure 4, the blue drugs (breast cancer treatment drugs) target pathway “B,” and the green drugs (prostate cancer treatment drugs) target pathway “D.” From this information, it is can be concluded that drug “K” can likely be used to treat prostate cancer, when the weight (ES) is high enough. This is main assumption that we make in our proposed framework for pathway-based drug repositioning. Defining the initial knowledge (or initial labels for nodes) is also one of the key steps in this work.
Figure 4

Similar drugs used for the same disease share most of the molecular pathway they target.

2.4. Label Initialization on a Pathway-Drug Network

To initialize the pathway-drug labels for the U (pathways) and V (drugs) disjoint sets, we used disease-specific pathways inferred from the multiple gene expression profiles and known treatment drugs for the given phenotype (breast cancer) were obtained from three different public resources: the Maya Clinic, Cancer Organization, and TTD. The identified disease-specific pathways were mapped to the U (pathways) set and labeled as 1, and the remaining pathways were labeled as 0. For the V (drugs) set, a more accurate prediction is possible if we can set the labels for the drug set in the pathway-drug network using previously known information about the disease-related drugs prior to using network propagation to predict drugs associated with the disease. Therefore, we first verified known drugs used for the treatment of the disease of interest using public drug-related sources, including the Maya Clinic database, Cancer Organization database, and TTD, and then determined the labels for the drug set in the pathway-drug network. These drugs were mapped to the V (drugs) set and labeled as 1, and the remaining drugs were labeled as 0.

2.5. Drug Repositioning by Semisupervised Learning

Once the initial labeling of the pathway-drug network was completed, we predicted the repositioned drugs by learning the drug nodes and pathway nodes with the network propagation algorithm. The bipartite graph can be defined as G = (V, U, E, w), where V and U are the node sets that are the disjoint node, in which the nodes of each node set are expressed as v and u, respectively. E is the set of edges between V and U, and w represents the weights of these edges. The weight of a specific edge is expressed as w(v, u). The function for the sum of all weight values for a node can be defined as d(v) = ∑( w(v, u). Now, let us examine the network propagation algorithm based on the definition of the previously defined bipartite graph. First, the network propagation algorithm normalizes the weights of the bipartite graph using the following formula:Here, W is a matrix containing the weights of the bipartite graph, D and D are the diagonal matrices with the values of D = d(v) and D = d(u), respectively, and B is the matrix of the normalized weights. Second, network propagation is performed for the bipartite graph using formulae (2) and (3), iterating over the objective function of the graph-based semisupervised learning algorithm. For each v ∈ V, For each u ∈ U,Here, t is the number of iterations and y is the initial label of the corresponding node. The parameter α has a value between 0 and 1 and acts to regulate the relative weight of the initial label and the learned label. y(v) and y(u) are the initial labels for the drugs and pathway, respectively, whereas f(v) and f(u) are the final label scores. Finally, network propagation is completed when the values of f(v) and f(u) converge. If the network propagation algorithm is executed over the pathway-drug bipartite according to the above method, the learned drugs label scores can be obtained. As the label score of a drug increases, the drug can be considered a more promising candidate for drug repositioning for the given phenotype. Therefore, we define the values of the final drug label scores as the drug repositioning scores and use them to predict disease-associated drugs from the pathway-drug network. In addition, all obtained label scores are normalized by the Z-score using the following equation:where l is the label score vector for all drugs and l drug is the final label score for drug. For each drug, the corresponding p value was estimated based on the Z-score for Gaussian distribution. For more conservative results, we chose drugs with p < 0.001 as promising drug candidates for drug repositioning for the given disease. The selected promising drug candidates are evaluated by our validation methods and chosen for further investigation.

3. Results and Discussion

We tested our proposed framework to reposition 1309 drugs for breast cancer.

3.1. Finding Disease-Specific Pathways in Breast Cancer

To obtain breast cancer-specific pathways, we used publicly available breast cancer expression profiles (GSE15852 [28], GSE20437 [29], GSE2043 [30], and GSE2990 [31]) from the Gene Expression Omnibus (GEO) [32]. Table 1 shows the detailed characteristics of the expression profiles used in our study. Each dataset was preprocessed using RMA techniques [33] and implemented in R using the BioConductor package, which includes a large number of metadata packages appropriate for different types of microarrays. Supplementary Figure  1, in Supplementary Material available online at http://dx.doi.org/10.1155/2016/7147039, shows the results of preprocessing. For each dataset, the corresponding annotation databases were downloaded separately, and each probe was mapped to a HUGO [34] gene symbol; a probe was discarded if it did not match any symbol. In addition, if a gene had multiple probes (many-to-one), the gene expression values were averaged over the probes.
Table 1

Breast cancer gene expression datasets.

#Dataset idSamples (control/case)ProbesPlatformReferences
1GSE1585286 (43/43)22283GPL96 (HG-U133A) Pau Ni et al. [28]
2GSE2043842 (18/26)22283GPL96 (HG-U133A)Graham et al. [29]
3GSE2043286 (180/106)22283GPL96 (HG-U133A)Wang et al. [30]
4GSE2929193 (64/129)22283GPL96 (HG-U133A)Sotiriou et al. [31]

The gene expression datasets were downloaded from the NCBI Gene Expression Omnibus (GEO).

The human metabolic and signaling pathways were obtained from the Molecular Signature Database (MSigDB) [35]. As shown in Table 2, we chose the canonical pathways in the curated gene sets that contain 1077 pathways collected from KEGG [36], Reactome [37], and BioCarta (http://www.biocarta.com/).
Table 2

Pathway data.

DatabasePathways# of gene setsURL
MSigDB(c2-canonical pathways)KEGG186http://www.genome.jp/kegg/
Reactome674http://www.reactome.org/
BioCarta217http://www.biocarta.com/

Total1077
For each dataset, a pathway was defined as breast cancer enriched by GSEA when p < 0.01. To integrate, the enriched pathways with nominal p values less than 0.01 (p < 0.01) were selected as significant pathways for each expression profile, and their union was defined as the “disease-specific pathways.” Table 3 shows the number of enriched pathways for each dataset and the integrated pathways obtained by taking their union. Table 4 shows an example of enriched pathways in breast cancer by using experiment dataset (GSE2990). In the Supplementary Material, Tables 1 –4 provide the GSEA analysis results for each cancer expression profile and list the identified disease-specific pathways that were used for label initialization on the pathway-drug network.
Table 3

Breast cancer disease-specific pathways for each dataset.

GSE15852GSE20438GSE2043GSE2929Integratedpathways
# of pathways p < 0.0110971721 143

Enriched pathways were identified by GSEA. For integration, the pathways with (p value < 0.01) were selected as significant pathways for each expression profile and their union was defined as “disease-specific pathways.”

Table 4

Breast cancer pathways from GSE2990 (p < 0.01).

#NameESNES p value
1REACTOME_DEFENSINS−0.767−1.5990.008230452
2REACTOME_ORGANIC_CATION_ANION_ZWITTERION_TRANSPORT0.7711.6680.004016064
3REACTOME_G0_AND_EARLY_G1−0.771−1.6010.008213553
4REACTOME_CELL_SURFACE_INTERACTIONS_AT_THE_VASCULAR_WALL−0.601−1.6700
5REACTOME_SYNTHESIS_OF_BILE_ACIDS_AND_BILE_SALTS_VIA_7ALPHA_HYDROXYCHOLESTEROL0.7561.6440.00617284
6REACTOME_PECAM1_INTERACTIONS−0.855−1.6850.001972387
7REACTOME_GABA_SYNTHESIS_RELEASE_REUPTAKE_AND_DEGRADATION0.7731.5580.003891051
8REACTOME_ACTIVATION_OF_THE_PRE_REPLICATIVE_COMPLEX−0.773−1.6130.008438818
9BIOCARTA_GATA3_PATHWAY0.7461.6880.002004008
10REACTOME_NEUROTRANSMITTER_RELEASE_CYCLE0.7091.7470
11BIOCARTA_G2_PATHWAY−0.705−1.7040.00625
12REACTOME_PASSIVE_TRANSPORT_BY_AQUAPORINS−0.757−1.7180.001865672
13REACTOME_PYRIMIDINE_METABOLISM−0.702−1.7260.001964637
14BIOCARTA_ACTINY_PATHWAY−0.734−1.7490.001945525
15REACTOME_SYNTHESIS_OF_GLYCOSYLPHOSPHATIDYLINOSITOL_GPI0.7031.6990.001976285
16REACTOME_ENDOGENOUS_STEROLS−0.762−1.6940.005703422
17REACTOME_GLYCOSPHINGOLIPID_METABOLISM0.5901.7630.006048387

Supplementary files 2, 3, 4, and 5 provide the full pathway enrichment analysis results for the breast cancer expression profiles.

3.2. Breast Cancer Drug Repositioning Using the Proposed Approach

From the four different breast cancer expression profiles, 143 pathways were identified as significantly enriched. On the pathway-drug network, these pathways were mapped to the U (pathways) set and initially labeled as 1, and the remaining 934 pathways were labeled as 0. In addition, known drugs used for the treatment for breast cancer were obtained from three different public resources, the Maya Clinic, Cancer Organization, and TTD. Sixty-one drugs approved to treat breast cancer were obtained from the Maya Clinic, 49 drugs were obtained from the Cancer Organization, and 11 drugs were obtained from TTD. Next, after mapping these drugs to the drug pathway network only 10 drugs were successfully mapped. Moreover, the 10 mapped drugs (tamoxifen, letrozole, doxorubicin, vinblastine, exemestane, aminoglutethimide, methotrexate, paclitaxel, megestrol, and fulvestrant) were labeled as 1 on V (drugs), whereas all remaining drugs (1299) were labeled as 0. Once the initial labels of the pathway-drug network were chosen, we predicted promising candidates related to breast cancer using semisupervised network propagation, as shown in Figure 5. As a result, we considered 17 drugs with p < 0.001, as shown in Table 5, and found that 10 of them are already known drugs. The remaining seven drugs were considered as promising drug candidates for breast cancer and used for further validation to examine their association with breast cancer.
Figure 5

Breast cancer drug repositioning. Ten known drugs approved to treat breast cancer were obtained from the Maya Clinic, Cancer Org, and TTD. A total of 143 breast cancer-specific pathways were identified from multiple breast cancer expression profiles. Successfully mapped pathways and drugs were labeled as 1. Once labels were initialized on the pathway-drug network, we repositioned drugs for breast cancer using semisupervised learning. Predicted drugs with p < 0.001 were considered promising candidate drugs, and their associations with breast cancer were investigated using two different validation methods.

Table 5

Predicted drugs after pathway-based drug repositioning.

Drug nameRanking score Z-score p valueDescription
Doxorubicin 0.9998.9350It works by intercalating DNA, with the most serious adverse effect being life threatening heart damage.
Exemestane 0.8036.8802.99E − 12Tyrosine kinase inhibitor which selectively inhibits HER2
Methotrexate 0.7085.8921.91E − 09blocks the production of steroids derived from cholesterol and is clinically used in the treatment of Cushing's syndrome and metastatic breast cancer.
Megestrol 0.6685.4642.33E − 08It binds to and inhibits the enzyme dihydrofolate reductase, resulting in inhibition of purine nucleotide and thymidylate synthesis and, subsequently, inhibition of DNA and RNA syntheses.
Paclitaxel 0.6465.2358.26E − 08It binds to and stabilizes microtubules, preventing their depolymerization and so inhibiting cellular motility, mitosis, and replication.
Aminoglutethimide 0.6375.1481.32E − 07It blocks the production of steroids derived from cholesterol and is clinically used in the treatment of Cushing's syndrome and metastatic breast cancer.
Tamoxifen 0.6345.1131.59E − 07It is an antagonist of the estrogen receptor in breast tissue via its active metabolite, hydroxytamoxifen. In other tissues such as the endometrium, it behaves as an agonist and thus may be characterized as a mixed agonist/antagonist.
Vinblastine 0.6255.0202.59E − 07It is an antimicrotubule drug used to treat certain kinds of cancer, including Hodgkin's lymphoma, non-small cell lung cancer, breast cancer, head and neck cancer, and testicular cancer.
Fulvestrant 0.6044.8027.86E − 07It is drug treatment of hormone receptor-positive metastatic breast cancer in postmenopausal women with disease progression following antiestrogen therapy. It is an estrogen receptor antagonist with no agonist effects, which works by downregulating the estrogen receptor.
Letrozole 0.5794.5392.83E − 06It is an oral nonsteroidal aromatase inhibitor for the treatment of hormonally responsive breast cancer.
MS-275∗∗ 0.5304.0232.87E − 05Entinostat, also known as SNDX-275 and MS-275, is a benzamide histone deacetylase inhibitor undergoing clinical trials for treatment of various cancers.
GW-8510∗∗ 0.4773.4670.000263332Cyclin-dependent kinase 5 inhibitors: inhibition of dopamine transporter activity.
Camptothecin∗∗ 0.4753.4520.000278495It is an alkaloid isolated from the stem wood of the Chinese tree, Camptotheca acuminata. This compound selectively inhibits the nuclear enzyme DNA topoisomerase.
Phenoxybenzamine∗∗ 0.4613.3030.000478379It is an alpha-adrenergic antagonist with long duration of action. It has been used to treat hypertension and as a peripheral vasodilator.
Tyrphostin_AG-825∗∗ 0.4473.1590.000792504It is tyrosine kinase inhibitor, which selectively inhibits HER2.
Alsterpaullone∗∗ 0.4473.1500.000815292CDC2 protein kinase, antiangiogenic potential of small molecular inhibitors of cyclin-dependent kinases in vitro.
Celastrol∗∗ 0.4423.1000.000966191Celastrol is a remedial ingredient isolated from the root extracts of “Tripterygium wilfordii” (Thunder of God vine) and “Celastrus regelii.” In “in vitro” and “in vivo” animal experiments, celastrol exhibits antioxidant, anti-inflammatory, anticancer, and insecticidal activities.

Known breast cancer drug. Potential drug candidate for repositioning.

3.3. Validation of Promising Candidate Drugs

To validate the predicted drugs, we recommend the use of two different methods. Drugs that have been successfully validated by both methods are considered to be confirmed for repositioning for breast cancer.

3.3.1. Biological Validation

Biological validation was performed by manually checking the evidence in the biological literature on promising drug candidates. We manually searched for any possible indication of the repositioned drugs for breast cancer. As shown in Table 6, for each promising drug candidate, several different lines of evidence in the literature were found indicating its possible use for breast cancer. Based on these results, we concluded that six drugs of seven drugs were confirmed by biological validation for their new usage in breast cancer treatment, with phenoxybenzamine not being confirmed.
Table 6

Literature evidences for the promising drug candidates for breast cancer.

Promising drugsBiological validationRelated literature for possible usage for breast cancer
MS-275∗∗ o (i) [45]: the potential anticancer activity of MS-275 in combination with pentoxifylline in panel of cell lines and human breast cancer xenograft model in vitro and in vivo.(ii) [46]: MS-275 sensitizes TRAIL-resistant breast cancer cells, inhibits angiogenesis and metastasis, and reverses epithelial-mesenchymal transition in vivo.(iii) [47]: HDAC inhibitors (MS-275) enhance the apoptosis-inducing potential of TRAIL in breast carcinoma.

GW-8510∗∗ (i) [48]: repositioning of a cyclin-dependent kinase inhibitor GW-8510 as a ribonucleotide reductase M2 inhibitor to treat human colorectal cancer. In addition, GW-8510 induced autophagic cell death.(ii) [49]: in cell viability tests, four candidate drugs, GW-8510, etacrynic acid, ginkgolide A, and 6-azathymine, are identified as having high inhibitory activities against cancer cells.

Camptothecin∗∗ o (i) [50]: the camptothecin targets WRN protein: mechanism and relevance in clinical breast cancer.(ii) [51]: CRLX101, an investigational camptothecin-containing nanoparticle-drug conjugate, targets cancer stem cells and impedes resistance to antiangiogenic therapy in mouse models of breast cancer.(iii) [52]: STI571 sensitizes breast cancer cells to 5-fluorouracil, cisplatin, and camptothecin in a cell type-specific manner.(iv) [53]: acquired camptothecin resistance of human breast cancer MCF-7/C4 cells with normal topoisomerase I and elevated DNA repair.(v) known as “Happy Tree” in Chinese traditional cancer treatment.

Phenoxybenzamine∗∗ x (i) Not enough evidence.

Tyrphostin_AG-825∗∗ o (i) [54]: C-Src activation by ErbB2 leads to attachment-independent growth of human breast epithelial cells.(ii) [55]: using in vivo mouse models of breast cancer: using gefitinib, ERBB1 inhibition rapidly inhibits tumor cell motility and invasion but not intravasation, whereas ERBB2 inhibition by AG825 rapidly blocks intravasation.(iii) [56]: tyrphostin AG 825 has been used in combination with hypericin-mediated photodynamic therapy (HY-PDT) for evaluating its therapeutic effects in HER2 overexpressing human breast cancer cells.

Alsterpaullone∗∗ o (i) [57]: the antitumor effects of ALP through induction of apoptosis in breast cancer and leukemia cells. Identification of alsterpaullone as a novel small molecule inhibitor to target group 3 medulloblastoma.(ii) [58]: baicalein blocked survivin expression in lung and breast cancer cells. Alsterpaullone is a CDC2 kinase inhibitor (43). Both CDC25 phosphatase and CDC2 kinase inhibitors enhanced the baicalein-induced cancer cell death.

Celastrol∗∗ o (i) [59]: anticancer effect of celastrol on human triple negative breast cancer: possible involvement of oxidative stress, mitochondrial dysfunction, apoptosis, and PI3K/Akt pathways.(ii) [60]: celastrol induces that apoptosis of breast cancer cells and inhibits invasion via downregulation of MMP-9.

Potential drug candidate for repositioning.

3.3.2. Computational Evaluation on the Validation Network

In drug repositioning, it is difficult to compare and evaluate the performances of computational methods. To address this issue, several recent studies have focused on curating a comprehensive and public catalog of existing drug indications using a manual process [4]. Therefore, to develop a better evaluation method using computational methods, a validation network was constructed using information on three different relationships, drug-drug, drug-gene, and gene-gene, from the STITCH and STRING databases [38]. The drug-drug relationship information was obtained from the STITCH (v4) [39] database, which contains data on the interactions between small molecules and the edges between two chemicals that are expressed using a score between 0 and 900 defined from the chemical similarity between drugs. The drug-gene network was constructed from STITCH (for human) protein-chemical interactions with the help of the STRING database which provides 4,523,609 relationships for humans with the correlations between proteins and chemicals recorded as scores using information obtained from experimental results, text-mining, or predicted correlations. The gene-gene network was constructed from the STRING database, where A PPI network can be described as a complex system of proteins linked by interactions. Two proteins or genes that physically interact are represented as adjacent nodes connected by an edge. Each protein id (unipro id) is converted to the corresponding gene symbols using annotation databases provided in the STRING protein-protein interaction database. For computational evaluation, we have selected a maximum of 40 neighbors of drugs (17 drugs) with a weight criterion of r > 0.4 from the validation network derived from STITCH. The constructed validation network is illustrated in Figure 6.
Figure 6

Known drugs and promising drug candidates on the validation network. The validation network for 16 drugs was constructed from STITCH. Each node is a drug or a gene. The green edges represent drug-gene interactions, and the red edges indicate drug-drug interactions; the blue edges represent gene-gene relationships obtained from STRING. Wider edges reflect stronger relationships between nodes. For easier implementation and visualization, a maximum of 40 neighbors of drugs (17 nodes) with a weight criterion of r > 0.4 were selected. As indicated in the figure, some drugs have significant topological features on the validation network.

To investigate the node properties in a network, network topology measurements (degree centrality and betweenness) and linkage analysis (PageRank) are often used. Degree centrality represents the number of interactions/edges/connections for a node. Biological networks are mostly scale-free networks, in which most nodes have few edges and a small number of nodes (hub) have a very high degree centrality. Betweenness is measured by the shortest paths between all nodes in the network and nodes that have the “shortest path” going through them are called bottlenecks. These hub and bottleneck nodes are topologically important and are usually functionally essential nodes (genes and drugs that have significant biological roles). Nodes connected to the hub and bottleneck node directly can also be functionally important. In addition, link analysis is a technique used to evaluate relationships (connection weights). The PageRank is a popular link analysis algorithm based on idea that a node should be significant if other significant nodes contain links to it. By answering the following biological questions for the promising drug candidates, we identified the most promising drugs among them.For this purpose, we checked the network properties of promising drug candidates on the validation network using degree centrality, betweenness, and PageRank. Among them, the network topology measurements (degree centrality and betweenness) are designed to produce a ranking which allows indication of the most important vertices and not designed to measure the influence of neighbor nodes in general. Therefore, for better validation of promising candidates on validation network, PageRank algorithm seems to be more preferable which evaluates the nodes by considering their connection weights to the influential neighbors nodes. Which candidate drug has an interesting/important relationship (connections) with known drugs? Which candidate drug has the hub/bottleneck property on the validation network? Which candidate drugs are connected to known breast cancer target genes? From the results shown in Table 7, the popular breast cancer drug “tamoxifen” was identified as the most important hub node with degree centrality of 0.661 on the validation network. Among the promising drug candidates, camptothecin showed the hub node property with the highest degree centrality (0.232) among the other five (MS-275, GW-8510, phenoxybenzamine, tyrphostin_AG-825, and alsterpaullone). Table 8 shows the neighbor nodes of the camptothecin on the validation network where it has a strong chemical similarity with the known drugs doxorubicin, paclitaxel, vinblastine, and methotrexate. A close look at this relationship is shown in Figure 7(a), and this evidence seems to point to the possibility of using the camptothecin for breast cancer treatment because structurally similar drugs usually bind the same disease targets. In addition, from Table 8 and Figures 7(a) and 7(b), it can be seen that camptothecin has a strong target relation with the genes that play active role in breast cancer including TOP1, ABCB1, TOP2A, CASP3, and TP53 (neighbors) and EGFR (second-degree neighbor). TOP1 and TOP2A were reported to inhibit the breast cancer resistant proteins [40]. ABCB1 is known as prognostic factor in breast cancer patients [41]. CASP3 expression loss represents an important cell survival mechanism in breast cancer patients [42] and it inhibits the growth of breast cancer cells. EGFR was one of the first identified important targets in breast cancer, and half of breast cancer cases overexpress EGFR.
Table 7

Degree centrality of promising drug candidates on the validation network.

RankDrug nameDegree centrality
1Tamoxifen 0.661
2Doxorubicin 0.554
3Paclitaxel 0.536
4Fulvestrant 0.268
5Methotrexate 0.268
6Camptothecin∗∗ 0.232
7Letrozole 0.214
8Vinblastine 0.196
9Exemestane 0.179
10Megestrol 0.125
11Aminoglutethimide 0.107
12MS-275∗∗ 0.089
13Alsterpaullone∗∗ 0.071
14GW-8510∗∗ 0.036
15Phenoxybenzamine∗∗ 0.036
16Celastrol∗∗ 0.036
17Tyrphostin_AG-825∗∗ 0.018

Known breast cancer drug. Potential drug candidate for repositioning.

Table 8

The neighbors of candidate drug “camptothecin” on the validation network.

NodesDescriptionWeight
TOP1Topoisomerase (DNA) I: the reaction catalyzed by topoisomerases leads to the conversion of one topological isomer of DNA to another.0.999
CASP3Caspase 3: apoptosis-related cysteine peptidase: it is involved in the activation cascade of caspases responsible for apoptosis execution.0.965
TP53Tumor protein p53: it acts as a tumor suppressor in many tumor types and induces growth arrest or apoptosis depending on the physiological circumstances and cell type.0.965
Doxorubicin It is a drug used in cancer chemotherapy; it works by intercalating DNA, with the most serious adverse effect being life threatening heart damage.0.890
ABCG2ATP-binding cassette, subfamily G (WHITE), member 2; xenobiotic transporter that may play an important role in the exclusion of xenobiotics from the brain. It may be involved in brain-to-blood efflux. It appears to play a major role in the multidrug resistance phenotype of several cancer cell lines.0.873
CDK1Cyclin-dependent kinase 1: it plays a key role in the control of the eukaryotic cell cycle. It is required in higher cells for entry into S phase and mitosis. p34 is a component of the kinase complex that phosphorylates the repetitive C-terminus of RNA polymerase II.0.846
ABCB1ATP-binding cassette, subfamily B (MDR/TAP), member 1; energy-dependent efflux pump responsible for decreased drug accumulation in multidrug-resistant cells.0.843
BCL2B-cell CLL/lymphoma 2: it suppresses apoptosis in a variety of cell systems including factor-dependent lymphohematopoietic and neural cells. It regulates cell death by controlling the mitochondrial membrane permeability. It appears to function in a feedback loop system with caspases. It inhibits caspase activity either by preventing the release of cytochrome c from the mitochondria and/or by binding to the apoptosis-activating factor (APAF-1).0.820
Paclitaxel It binds to and inhibits the enzyme dihydrofolate reductase, resulting in inhibition of purine nucleotide and thymidylate synthesis and, subsequently, inhibition of DNA and RNA syntheses.0.812
CDK2Cyclin-dependent kinase 2; involved in the control of the cell cycle; interacting with cyclins A, B1, B3, D, or E. Activity of CDK2 is maximal during S phase and G2.0.754
Vinblastine An antimicrotubule drug used to treat certain kinds of cancer, including Hodgkin's lymphoma, non-small cell lung cancer, breast cancer, head and neck cancer, and testicular cancer.0.560
Methotrexate It blocks the production of steroids derived from cholesterol and is clinically used in the treatment of Cushing's syndrome and metastatic breast cancer.0.554
TOP2ATopoisomerase (DNA) II alpha 170 kDa; control of topological states of DNA by transient breakage and subsequent rejoining of DNA strands.0.431

Known breast cancer drug.

Figure 7

The candidate drug camptothecin on the validation network. (a) Camptothecin has a strong relationship (chemical similarity) with known breast cancer drugs: doxorubin, paclitaxel, vinblastine, and methotrexate. (b) Camptothecin has direct target relationship with the genes playing active roles in breast cancer including TOP1, ABCB1, TOP2A, CASP3, and TP53 (neighbors). Moreover, it has an indirect relationship with the breast cancer target gene EGFR.

The candidate drugs MS-257 and alsterpaullone showed relatively higher degree centrality values among the remaining drugs. Table 9 and Figure 8 show the neighbor nodes relationship of MS-257 on the validation network, where it has strong target relationships with the genes HDAC1, TP53, CASP3, CCND1, and CYP3A4. Overexpression of HDCA1 represents clinicopathological indicators of disease progression in human breast cancer [43]. CCDN1 was reported to be a therapeutic target in breast cancer [44], and it has an indirect relationship with breast cancer susceptibility gene BRCA1. The betweenness results are summarized in Table 10. Among promising drug candidates only camptothecin and MS-275 showed some bottleneck node properties. Tamoxifen was defined as the most important bottleneck drug for breast cancer. Finally, we evaluated the connection weights of candidate drugs on the validation network using PageRank algorithm. We chose the alpha parameter as 0.85, which is the most commonly used value for this parameter with original Google PageRank algorithm. As shown in Table 11, camptothecin (0.257), alsterpaullone (0.102), and MS-275 (0,088) exhibited higher ranking scores than the other promising candidate drugs.
Table 9

The neighbors of candidate drug “MS-257” on the validation network.

GenesDescriptionWeight
HDAC1Histone deacetylase 1; responsible for the deacetylation of lysine residues on the N-terminal part of the core histones (H2A, H2B, H3, and H4). Histone deacetylation gives a tag for epigenetic repression and plays an important role in transcriptional regulation, cell cycle progression, and developmental events.0.987
TP53Tumor protein p53: it acts as a tumor suppressor in many tumor types and induces growth arrest or apoptosis depending on the physiological circumstances and cell type.0.831
CASP3Caspase 3, apoptosis-related cysteine peptidase; involved in the activation cascade of caspases responsible for apoptosis execution.0.827
CCND1cyclin D1; essential for the control of the cell cycle at the G1/S (start) transition.0.822
CYP3A4Cytochrome P450, family 3, subfamily A, polypeptide 4; cytochromes P450 are a group of heme-thiolate monooxygenases.0.433
Figure 8

The candidate drug MS-275 on the validation network. MS-275 has a strong target relationship with the breast cancer genes HDAC1, TP53, CASP3, CCND1, and CYP3A4. Furthermore, it has an indirect relationship with the well-known breast cancer gene BRCA1.

Table 10

Betweenness of promising drug candidates on the validation network.

RankDrug nameBetweenness
1Tamoxifen 172
2Paclitaxel 37
3Doxorubicin 32
4Camptothecin∗∗ 13
5Exemestane 13
6Fulvestrant 12
7Methotrexate 10
8Vinblastine 6
9Megestrol 6
10Aminoglutethimide 2
11Letrozole 2
12MS-275∗∗ 2
13GW-8510∗∗ 0
14Phenoxybenzamine∗∗ 0
15Tyrphostin_AG-825∗∗ 0
16Alsterpaullone∗∗ 0
17Celastrol∗∗ 0

Known breast cancer drug. Potential drug candidate for repositioning.

Table 11

PageRank of promising drug candidates on the validation network (α = 0.85).

RankDrug nameRanking score
1Tamoxifen 0.990
2Doxorubicin 0.692
3Paclitaxel 0.663
4Methotrexate 0.373
5Fulvestrant 0.316
6Camptothecin∗∗ 0.257
7Letrozole 0.252
8Vinblastine 0.235
9Exemestane 0.176
10Aminoglutethimide 0.108
11Alsterpaullone∗∗ 0.102
12Megestrol 0.101
13MS-275∗∗ 0.088
14Phenoxybenzamine∗∗ 0.080
15GW-8510∗∗ 0.026
16Celastrol∗∗ 0.023
17Tyrphostin_AG-825∗∗ 0.010

Known breast cancer drug. Potential drug candidate for repositioning.

From the evidences shown above, we concluded that camptothecin, MS-257, and alsterpaullone exhibited the strongest network property evidences for breast cancer on the validation network. In general, all of the promising candidates successfully passed the computational evaluation on the network. After performing biological and computational evaluations of the promising candidate drugs, we selected camptothecin as the most promising candidate because it was the most successful in both evaluation processes. For MS-278, GW-85, AG825, alsterpaullone, and celastrol, there was strong literature evidence with a reasonable network property. Thus, as shown in Figure 9, camptothecin, MS-278, alsterpaullone, GW-85, and AG825 and were validated as repositioned drugs and indicated for further investigation in breast cancer treatment.
Figure 9

Validated drugs. Candidate drugs with successful results for both the biological validation and computational evaluation are considered repositioned drugs for breast cancer.

4. Summary

We introduced a new systematic framework for disease-specific drug repositioning from integrated gene expression profiles on a pathway-drug network constructed from drug phenotype expression profiles (CMap) using semisupervised learning. The proposed pathway-based drug repositioning process showed encouraging results when using four different disease expression profiles to predict candidate drugs for disease-specific repositioning. Two different methods were employed to evaluate the repositioned drugs. The drugs that passed both evaluation methods successfully were considered the most promising drugs to target breast cancer. As a result, several drugs, including camptothecin, MS-275, alsterpaullone, GW-8510, AG 825, and celastrol were identified as possible drugs to be repositioned to treat breast cancer, and these results are supported by multiple lines of evidence in the public literature. Specifically, camptothecin was the most promising drug candidate because it showed a high network property on the validation network and was supported by evidence in the literature. Despite the interesting results, our method for drug repositioning was developed and validated in only using integrated mRNA gene expression profiles. However, the strategy can be easily improved to include other experimental data types, such as RNA-seq, miRNA, DNA-methylation, and single nucleotide polymorphism (SNP) information. Finally, the increasing number of genomic and pharmaceutical databases necessitates the further development of the method to identify new drugs and targets for rare cancer subtypes, develop personalized medicine, and design targeted cancer therapies. Supplementary file 1 contains Figures (1 and 2) of the preprocessing results and enrichment heatmaps for each datasets. Supplementary files (2, 3, 4, and 5) contain enrichment analysis results of each datasets in tab-limited format.
  59 in total

1.  Drug-target network.

Authors:  Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal:  Nat Biotechnol       Date:  2007-10       Impact factor: 54.908

2.  Down-regulation of caspase 3 in breast cancer: a possible mechanism for chemoresistance.

Authors:  Eswaran Devarajan; Aysegul A Sahin; Jack S Chen; Raghu R Krishnamurthy; Neeraj Aggarwal; Anne-Marie Brun; Anna Sapino; Fan Zhang; Dhawal Sharma; Xiao-He Yang; Ann D Tora; Kapil Mehta
Journal:  Oncogene       Date:  2002-12-12       Impact factor: 9.867

Review 3.  A survey of current trends in computational drug repositioning.

Authors:  Jiao Li; Si Zheng; Bin Chen; Atul J Butte; S Joshua Swamidass; Zhiyong Lu
Journal:  Brief Bioinform       Date:  2015-03-31       Impact factor: 11.622

4.  Acquired camptothecin resistance of human breast cancer MCF-7/C4 cells with normal topoisomerase I and elevated DNA repair.

Authors:  A Fujimori; M Gupta; Y Hoki; Y Pommier
Journal:  Mol Pharmacol       Date:  1996-12       Impact factor: 4.436

5.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

6.  HDAC inhibitors enhance the apoptosis-inducing potential of TRAIL in breast carcinoma.

Authors:  Thiyam Ramsing Singh; Sharmila Shankar; Rakesh K Srivastava
Journal:  Oncogene       Date:  2005-07-07       Impact factor: 9.867

7.  STRING v10: protein-protein interaction networks, integrated over the tree of life.

Authors:  Damian Szklarczyk; Andrea Franceschini; Stefan Wyder; Kristoffer Forslund; Davide Heller; Jaime Huerta-Cepas; Milan Simonovic; Alexander Roth; Alberto Santos; Kalliopi P Tsafou; Michael Kuhn; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2014-10-28       Impact factor: 16.971

8.  Reactome knowledgebase of human biological pathways and processes.

Authors:  Lisa Matthews; Gopal Gopinath; Marc Gillespie; Michael Caudy; David Croft; Bernard de Bono; Phani Garapati; Jill Hemish; Henning Hermjakob; Bijay Jassal; Alex Kanapin; Suzanna Lewis; Shahana Mahajan; Bruce May; Esther Schmidt; Imre Vastrik; Guanming Wu; Ewan Birney; Lincoln Stein; Peter D'Eustachio
Journal:  Nucleic Acids Res       Date:  2008-11-03       Impact factor: 16.971

9.  STITCH: interaction networks of chemicals and proteins.

Authors:  Michael Kuhn; Christian von Mering; Monica Campillos; Lars Juhl Jensen; Peer Bork
Journal:  Nucleic Acids Res       Date:  2007-12-15       Impact factor: 16.971

10.  Functional Module Connectivity Map (FMCM): a framework for searching repurposed drug compounds for systems treatment of cancer and an application to colorectal adenocarcinoma.

Authors:  Feng-Hsiang Chung; Yun-Ru Chiang; Ai-Lun Tseng; Yung-Chuan Sung; Jean Lu; Min-Chang Huang; Nianhan Ma; Hoong-Chien Lee
Journal:  PLoS One       Date:  2014-01-27       Impact factor: 3.240

View more
  13 in total

1.  Characterization of Altered Molecular Pathways in the Entorhinal Cortex of Alzheimer's Disease Patients and In Silico Prediction of Potential Repurposable Drugs.

Authors:  Paolo Fagone; Katia Mangano; Gabriella Martino; Maria Catena Quattropani; Manuela Pennisi; Rita Bella; Francesco Fisicaro; Ferdinando Nicoletti; Maria Cristina Petralia
Journal:  Genes (Basel)       Date:  2022-04-15       Impact factor: 4.141

2.  Topological network measures for drug repositioning.

Authors:  Apurva Badkas; Sébastien De Landtsheer; Thomas Sauter
Journal:  Brief Bioinform       Date:  2021-07-20       Impact factor: 11.622

Review 3.  Changing Trends in Computational Drug Repositioning.

Authors:  Jaswanth K Yella; Suryanarayana Yaddanapudi; Yunguan Wang; Anil G Jegga
Journal:  Pharmaceuticals (Basel)       Date:  2018-06-05

Review 4.  Drug repurposing for breast cancer therapy: Old weapon for new battle.

Authors:  Sadhna Aggarwal; Sumit Singh Verma; Sumit Aggarwal; Subash Chandra Gupta
Journal:  Semin Cancer Biol       Date:  2019-09-21       Impact factor: 15.707

5.  Transcriptome-Guided Drug Repositioning.

Authors:  Arsen Arakelyan; Lilit Nersisyan; Maria Nikoghosyan; Siras Hakobyan; Arman Simonyan; Lydia Hopp; Henry Loeffler-Wirth; Hans Binder
Journal:  Pharmaceutics       Date:  2019-12-12       Impact factor: 6.321

6.  A Discovery of Clinically Approved Formula FBRP for Repositioning to Treat HCC by Inhibiting PI3K/AKT/NF-κB Activation.

Authors:  Yanqiong Zhang; Xia Mao; Wenjia Chen; Xiaodong Guo; Liangxiang Yu; Funeng Jiang; Xiaoyue Wang; Weijie Li; Qiuyan Guo; Taixian Li; Na Lin
Journal:  Mol Ther Nucleic Acids       Date:  2020-01-10       Impact factor: 8.886

7.  Therapeutic Targeting of Repurposed Anticancer Drugs in Alzheimer's Disease: Using the Multiomics Approach.

Authors:  Dia Advani; Pravir Kumar
Journal:  ACS Omega       Date:  2021-05-19

8.  Master Regulators Connectivity Map: A Transcription Factors-Centered Approach to Drug Repositioning.

Authors:  Marco A De Bastiani; Bianca Pfaffenseller; Fabio Klamt
Journal:  Front Pharmacol       Date:  2018-07-02       Impact factor: 5.810

9.  Exploratory Analysis of iPSCS-Derived Neuronal Cells as Predictors of Diagnosis and Treatment of Alzheimer Disease.

Authors:  Eugenio Cavalli; Giuseppe Battaglia; Maria Sofia Basile; Valeria Bruno; Maria Cristina Petralia; Salvo Danilo Lombardo; Manuela Pennisi; Reni Kalfin; Lyubka Tancheva; Paolo Fagone; Ferdinando Nicoletti; Katia Mangano
Journal:  Brain Sci       Date:  2020-03-13

10.  Integrated transcriptomics reveals master regulators of lung adenocarcinoma and novel repositioning of drug candidates.

Authors:  Marco Antônio De Bastiani; Fábio Klamt
Journal:  Cancer Med       Date:  2019-09-10       Impact factor: 4.452

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.