Literature DB >> 26100720

A weighted and integrated drug-target interactome: drug repurposing for schizophrenia as a use case.

Liang-Chin Huang, Ergin Soysal, W Zheng, Zhongming Zhao, Hua Xu, Jingchun Sun.   

Abstract

BACKGROUND: Computational pharmacology can uniquely address some issues in the process of drug development by providing a macroscopic view and a deeper understanding of drug action. Specifically, network-assisted approach is promising for the inference of drug repurposing. However, the drug-target associations coming from different sources and various assays have much noise, leading to an inflation of the inference errors. To reduce the inference errors, it is necessary and critical to create a comprehensive and weighted data set of drug-target associations.
RESULTS: In this study, we created a weighted and integrated drug-target interactome (WinDTome) to provide a comprehensive resource of drug-target associations for computational pharmacology. We first collected drug-target interactions from six commonly used drug-target centered data sources including DrugBank, KEGG, TTD, MATADOR, PDSP K(i) Database, and BindingDB. Then, we employed the record linkage method to normalize drugs and targets to the unique identifiers by utilizing the public data sources including PubChem, Entrez Gene, and UniProt. To assess the reliability of the drug-target associations, we assigned two scores (Score_S and Score_R) to each drug-target association based on their data sources and publication references. Consequently, the WinDTome contains 546,196 drug-target associations among 303,018 compounds and 4,113 genes. To assess the application of the WinDTome, we designed a network-based approach for drug repurposing using mental disorder schizophrenia (SCZ) as a case. Starting from 41 known SCZ drugs and their targets, we inferred a total of 264 potential SCZ drugs through the associations of drug-target with Score_S higher than two in WinDTome and human protein-protein interactions. Among the 264 SCZ-related drugs, 39 drugs have been investigated in clinical trials for SCZ treatment and 74 drugs for the treatment of other mental disorders, respectively. Compared with the results using other Score_S cutoff values, single data source, or the data from STITCH, the inference of 264 SCZ-related drugs had the highest performance.
CONCLUSIONS: The WinDTome generated in this study contains comprehensive drug-target associations with confidence scores. Its application to the SCZ drug repurposing demonstrated that the WinDTome is promising to serve as a useful resource for drug repurposing.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 26100720      PMCID: PMC4474536          DOI: 10.1186/1752-0509-9-S4-S2

Source DB:  PubMed          Journal:  BMC Syst Biol        ISSN: 1752-0509


Background

Computational pharmacology plays an important role in the drug development, given that the traditional approaches have the low success rate and the increasing cost [1,2]. It provides a macroscopic view and a deeper understanding of the molecular mechanisms of drug action by integrating multiple data sets through a variety of informatics approaches [3]. Among these approaches, network-assisted method provides a unique platform by interrogating the relationships among drugs, proteins, and diseases to predict drug repurposing [3,4]. The drug repurposing (also called repositioning or re-profiling) is a process to identify novel indication for already existing drugs [5]. It is an essential strategy for drug development due to its capability of identifying novel indications of an approved drug, which in turn accelerates the drug development process [5,6]. Therefore, some studies have applied the computational approaches to drug repurposing based on drug-disease/side-effect association [7], genome-wide association studies (GWAS) [8], or gene expression profiles [9]. However, inferences of new drug indication heavily rely on the drug-target association (also called interactome). Therefore, reliable drug-target data are fundamental and crucial for supporting drug action inferences. During the last decade, many effects involving a multitude of methods has been made to the determination of the interactions between drugs and their targets. A number of databases have been created to systematically store the drug-target interactions such as SuperTarget [10], Psychoactive Drug Screening Program (PDSP) [11], Drug Gene Interaction Database (DGIdb) [12], and STITCH ('search tool for interactions of chemicals') [13]. According to the nature of data generation, these existing drug-target databases can be categorized into three groups: expert-curated [10-12,14-20], predicted [12,17,21,22], and integrative [10,12,13,21-29]. The expert-curated drug-target associations are manually extracted from the literature, including the small-scale studies and high-throughput screening (HTS). The small-scale studies possess the highest reliability while the HTS studies have issues about systematic errors on quality control and hit selection [30]. Predicted drug-target associations are inferred from known drug targets or other drug actions. Although the reliability of this type of data is not comparable to that of the expert-curated associations, the predicted associations provide researchers potential drug targets or off-targets for further investigation [21]. The integrative databases consist of the expert-curated and predicted associations such as STITCH, whose quality relies on the original data sources and comprehensive evaluation of the data. Additionally, it is not easy to trace back to the original data and assess their reliability if the integrated database did not provide the original data sources. Moreover, addressing the issues of data redundancy and both drug and target identifiers' heterogeneities is another major problem during the data integration. In this study, we integrated drug-target associations from six commonly used drug-target centered data sources and assigned a reliability assessment for each drug-target association to create one weighted and integrative drug-target interactome (WinDTome). In the process of building the WinDTome, we addressed the intrinsic issues of redundancy and heterogeneous identifiers by using the public drug-centered databases and gene-centered databases. The WinDTome will serve as a fundamental with assessments for the network-assisted computational pharmacology for drug repurposing. To illustrate its application, we used schizophrenia (SCZ) as an example. We started with the known medicines used to treat SCZ and SCZ candidate genes to predict the non-SCZ drugs that might have potentials for SCZ treatments based on the higher confident drug-target associations in WinDTome and protein-protein interactions. We inferred 264 drugs that might have potential to treat the SCZ. Among them, 39 drugs and 74 drugs have been investigated to treat SCZ and other mental disorders based on the clinical trial records in the database "clinical trials.gov", respectively. We further compared the performance of this process with the results based on other score cutoff values, single data sources, and the high confident data from STITCH. The comparative results showed that the WinDTome is promising for providing the fundamental of drug-target interactions for drug repurposing.

Materials and methods

Sources of drug-target interactome

We integrated drug-target interactions that were extracted from six public data sources. They were DrugBank [14], Kyoto Encyclopedia of Genes and Genomes (KEGG) Drug [24], Therapeutic Target Database (TTD) [23], Manually Annotated Targets and Drugs Online Resource (MATADOR) [10], NIMH Psychoactive Drug Screening Program (PDSP) Ki Database [11], and BindingDB [15]. DrugBank is a freely available web-enable database that contains detailed drug data with comprehensive drug target and drug action information [14]. We downloaded the file "drugbank.xml" from the DrugBank website on February 2014. From the file, we obtained the drug information such as drug's status, descriptive indication, target's UniProt accession numbers (ACs), references, and external links to ChEBI [31], PubChem Compound (CID) and Substance (SID). KEGG Drug is a comprehensive information resource for approved drugs in Japan, USA, and Europe [24]. We accessed the data of KEGG Drug through KEGG FTP server on May 2014, and extracted drugs, targets, and external links to DrugBank and PubChem Substance and the drug targets and corresponding genes' Entrez Gene IDs. TTD is a database that contains the known and explored therapeutic protein and nucleic acid targets, the disease, pathway information and the corresponding drugs. The approved drugs, experimental drugs, and clinical trial drugs, and their primary targets were obtained from a comprehensive search of literatures and FDA labels [23]. The data set of TTD was downloaded on October 2013. Drug-target associations along with drug's status and structuralized indication are available in the file "TTD_download.txt". The drug's external links to ChEBI, PubChem Compound, and PubChem Substance were obtained from the file "TTD_crossmatching.txt". The target's UniProt ACs are available in the "TTD_uniprot_all.txt" file. MATADOR is a resource for direct and indirect protein-chemical interactions that was assembled by automated text mining followed by manual curation. Each interaction contains links to PubMed abstracts or OMIM entries that were used to deduce the interaction [10]. The data set of MATADOR was on March 2007. Drug-target associations, including both direct and indirect interactions, are available in the "matador.tsv" file. PubChem Compound CID and UniProt AC were utilized to represent a drug's identifier and a target's identifier, respectively. The PDSP Ki database is a repository for numerous drugs' and drug candidates' affinity (i.e., Ki values) information [11]. The data was downloaded on January 2011. We obtained the drug-target associations along with their references, drug's PubChem CID, and gene symbols from the "kidb110121.txt" file. The BindingDB a public, web-accessible database of measured binding affinities, focusing chiefly on the interactions of protein considered to be drug targets with small, drug-like molecules [15]. We downloaded the data on June 2014. We extracted drug-target associations and references from the file "BindingDB_All.tsv", in which UniProt AC was used to represent the target's identifier. We also extracted the drugs' external links to DrugBank, ChEBI, PubChem Compound and Substance from the TSV file.

Drug and target name normalization

We created the WinDTome that consists of the drug-target associations from six databases. To normalize drug and target names from the six sources, we utilized the drug information from PubChem and gene/protein information from Entrez Gene and UniProt. PubChem is an open and comprehensive database with information on the biological activities of small molecules collected from 276 data sources. It contains three linked databases: Substances, Compounds, and Bioassay [32]. In this study, we utilized the Compound CIDs and their synonyms as a major part of drug normalization. We first downloaded the "CID-Synonym-filtered" file from PubChem on April 2014. If external links for PubChem Compound in the original data sources are not available, we utilized the string exact matching approach to link the drug's names to PubChem CIDs by the synonyms. For those cannot match to PubChem CID, we kept the original names from their corresponding data sources. We assigned a unified identifier (UID) to each drug and also kept the five identifiers including drug name, DrugBank ID, PubChem CID, PubChem Substance ID (SID), ChEBI ID. In this study, we utilized the gene symbols to represent the drug targets by matching the gene ID and symbol to UniProt ACs. The human gene information and annotation are available in the "gene_info" file, which was downloaded from NCBI Gene on May 2014 [33]. The file provides gene's ID, name, symbol, and taxonomy information. We utilized the gene ID, name, and symbol to present the drug target and gene taxonomy to filter out the targets that not belong to human. The UniProt is built as a depository of protein knowledge including sequence and functional information [34]. The data set of UniProt is available at http://www.uniprot.org/downloads (accessed May 2014). The mapping file "idmapping.dat" was served as a linkage between UniProt AC and Entrez Gene ID, and it helps to normalize heterogeneous target name or identifier in different drug target databases to Entrez Gene ID.

Drug and target classification

For further exploring the drug and target classification, we explored the Anatomical Therapeutic Chemical (ATC) code and Protein Analysis Through Evolutionary Relationships (PANTHER) protein class tool to classify the normalized drugs and targets, respectively. The ATC Classification System, controlled by the World Health Organization Collaborating Centre for Drug Statistics Methodology (WHOCC), is used for the classification of drugs. This pharmaceutical coding system hierarchically categorizes drugs into different groups based on the drugs' therapeutic and chemical properties. In this study, the drug's ATC code is obtained from both DrugBank and KEGG. PANTHER is a comprehensive, curated database of protein families [35]. The analysis was performed on January 2014. The UniProt ACs used in PANTHER as protein IDs were converted to Entrez Gene IDs for being compatible with the unified target ID in WinDTome. In order to simplify the drug targets' protein classes, we assigned the "protein class" as level 1 class. In the study, we utilized level 2 classes.

Scoring system

We designed two scoring systems to assess the reliability of the drug-target associations in the WinDTome: Score_S and Score_R. For each drug-target pair, the Score_S is its frequency of the presence in the six sources. If a drug-target association exists in all the six sources, its Score_S will be 6, and it was considered a highest reliable association. As discussed in the introduction section, some drug-target associations might originated from HTS studies, which have the limitation of the specificity and accuracy in drug-target associations. To distinguish the HTS studies from small-scale studies needs a large amount of manual checking, which requires extensive time for domain experts to accomplish it. In order to address this issue, we developed a Score_R to assess a drug-target association's reliability based on the number of references and the reference's specificity. For each drug-target association (x), Score_R was calculated by the following equation: n is the number of PubMed references supporting the association x; rrepresents the ith reference supporting the association x; fis the number of drug-target associations supported by the reference r; dand trepresent the numbers of drugs and targets reported in the reference r, respectively. An HTS study tends to have either high d and low t or low d and high t. Thus, given that two references have the same frequency in entire the data set, the reference of HTS study would have a lower score than the specific drug-target study does. Moreover, Score_R was also constructed of an intuitive assumption: the more references, the higher score. Thus, Score_R was a score of an accumulation of all the specificity scores of the references related to a drug-target association. The higher Score_R of a drug-target association represents the higher reliability.

Network-based inference of potential drugs to treat SCZ

To demonstrate the application of WinDTome for drug repurposing, we inferred the drugs that have not yet been approved for SCZ treatment (non-SCZ drugs) as potential SCZ drugs based on the drugs used to treat SCZ (SCZ drugs) and their known targets. The rationale of this inference was based on the assumption that drugs having same targets with SCZ drugs, or drugs whose targets have direct interactions with SCZ drug target, might be potential SCZ drugs. In our previous study, we collected 32 SCZ drugs [36]. In this study, we updated the drug list by manually checking the drugs' indication field in the DrugBank. For each drug, if the field contains any keyword of "schizophrenia," "schizophrenic," "schizotypy," or "schizotypal," it may be a known SCZ drug. After manually check, we obtained 41 SCZ drugs. Then we obtained their targets from the DrugBank as known SCZ drug targets. Starting the known SCZ drug targets and drug-target associations from WinDTome, we first obtained a group of non-SCZ drugs having associations with SCZ drugs' targets. Secondly, from drug-target associations from WinDTome, we obtained another group of non-SCZ drugs by requiring their target proteins are encoded by SCZ candidate genes [37] and these target proteins have interactions with known SCZ drug target proteins. The protein-protein interactions used here were obtained from the Protein Interaction Network Analysis (PINA) [38] (accessed in December 2012). During the two processes, we further required the drug-target associations with Score_S ≥ 3 in WinDTome.

SCZ-relevant clinical trials

To evaluate the potential of the drugs that we inferred to treat SCZ, we exploited the SCZ-relevant clinical trials collected in the ClinicalTrials.gov (https://clinicaltrials.gov/). The ClinicalTrials.gov is a web-based resource that provides the information of publically and privately supported clinical studies on a wide range of diseases and conditions. To identify evidences of existing SCZ-relevant clinical trials to support the predictive potential SCZ drugs, we searched the drug names against ClinicalTrials.gov by setting the "Conditions" as "schizophrenia" and the "Interventions" as the name of each potential SCZ drug.

Results

Overview of the normalized drug-target interactome

After extracting drug-target interactome from the six databases, we normalized drug names and target names for uniting identifiers and Entrez Gene IDs, respectively (Figure 1). In this process, heterogeneous identifiers of drugs and targets were mapped to the unified identifier systems (see Materials and Methods) to reduce the redundant drug-target associations among these sources. To further purify the data of WinDTome, we removed the drug-target associations in which 1) the length of the drug name is less than three, 2) the drug name is a numerical digit, and 3) the target does not belong to the human. Thus, we obtained in a total of 546,196 drug-target associations composing of 303,018 drugs and 4,113 human targets (Supplementary Table S1, Additional file 1). We further summarized the number of drugs, targets, and interactions for each database and their overlap proportion between any two databases (Table 1). We defined the proportion as the number of intersections divided by the smaller number of the two sources to represent the coverage of the overlap in the smaller data source. Among the 15 values of overlap proportion among the six databases, only three of the drug and target overlaps were much than 50%, and no values of the interactions were much than 50%. The results showed that, though some duplicated drug-target associations existed among the six drug-target databases, most of them had the lower overlap proportions. The observation indicated that these six sources have diverse types of drug-target association, which may be complementary to each other. Among these drugs, 1,572 were approved drugs, according to DrugBank and TTD. They had 3,041 targets and formed 22,059 drug-target associations.
Figure 1

The generation of the weighted and integrated drug target interactome (WinDTome). The processes of building WinDTome included three steps: data extraction and pre-processing (left); drug and target name normalization; and integration (right). The name normalization is the process to convert heterogeneous identifiers into one unified identifier. In this study, we utilized external data sources to convert heterogeneous drug and target identifiers into unified drug and target identifiers, respectively.

Table 1

Summary of the drugs, targets, and their interactions in six data sources and their overlap

Overlap (numbera/proportionb)

SourceDrugBankKEGGTTDMATADORPDSPBindingDB
DrugDrugBank4,3160.6790.3690.5900.0450.269
KEGG7611,1210.4670.5500.1350.231
TTD1,59452414,0730.4050.0690.425
MATADOR4474173077580.1990.170
PDSP1951513401514,9470.004
BindingDB1,1622595,98412922290,874

TargetDrugBank2,1020.6220.5100.1080.3240.182
KEGG2954740.4180.4090.2830.348
TTD4791989400.1130.4340.526
MATADOR1981941061,8400.2620.040
PDSP474163381450.152
BindingDB37216549473222,048

Drug-targetDrugBank9,8860.4400.2000.0920.0390.149
KEGG1,5533,5320.1660.2540.0670.117
TTD1,97658622,7910.0320.0420.386
MATADOR90689834110,6870.0230.017
PDSP38423749024711,7620.007
BindingDB1,4714158,80417981503,378

aThe value on the diagonal line of each matrix represents the numbers of drug/target/drug-target pair in each source. The value under the diagonal line of each matrix shows the number of overlap between two sources.

bThe value above the diagonal line of each matrix shows the proportions of overlap between two sources. The proportion is defined as the number of intersection dividing by the smaller number of the two sources.

The generation of the weighted and integrated drug target interactome (WinDTome). The processes of building WinDTome included three steps: data extraction and pre-processing (left); drug and target name normalization; and integration (right). The name normalization is the process to convert heterogeneous identifiers into one unified identifier. In this study, we utilized external data sources to convert heterogeneous drug and target identifiers into unified drug and target identifiers, respectively. Summary of the drugs, targets, and their interactions in six data sources and their overlap aThe value on the diagonal line of each matrix represents the numbers of drug/target/drug-target pair in each source. The value under the diagonal line of each matrix shows the number of overlap between two sources. bThe value above the diagonal line of each matrix shows the proportions of overlap between two sources. The proportion is defined as the number of intersection dividing by the smaller number of the two sources. After the normalization of drug-target interactome extracted from different sources, we further classified the drugs and the proteins by ATC drug classification system and PANTHER protein class tool, respectively. Supplementary Table S2, Additional file 2 shows the number of drugs in each drug class. Based on the drug's ATC annotation from DrugBank and KEGG, among the 303,018 drugs in WinDTome, 1,856 had ATC codes, of which 1,502 drugs had one ATC code while 354 drugs had multiple ATC codes. Supplementary Table S3, Additional file 2 summarizes the number of protein targets in each protein class in WinDTome. Among the 4,113 protein targets in WinDTome, 2,888 were found in PANTHER. Among them, 2,240 had been annotated to be one single protein class while the rest 648 targets have been annotated to be multiple protein classes. The top 5 protein classes of drug's targets in WinDTome were receptor, transporter, oxidoreductase, kinase, and enzyme modulator.

Drug-target network and its scoring system

To measure the reliability of each drug-target association in WinDTome, we designed two scores (Score_S and Score_R) were designed to. Score_S and Score_R were calculated based on diverse supports: the former was from the number of the concordance of the six sources while the latter was from the summation of the specificities of reference papers. Table 2 shows the numbers of drugs, targets, and drug-target interactions and the average Score_R in each group of Score_S of WinDTome and those in the subset with drug having ATC code. Figure 2A and Figure 2B show the distributions of Score_S and Score_R, respectively. To examine the consistency between the two scores, we performed Pearson correlation analysis. Figure 2C shows the Score_S and Score_R had a positive correlation (r = 0.423). Additionally, the correlation between Score_S and Score_R was 0.546 if we only considered the drug-target associations in which the drugs have ATC codes. Based on the correlation between Score_S and Score_R, we utilized the Score_S ≥ 3 as a threshold for further analysis.
Table 2

The distribution of the drugs, targets, their associations, and their Score_R and Score_S scores

All drugsDrugs having ATC codes

Score_Sa#Drugs#Targets#Drug-targetsScore_Rb#Drugs#Targets#Drug-targetsScore_Rb
≥ 1303,0184,113546,1960.0591,8562,87224,3810.358
≥ 28,26994613,7280.5341,2025893,2841.644
≥ 31,0343581,5882.4807532911,2562.905
≥ 43671224344.2173671224344.217
≥ 58727884.5698727884.569
≥ 62124.7252124.725

aThe score was calculated based on the occurrence of the drug-target interaction in the six databases.

bThe score was calculated according to the references of drug-target associations.

Figure 2

Distributions and comparison of Score_S and Score_R. A) The y-axis represents the frequency of drug-target association's Score_S. The label above a bar shows the exact number of the frequency. B) The y-axis represents the frequency of drug-target association's Score_R. The label above a bar shows the exact number of the frequency. C) This bar chart shows the average Score_Rs of all drug-target associations and the associations involving the drugs with ATC codes (represented as "Drug_ATC") in each group of Score_S, respectively. The label above a bar represents the average value of Score_R. The error bar is shown and defined as the standard error of the mean (SEM) of the Score_Rs in each group of Score_S. The r value is the Pearson correlation coefficient between Score_S and Score_R.

The distribution of the drugs, targets, their associations, and their Score_R and Score_S scores aThe score was calculated based on the occurrence of the drug-target interaction in the six databases. bThe score was calculated according to the references of drug-target associations. Distributions and comparison of Score_S and Score_R. A) The y-axis represents the frequency of drug-target association's Score_S. The label above a bar shows the exact number of the frequency. B) The y-axis represents the frequency of drug-target association's Score_R. The label above a bar shows the exact number of the frequency. C) This bar chart shows the average Score_Rs of all drug-target associations and the associations involving the drugs with ATC codes (represented as "Drug_ATC") in each group of Score_S, respectively. The label above a bar represents the average value of Score_R. The error bar is shown and defined as the standard error of the mean (SEM) of the Score_Rs in each group of Score_S. The r value is the Pearson correlation coefficient between Score_S and Score_R.

Application of WinDTome for drug repurposing using SCZ as an example

To illustrate the application of WinDTome for drug repurposing, we utilized the SCZ as an example. We collected 41 known SCZ drugs and their 41 target proteins. Among the 41 proteins, 34 existed in the drug-target interactions in the WinDTome with highly confident associations (Score_3 ≥ 3) and were targeted by 224 non-SCZ drugs. These non-SCZ drugs were defined as the first set of potential SCZ drugs. From the 51 proteins, we obtained 563 proteins having interactions with them. Among them, 24 were encoded by SCZ-related genes, from which we obtained 46 non-SCZ drugs that targeted the 7 SCZ-related proteins from WinDTome with highly confident associations (Score_S ≥ 3). We defined them as the second set of potential SCZ drugs. After putting them together, we obtained the 264 unique non-SCZ drugs and 41 unique proteins (34 SCZ drug targets and 7 SCZ-related proteins). To assess if these non-SCZ drugs have potential to treat SCZ, we utilized the clinical trial studies to examine how many drugs have been investigated in schizophrenia or other mental diseases. After querying the 264 non-SCZ drugs against ClinicalTrials.gov, we found that 39 non-SCZ drugs have been investigated in the 82 SCZ-relevant clinical trials and more 74 non-SCZ drugs have been investigated in the 1,831 mental disorders-relevant clinical trials, respectively. There were 512 drugs in the 82 SCZ-relevant clinical trials. Figure 3 summarized that the overlap between the 264 non-SCZ drugs and the drugs in SCZ-relevant clinical trials, listed the 39 drugs and their identification methods (directly from SCZ drug targets, or from target interactors), and also summarized the number of non-SCZ drugs that have been investigated in the other mental diseases in the ClinicalTrials.gov.
Figure 3

The 39 potential SCZ drugs and their comparison with other drugs with clinical trials. Target type means the category of the potential SCZ drug's target. It indicated that inference methods for the potential SCZ drugs. SCZ: schizophrenia; MTL: mental disorders.

The 39 potential SCZ drugs and their comparison with other drugs with clinical trials. Target type means the category of the potential SCZ drug's target. It indicated that inference methods for the potential SCZ drugs. SCZ: schizophrenia; MTL: mental disorders. Table 3 shows the detail number of the clinical trials and the ATC categories of these 39 drugs. The 39 drugs had 24 targets, and Table 4 shows their protein classes. We observed that most of them were either receptors or transporters. Figure 4 represents the expanding SCZ drug-target network plus potential SCZ drugs. To make the figure simple, we did not include the interactions among SCZ drug targets or the interactions among SCZ-related proteins. Additionally, if a potential SCZ drug had associations with multiple SCZ genes/SCZ drug targets, or a SCZ drug target had associations with multiple SCZ drugs, we utilized the associations with the highest Score_S and Score_R to represent the existence of the potential SCZ drugs. The network mainly contained three subnetworks including Olanzapine-centered subnetwork, Clozapine-centered subnetwork, and a subnetwork containing Asenapine, Imipramine, and Risperdone.
Table 3

Potential drugs for schizophrenia treatment

Potential SCZ drugIndicationaATC codesNumber of SCZ clinical trials
CitalopramDepressionN06AB04; N06AB1013
AtomoxetineADHDbN06BA098
MemantineParkinson's diseaseN06DX016
ParoxetineDepressionN06AB056
SertralineDepressionN06AB065
FluvoxamineDepressionN06AB084
LorazepamAnxietyN05BA064
OndansetronNausea and vomitingA04AA014
DexmethylphenidateADHDN06BA113
FluoxetineDepressionN06AB033
MethylphenidateADHDN06BA043
BenztropineParkinson's disease2
BetahistineObesityN07CA012
ClonidineADHDC02AC01; N02CX02; S01EA04; S01EA032
FamotidinePeptic ulcer diseaseA02BA032
GuanfacineHypertensionC02AC022
MirtazapineDepressionN06AX112
PergolideParkinson's diseaseN04BC022
ReboxetineDepressionN06AX182
ZolpidemInsomniaN05CF022
AgomelatineDepressionN06AX221
BromocriptineParkinson's diseaseG02CB01; N04BC011
BuspironeAnxietyN05BE011
CinnarizineNausea and vomitingN07CA021
CyproheptadineAllergiesR06AX021
DesipramineDepressionN06AA011
DexmedetomidineAnxietyN05CM181
DiazepamAnxietyN05BA01; N05BA171
DopamineParkinson's diseaseC01CA041
LevodopaParkinson's diseaseN04BA01; N04BA041
MethamphetamineADHDN06BA031
NaratriptanMigraine headachesN02CC021
NitrazepamInsomniaN05CD021
NizatidinePeptic ulcer diseaseA02BA041
PramipexoleParkinson's diseaseN04BC051
PromethazineAllergiesD04AA10; R06AD02; R06AD051
SibutramineObesityA08AA101
TrazodoneDepressionN06AX051
ZotepineN05AX111
Total82

aThe information of drug indication was obtained from DrugBank and TTD.

bADHD: attention deficit hyperactivity disorder.

Table 4

Protein classification of the potential SCZ drug targets

Protein classProteinPotential SCZ drugs withclinical trials
ReceptorADRA1ATrazodone

CHRM1BenztropineDesipramine

DRD1PergolideDopamine

DRD2BromocriptinePergolide
PramipexoleDopamine
BuspironeLevodopa

DRD3BromocriptinePergolide
PramipexoleDopamine

DRD4PramipexoleLevodopa
Dopamine

DRD5Dopamine

HRH1BetahistineCinnarizine
CyproheptadineDesipramine
PromethazineMirtazapine

HRH2FamotidineNizatidine

HTR1ANaratriptanBuspirone

HTR1BNaratriptan

HTR1DNaratriptan

HTR2AMirtazapineZotepine
Trazodone

HTR2CAgomelatineMirtazapine

TransporterSLC6A2AtomoxetineDesipramine
MethamphetamineReboxetine
MethylphenidateSibutramine

SLC6A3DexmethylphenidateMethylphenidate

SLC6A4CitalopramFluoxetine
FluvoxamineParoxetine
MethamphetamineSertraline
SibutramineTrazodone

Transporter;GABRA1LorazepamDiazepam
ReceptorNitrazepamZolpidem

GABRG2LorazepamDiazepam

GRIN2BMemantine

HTR3AOndansetronMirtazapine

(Not available)ADRA2ADexmedetomidineClonidine
MirtazapineGuanfacine

ADRA2BClonidine

ADRA2CDexmedetomidineClonidine
Figure 4

Expanding drug-target network for predicting potential SCZ drug. The red rectangular represents the SCZ drug; the blue rectangular represents the potential SCZ drug; the blue rectangular with red border represents the potential SCZ drug related to SCZ-relevant clinical trial; the green ellipse represents the SCZ drug target; the orange ellipse represents the SCZ-related proteins; the solid line between drug and gene represents the highly confident drug-target association; and the dotted line represents the protein-protein interaction.

Potential drugs for schizophrenia treatment aThe information of drug indication was obtained from DrugBank and TTD. bADHD: attention deficit hyperactivity disorder. Protein classification of the potential SCZ drug targets Expanding drug-target network for predicting potential SCZ drug. The red rectangular represents the SCZ drug; the blue rectangular represents the potential SCZ drug; the blue rectangular with red border represents the potential SCZ drug related to SCZ-relevant clinical trial; the green ellipse represents the SCZ drug target; the orange ellipse represents the SCZ-related proteins; the solid line between drug and gene represents the highly confident drug-target association; and the dotted line represents the protein-protein interaction.

Evaluation of prediction performance

Based on the data from WinDTome (Score_S ≥ 3), we inferred 264 potential SCZ drugs. According to the clinical trial records in the ClinicalTrials.gov, among them, 39 drugs have been investigated in the schizophrenia treatment and 74 drugs have been investigated in the intervention of other mental diseases. To further evaluate the performance of the prediction, we designed three comparative strategies, including utilizing different score thresholds, single data source, and the data from external integrative database STITCH. Table 5 summarizes the comparative results.
Table 5

Potential SCZ drugs identified by using different data sources and threshold

Data sourceThreshold#Potential SCZ drugs#Drugs in SCZ-CTa#Drugs in MTL-CTb
WinDTomeScore_S ≥ 326439 (14.8%)74 (28.0%)
Score_S ≥ 21,35476 (5.6%)102 (7.5%)
Score_S ≥ 137,680191 (0.5%)292 (0.8%)

DrugBank49045 (9.2%)87 (17.8%)
KEGG37530 (8.0%)70 (18.7%)
TTD2,38827 (1.1%)62 (2.6%)
MATADOR37448 (12.8%)82 (21.9%)
PDSP2,00139 (1.9%)59 (2.9%)
BindingDB34,74841 (0.1%)98 (0.3%)

STITCHCombined score ≥ 0.732,282100 (0.3%)241 (0.7%)

aSCZ-CT represents the schizophrenia-relevant clinical trials.

bMTL-CT represents the mental disorders-relevant clinical trials.

Potential SCZ drugs identified by using different data sources and threshold aSCZ-CT represents the schizophrenia-relevant clinical trials. bMTL-CT represents the mental disorders-relevant clinical trials. Using the data in WinDTome whose Score_S no less than 2 or 1, we performed the inference process as the data with Score_S were no less than 3, respectively, and calculated the proportion of the potential SCZ drugs that have been investigated to treat SCZ in the clinical trials. We observed that the proportion of the potential SCZ drugs found in SCZ-relevant clinical trials based on Score_S ≥ 3 as a threshold (14.8%) was higher than that of the threshold Score_S ≥ 2 (5.6%) and the threshold Score_S ≥ 1 (0.5%), respectively. Similarly, the proportion of the potential SCZ drugs found in other mental disorders, according to clinical trials using the threshold of Score_S ≥ 3 was higher than those based on the other two thresholds (28.0% vs. 7.5% and 0.8%). We repeated the inference process by using the data from each data source for the WinDTome creation. Some performance based on single data sources were higher than those based WinDTome with Score_S ≥ 2 or Score_S ≥ 1; however, none of them is higher than those of using WinDTome with Score_S ≥ 3. It implies that the scoring system we used is promising in conducting network pharmacology studies, and the recommending cutoff of Score_S is reasonable. Finally, we performed the prediction using the data from the STITCH for potential SCZ drugs. We extracted 455,430 high confident (combined score ≥ 0.7) drug-target associations among 199,133 drugs and 9,379 unique human proteins from STITCH. We obtained 29,405 non-SCZ drugs having associations with known SCZ drug targets and 5,447 non-SCZ drugs targeting the 51 SCZ-related proteins. In total, we obtained 32,282 unique non-SCZ drugs. In the ClinicalTrials.gov, we found 100 out of the 32,282 potential SCZ drugs in SCZ-relevant clinical trials and 241 potential SCZ drugs in other mental disorders-relevant clinical trials. In summary, the comparative results indicated that the scoring strategy designed in this study provides an assessment of the reliability of the drug-target associations, and WinDTome provides weighted drug-target associations.

Discussion

In this study, we built a weighted and integrated drug-target interactome (WinDTome) to provide a fundamental for computational drug repurposing. In WinDTome, we assigned two scores for each drug-target association, which provides an assessment of the reliability of the drug-target associations. Starting from the drug-target associations in WinDTome and SCZ known drugs and their targets, we inferred the potential drugs for SCZ treatment. The illustration indicates that weighted WinDTome provides one promising source for drug repurposing. We inferred the 264 potential candidate drugs that could be used to treat SCZ. Among them, 39 potential drugs have been investigated in schizophrenia according to the clinical trial records from the clinicaltrials.gov. We further checked the indications of these drugs in DrugBank and TTD and then categorized them into six indication groups: depression, anxiety, Parkinson's disease, insomnia, attention deficit hyperactivity disorder (ADHD), and others (Table 3). Most of them are related to mental disorders. Among the 264 potential SCZ drugs, 113 have been investigated in the mental disorders. Among them, the drugs treating depression have the most number of clinical trials. It is not surprising that the four potential SCZ drugs - citalopram, paroxetine, sertraline, and fluvoxamine - might have relevant mechanisms and resembling biological pathways with existing SCZ drugs because depression is one of the negative symptoms of schizophrenia. In addition to the positive and negative symptoms, SCZ patients are highly inclined to have comorbidities such as anxiety disorders, depression, and diabetes [39,40]. Interestingly, in the study "NCT01401491" (an identifier in the ClinicalTrials.gov) [41], Fluvoxamine was combined with clozapine to treat schizophrenia patients, and the clinical improvement was observed in the investigators' patients. The investigators found that fluvoxamine could reduce the clozapine-induced side effects of metabolic disturbance and obesity. The use of fluvoxamine as a co-administration for the treatment of cognitive impairments in patients with schizophrenia has been reported [42,43], and the potential of Fluvoxamine for treating schizophrenia was also proposed [44]. In the 39 potential SCZ drugs with clinical evidences, we also found that betahistine and sibutramine are used to treat obesity. Betahistine, a nervous system drug, and Sibutramine, a centrally acting antiobesity product, were proposed to be co-administrations for treating schizophrenia patients, respectively [45,46]. These findings were consistent to the a previous study mentioning that prescribed co-administrations of antipsychotics drugs and atypical substances that have been increasing for treating SCZ patients along with various comorbidities [47]. Moreover, both Parkinson's disease and schizophrenia are related to the dopaminergic system [48]. Thus, the four Parkinson's disease drugs - benztropine, pergolide, levodopa, and pramipexole - may also have potential for schizophrenia treatment [49-51]. The inference strategy for drug repurposing started with the known drugs that have been used to treat SCZ and their targets. The underlying assumption is that the potential drugs for a given disease should have similar molecular mechanisms or actions to the known drugs. Therefore, this strategy provides a methodology for any diseases with effective medicines. To examine the robustness of the network-based drug repurposing methodology in this study, we further performed the similar inference of drug repurposing using colon cancer as another example. We manually collected 12 approved colon cancer drugs from DrugBank (aflibercept, bevacizumab, capecitabine, cetuximab, fluorouracil, irinotecan, leucovorin, oxaliplatin, panitumumab, raltitrexed, regorafenib, and trimetrexate) and 38 drug targets (human genes), and the colon cancer associated genes from Cancer Gene Census [52]. By implementing the same processes of network-based inference, we obtained 26 potential colon cancer drugs with high confident associations with known colon cancer drug targets. Among them, 16 were found in clinical trials whose condition was "colon cancer." The numbers of these 16 drugs' colon cancer relevant trials, as well as the drugs' indications and ATC codes, are provided in Supplementary Table S4, Additional file 2. In addition, the protein classes of these 16 potential colon cancer drug targets are provided in Supplementary Table S5, Additional file 2. In summary, this preliminary result further suggested that the network-based inference for drug repurposing by using the drug-target associations in WinDTome is promising. The inference of drug repurposing mainly relies on the drug-target interactome. In this study, based on the analysis of Score_S and Score_R distributions, we mainly employed highly confident drug-target associations (Score_S ≥ 3) to identify the candidates. However, some potential candidates will be missed compared to the inference based on the concept of off-target and drug target similarity. Thus, integrating the other data, such as drug structure similarity, off-target information, and target similarity, might improve further the performance. Therefore, it still is necessary to develop a more comprehensive scoring system for the drug-target associations. Additionally, in this study, we mainly employed the direct protein-protein interactions (PPIs) of the drug targets and disease genes, which is restricted to the network environment information. In the future, we will consider the effects of drug targets on the disease genes in the context of PPI networks.

Conclusion

In this study, we generated a weighted and integrated drug-target interactome (WinDTome), which provides a reliable source of drug-target interactome. We utilized the WinDTome along with a network pharmacology approach for drug repurposing for SCZ as an example. Eventually, we successfully obtained 264 potential SCZ drugs, including 39 drugs that have been found in existing SCZ-relevant clinical trials and 74 drugs that have been investigated in other mental disorders. We further compared the results with the inference using different thresholds, single data source, and the data from the commonly used database STITCH. The comparative result further demonstrated that the WinDTome is promising for further systems pharmacology study.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

LH and JS collected data for the study. LH and JS performed data analysis. JS and HX conceived and designed the study. LH and JS wrote the manuscript. ES, WZ, JS, ZZ revised the manuscript.
  48 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Drug repositioning: identifying and developing new uses for existing drugs.

Authors:  Ted T Ashburn; Karl B Thor
Journal:  Nat Rev Drug Discov       Date:  2004-08       Impact factor: 84.694

Review 3.  Can the pharmaceutical industry reduce attrition rates?

Authors:  Ismail Kola; John Landis
Journal:  Nat Rev Drug Discov       Date:  2004-08       Impact factor: 84.694

Review 4.  Dopamine transporter ligands: recent developments and therapeutic potential.

Authors:  Scott P Runyon; F Ivy Carroll
Journal:  Curr Top Med Chem       Date:  2006       Impact factor: 3.295

5.  Drug-target network.

Authors:  Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal:  Nat Biotechnol       Date:  2007-10       Impact factor: 54.908

Review 6.  Medical morbidity and mortality in schizophrenia: guidelines for psychiatrists.

Authors:  Donald C Goff; Corinne Cather; A Eden Evins; David C Henderson; Oliver Freudenreich; Paul M Copeland; Michael Bierer; Kenneth Duckworth; Frank M Sacks
Journal:  J Clin Psychiatry       Date:  2005-02       Impact factor: 4.384

7.  Effects of dopaminergic modulation on automatic semantic priming: a double-blind study.

Authors:  Christina Andreou; Kristina Veith; Vasilis P Bozikas; Tania M Lincoln; Steffen Moritz
Journal:  J Psychiatry Neurosci       Date:  2014-03       Impact factor: 6.186

8.  Comorbidity of medical illnesses among adults with serious mental illness who are receiving community psychiatric services.

Authors:  Joseph Sokal; Erick Messias; Faith B Dickerson; Julie Kreyenbuhl; Clayton H Brown; Richard W Goldberg; Lisa B Dixon
Journal:  J Nerv Ment Dis       Date:  2004-06       Impact factor: 2.254

9.  BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities.

Authors:  Tiqing Liu; Yuhmei Lin; Xin Wen; Robert N Jorissen; Michael K Gilson
Journal:  Nucleic Acids Res       Date:  2006-12-01       Impact factor: 16.971

10.  ChEBI: a database and ontology for chemical entities of biological interest.

Authors:  Kirill Degtyarenko; Paula de Matos; Marcus Ennis; Janna Hastings; Martin Zbinden; Alan McNaught; Rafael Alcántara; Michael Darsow; Mickaël Guedj; Michael Ashburner
Journal:  Nucleic Acids Res       Date:  2007-10-11       Impact factor: 16.971

View more
  5 in total

1.  Identify potential drugs for cardiovascular diseases caused by stress-induced genes in vascular smooth muscle cells.

Authors:  Chien-Hung Huang; Jin-Shuei Ciou; Shun-Tsung Chen; Victor C Kok; Yi Chung; Jeffrey J P Tsai; Nilubon Kurubanjerdjit; Chi-Ying F Huang; Ka-Lok Ng
Journal:  PeerJ       Date:  2016-09-28       Impact factor: 2.984

2.  Improving language model of human genome for DNA-protein binding prediction based on task-specific pre-training.

Authors:  Hanyu Luo; Wenyu Shan; Cheng Chen; Pingjian Ding; Lingyun Luo
Journal:  Interdiscip Sci       Date:  2022-09-22       Impact factor: 3.492

3.  COMPUTING THERAPY FOR PRECISION MEDICINE: COLLABORATIVE FILTERING INTEGRATES AND PREDICTS MULTI-ENTITY INTERACTIONS.

Authors:  Sam Regenbogen; Angela D Wilkins; Olivier Lichtarge
Journal:  Pac Symp Biocomput       Date:  2016

4.  Education, collaboration, and innovation: intelligent biology and medicine in the era of big data.

Authors:  Jianhua Ruan; Victor Jin; Yufei Huang; Hua Xu; Jeremy S Edwards; Yidong Chen; Zhongming Zhao
Journal:  BMC Genomics       Date:  2015-06-11       Impact factor: 3.969

Review 5.  Repurposing Drugs via Network Analysis: Opportunities for Psychiatric Disorders.

Authors:  Trang T T Truong; Bruna Panizzutti; Jee Hyun Kim; Ken Walder
Journal:  Pharmaceutics       Date:  2022-07-14       Impact factor: 6.525

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.