| Literature DB >> 25739475 |
Hyojung Paik1, Ah-Young Chung2, Hae-Chul Park2, Rae Woong Park3, Kyoungho Suk4, Jihyun Kim3, Hyosil Kim3, KiYoung Lee3, Atul J Butte5.
Abstract
Prediction of new disease indications for approved drugs by computational methods has been based largely on the genomics signatures of drugs and diseases. We propose a method for drug repositioning that uses the clinical signatures extracted from over 13 years of electronic medical records from a tertiary hospital, including >9.4 M laboratory tests from >530,000 patients, in addition to diverse genomics signatures. Cross-validation using over 17,000 known drug-disease associations shows this approach outperforms various predictive models based on genomics signatures and a well-known "guilt-by-association" method. Interestingly, the prediction suggests that terbutaline sulfate, which is widely used for asthma, is a promising candidate for amyotrophic lateral sclerosis for which there are few therapeutic options. In vivo tests using zebrafish models found that terbutaline sulfate prevents defects in axons and neuromuscular junction degeneration in a dose-dependent manner. A therapeutic potential of terbutaline sulfate was also observed when axonal and neuromuscular junction degeneration have already occurred in zebrafish model. Cotreatment with a β2-adrenergic receptor antagonist, butoxamine, suggests that the effect of terbutaline is mediated by activation of β2-adrenergic receptors.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25739475 PMCID: PMC4894399 DOI: 10.1038/srep08580
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the data used
| Omics class | Features | Total no. |
|---|---|---|
| Clinical data | Total no. of cases | 1,011,055 |
| Total diagnosis code types (KCD6) | 10,874 | |
| Total diagnosis types (ICD10) with OMIM disease ids | 425 | |
| Total no. of diagnosis records | 2,788,135 | |
| Total drug medication types (drug order/ATC codes) | 5,350/1,003 | |
| No. of drug medication records | 8,693,995 | |
| Total laboratory test types | 246 | |
| Total no. of laboratory test records | 9,494,169 | |
| Total no. of cases with laboratory test records | 313,347 | |
| Mean laboratory tests per case | 30.2 | |
| Genomic data | Total no. of diseases (OMIM disease ids) | 17,986 |
| No. of disease-related genes | 11,804 | |
| No. of diseases that matched diagnosis codes (ICD10) | 1,096 | |
| Total drug types (ATC codes) | 1,615 (691 | |
| No. of drug-related genes | 14,466 | |
| Total no. of protein–protein interactions | 123,726 | |
| Selected no. of PPIs (physical interactions | 112,988 | |
| Total no. of human genes | 42,130 | |
| No. of GO terms (related genes) | 12,015 (17,919) | |
| No. of GO terms with evidence codes | 6,868 (8,671) |
aOne case was an admission–discharge event for a patient.
bThe medication order was determined by the drug order code. The ATC code denotes the chemical compound name of a drug.
cData resources: DrugBank (download date: 2011.10.23), CTD (download date: 2011.10.23), and STITCH (download date: 2011.10.23).
dData resources: HPRD (download date: 2012.01.05), BioGrid (download date: 2011.12.27), IntAct (download date: 2011.11.20), MINT (download date: 2012.01.05), and DIP (download date: 2011.10.30).
ePhysical interactions were determined by the PSI-MI codes: physical interaction (MI:0218), direct interaction (MI:0407), and physical association (MI:0915).
fGO evidence codes (EXP, IDA, IPI, IGI, and IEP).
*Total number of diagnosis codes used for further analysis of disease–disease similarities.
**Total number of drugs used for further analysis of drug–drug similarities.
Figure 1Overview of ClinDR.
(A) Construction of a drug–disease network. Known associations between drugs (circle nodes) and target diseases (square nodes) are represented as a bipartite network (black lines). We utilized existing drug prescription records in our EMRs and public drug indication resources to generated standard known drug-disease associations. (B) Calculation of drug–drug and disease–disease similarities using clinical signatures, such as distribution or pattern of laboratory test results under drugs or diseases related conditions. For disease pair similarity ClinDR uses the absolute values of individual types of laboratory test performed before any drug treatment. For drug pair similarity, ClinDR uses the changing pattern of laboratory test results during the corresponding drug medication. Then, ClinDR finds the maximum similarity scores across diverse types of laboratory test (C). (D–E) Calculation of drug–drug and disease–disease similarities using genomic signatures. (F) Prediction of final score (f(e) > θ, true) between the query indication (i.e. between drug α and disease a) using the combined clinic and genomic similarity matrixes from (C) and (E). The similarities between drug pairs or disease pairs are represented as edge widths. P and P: the maximum score of a query indication (e) using clinical (P) and genomic (P) data, respectively. βi: a similar drug to α. bi: a similar disease to a.
Summary of the similarity analysis of disease pairs and drug pairs
| Type of similarity | Features | Frequency | |
|---|---|---|---|
| Clinical signatures | Disease–disease | Total laboratory test types | 246 |
| Selected laboratory test types | 11 | ||
| No. of laboratory tests at diagnosis points | 2,703,258 (408,722 | ||
| Drug–drug | Total laboratory test types | 246 | |
| Selected laboratory test types | 9 | ||
| No. of laboratory tests after medication | 9,494,169 (28,234 | ||
| Genomic signatures | Disease–disease | Total no. of GO terms | 6,868 |
| Total no. of genes used in the network analysis | 11,804 | ||
| Drug–drug | Total no. of GO terms | 6,868 | |
| Total no. of genes used in the network analysis | 14,466 |
aThe laboratory test type was determined by the target protein or molecule detected by the serum/urine analysis, such as the total serum cholesterol concentration.
bWe analyzed selected laboratory test results from 246 test types based on the total patient coverage (≥40%). Eleven laboratory tests were selected: erythrocyte sedimentation rate (EST), platelet count, activated partial thromboplastin time (aPTT), AC glucose value, and the GOT, GPT, alkaline phosphatase, total cholesterol, sodium, chloride, and total CO2 concentrations.
cThe laboratory test results were prepared before the administration of drugs.
dFinal number of results used. About 95% of the test results were filtered out because of the absence of matched OMIM disease IDs for diagnosis codes and a lack of patient coverage.
eThe selection criteria were: 1) coverage of total drugs ≥30%; and 2) total observed cases >1,000. Nine laboratory test measures were selected: platelet count, AC glucose value, and the GOT, GPT, alkaline phosphatase, total cholesterol, sodium, chloride and total CO2 concentrations.
fThe laboratory test results were prepared for drug-free and drug-treated patients.
gThe laboratory results were selected based on these criteria: 1) assigned drug order codes with ATC (Anatomical Therapeutic Chemical classification system) codes and assigned PubChem IDs; 2) ≤5 drug treatments; and 3) the laboratory test points were prepared before and after drug administration events.
hIn the present study, we determined the disease–disease and drug–drug similarities based on the distances between GO terms. The detailed methods used to calculate the GO-based similarity measures are described in the Methods section.
iIn the present study, we determined the disease–disease and drug–drug similarities using a network-based membership scoring function. The detailed methods used to calculate this similarity measure are described in the Methods section.
Figure 2Clustering of drug- or disease-pair similarities of clinical data and performance evaluations.
(A) Hierarchical clustering of Wilcoxon rank sum test for disease-disease and drug-drug pairs by distinct laboratory test results. (B) Bar chart for the 10-fold cross-validation of ClinDR with/without clinical physiome signatures and the GBA method. The GBA method presents deterministic results, without AUC. (C) The enrichment test of novel ClinDR repositionings with clinical trials in ClinicalTrials.gov.
Figure 3Schematic view for the repurpose prediction of terbutaline sulfate for ALS.
ClinDR predict terbutaline sulfate (TS) as a promising candidate for ALS by drug-drug and disease-disease similarity analysis. Presented scores in between TS and Ursodeoxycholic acid (UDCA), and ALS and Kawasaki syndrome were analyzed similarity values using clinical signatures from EMRs (0.995 for the similarity between TS-UDCA pair and 0.99 for the disease pair similarity between ALS and Kawasaki syndrome). By integration of clinical (P) and genomic signature based predictions (P), TS was determined as a repositioning candidate for ALS therapy.
Figure 4Experimental validation of terbutaline sulfate repurposing for ALS.
(A, C, F) All panels show lateral views of Tg(olig2:dsred2) spinal cords of zebrafishes, with anterior to the left and dorsal to the top. (A) Terbutaline sulfate (TS) prevent motor axon and neuromuscular junction degeneration of ALS model (d–f). In normal conditions, treatment with TS (c) had nonlethal effects compared with the untreated condition (a). Mt TDP indicates mutant TDP-43 mRNA-injected model and WT means wild type (i.e. normal). (B) Statistical analysis of panel A. Axonal defects indicate fragmentation and reduced lengths of axons. Data were obtained from 4 myotome segments from each of 10 control and 10 TS-treated models. (C) TS rescues the ALS phenotype. Mt TDPs had abnormal motor axon phenotypes at 36 h postfertilization (hpf) (b) and 48 hpf (f) compared with WTs (a, c). These models had clear motor axon and neuromuscular junction (NMJ) defects at 72 hpf (e, g). Mt TDP with 1 mM TS at 36 hpf (c) and 48 hpf (g), respectively, rescued motor axon and NMJ defects at 72 hpf (d, h). (D) Statistical analysis of panel C. (E) Inhibition of therapeutic effect of TS by beta2-adrenergic receptor antagonist, Butoxamine (BTX). In normal conditions, treatment with BTX had no effects compared with the untreated condition (a, c). Co-treatment with TS and BTX inhibits therapeutic effect of TS on ALS phenotype of Mt TDP model (b, d–f). (F) Statistical analysis of panel E. Data was obtained from 8 control and 8 terbutaline sulfate and/or BTX-treated models.