| Literature DB >> 24950817 |
Ruifeng Liu1, Narender Singh, Gregory J Tawa, Anders Wallqvist, Jaques Reifman.
Abstract
BACKGROUND: Despite increased investment in pharmaceutical research and development, fewer and fewer new drugs are entering the marketplace. This has prompted studies in repurposing existing drugs for use against diseases with unmet medical needs. A popular approach is to develop a classification model based on drugs with and without a desired therapeutic effect. For this approach to be statistically sound, it requires a large number of drugs in both classes. However, given few or no approved drugs for the diseases of highest medical urgency and interest, different strategies need to be investigated.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24950817 PMCID: PMC4079911 DOI: 10.1186/1471-2105-15-210
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic bit-string representation of a drug-human protein interaction profile. Each protein is represented by 3 bits to encode drug binding, drug activation, and drug inhibition of the protein, respectively. When a drug has been reported or predicted to bind, activate, or inhibit a protein, the bit representing the specific drug-protein interaction is turned on (assigned a value of 1). Otherwise, the bit is off (assigned a value of 0). M denotes the number of drugs with an approved indication (positive class), N denotes the total number of drugs, f represents on-bit or off-bit of the i-th bit feature, A denotes the number of on-bits of the i-th bit feature in the positive class, and B denotes the number of on-bits of the i-th bit feature in all drugs.
Figure 2Three machine learning approaches for developing data-driven drug repurposing models. The total number of drugs and drug development candidates with drug-protein interaction profiles is 4,902. m denotes the number of drugs with a desirable therapeutic effect (positive class), n represents a subset of m used as the positive class of the training set for model development, and k denotes the number of drugs that do not have a desired therapeutic effect but can be used as false positives (FP) for the purpose of model development. TP: true positive.
Figure 3Performance comparison between type I and type II models. Comparison of enrichment efficiencies of type I (A-C) and type II (D-F) models for high blood pressure (HBP), HIV, and antimalarial drugs. The models were built with one, two, and three drugs in the positive class of the training set. Bar heights denote the fraction of FDA-approved HBP, HIV, and antimalarial drugs in the testing set (type I models) or baseline class (type II models) that scored in the highest 1%, 5%, and 10% of the compounds, respectively. Error bars represent 1 standard deviation from full cross-validation calculations.
Figure 4Impact of false positive on the performance of type II models. The models were built with one to three true positives and either zero or one false positive. Error bars represent 1 standard deviation from full cross-validation calculations. The random bar heights represent the expected fractions of positive drugs in 1%, 5%, and 10% randomly picked baseline compounds. Models constructed with no false positives correspond to type II models (Figure 3,D-F). A: High blood pressure (HBP) model. B: Human immunodeficiency virus (HIV) model. C: antimalarial model. TP: true positive. FP: false positive.
Impact of SEA-predicted drug-protein interactions on Type II model performance
| | | ||||
|---|---|---|---|---|---|
| 1 | 1 | 0.14 | 0.06 | 0.15 | 0.07 |
| 5 | 1 | 0.36 | 0.11 | 0.36 | 0.11 |
| 10 | 1 | 0.50 | 0.14 | 0.48 | 0.14 |
| 1 | 2 | 0.14 | 0.06 | 0.18 | 0.07 |
| 5 | 2 | 0.48 | 0.09 | 0.45 | 0.10 |
| 10 | 2 | 0.64 | 0.09 | 0.60 | 0.11 |
| 1 | 3 | 0.19 | 0.06 | 0.20 | 0.07 |
| 5 | 3 | 0.53 | 0.08 | 0.52 | 0.10 |
| 10 | 3 | 0.70 | 0.08 | 0.67 | 0.10 |
| 1 | 1 | 0.16 | 0.09 | 0.20 | 0.19 |
| 5 | 1 | 0.28 | 0.14 | 0.30 | 0.18 |
| 10 | 1 | 0.37 | 0.17 | 0.39 | 0.21 |
| 1 | 2 | 0.17 | 0.07 | 0.18 | 0.07 |
| 5 | 2 | 0.32 | 0.12 | 0.33 | 0.10 |
| 10 | 2 | 0.44 | 0.13 | 0.43 | 0.15 |
| 1 | 3 | 0.19 | 0.07 | 0.21 | 0.07 |
| 5 | 3 | 0.34 | 0.11 | 0.37 | 0.09 |
| 10 | 3 | 0.48 | 0.10 | 0.50 | 0.12 |
| 1 | 1 | 0.14 | 0.07 | 0.20 | 0.24 |
| 5 | 1 | 0.34 | 0.14 | 0.54 | 0.25 |
| 10 | 1 | 0.70 | 0.22 | 0.79 | 0.22 |
| 1 | 2 | 0.20 | 0.13 | 0.20 | 0.13 |
| 5 | 2 | 0.43 | 0.20 | 0.58 | 0.16 |
| 10 | 2 | 0.73 | 0.15 | 0.89 | 0.10 |
| 1 | 3 | 0.21 | 0.09 | 0.21 | 0.10 |
| 5 | 3 | 0.49 | 0.15 | 0.66 | 0.13 |
| 10 | 3 | 0.77 | 0.13 | 0.91 | 0.07 |
aThe type II models were built with one, two, and three positive drugs in the positive class of the training set. The fraction of positive drugs in the baseline class that scored in the highest 1%, 5%, and 10% of the compounds are recorded for models using the STITCH 3.1 database with and without SEA-predicted drug-protein interactions. Fraction, fraction of known drugs retrieved; σ, standard deviation; SEA, similarity ensemble approach.
Therapeutic information of the drugs assigned to the baseline class that were scored highest by the hypertension model
| Nitrendipine | 103.9 | For mild to moderate | |
| Nimodipine | 93.9 | For use as an adjunct to improve neurologic outcome following subarachnoid hemorrhage. | A 1998 clinical study
[ |
| Alprenolol | 69.7 | For | |
| Nilvadipine | 69.2 | For vasospastic angina, chronic stable angina, and | |
| Oxprenolol | 69.1 | For the treatment of | |
| Norepinephrine | 53.8 | For patients in vasodilatory shock states, also used as a vasopressor medication for patients with critical | |
| Spirapril | 52.1 | An ACE inhibitor for | |
| Lercanidipine | 46.5 | For | |
| Yohimbine | 44.7 | Used as a mydriatic and for the treatment of | |
| Epinephrine | 43.9 | Used in asthma and cardiac failure and to delay absorption of local anesthetics. | A systematic review in 2002
[ |
aThe model was developed with all 55 drugs in the National Institutes of Health hypertension drug list in the positive class; the remaining drugs and drug development candidates were potential repurposing candidates in the baseline class.
bRef. [26].
ACE: angiotensin-converting enzyme.
Therapeutic information of the drugs assigned to the baseline class that were scored highest by the HIV model
| Verapamil | 23.5 | For hypertension, angina, and cluster headache prophylaxis. | A 1991 study
[ |
| Clotrimazole | 22.6 | For oropharyngeal candidiasis, vaginal yeast infections, and fungal infections. | |
| Ketoconazole | 21.5 | For fungal infections. | |
| Dexamethasone | 21.0 | An anti-inflammatory 9-fluoro-glucocorticoid. | A 2001 study
[ |
| Amprenavir | 19.6 | For treatment of | |
| Atorvastatin | 19.3 | For hypercholesterolemia. | A 2004 study
[ |
| Clarithromycin | 18.5 | Antibiotic. | |
| Lovastatin | 18.2 | For hypercholesterolemia. | A 2004 study
[ |
| Quinidine barbiturate | 18.1 | For the treatment of ventricular pre-excitation and cardiac dysrhythmias. | |
| Cimetidine | 17.9 | For acid-reflux disorders (GERD), peptic ulcer disease, heartburn, and acid indigestion. |
aThe model was developed with all 20 HIV drugs in the FDA’s HIV drug list in the positive class; the remaining compounds were potential repurposing candidates in the baseline class.
bRef. [26].
Therapeutic information of the drugs assigned to the baseline class that were scored highest by the malaria model
| Dexamethasone | 22.1 | An anti-inflammatory 9-fluoro-glucocorticoid. | Dexamethasone was reported to have a dramatic life-saving effect on people with cerebral malaria
[ |
| Verapamil | 21.5 | A calcium channel blocker for hypertension, angina, and cluster headache prophylaxis. | A 1995 study
[ |
| Quercetin | 18.9 | A flavonol found in plants, antioxidant. | A 2012 study
[ |
| Miconazole | 16.5 | An imidazole antifungal agent. | Many studies
[ |
| Clotrimazole | 15.9 | An imidazole derivative with a broad spectrum of antimycotic activity. | Many studies
[ |
| Cimetidine | 15.5 | For acid-reflux disorders (GERD), peptic ulcer disease, heartburn, and acid indigestion. | A 1997 study
[ |
| Ketoconazole | 15.2 | For systemic fungal infections. | Many studies
[ |
| Nifedipine | 14.9 | A calcium channel blocker for angina, hypertension, and Raynaud's phenomenon. | |
| Tamoxifen | 14.4 | For breast cancer. | |
| Clobetasol | 14.4 | For corticosteroid-responsive dermatoses of the scalp. |
aThe model was developed with all 11 antimalarial drugs in the positive class; the remaining compounds were potential repurposing candidates in the baseline class.
bRef. [26].
Structures and therapeutic information of tranexamic acid and the highest-scored drugs by the tranexamic acid model
| Tranexamic Acid | | For preventing hemorrhage in trauma and for excessive bleeding during and following tooth extraction, surgery, and menstruation. | The only drug used to build the DPIR prediction model. | |
| Aminocaproic acid | 1.39 | For the treatment of excessive postoperative bleeding. | | |
| Amiloride | 1.38 | For use as adjunctive treatment with thiazide diuretics or other kaliuretic-diuretic agents in congestive heart failure or hypertension. | Amiloride was evaluated as a treatment for ameliorating trauma-hemorrhagic shock-induced lung injury in rats
[ | |
| … 26 experimental drugs with scores between 1.38 and 0.69 …. | | |||
| Diethylstilbestrol | 0.69 | For treatment of prostate cancer and prevention of miscarriage or premature delivery in pregnant women prone to miscarriage or premature delivery. | Diethylstilbestrol was found to have particular clinical value in the treatment of certain functional gynecic aberrations. One of these is excessive or prolonged functional uterine bleeding
[ | |
aThe model was developed with tranexamic acid as the only member of the positive class; all other compounds were in the baseline class.
bRef. [26].
Comparison of DPIR and chemical fingerprint (similarity) search-based drug repurposing approaches
| | | ||||
|---|---|---|---|---|---|
| 1 | 1 | 0.06 | 0.09 | 0.07 | |
| 5 | 1 | 0.11 | 0.16 | 0.07 | |
| 10 | 1 | 0.14 | 0.24 | 0.10 | |
| 1 | 2 | 0.06 | 0.14 | 0.08 | |
| 5 | 2 | 0.09 | 0.22 | 0.09 | |
| 10 | 2 | 0.09 | 0.29 | 0.10 | |
| 1 | 3 | 0.06 | 0.18 | 0.08 | |
| 5 | 3 | 0.08 | 0.27 | 0.10 | |
| 10 | 3 | 0.08 | 0.33 | 0.10 | |
| 1 | 1 | 0.09 | 0.08 | 0.07 | |
| 5 | 1 | 0.14 | 0.20 | 0.09 | |
| 10 | 1 | 0.17 | 0.25 | 0.10 | |
| 1 | 2 | 0.07 | 0.11 | 0.07 | |
| 5 | 2 | 0.12 | 0.28 | 0.12 | |
| 10 | 2 | 0.13 | 0.36 | 0.12 | |
| 1 | 3 | 0.07 | 0.13 | 0.07 | |
| 5 | 3 | 0.11 | 0.32 | 0.13 | |
| 10 | 3 | 0.10 | 0.44 | 0.14 | |
| 1 | 1 | 0.14 | 0.07 | 0.18 | |
| 5 | 1 | 0.34 | 0.14 | 0.21 | |
| 10 | 1 | 0.22 | 0.55 | 0.23 | |
| 1 | 2 | 0.20 | 0.13 | 0.16 | |
| 5 | 2 | 0.43 | 0.20 | 0.16 | |
| 10 | 2 | 0.15 | 0.72 | 0.16 | |
| 1 | 3 | 0.21 | 0.09 | 0.14 | |
| 5 | 3 | 0.49 | 0.15 | 0.14 | |
| 10 | 3 | 0.77 | 0.13 | 0.12 | |
aThe DPIR (type II) models were developed with one, two, or three positive drugs in the positive class of the training set. The chemical fingerprint search used Tanimoto similarity (TS) calculated with the Accelrys ECFP_4 fingerprint. The highest TS coefficient between a baseline compound and a positive compound was used to rank order the baseline compound. DPIR, drug-protein interaction-based repurposing; fraction, fraction of known drugs retrieved; σ, standard deviation; bold font, highest fraction retrieved.