| Literature DB >> 23800010 |
Francesco Napolitano1, Yan Zhao, Vânia M Moreira, Roberto Tagliaferri, Juha Kere, Mauro D'Amato, Dario Greco.
Abstract
: Existing computational methods for drug repositioning either rely only on the gene expression response of cell lines after treatment, or on drug-to-disease relationships, merging several information levels. However, the noisy nature of the gene expression and the scarcity of genomic data for many diseases are important limitations to such approaches. Here we focused on a drug-centered approach by predicting the therapeutic class of FDA-approved compounds, not considering data concerning the diseases. We propose a novel computational approach to predict drug repositioning based on state-of-the-art machine-learning algorithms. We have integrated multiple layers of information: i) on the distances of the drugs based on how similar are their chemical structures, ii) on how close are their targets within the protein-protein interaction network, and iii) on how correlated are the gene expression patterns after treatment. Our classifier reaches high accuracy levels (78%), allowing us to re-interpret the top misclassifications as re-classifications, after rigorous statistical evaluation. Efficient drug repurposing has the potential to significantly impact the whole field of drug development. The results presented here can significantly accelerate the translation into the clinics of known compounds for novel therapeutic uses.Entities:
Year: 2013 PMID: 23800010 PMCID: PMC3704944 DOI: 10.1186/1758-2946-5-30
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Figure 1Flowchart of the analysis. Green boxes indicate data, red boxes indicate processes.
Figure 2Classification performance and data integration. Receiving Operator Curve for the three separated kernels (GEX: gene expression; CHEM: chemical structure; TAR: molecular targets) and the final joint kernel in respect with ATC classes.
Figure 3Repositioning overview. Direction of the arrows represent direction of repositioning from one ATC class to another. Thickness and opacity of the edges represent a score-weighted sum of the reclassification events.
Top drug repositioning predictions
| Carbamazepine | Antiepileptics (N03) | Cardiac therapy (C01) |
| Chlorphenamine | Antihistamines for systemic use (R06) | Psychoanaleptics (N06) |
| Dobutamine | Cardiac therapy (C01) | Beta blocking agents (C07) |
| Gefitinib | Antineoplastic agents (L01) | Antibacterials for systemic use (J01) |
| Hydroxyzine | Psycholeptics (N05) | Antihistamines for systemic use (R06) |
| Ivermectin | Anthelmintics (P02) | Antibacterials for systemic use (J01) |
| Levobunolol | Ophthalmologicals (S01) | Beta blocking agents (C07) |
| Niclosamide | Anthelmintics (P02) | Antineoplastic agents (L01) |
| Oxamniquine | Anthelmintics (P02) | Antineoplastic agents (L01) |
| Spironolactone | Diuretics (C03) | Sex hormones and modulators of the genital system (G03) |
| Sulfacetamide | Ophthalmologicals (S01) | Antibacterials for systemic use (J01) |
| Thiethylperazine | Antihistamines for systemic use (R06) | Psycholeptics (N05) |
The top 12 repositioned drugs (classification score =1) are shown in rows. The drug name, the original and the predicted therapeutic class are reported. The level 2 ATC codes are also reported in brackets.
Figure 4Flowchart of the CMap gene expression data analysis. Green boxes indicate data, red boxes indicate processes.
Figure 5Kernel noise reduction through MDS projections. Values on the x axis correspond to SVM classifiers trained with kernel K (see text) as projected into the subspace spanned by its first x Principal Components. The y axis reports the corresponding 6-fold cross validation error. The green dot indicates minimum error.