| Literature DB >> 25103881 |
Halil Bisgin, Zhichao Liu, Hong Fang, Reagan Kelly, Xiaowei Xu1, Weida Tong.
Abstract
BACKGROUND: The phenome represents a distinct set of information in the human population. It has been explored particularly in its relationship with the genome to identify correlations for diseases. The phenome has been also explored for drug repositioning with efforts focusing on the search space for the most similar candidate drugs. For a comprehensive analysis of the phenome, we assumed that all phenotypes (indications and side effects) were inter-connected with a probabilistic distribution and this characteristic may offer an opportunity to identify new therapeutic indications for a given drug. Correspondingly, we employed Latent Dirichlet Allocation (LDA), which introduces latent variables (topics) to govern the phenome distribution.Entities:
Mesh:
Year: 2014 PMID: 25103881 PMCID: PMC4137076 DOI: 10.1186/1471-2105-15-267
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Overview of the study. A) Integrating side-effects and indications to complete phenome; B) Determining the number of topics by using information loss; C) Hiding known indications one by one to see the recovery potential for decision criteria; D) Developing the decision criteria considering recovered indications with significance; E-F) LDA is applied to the drug-phenome matrix. Observing the decision criteria, real indications were recovered for drugs without indication information and new indications suggested for remaining drugs.
Figure 2Binned probabilities for known indications in 11,183 cases. p(i|d) stands for the probability of the indication in a drug-indication pair. Each bar denotes the number of known pairs falling into the probability intervals. Blue bars show the cases over random chance (q = 0.005), and we have 5,516 drug-indication pairs satisfying this condition.
Indications retrieved by the model
| Findings supported by drug labels | |||
|---|---|---|---|
| Drug | Indication | Rank | Reference |
| Thioridazine | Schizophrenia | 1 | DB00679 |
| Mesoridazine | Schizophrenia | 1 | DB00933 |
| Bromazepam | Insomnia | 4 | DB01558 |
| Nitrazepam | Insomnia | 3 | DB01595 |
| Pipotiazine | Schizophrenia | 1 | DB01621 |
| Nilotinib | Leukemia | 6 | DB04868 |
|
| |||
|
|
|
|
|
| Apomorphine | Anxiety | 6 | [ |
| Metyrapone | Cancer | 6 | [ |
| Prilocaine | Analgesia | 3 | [ |
| Sevoflurane | Sedation | 1 | [ |
| Remifentanil | Sedation | 1 | [ |
| Methohexital | Muscle relaxation | 2 | [ |
| Isoflurane | Sedation | 3 | [ |
| Disulfiram | Ulcer | 1 | [ |
Figure 3Binned 5,586 suggested pairs based on IS along with probabilities. X-axis denotes the number of indications and the bars represent the number of cases (Right-Y-axis) including drugs with that many indications. Box plots summarize the probability distribution (Left-Y-axis) of suggested drug-indication pairs.
Verification of suggested uses through literature
| Drug | IS | Indication (source) | Rank of indication (p(i|d)) |
|---|---|---|---|
| Atazanavir | 6 | HIV infection [ | 3 (0.045) |
| Aripiprazole | 15 | Dementia [ | 11 (0.033) |
| Amantadine | 18 | Epilepsy [ | 7 (0.018) |
| Itraconazole | 15 | Meningitis [ | 11 (0.012) |
| SMS 201-995 | 36 | Migraine [ | 13 (0.015) |
| Celecoxib | 26 | Migraine [ | 9 (0.013) |
| Mefenamic acid | 3 | Rheumatoid arthritis [ | 3 (0.076) |
Figure 4Relations between variables in LDA model. (A) Latent variables (topics) are used to construct paths from drugs to phenotypes. P(t|d) is the probability that drug d is associated with topic t while P(ph|t) defines the probability of phenotype ph associated with topic t. (B) Graphical representation of LDA for phenome. In this framework, M stands for the number of drugs, and N is the number of phenotypes.