| Literature DB >> 30200333 |
Ruolan Chen1, Xiangrong Liu2, Shuting Jin3, Jiawei Lin4, Juan Liu5.
Abstract
Identifying drug-target interactions will greatly narrow down the scope of search of candidate medications, and thus can serve as the vital first step in drug discovery. Considering that in vitro experiments are extremely costly and time-consuming, high efficiency computational prediction methods could serve as promising strategies for drug-target interaction (DTI) prediction. In this review, our goal is to focus on machine learning approaches and provide a comprehensive overview. First, we summarize a brief list of databases frequently used in drug discovery. Next, we adopt a hierarchical classification scheme and introduce several representative methods of each category, especially the recent state-of-the-art methods. In addition, we compare the advantages and limitations of methods in each category. Lastly, we discuss the remaining challenges and future outlook of machine learning in DTI prediction. This article may provide a reference and tutorial insights on machine learning-based DTI prediction for future researchers.Entities:
Keywords: drug discovery; drug-target interaction prediction; machine learning
Mesh:
Year: 2018 PMID: 30200333 PMCID: PMC6225477 DOI: 10.3390/molecules23092208
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Branch diagram of recent computational methods for DTI prediction.
Databases supporting drug discovery methods.
| Database and URL | Brief Descriptions |
|---|---|
| KEGG [ | An encyclopedia of genes and genomes for both functional interpretation and practical application of genomic information. |
| BRENDA [ | The main enzyme and enzyme-ligand information system. |
| PubChem [ | A database for information on chemical substances and their biological activities involving three inter-linked databases, i.e., Substance, Compound and BioAssay. |
| TTD [ | Therapeutic Target Database providing comprehensive information about the drug resistance mutations, gene expressions and target combinations data. |
| DrugBank [ | Consisting of two parts information involving detailed drug data (i.e., chemical, pharmacological and pharmaceutical) and drug target information (i.e., sequence, structure, and pathway) respectively. |
| SuperTarget [ | A database integrating drug-related information with more than 330,000 compound-target protein relations. |
| ChEMBL [ | Data resource for molecule structures and molecule-protein interactions collected from the primary published literature on a regular basis. |
| STITCH [ | Repository of known and predicted chemical-protein interactions. |
| MATADOR [ | A database of protein-chemical interactions including as many direct and indirect interactions as possible. |
| BindingDB [ | A public database of protein-ligand binding affinities. |
| TDR targets [ | A chemogenomics resource for neglected tropical diseases. |
| SIDER [ | Serving information on marketed medicines and their recorded adverse drug reactions. |
| ChemBank [ | Collections of available data derived from small molecules and small-molecule screens and resources for studying their properties. |
| DCDB [ | The Drug Combination Database for collecting and organizing known examples of drug combinations. |
| CancerDR [ | Cancer Drug Resistance Database of 148 anticancer drugs and their effectiveness against around 1000 cancer cell lines. |
| ASDCD [ | The first Antifungal Synergistic Drug Combination Database including published synergistic antifungal drug combinations, targets, indications, and other pertinent data. |
| SuperPred [ | Resource of compound-target interactions. |
The statistics of the number of compounds, targets and compound-target interactions in the databases covered in the review.
| Databases | The Number of Compounds | The Number of Targets | The Number of Compound-Target Interactions |
|---|---|---|---|
| KEGG | 18,380 | 26,885,475 | |
| BRENDA | 7341 | ||
| PubChem | 96,479,316 | 68,868 | |
| TTD | 34,019 | 3101 | |
| DrugBank | 11,682 | 26,889 | 131,724 |
| SuperTarget | 195,770 | 6219 | 332,828 |
| ChEMBL | 2,275,906 | 12,091 | |
| STITCH | 500,000 | 9,600,000 | 1,600,000,000 |
| MATADOR | 775 | ||
| BindingDB | 652,068 | 7082 | 1,454,892 |
| TDR targets | 2,000,000 | 5300 | |
| SIDER | 5868 | 1430 | 139,756 |
| ChemBank | 1,700,000 | ||
| DCDB | 904 | 805 | |
| CancerDR | 148 | 116 | |
| ASDCD | 105 | 1225 | 210 |
| SuperPred | 341,000 | 1800 | 665,000 |