| Literature DB >> 24838565 |
Yoshihiro Yamanishi1, Masaaki Kotera2, Yuki Moriya3, Ryusuke Sawada4, Minoru Kanehisa3, Susumu Goto5.
Abstract
DINIES (drug-target interaction network inference engine based on supervised analysis) is a web server for predicting unknown drug-target interaction networks from various types of biological data (e.g. chemical structures, drug side effects, amino acid sequences and protein domains) in the framework of supervised network inference. The originality of DINIES lies in prediction with state-of-the-art machine learning methods, in the integration of heterogeneous biological data and in compatibility with the KEGG database. The DINIES server accepts any 'profiles' or precalculated similarity matrices (or 'kernels') of drugs and target proteins in tab-delimited file format. When a training data set is submitted to learn a predictive model, users can select either known interaction information in the KEGG DRUG database or their own interaction data. The user can also select an algorithm for supervised network inference, select various parameters in the method and specify weights for heterogeneous data integration. The server can provide integrative analyses with useful components in KEGG, such as biological pathways, functional hierarchy and human diseases. DINIES (http://www.genome.jp/tools/dinies/) is publicly available as one of the genome analysis tools in GenomeNet.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24838565 PMCID: PMC4086078 DOI: 10.1093/nar/gku337
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of DINIES. DINIES accepts as inputs tab-delimited text files that must be in the forms of either a ‘profile’ or a ‘kernel matrix’ (a kernel, in short). We use ‘profile’ to denote an asymmetric matrix in which rows correspond to the objects of interest (drugs or proteins) and columns to the properties of the objects (such as chemical substructures and domains). We use ‘kernel’ to denote a symmetric matrix where rows and columns both correspond to objects (drugs or proteins).
Figure 2.Output example of DINIES. (1) Inferred list classifies the drug–target interaction network into training–new, prediction–prediction and new–training, where ‘training’ and ‘new’ denote drugs/targets found and not found in the training data (KEGG DRUG in the default settings), respectively. Text files with the same format can be downloaded using the Download option. (2) The BRITE mapping option enables the user to find drugs or proteins of interest from the functional hierarchy defined in the KEGG BRITE database. (3) The Pathway mapping option shows the predicted drug–target interactions grouped on the basis of KEGG PATHWAY maps.
Figure 3.BRITE mapping of the predicted drug–target interactions. Left and right panels show BRITE functional hierarchies for drugs and human proteins, respectively. Solid and dotted gray lines represent known and predicted drug–protein interactions, respectively.
AUC scores and AUPR scores in the 3-fold cross-validation experiments
| Cross-validation | Drug data | Target data | AUC ± S.D. | AUPR ± S.D. |
|---|---|---|---|---|
| Pair-wise | Chemical | Sequence | 0.936 ± 0.010 | 0.579 ± 0.079 |
| Pair-wise | Chemical | Domain | 0.925 ± 0.095 | 0.355 ± 0.069 |
| Pair-wise | Side effect | Sequence | 0.922 ± 0.010 | 0.481 ± 0.087 |
| Pair-wise | Side effect | Domain | 0.903 ± 0.086 | 0.336 ± 0.072 |
| Pair-wise | Integration | Integration | 0.952 ± 0.006 | 0.593 ± 0.087 |
| Block-wise | Chemical | Sequence | 0.870 ± 0.004 | 0.485 ± 0.006 |
| Block-wise | Chemical | Domain | 0.843 ± 0.005 | 0.282 ± 0.001 |
| Block-wise | Side effect | Sequence | 0.847 ± 0.005 | 0.364 ± 0.016 |
| Block-wise | Side effect | Domain | 0.829 ± 0.001 | 0.237 ± 0.007 |
| Block-wise | Integration | Integration | 0.892 ± 0.001 | 0.507 ± 0.008 |