| Literature DB >> 35379165 |
E Amiri Souri1, R Laddach1,2, S N Karagiannis2,3, L G Papageorgiou4, S Tsoka5.
Abstract
BACKGROUND: As many interactions between the chemical and genomic space remain undiscovered, computational methods able to identify potential drug-target interactions (DTIs) are employed to accelerate drug discovery and reduce the required cost. Predicting new DTIs can leverage drug repurposing by identifying new targets for approved drugs. However, developing an accurate computational framework that can efficiently incorporate chemical and genomic spaces remains extremely demanding. A key issue is that most DTI predictions suffer from the lack of experimentally validated negative interactions or limited availability of target 3D structures.Entities:
Keywords: Drug repurposing; Graph-embedding; Link prediction; Machine learning
Mesh:
Substances:
Year: 2022 PMID: 35379165 PMCID: PMC8978405 DOI: 10.1186/s12859-022-04650-w
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1DT2Vec pipeline. a1, a2 Drug-drug (DDS) and protein–protein (PPS) similarity graphs based on similarity used as input of embedding method to generate vectors. a3 Graph of DTIs. b Graph-embedding developed by node2vec to map nodes (in DDS and PPS) to vectors (in this figure drugs and target mapped to 2D-vector, x and y). c Known DTIs (positive and negative) were divided into 10% independent dataset (external testset) and 90% internal test and train (tenfold cross-validation). d Drug and protein vectors were concatenated and labeled as positive (1) or negative (0) and an XGBoost model was trained on the cross-validation datasets. The best model over the tenfold cross-validation on the internal testset was selected and applied on the external testset. The XGBoost model in c, d was repeated 5 times and the average performance of internal and external testsets was reported
Dataset details (a) Golden-standard dataset (b) ChEMBL-based dataset
| (a) Golden_standarad_dataset | # Drug | # Target | # Negative/unknown DTI | # Positive interaction | Class ratio |
|---|---|---|---|---|---|
| Enzyme (E) | 445 | 664 | 292,554 | 2926 | 0.01 |
| Ion channel (IC) | 210 | 204 | 41,364 | 1476 | 0.04 |
| G-protein-coupled receptors (GPCR) | 223 | 95 | 20,550 | 635 | 0.03 |
| Nuclear receptor (NR) | 54 | 26 | 1314 | 90 | 0.07 |
| Total | 791 | 989 | 777,172 | 5127 | 0.01 |
*pChEMBL value ≥ 5.5, **pChEMBL value > 0, Development-dataset, Experimental-dataset
Fig. 2Heatmap to show the mapping of known DTI interactions (dark blue, dark red) to predicted interactions (light blue, pink), labeled based on protein subgroups (columns) and drug network clusters (rows)
Fig. 3Illustration of key interactions between drugs (left) and protein targets (right). a Predicted highly-ranked positive interactions for phase-4 drugs with interactions colored based on protein type. b Phase-4 drugs repurposed for protein without any approved drugs based on the selected dataset. Colours show the type of protein target
Fig. 4Drug-target docking by SwissDock for new DTIs, showing binding with the lowest deltaG