| Literature DB >> 26154857 |
Thanh-Phuong Nguyen1, Corrado Priami2, Laura Caberlotto3.
Abstract
Dementia is a neurodegenerative condition of the brain in which there is a progressive and permanent loss of cognitive and mental performance. Despite the fact that the number of people with dementia worldwide is steadily increasing and regardless of the advances in the molecular characterization of the disease, current medical treatments for dementia are purely symptomatic and hardly effective. We present a novel multi-relational association mining method that integrates the huge amount of scientific data accumulated in recent years to predict potential novel targets for innovative therapeutic treatment of dementia. Owing to the ability of processing large volumes of heterogeneous data, our method achieves a high performance and predicts numerous drug targets including several serine threonine kinase and a G-protein coupled receptor. The predicted drug targets are mainly functionally related to metabolism, cell surface receptor signaling pathways, immune response, apoptosis, and long-term memory. Among the highly represented kinase family and among the G-protein coupled receptors, DLG4 (PSD-95), and the bradikynin receptor 2 are highlighted also for their proposed role in memory and cognition, as described in previous studies. These novel putative targets hold promises for the development of novel therapeutic approaches for the treatment of dementia.Entities:
Mesh:
Year: 2015 PMID: 26154857 PMCID: PMC4495601 DOI: 10.1038/srep11104
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The systematic workflow of our methodological approach.
Drug targets (DTs) were obtained by collecting information from different pharmaceutical company websites in the different phases of the drug discovery process (in red, yellow and orange). The interaction network of DTs was then constructed by extracting the direct 1-step neighbors of the DT based on the i2d database (the blue nodes in the network). Following the integration of multiple and heterogeneous data types by using the MRAM method, the rules were induced to predict the potential DTs. Finally, We characterized the functionality of the potential DTs by testing over-represented Gene Ontology biological process terms and pathways.
Reference databases used for data retrieval during the investigation.
| Clinical drug data | a database of publicly and privately supported clinical studies of human participants conducted around the world | drug target | ||
| DrugBank | a bioinformatics and cheminformatics resource that combines detailed drug data with comprehensive drug target (i.e. sequence, structure, and pathway) information | 4,092 unique drug targets | drug target | |
| i2d | an on-line database of known and predicted mammalian and eukaryotic protein-protein interactions | 183,524 curated interactions for human | protein interaction | |
| Reactome | a curated resource of core pathways and reactions in human biology. | 1,597 for human | pathway | |
| InterPro | an integrated database of predictive protein "signatures" used for the classification and automatic annotation of proteins and genome | 7,497 protein domains | protein domain | |
| Gene Ontology | a relational database comprised of the GO terms as well as the annotations of genes and gene products to terms in the those ontologies | GO term |
Figure 2Interaction network of drug targets including drug targets and their first neighbors as extracted from the i2d database.
The DTs are highlighted in red.
Figure 3Example of a MRD in table form (left) and in graph form (right).
The entity types ‘pathway’, ‘protein’, and ‘degree’ correspond to different blocks in the graph and the entities of each type correspond to different nodes. The table ‘Reactome_Pathway’ defines the pathway description. The join table ‘Of_Pathway’ defines a many-to-many relationship between the entity types ‘pathway’ and ‘protein’ and the table ‘Centrality’ defines an one-to-many relationship between entities ‘protein’ and ‘degree’. Two entities are linked with an edge if they co-occur in a same tuple.
Figure 4Computational performance of the multi-relational association mining (MRAM) method compared to the other methods.
Fig 4A–D shows the lift charts of the Decision Tree method (D-Tree), the Naïve Bayes (NB) and, the Neural Network (NN), and (MRAM, respectively. The x-axis of the chart represents the percentage of the test dataset that is used to compare the predictions. The y-axis now represents the percentage of predictions that are correct. The blue lines show the performance of the ideal model and the red lines show the performance of D-Tree, the NB, NN, and MRAM models correspondingly.
Computational measures calculated for the four methods with the three sets of negative examples with different sizes n , n , n .
| MRAM | ||||
| Decision Tree | 0.837 | −0.405 ± 0.063 | 0.287 ± 0.063 | 0.213 ± 0.020 |
| Bayesian Network | 0.822 | −0.540 ± 0.175 | 0.152 ± 0.175 | 0.284 ± 0.029 |
| Neural Network | 0.783 | −0.416 ± 0.084 | 0.276 ± 0.084 | 0.224 ± 0.026 |
| MRAM | ||||
| Decision Tree | 0.827 | −0.294 ± 0.102 | 0.314 ± 0.102 | 0.115 ± 0.009 |
| Bayesian Network | 0.814 | −0.348 ± 0.073 | 0.276 ± 0.073 | 0.116 ± 0.031 |
| Neural Network | 0.783 | −0.501 ± 0.108 | 0.114 ± 0.108 | 0.237 ± 0.018 |
| MRAM | ||||
| Decision Tree | 0.866 | −0.265 ± 0.039 | 0.202 ± 0.040 | 0.166 ± 0.001 |
| Bayesian Network | 0.808 | −0.263 ± 0.048 | 0.218 ± 0.047 | 0.130 ± 0.025 |
| Neural Network | 0.804 | −0.293 ± 0.060 | 0.198 ± 0.057 | 0.174 ± 0.019 |
The best results obtained are labeled in bold.
Performance of MRAM in term of likelihood lift with different subsets of data features and the three sets of negative examples with different sizes n , n , n .
| Exp1: All data features excluding the topological data features | 0.391 | 0.412 | 0.211 |
| Exp3: All data features excluding the GO data feature | 0.398 | 0.443 | 0.220 |
| Exp4: All data features excluding the Reactome data feature | 0.408 | 0.436 | 0.215 |
| Exp5: All data features excluding the InterPro data feature | 0.411 | 0.449 | 0.237 |
| Exp6: All of investigated data feature |
The best results obtained are labeled in bold.
Figure 5Representation of obtained association rules.
Three columns: Probability, Importance, and Rule. The probability describes how likely the result of a rule is to occur. The importance measures the significance of a rule.
Figure 6Summary of statistically significant Gene Ontology biological processes functional annotation corresponding to the putative DT list as obtained from REVIGO.
Nodes are GO terms and edges represent the strongest GO terms pairwise similarity. Colors represent the p-values (low values in green, high in red). Only significant GO terms are shown (P < 0.001).
Figure 7Flow diagram representing the molecular interactions in the MAPK signaling pathway (from KEGG database hsa04010)59.
The pathway is enriched with predicted drug targets proteins, labeled in pink. In blue are labeled the drug targets for dementia.
List of predicted drug targets with UniProt ID, Official gene symbol, Gene name and Degree centrality.
| P78352 | DLG4 | discs, large homolog 4 (Drosophila) | 18 |
| P17612 | PRKACA | protein kinase, cAMP-dependent, catalytic, alpha | 17 |
| P28482 | MAPK1 | mitogen-activated protein kinase 1 | 16 |
| P05771 | PRKCB | protein kinase C, beta | 14 |
| Q05655 | PRKCD | protein kinase C, delta | 13 |
| P31749 | AKT1 | v-akt murine thymoma viral oncogene homolog 1 | 12 |
| P68400 | CSNK2A1 | casein kinase 2, alpha 1 polypeptide pseudogene; casein kinase 2, alpha 1 polypeptide | 11 |
| P68400 | CSNK2A1P | casein kinase 2, alpha 1 polypeptide pseudogene; casein kinase 2, alpha 1 polypeptide | 11 |
| Q96RR4 | CAMK2A | calcium/calmodulin-dependent protein kinase kinase 2, beta | 10 |
| P27361 | MAPK3 | hypothetical LOC100271831; mitogen-activated protein kinase 3 | 9 |
| Q8TD19 | NEK9 | NIMA (never in mitosis gene a)- related kinase 9 | 9 |
| Q05513 | PRKCZ | protein kinase C, zeta | 7 |
| Q9UQM7 | CAMK2a | calcium/calmodulin-dependent protein kinase II alpha | 7 |
| P25098 | ADRBK1 | adrenergic, beta, receptor kinase 1 | 6 |
| Q02156 | PRKCE | protein kinase C, epsilon | 6 |
| O00141 | SGK1 | serum/glucocorticoid regulated kinase 1 | 5 |
| P45983 | MAPK8 | mitogen-activated protein kinase 8 | 5 |
| P51812 | RPS6KA3 | ribosomal protein S6 kinase, 90kDa, polypeptide 3 | 5 |
| Q15831 | STK11 | serine/threonine kinase 11 | 5 |
| Q96L34 | MARK4 | MAP/microtubule affinity-regulating kinase 4 | 5 |
| Q9Y6E0 | STK24 | serine/threonine kinase 24 (STE20 homolog, yeast) | 5 |
| P19784 | csnk2a2 | casein kinase 2, alpha prime polypeptide | 4 |
| P23443 | RPS6KB1 | ribosomal protein S6 kinase, 70kDa, polypeptide 1 | 4 |
| P34947 | GRK5 | G protein-coupled receptor kinase 5 | 4 |
| O94985 | CLSTN1 | calsyntenin 1 | 3 |
| P30411 | BDKRB2 | bradykinin receptor B2 | 3 |
| P43250 | GRK6 | G protein-coupled receptor kinase 6 | 3 |
| Q15418 | RPS6KA1 | ribosomal protein S6 kinase, 90kDa, polypeptide 1 | 3 |
| Q16512 | PKN1 | protein kinase N1 | 3 |
| Q16659 | MAPK6 | mitogen-activated protein kinase 6 | 3 |
| O75582 | RPS6KA5 | ribosomal protein S6 kinase, 90kDa, polypeptide 5 | 2 |
| Q99683 | MAP3K5 | mitogen-activated protein kinase kinase kinase 5 | 2 |
| Q9NSB8 | HOMER2 | homer homolog 2 (Drosophila) | 2 |
| Q9NSC5 | HOMER3 | homer homolog 3 (Drosophila) | 2 |