| Literature DB >> 32338900 |
Maurizio Recanatini1, Chiara Cabrelle1.
Abstract
Network theory provides one of the most potent analysis tools for the study of complex systems. In this paper, we illustrate the network-based perspective in drug research and how it is coherent with the new paradigm of drug discovery. We first present data sources from which networks are built, then show some examples of how the networks can be used to investigate drug-related systems. A section is devoted to network-based inference applications, i.e., prediction methods based on interactomes, that can be used to identify putative drug-target interactions without resorting to 3D modeling. Finally, we present some aspects of Boolean networks dynamics, anticipating that it might become a very potent modeling framework to develop in silico screening protocols able to simulate phenotypic screening experiments. We conclude that network applications integrated with machine learning and 3D modeling methods will become an indispensable tool for computational drug discovery in the next years.Entities:
Mesh:
Year: 2020 PMID: 32338900 PMCID: PMC8007104 DOI: 10.1021/acs.jmedchem.9b01989
Source DB: PubMed Journal: J Med Chem ISSN: 0022-2623 Impact factor: 7.446
Chemical Databases
| database name | description | url | ref. |
|---|---|---|---|
| ChemBL | Collection of bioactive drug-like small molecules with 2D structures, calculated chemical properties and bioactivities. | ( | |
| PubChem | Open chemistry database of mostly small molecules that collects information on chemical structures, chemical and physical properties, and biological activities. It is structured into three linked databases: substance, compound, and bioassay. | ( | |
| DrugBank | Freely accessible data on small molecules and biotechnological drugs (with chemical, pharmaceutical, and pharmacological profiles), and drug targets (sequence and functions of target/enzyme/transporter/carrier) intended as drugs encyclopedia. | ( | |
| ChemSpider | Free chemical structures database collecting structures and related information, such as physicochemical properties and interactive spectra, made accessible through a fast search engine allowing search by name, structure, or advanced options. | ( |
Biological Databases
| database name | description | url | ref |
|---|---|---|---|
| SMART | Simple modular architecture research tool is a Web tool for identification of protein domains and exploration of domain architectures. | ( | |
| UniProtKB | UniProtKB consists of two databases: SwissProt (manually annotated) and TrEMBL (automatically annotated and unreviewed). UniProtKB collects protein sequences providing rich annotations. | ( | |
| GPCRdb | GPCRdb stores reference data on GPCRs sequence, structures, mutations, and ligand interactions. GPCRdb also provides a suite of Web tools and interactive diagrams for illustrative purposes. | ( | |
| Kinomer | Protein kinases database contains annotated classification of 43 eukaryotic genomes. Kinomer is accessible through a Web interface, which enables the retrieval of sequences and the classification of arbitrary sequences. | ( | |
| PDB | Protein Data Bank is a repository of experimentally determined 3D structures of proteins, nucleic acids, and complexes with metal ions, drugs, and small molecules. | ( | |
| STRING | Search tool for retrieval of interacting genes/proteins (STRING) database of known and predicted protein–protein interactions, including physical and functional associations for a large number of organisms. STRING allows users to visualize interactions network and to make analysis. | ( | |
| Reactome | Reactome is a relational pathway database in which signaling and metabolic molecules are organized into biological pathways and processes. | ( | |
| DisGeNET | DisGeNET is a platform that integrates and standardizes data about disease associated genes and variants (628 685 gene-disease associations, GDAs, and 210 498 variant-disease associations, VDAs). | ( | |
| ConnectivityMap | CMap database contains gene-expression profiles from cultured human cells treated with perturbagens (bioactive small molecules). | ( | |
| LINCS | Library of integrated network-based cellular signatures catalogues perturbation-response signatures employing a various set of perturbations, model systems, and assay types. | ( |
Network Building and Visualization Systems
| database name | description | url | ref |
|---|---|---|---|
| Cytoscape | Cytoscape is a platform for visualization, analysis, and integration of networks via basic functionalities or through apps; conceived mainly for biological research. | ( | |
| Gephi | Gephi allows the visualization and exploration of all types of large graphs in real-time through a 3D render engine. | ( | |
| Pajek | Pajek enables analysis and visualization of large networks having some thousands or millions of vertices. | ( | |
| NetworkX | NetworkX is a Phyton package designed for the creation and analysis of structure, dynamics, and functions of networks. | ( | |
| Apache Hadoop | Open source framework for storing and processing large data sets across clusters of computers in a distributed environment through simple programming models. | ( | |
| Apache Spark | A fast cluster computing system for large-scale data processing powering different libraries (SQL, MLlib, GraphX), and easy to use interactively from the Scala, Python, R, and SQL shells. | ( |
Figure 1Exemplary CSN of PARP inhibitors. PARPs 1, 2, and 3 family inhibitors with a measured EC50 were retrieved from CHEMBL.[16] The pairwise chemical similarities between compounds were assessed by means of Tanimoto coefficient (Tc) values calculated for the ECFP4 fingerprints[49] of the molecules generated by Canvas[50] (Schrödinger, LLC, New York, NY, 2019). Pairs of inhibitors were connected only if their calculated Tc value exceeded the threshold value of 0.55. The chemical structures of the inhibitors are shown inside the nodes that are colored according to pEC50 values ranging from red (lowest potency) to green (highest potency) and sized based on node degree from small (low degree) to large (high degree). Edges are weighted by Tc values from thin (Tc = 0.55) to thick (Tc = 1) width. The network was generated by means of Cytoscape[38] version 3.7.2.
Figure 2Drug–target network. The DTN was built from DrugBank[18] version 5.1.5 retrieving the drug–target interactions between approved small molecule drugs and human protein targets. Drugs are represented as circle-shaped nodes, and protein targets are represented as diamond-shaped nodes. As shown in the inset, drugs are color-coded according to the first level anatomical therapeutic chemical (ATC) codes as reported in DrugBank. The nodes size accounts for the node degree from small (low degree) to large (high degree). Edges connect only drugs and targets nodes. The network was generated by means of Cytoscape[38] version 3.7.2.
Figure 3Hetionet version 1.0. (a) Metagraph showing the types of nodes used to build the network and the types of links defined to connect the nodes. A detailed description of the meaning of each link type as well as the sources of information used to collect the nodes and to draw the edges is reported in ref (85). (b) Visualization of the whole heterogeneous network. Nodes of the same type are grouped within circles, and links are colored by type. This Figure 3 is reproduced from Figure 1 of (ref (85)), published under the Creative Commons Attribution 4.0 International Public License (CC BY 4.0; https://creativecommons.org/licenses/by/4.0/).
Figure 4Boolean network dynamics. (a) Nodes are colored red or gray based on their “on” or “off” state, respectively. The stepwise evolution of the interactions between nodes determines some sequential steps that are calculated based on a set of rules. The stable state of the network at time step t = S represents an attractor. (b) Attractor landscape. Gray circles represent network states, colored circles represent attractor states. The landscape contains all the possible network states. The sets of states that converge toward an attractor form the basin of that attractor (colored areas). (c) The basins of attractors can be associated with cell phenotypes, and the gene states of the attractors determine the nature of the phenotype.