| Literature DB >> 30182834 |
Abstract
Since the official release as a KNIME Community Contribution in June 2013, the Vernalis KNIME nodes have increased from a single node (the 'PDB Connector' node) to around 126 nodes (November 2017; Version 1.12.0); furthermore, a number of nodes have been adopted into the core KNIME product. In this review, we provide a brief timeline of the development of the current public release and an overview of the current nodes. We will focus in more detail on three particular areas: nodes accessing publicly available information via web services, nodes providing cheminformatics functionality without recourse to a cheminformatics toolkit, and nodes using one of the cheminformatics toolkits present in KNIME. We will conclude with a number of case studies demonstrating the use of KNIME at Vernalis. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.Entities:
Keywords: Cheminformatics; Fingerprints; KNIME Community Contribution; MMPA); Matched Molecular Pairs (MMP; Principal Moments of Inertia (PMI); Protein Data Bank (PDB); SMILES; Sequences
Mesh:
Year: 2020 PMID: 30182834 PMCID: PMC8206202 DOI: 10.2174/0929867325666180904113616
Source DB: PubMed Journal: Curr Med Chem ISSN: 0929-8673 Impact factor: 4.530
Fig. (9)Simple example workflow, showing building and running a query, downloading the hit PDB files and saving them locally. (A higher resolution / colour version of this figure is available in the electronic copy of the article).
Fig. (10)PMI plot of bound conformers of 250 randomly selected PDB ligands, produced with nPMI values calculated using the Vernalis ‘PMI Calculation’ node, and an unreleased ‘PMI Triangle Scatter Plot’ node.
Fig. (11)Matched Molecular Pairs. A. Schematic representation of Hussain’s algorithm [101]; B. ‘Real-world’ example from ChEMBL [7, 8], aligned to schematic example in Fig. 11A; C. Example output from the MMP Molecule Fragmentation nodes, showing the input structure, key and value.
Summary of current PDB Connector nodes and functionality.
|
|
|
|
|
|---|---|---|---|
| PDB Connector | ✓ | ✓ | ✓ |
| PDB Connector (XML Query) | - | ✓ | ✓ |
| PDB Connector XML Query Builder | ✓ | - | - |
| PDB Connector Query Only (XML Query) | - | ✓ | - |
| PDB Connector Query Only | ✓ | ✓ | - |
| PDB Connector Custom Report | - | - | ✓ |
Representations of the two fingerprint types available.
|
|
|
|
|---|---|---|
| Binary (BitVector) | Binary String | 100101010100111001101010 |
| Hexadecimal String | 954E6A | |
| String | {length=24, set bits=1,3,5,6,9,10,11,14,16, | |
| Set Bits List | [ | |
| Count (ByteVector) | String | {81,153,43,18,54,186,204, |
| Counts List | [81,153,43,18,54,186,204, |
Values are an array of integers, rather than text.
Examples of SMILES desalting behavior; aInvalid SMILES. See Supplementary Material for structural representations.
|
|
|
|
|
|
|---|---|---|---|---|
| 1 | C[N+](C)(C)C.CCC(=O)[O-] | C[N+](C)(C)C,CCC(=O)[O-] | C[N+](C)(C)C | C[N+](C)(C)C |
| 2 | C[N+](C)(C)CCCC.C[N+](C) | C[N+](C)(C)CCCC, | C[N+](C)(C)CCCC | [O-]C(=O)CCC |
| 3 | C1C.CC1 | C1C | C1C | C1C |
aInvalid SMILES.
Summary of reasons for removal of compounds from ChEMBL21 [88].
|
|
|
|---|---|
| HAC outside range 8-45 | 87849 |
| Non-neutral | 30725 |
| No Rings Present | 18211 |
| N+O count < 2 | 15892 |
| Missing SMILES String | 8294 |
| Gross charge > 4 | 970 |
| Carbon count < 2 | 7 |
| Broken bond during de-salting | 0 |