| Literature DB >> 30288997 |
Kristy A Carpenter1, David S Cohen1, Juliet T Jarrell1, Xudong Huang1.
Abstract
Current drug development is still costly and slow given tremendous technological advancements in drug discovery and medicinal chemistry. Using machine learning (ML) to virtually screen compound libraries promises to fix this for generating drug leads more efficiently and accurately. Herein, we explain the broad basics and integration of both virtual screening (VS) and ML. We then discuss artificial neural networks (ANNs) and their usage for VS. The ANN is emerging as the dominant classifier for ML in general, and has proven its utility for both structure-based and ligand-based VS. Techniques such as dropout, multitask learning and convolution improve the performance of ANNs and enable them to take on chemical meaning when learning about the drug-target-binding activity of compounds.Entities:
Keywords: artificial intelligence; artificial neural networks; convolutional neural networks; deep learning; drug discovery; machine learning; multitask learning; virtual screening
Mesh:
Substances:
Year: 2018 PMID: 30288997 PMCID: PMC6563286 DOI: 10.4155/fmc-2018-0314
Source DB: PubMed Journal: Future Med Chem ISSN: 1756-8919 Impact factor: 3.808
Internal versus external validation.
This is a visual representation of the conceptual difference between internal and external validation. The arrow points from the data to be validated to the data used for validation.
A block diagram of the optimal workflow for machine learning-based virtual screening.
Representation of the hierarchy and relationship between different artificial-related concepts.
These concepts exist as subsets of each other.
Stochastic gradient descent compared with gradient descent.
Example architecture of a Convolutional Neural Network.
Dropout would involve dropping certain nodes that are illustrated within the layers.
Visual representation of multitask learning.
Recent research utilizing artificial neural networks for drug discovery.
| Prakash | (2013) | QSAR-ANN | Found CID_73621, CID_16757497, CID_301751, CID_390666 and CID_46830222 to be cytotoxic to human breast cancer cells | [ |
| Bilsland | (2015) | QSAR-ANN | Screened for senescence-inducing compounds and found 147 hits from a screening library of 2 million compounds. CB-20903630 was viewed as most favorable for further development | [ |
| Korkmaz | (2015) | ANNs in addition to Naive Bayes, kNN, Decision Trees, Support Vector Machines and Random Forests | Online VS tool utilizing most accurate models from a tested broader selection | [ |
| Ashtawy and Mahapatra | (2018) | MTL, ANN | Proposed a novel multitask deep neural network capable of simultaneously predicting binding pose, binding affinity and classification of activity that performs better than conventional scoring functions | [ |
| Wallach | (2015) | DCNN | Introduced a DCNN designed to predict the bioactivity of small molecules for drug discovery applications while outperforming previous docking approaches | [ |
ANN: Artificial neural network; DCNN: Deep convolutional neural network; kNN: k-Nearest neighbor; MTL: Multitask learning; QSAR: Quantitative structure-activity relationship; VS: Virtual screening.