| Literature DB >> 33295676 |
Sezen Vatansever1,2,3, Avner Schlessinger4,5, Daniel Wacker4,5,6, H Ümit Kaniskan4,5,7, Jian Jin4,5,7, Ming-Ming Zhou4,7, Bin Zhang1,2,3,4.
Abstract
Neurological disorders significantly outnumber diseases in other therapeutic areas. However, developing drugs for central nervous system (CNS) disorders remains the most challenging area in drug discovery, accompanied with the long timelines and high attrition rates. With the rapid growth of biomedical data enabled by advanced experimental technologies, artificial intelligence (AI) and machine learning (ML) have emerged as an indispensable tool to draw meaningful insights and improve decision making in drug discovery. Thanks to the advancements in AI and ML algorithms, now the AI/ML-driven solutions have an unprecedented potential to accelerate the process of CNS drug discovery with better success rate. In this review, we comprehensively summarize AI/ML-powered pharmaceutical discovery efforts and their implementations in the CNS area. After introducing the AI/ML models as well as the conceptualization and data preparation, we outline the applications of AI/ML technologies to several key procedures in drug discovery, including target identification, compound screening, hit/lead generation and optimization, drug response and synergy prediction, de novo drug design, and drug repurposing. We review the current state-of-the-art of AI/ML-guided CNS drug discovery, focusing on blood-brain barrier permeability prediction and implementation into therapeutic discovery for neurological diseases. Finally, we discuss the major challenges and limitations of current approaches and possible future directions that may provide resolutions to these difficulties.Entities:
Keywords: Alzheimer's; CNS; Parkinson's; anesthesia; artificial intelligence; blood-brain barrier; depression; disease subtyping; drug design; drug discovery; machine learning; neurological diseases; pain treatment; schizophrenia; target identification
Year: 2020 PMID: 33295676 PMCID: PMC8043990 DOI: 10.1002/med.21764
Source DB: PubMed Journal: Med Res Rev ISSN: 0198-6325 Impact factor: 12.944
Figure 1AI/ML applications in the drug discovery pipeline. AI/ML approaches provide a range of tools that can be applied in all the three stages of early drug discovery to improve decision making and speed up the process. ADME, absorption, distribution, metabolism, and excretion; AI, artificial intelligence; ML, machine learning; QSAR, quantitative structure–activity relationship
AI‐related learning techniques used in drug discovery
| Category of learning | Definition | |
|---|---|---|
|
|
A predictive model trained on data points with known outcomes (“labeled data”) Two types of problems: | |
|
| ||
|
| ||
|
|
|
|
| Naïve Bayes | Classification |
A “probabilistic classifier” that determines the probability of the features occurring in each class by treating every feature independently to return the most likely class based on the Bayes rule. Particularly suited when the dimensionality of the inputs is high. |
| Support vector machines | Classification |
A discriminative classifier that outputs an optimal hyperplane to categorize new examples. The vectors that define the hyperplane are the support vectors. |
| Random Forest | Classification/Regression |
An ensemble of simple tree predictors that vote for the most popular class for classification problems. In the regression problems, the tree responses are averaged to obtain an estimate of the dependent variable. Overfitting is less likely to occur as more decision trees are added to the forest. |
| K‐nearest‐neighbors | Classification/Regression |
A nonparametric algorithm based on feature similarity by assuming that similar things exist in close proximity. Useful for a classification study when there is little or no prior knowledge about the distribution data. |
| Artificial neural networks | Classification/Regression |
A method that learns from input data based on layers of connected neurons consisting of input layers, hidden layers, and output layers. |
| Deep neural network | Classification/Regression |
A collection of neurons organized in a sequence of multiple layers. Type of artificial neural network with several advantages (i.e., shared weights [parameter sharing), spatial relations, and local receptive fields Learning can be supervised, unsupervised, or semisupervised. End‐to‐end learning and transfer learning are the major approaches performed by the deep neural network. Autoencoders and generative adversarial networks are the two specific forms of deep neural networks. |
| Multiple regression | Regression |
A statistical approach to find relationships between dependent variables and one or more independent variables. |
|
|
A self‐organized model that organizes the data in some way or describe its structure to learn underlying patterns of features directly from unlabeled data. | |
|
|
|
|
| K‐means clustering | Clustering |
A classification method that divides data into k groups by minimizing within‐group distances to the centroid |
| Fuzzy clustering | Clustering |
A form of clustering (Fuzzy C‐means clustering) in which each data point can belong to more than one cluster. It computes the coefficients of being in the clusters for each data point. |
| Hierarchical clustering | Clustering |
A classification method that builds a hierarchy of clusters by merging two close clusters into the same cluster. This algorithm ends when there is only one cluster left. |
| Principal component analysis | Dimensionality reduction |
A nonparametric statistical technique that uses an orthogonal procedure to transform a set of correlated features to new independent variables called principal components |
| Independent component analysis | Dimensionality reduction |
A statistical method that separates a multivariable output into statistical independent additive components |
| Autoencoders | Dimensionality reduction |
A deep neural network trained with backpropagation to reconstruct its original input |
| Deep belief nets | Dimensionality reduction |
Probabilistic generative models with many layers of stochastic, latent variables. Each layer is a Restricted Boltzmann machine. |
| Generative adversarial networks | Anomaly detection |
Deep generative models that use two neural networks, pitting one against the other (thus the “adversarial”) to generate new synthetic but realistic instances of data. |
| Self‐organizing map | Dimensionality reduction |
A competitive learning network that reduces the input dimensionality to represent its distribution as a map. |
|
|
A combination of supervised and unsupervised learning methods that uses a small amount of labeled data and also a large amount of unlabeled data during training to gain more understanding of the sample population. | |
|
|
A particular case of semisupervised learning, where the algorithm is allowed to query the user for the label of a subset of training instances Used to construct a high‐performance classifier while keeping the size of the training data set to a minimum by actively selecting the valuable data points | |
|
|
Dynamic programming that trains algorithms using a system of reward and punishment to maximize the performance. | |
|
|
A deep learning technique enables developers to harness a neural network used for one task and apply it to another domain. It allows the reuse of a pretrained deep neural network on a new task with only a small amount of data. Useful when the data is insufficient for a new domain to be handled by a neural network, and there is a big preexisting data pool that can be transferred | |
|
|
An approach to inductive transfer that improves generalization performance of multiple related tasks by leveraging useful information among them. Useful when there are multiple related tasks, each of which has limited training samples | |
|
|
A flexible learning method that use a predefined set of kernels and learn convex combinations of kernels over potentially different domains. Used when there are heterogeneous sources of data for the task at hand | |
|
|
A meta‐algorithm that combines decisions from multiple models into one predictive model to decrease variance (bagging), bias (boosting), or improve predictions (stacking). | |
|
|
A deep learning process in which all of the parameters are trained jointly, rather than step by step. It allows the training of a deep neural network based on raw data without descriptors. Since the pipeline is replaced with a single learning algorithm, it goes directly from the input to the desired output and thereby overcome limitations of the traditional approach. | |
Note: The rows with gray backgrounds show the basic learning categories and their definition, while the rows following supervised and unsupervised learning parts display the different algorithms used in these categories.
Figure 2The basic steps of building an artificial intelligence (AI) platform for drug discovery. The process for developing an AI model as follows: (1) Define the problem appropriately (objective, desired outputs, etc.), (2) prepare the data (collection, exploration and profiling, formatting, and improving the quality), (3) transform raw data into features and select meaningful features (a.k.a. feature engineering), (4) split data into training and validation sets, (5) develop a model, (6) train the model with a fraction of the data, test its performance (cross‐validation) and tune its parameters with the validation set (7) evaluate model performance on the validation set and refine the model, and (8) evaluate the model on independent data not used for method development
Figure 3AI‐guided target discovery. AI/ML methods can efficiently analyze all available information to speed up the discovery of disease‐related drug targets. Specifically, AI/ML methods are utilized for disease subtyping, identification of disease driver genes and microRNAs, alternative splicing prediction, triaging of novel drug targets, modeling of three‐dimensional target structures, and druggability assessment. AI, artificial intelligence; ML, machine learning
Figure 4AI/ML‐enabled improvements in the treatment of CNS diseases. DL is a subset of ML, which is a subset of AI and their applications address a wide range of challenges in CNS drug discovery and development. The application fields portrayed here are discussed in the Section 3. AI, artificial intelligence; CNS, central nervous system; DL, deep learning; ML, machine learning
The promise of AI/ML‐based drug discovery strategies in CNS disorders
| Application field | Schizophrenia | Autism spectrum disorder | Depression | PD | AD | Anesthesia | Pain treatment |
|---|---|---|---|---|---|---|---|
| Diagnosis/prognosis | ✓ | ✓ | ✓ | ✓ | ✓ | ‐ | ✓ |
| Subtyping | ✓ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ |
| Heterogeneity detection | ✓ | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ |
| Target identification | ✓ | ‐ | ‐ | ‐ | ✓ | ‐ | ✓ |
| Inhibitor discovery | ✓ | ✓ | ✓ | ✓ | ✓ | ‐ | ‐ |
| Multitarget drug discovery | ‐ | ‐ | ‐ | ‐ | ✓ | ‐ | ✓ |
| Drug repositioning | ✓ | ✓ | ✓ | ✓ | ✓ | ‐ | ‐ |
| Drug response | ‐ | ✓ | ✓ | ‐ | ‐ | ‐ | ‐ |
| Variant effect | ‐ | ✓ | ‐ | ‐ | ‐ | ‐ | ‐ |
| Developmental neurotoxicants | ✓ | ✓ | ‐ | ‐ | ‐ | ‐ | ‐ |
| Pharmacological decision support | ‐ | ‐ | ✓ | ‐ | ‐ | ✓ | ✓ |
| Drug response monitoring | ‐ | ‐ | ‐ | ✓ | ‐ | ✓ | ‐ |
| Adverse drug effects | ‐ | ‐ | ‐ | ✓ | ‐ | ✓ | ‐ |
| Drug screening | ‐ | ‐ | ‐ | ✓ | ‐ | ‐ | ‐ |
| Overdose and misuse | ‐ | ‐ | ‐ | ‐ | ‐ | ‐ | ✓ |
Abbreviations: AD, Alzheimer's disease; AI, artificial intelligence; CNS, central nervous system; ML, machine learning; PD, Parkinson's disease.