| Literature DB >> 35064444 |
Osondu Everestus Oguike1,2, Chikodili Helen Ugwuishiwu1,2, Caroline Ngozi Asogwa1,2, Charles Okeke Nnadi3,4, Wilfred Ofem Obonga1,5, Anthony Amaechi Attama1,6.
Abstract
Malaria accounts for over two million deaths globally. To flatten this curve, there is a need to develop new and high potent drugs against Plasmodium falciparum. Some major challenges include the dearth of suitable animal models for anti-P. falciparum assays, resistance to first-line drugs, lack of vaccines and the complex life cycle of Plasmodium. Gladly, newer approaches to antimalarial drug discovery have emerged due to the release of large datasets by pharmaceutical companies. This review provides insights into these new approaches to drug discovery covering different machine learning tools, which enhance the development of new compounds. It provides a systematic review on the use and prospects of machine learning in predicting, classifying and clustering IC50 values of bioactive compounds against P. falciparum. The authors identified many machine learning tools yet to be applied for this purpose. However, Random Forest and Support Vector Machines have been extensively applied though on a limited dataset of compounds.Entities:
Keywords: Drug discovery; Machine learning; Plasmodium falciparum; QSAR
Year: 2022 PMID: 35064444 PMCID: PMC8782692 DOI: 10.1007/s11030-022-10380-1
Source DB: PubMed Journal: Mol Divers ISSN: 1381-1991 Impact factor: 3.364
Fig. 6Classification of objects into two different categories (A) and distribution of the best performing ML algorithms based on the relevant articles reviewed (B). In (a), datasets are classified based on similarity (blue or red color, circular or triangular shape) or dissimilarity (blue and red colors or circular and triangular shapes); Deep Learning (DL), Boosted Trees Regression (BTR), J48 classifier (JC), Discriminant Functions (DF), XGBoost (XGB), Graph Convolutional Neural Networks (GCNN), Multilinear Regression (MLR), General Regression Neural Network (GRNN), Multivariate Analysis (MVA), C5.0 and Artificial Neural Networks (ANN) represent (X), Random Forest (RF) represents (Y), and Support Vector Machine (SVM) represents (Z)
Fig. 1Workflow for ligand-based drug discovery (LMD—low mode dynamic; AM1—Austin model 1 Hamiltonian). Four major steps: dataset pretreatment, alignment, modeling and visualization are important here
Fig. 2Workflow for a structure-based drug discovery approach (binding modes and mechanisms of molecules to specific amino acid residues of the receptor targets are obtained in this approach)
Different AI tools used in various stages of malaria drug discovery
| S/N | AI tools name | Description | References |
|---|---|---|---|
| 1 | Chemputer | Structured format for procedure documentation for chemical synthesis | [ |
| 2 | DeepChem | Python-based AI platform for the prediction of drug discovery tasks | [ |
| 3 | DeepNeuralNet-QSAR | Use for prediction of molecular activity | [ |
| 4 | DeepTox | Use for prediction of toxicity | [ |
| 5 | DeltaVina | A scoring feature for protein–ligand binding affinity rescoring | [ |
| 6 | Hit Dexter | ML-based molecule prediction models for molecules that could react to biochemical assays | [ |
| 7 | Neural Graph Fingerprints | Predicts properties of novel molecules | [ |
| 8 | NN Score | Used for protein–ligand interaction and neural network-based scoring mechanism | [ |
| 9 | ODDT | Cheminformatic and molecular toolkit | [ |
| 10 | ORGANIC | Method for molecular generation to build molecule with desired characteristics | [ |
| 11 | PotentialNet | Ligand-binding prediction of affinity based on a convolution neural network (CNN) | [ |
| 12 | PPB2 | Polypharmacology prediction | [ |
| 13 | REINVENT | Using RNN (recurrent neural network) and RL (reinforcement learning), molecular de novo architecture | [ |
| 14 | SCScore | Scoring feature for the assessment of molecular synthesis complexity | [ |
| 15 | SIEVE-Score | An improved method of virtual screening based on structure via interaction energy-based learning | [ |
An overview of some studies that used AI for drug discovery [72]
| Technique | Application | Method | Accuracy |
|---|---|---|---|
| Deep learning | Drug screening | Using interactions between proteins and their corresponding ligands as DeepTox input values to predict drug toxicity | Very accurate |
| Neural networks | Drug design | Using a deep learning neural network to predict the structures of different proteins | Very accurate |
| Reinforcement learning | Drug screening | Using a machine learning approach to finding an inhibitor molecule for a specific protein (DDR1) | Accurate |
| Neural networks | Drug design | Used artificial neural networks and deep learning to predict interactions between drugs and their targets | Very accurate |
| Neural networks | Drug design | Used neural networks which were integrated into a neural computer to design new small organic molecules | Highly accurate |
Fig. 3Outcome of different novel approaches to AI
Fig. 4Example of exclusive clustering
Example of probabilistic clustering
| Instances | Cluster 1 | Cluster 2 | Cluster 3 |
|---|---|---|---|
| A | 0.4 | 0.1 | 0.5 |
| B | 0.1 | 0.8 | 0.1 |
| C | 0.3 | 0.3 | 0.4 |
| D | 0.1 | 0.1 | 0.8 |
| E | 0.4 | 0.2 | 0.4 |
| F | 0.1 | 0.4 | 0.5 |
Fig. 5Result of the clustering algorithm