| Literature DB >> 35328682 |
Giuseppe Floresta1, Chiara Zagni1, Davide Gentile1, Vincenzo Patamia1, Antonio Rescifina1.
Abstract
The recent covid crisis has provided important lessons for academia and industry regarding digital reorganization. Among the fascinating lessons from these times is the huge potential of data analytics and artificial intelligence. The crisis exponentially accelerated the adoption of analytics and artificial intelligence, and this momentum is predicted to continue into the 2020s and beyond. Drug development is a costly and time-consuming business, and only a minority of approved drugs generate returns exceeding the research and development costs. As a result, there is a huge drive to make drug discovery cheaper and faster. With modern algorithms and hardware, it is not too surprising that the new technologies of artificial intelligence and other computational simulation tools can help drug developers. In only two years of covid research, many novel molecules have been designed/identified using artificial intelligence methods with astonishing results in terms of time and effectiveness. This paper reviews the most significant research on artificial intelligence in de novo drug design for COVID-19 pharmaceutical research.Entities:
Keywords: COVID-19; artificial intelligence; drug design; ligand-based drug design; machine learning; structure-based drug design
Mesh:
Substances:
Year: 2022 PMID: 35328682 PMCID: PMC8949797 DOI: 10.3390/ijms23063261
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Selected top-scoring compounds by HTVS on the SARS-CoV-2 main protease.
|
| Structure | CmDock Docking Score |
|---|---|---|
|
|
| −32.51 |
|
|
| −29.02 |
|
|
| −26.80 |
|
|
| −25.58 |
|
|
| −25.53 |
|
|
| −25.05 |
|
|
| −24.76 |
|
|
| −24.51 |
|
|
| −24.17 |
|
|
| −24.01 |
|
|
| −23.98 |
|
|
| −23.61 |
|
|
| −23.53 |
|
|
| −23.26 |
|
|
| −23.18 |
Figure 1Ligand ID and structures of selected hit compounds.
The 5 compounds with predicted routes < 4 steps. The top 3 compounds from the training set, with potency and cytotoxicity measurements.
| Top 5 predicted compounds |
|
|
| Top 3 training set compounds |
|
|
24 different model systems.
| Protein/System (PDB Code) | ||
|---|---|---|
| S (spike) Protein Receptor Binding Domain | S Protein RBD/Complexed with ACE2 (PDB ID: 6W41) | MPro/monomer, CHARMM-GUI default protonation |
Structures and calculated free binding energies (MM-GBSA score, in kcal/mol) of four top-ranking compounds.
| Name | Structure | IUPAC Name | MM-GBSA Score |
|---|---|---|---|
| Compound |
| −42.21 | |
| Compound |
| [({[4-(dimethylamino)phenyl]- | −42.07 |
| Compound |
| −40.95 | |
| Compound |
| 2-{1-[({[(2-hydroxyphenyl)methyl]- | −39.5 |
Figure 2MNPs have been proposed as the most promising marine drug leads as inhibitors of SARS-CoV-2 Mpro.Combining a generative recurrent neural network model with transfer learning methods and active learning algorithms, R. Yassine et al. designed a novel set of small molecules capable of effectively inhibiting the 3CL protease in human cells [44]. The novelty of this work is the use of active learning methods with generative recurrent neural networks (RNNs) containing long-term memory cells (LSTM). The active learning method facilitates the selection process by focusing on the areas of the chemical space that have the best chance of success, considering structural novelties. The authors built a database consisting of multiple datasets such as FDA-approved drugs (from the ZINC database), natural products (from SuperNatural), and a manually developed database representing drug-like bioactive molecules. In the first phase of this study, by applying the RNN deep learning methodology, the LSTM-based RNN model was created to generate reliable and high-quality SMILES to design new drugs. Subsequently, molecules structurally similar to drugs with known activity against the specific SARS-CoV-2 target were generated. In this way, they were able to find a model capable of discovering new drugs using fragment-based drug discovery (FBDD) to create a library containing a series of SMILES inspired by the well-known compounds. The model generated 25,000 small molecules from the learned chemical space as described above. After removing duplicates and identical molecules from the database used for training, the remaining dataset consisted of 22,173 molecules. These molecules were then subjected to other filters such as physicochemical properties, drug similarity, and synthetic accessibility, resulting in a set of 6962 molecules. The generated molecules were then screened for affinity to the 3CL protease. After the virtual screening, a total of 41 molecules were obtained, with a virtual screening score of less than −7.0 kcal/mol. Among these, four molecules resulted in a binding affinity score lower than −18 kcal/mol (Figure 3).
Figure 3The generated molecules by R. Yassine et al., with the lowest binding affinity scores.
Figure 4The nine compounds reported by F. P. Silva-Jr et al., that showed similar binding positions to the experimentally validated inhibitors in X-ray crystal complexes with Mpro.
Figure 5NCEs with the highest virtual screening score and a remarkable similarity to existing protease inhibitors.