| Literature DB >> 36105994 |
Nannan Chen1, Lijuan Yang2,3, Na Ding1, Guiwen Li1, Jiajing Cai1, Xiaoli An2, Zhijie Wang1, Jie Qin1, Yuzhen Niu1.
Abstract
Metronidazole is a specific drug against trichomonas and anaerobic bacteria, and is widely used in the clinic. However, extensive clinical application is often accompanied by extensive side effects, so it is still of great significance to develop metronidazole derivatives with a new skeleton. Compared with other traditional receptor-based drug design methods, the computational model based on a neural network has higher accuracy and reliability. In this work, a Recurrent Neural Network (RNN) model is applied to the discovery of metronidazole drugs with a new skeleton. Firstly, the generation model based on a Gated Recurrent Unit (GRU) is trained to generate an effective Simplified Molecular-Input Line-Entry System (SMILES) string library with high precision. Then, transfer learning is introduced to fine-tune the GRU model, and many molecules with structures similar to known active drugs are generated. After cluster analysis of the structures of the new compounds, 20 small molecular compounds with metronidazole structures of all different categories were selected, of which 19 may not belong to any published patents or applications. Through prediction and personal experience, the difficulty of synthesizing these 20 new structures was analyzed, and compound 0001 was chosen as our synthetic target, and a series of structures (8a-l) similar to compound 0001 were synthesized. Finally, the inhibitory activities of these compounds against bacteria E. coli, P. aeruginosa, B. subtilis and S. aureus were determined. The results showed that compound 8a-l had obvious inhibitory activity against these four bacteria, which proved the accuracy of our compound generation model. This journal is © The Royal Society of Chemistry.Entities:
Year: 2022 PMID: 36105994 PMCID: PMC9377161 DOI: 10.1039/d2ra01807a
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Fig. 1Pipeline of generative model for novel compound generation.
Fig. 4Synthesis process of compound 8a–l.
Fig. 2Distribution of the quantitative estimate of drug-likeness scores (QED) (A) and distribution of the synthetic accessibility scores (SAS) in two generator and existing metronidazole compounds.
Fig. 320 compounds with different skeleton obtained by clustering. SAS refers to the synthetic accessibility scores.
Inhibitory activity (IC50) of compound 8a–l against four kinds of bacteria
| Compound | IC50 (μM L−1) | |||
|---|---|---|---|---|
|
|
|
|
| |
| 8a | 72.64 | 99.52 | 45.65 | 28.21 |
| 8b | 70.11 | 78.07 | 37.71 | 37.16 |
| 8c | 61.46 | 69.61 | 29.15 | 20.8 |
| 8d | 74.01 | 42.82 | 33.43 | 24.59 |
| 8e | 60.63 | 49.95 | 37.92 | 19.97 |
| 8f | 51.03 | 40.29 | 33.91 | 16.73 |
| 8g | 59.11 | 15.99 | 26.02 | 11.98 |
| 8h | 48.89 | 16.32 | 21.20 | 9.89 |
| 8i | 42.51 | 18.69 | 18.97 | 6.85 |
| 8j | 30.00 | 28.38 | 37.80 | 23.73 |
| 8k | 38.64 | 37.51 | 53.15 | 25.90 |
| 8l | 41.07 | 53.12 | 48.35 | 22.23 |
| Streptomycin | 22.92 | 12.48 | 16.22 | 8.67 |
Abbreviations
| Abbreviations | Explanations |
|---|---|
| GRU | Gated recurrent unit |
| SMILES | Simplified molecular-input line-entry system |
| QSAR | Quantitative structure–activity relationship |
| AI | Artificial intelligence |
| TL | Transfer learning |
| RNN | Recurrent neural network |
| MTT | 3-(4,5-Dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide |
| SAS | Synthetic accessibility score |
| QED | Weighted quantitative estimation of drug-likeness |
| log | Oil–water partition coefficient |
| TLC | Thin-layer chromatography |
| TMS | Tetramethylsilane |