| Literature DB >> 29319225 |
Daniel Merk1, Lukas Friedrich1, Francesca Grisoni1,2, Gisbert Schneider1.
Abstract
Generative artificial intelligence offers a fresh view on molecular design. We present the first-time prospective application of a deep learning model for designing new druglike compounds with desired activities. For this purpose, we trained a recurrent neural network to capture the constitution of a large set of known bioactive compounds represented as SMILES strings. By transfer learning, this general model was fine-tuned on recognizing retinoid X and peroxisome proliferator-activated receptor agonists. We synthesized five top-ranking compounds designed by the generative model. Four of the compounds revealed nanomolar to low-micromolar receptor modulatory activity in cell-based assays. Apparently, the computational model intrinsically captured relevant chemical and biological knowledge without the need for explicit rules. The results of this study advocate generative artificial intelligence for prospective de novo molecular design, and demonstrate the potential of these methods for future medicinal chemistry.Entities:
Keywords: Automation; drug discovery; machine learning; medicinal chemistry; nuclear receptor
Mesh:
Substances:
Year: 2018 PMID: 29319225 PMCID: PMC5838524 DOI: 10.1002/minf.201700153
Source DB: PubMed Journal: Mol Inform ISSN: 1868-1743 Impact factor: 3.353
Figure 1Concept of generative artificial intelligence (AI). A model of the training data (e. g., molecular structures) is obtained that can be used to emit new instances (new chemical entities) within the training domain by sampling.
Figure 2Chemical space analysis by multi‐dimensional scaling. Compounds were represented by Morgan substructure fingerprints (radius=0–4 bonds, length=1024 bit), and similarity was defined by the Jaccard‐Tanimoto index. Colored dots represent the training data (light grey), fine‐tuning set (green), known RXR (orange) and PPAR (blue) agonists, sampled molecules (dark grey), and the selected de novo designs 1–5 (red). Compounds 1, 2, 3 and 5 populate the same area as the known RXR and PPAR agonists, while 4 is similar to PPAR agonist but remote from known RXR actives.
Scheme 1Synthesis of designs 1–5. Reagents & conditions: (a) H2N−C6H4−COOH (7), EDC, 4‐DMAP, THF, reflux, 4 h; (b) C6H5−B(OH)2 (9), Pd(PPh3)4, Cs2CO3, dioxane, 100 °C, 16 h; (c) KOH, MeOH/THF/H2O, μw, 70 °C, 30 min; (d) HO‐C6H3F−B(OH)2 (12), Pd(PPh3)4, Cs2CO3, toluene/EtOH, 100 °C, 20 h; (e) F‐C6H4‐CH2‐Br (15), K2CO3, DMF, μw, 100 °C, 120 min; (f) MeOH, H2SO4cc, reflux, 4 h; (g) C5H9Br (18), K2CO3, DMF, μw, 100 °C, 6 h; (h) HO‐C6H4‐B(OH)2 (20), Pd(PPh3)4, Cs2CO3, toluene/EtOH, 100 °C, 16 h; (i) C6H4Cl‐C6H4‐COOH (24), EDC, 4‐DMAP, CHCl3, relux, 12 h; (j) C6H3Br(OH)2 (27), Pd(PPh3)4, Cs2CO3, dioxane/DMF, reflux, 4 h; (k) malonic acid, pyridine/piperidine, μw, 100 °C, 30 min.
In vitro activity of designs 1–5 on RXRs and PPARs (EC50 values ± SEM [μM]; n=2 (when inactive) or 4 (when active) independent experiments in duplicates; inactive, no statistically significant reporter transactivation at a compound concentration of 30 μM).
| Compound no. | RXRα | RXRβ | RXRγ | PPARα | PPARγ | PPARδ |
|---|---|---|---|---|---|---|
| 1 | 0.13±0.01 | 1.1±0.3 | 0.06±0.02 |
| 2.3±0.2 |
|
| 2 | 13.0±0.1 | 9±2 | 8.0±0.7 |
| 2.8±0.3 |
|
| 3 |
|
|
| 4.0±1.0 | 10.1±0.3 |
|
| 4 |
|
|
|
| 9±3 | 14±2 |
| 5 |
|
|
|
|
|
|
| reference agonistsa) | 0.033±0.002 | 0.024±0.004 | 0.025±0.002 | 0.006±0.002 | 0.6±0.1 | 0.5±0.1 |
a) Reference agonists, literature data: bexarotene17 for RXRs, GW764718 for PPARα, pioglitazone19 for PPARγ, L165,04119 for PPARδ