| Literature DB >> 30276245 |
Masato Sumita1,2, Xiufeng Yang1,3, Shinsuke Ishihara2, Ryo Tamura2,3,4, Koji Tsuda1,3,4.
Abstract
This work presents a proof-of-concept study in artificial-intelligence-assisted (AI-assisted) chemistry where a machine-learning-based molecule generator is coupled with density functional theory (DFT) calculations, synthesis, and measurement. Although deep-learning-based molecule generators have shown promise, it is unclear to what extent they can be useful in real-world materials development. To assess the reliability of AI-assisted chemistry, we prepared a platform using a molecule generator and a DFT simulator, and attempted to generate novel photofunctional molecules whose lowest excited states lie at desired energetic levels. A 10 day run on the 12-core server discovered 86 potential photofunctional molecules around target lowest excitation levels, designated as 200, 300, 400, 500, and 600 nm. Among the molecules discovered, six were synthesized, and five were confirmed to reproduce DFT predictions in ultraviolet visible absorption measurements. This result shows the potential of AI-assisted chemistry to discover ready-to-synthesize novel molecules with modest computational resources.Entities:
Year: 2018 PMID: 30276245 PMCID: PMC6161049 DOI: 10.1021/acscentsci.8b00213
Source DB: PubMed Journal: ACS Cent Sci ISSN: 2374-7943 Impact factor: 14.553
Figure 1Our AI-assisted chemistry platform for discovering new functional molecules. The Android robot is reproduced or modified from work created and shared by Google and used according to terms described in the Creative Commons 3.0 Attribution License.
Number of Molecules at Different Qualification Levels for Each Target Wavelength
| Target wavelength | |||||
|---|---|---|---|---|---|
| 200 nm | 300 nm | 400 nm | 500 nm | 600 nm | |
| Generated | 646 | 757 | 629 | 607 | 638 |
| Simulator-qualified | 34 | 26 | 13 | 12 | 1 |
| Synthesized | 2 | 2 | 1 | 1 | 0 |
| Functional | 1 | 2 | 1 | 1 | 0 |
The first row indicates the number of molecules generated by ChemTS.
The second row shows the number of simulator-qualified molecules whose absorption wavelength is predicted by DFT to be within 20 nm error from the target.
The third and fourth rows denote the number of synthesized molecules, and those experimentally confirmed by UV–vis measurement, respectively.
Simulator-Qualified Moleculesa as SMILES Strings Found by Our AI-Assisted Chemistry Platform
The synthesized molecules are shown with their chemical structural formula. Randomly sampled 24 molecules’ origins of excitation are shown in parentheses with excitation wavelengths.
Figure 2Experimental UV–vis absorption spectra and computational spectra at the B3LYP/3-21G* level of the compounds I–VI. The computational spectra are smoothed by a Gaussian function and arbitrarily scaled for comparison with the experimental spectra. The red dashed line in each spectrum indicates the target wavelength.
Figure 3Main Kohn–Sham orbitals involved in the first excited states of I–VI at the B3LYP/3-21G* level. HOMO and LUMO denote the highest occupied molecular orbital and the lowest unoccupied molecular orbital, respectively. λ and f denote the computational absorption wavelength and oscillator strength, respectively.
Figure 4Workflow of a molecule generator (ChemTS[4]) coupled with electronic structure theory (DFT). Molecules in SMILES string generated by ChemTS (MCTS+RNN) are converted to those in a three-dimensional structure with RDKit.[36] Then, the computation for each molecule is performed with electronic structure theory to obtain the value of λ (absorption wavelength). Reward (r) is calculated by eq . Rewards of molecules whose wavelengths are not available because of the failure in DFT calculation are set to −1. The calculated reward (r) reflects MCTS as back-propagation.