| Literature DB >> 35664624 |
Zheng Tan1, Yan Li2, Ziying Zhang3, Xin Wu2, Thomas Penfold4, Weimei Shi1, Shiqing Yang1.
Abstract
Adversarial generative models are becoming an essential tool in molecular design and discovery due to their efficiency in exploring the desired chemical space with the assistance of deep learning. In this article, we introduce an integrated framework by combining the modules of algorithmic synthesis, deep prediction, adversarial generation, and fine screening for the purpose of effective design of the thermally activated delayed fluorescence (TADF) molecules that can be used in the organic light-emitting diode devices. The retrosynthetic rules are employed to algorithmically synthesize the D-A complex based on the empirically defined donor and acceptor moieties, which is followed by the high-throughput labeling and prediction with the deep neural network. The new D-A molecules are subsequently generated via the adversarial autoencoder, with the excited-state property distributions perfectly matching those of the original samples. Fine screening of the generated molecules, including the spin-orbital coupling calculation and the excited-state optimization, is eventually implemented to select the qualified TADF candidates within the novel chemical space. Further investigation shows that the created structures fully mimic the original D-A samples by maintaining a significant charge transfer characteristic, a minimal adiabatic singlet-triplet gap, and a moderate spin-orbital coupling that are desirable for the delayed fluorescence.Entities:
Year: 2022 PMID: 35664624 PMCID: PMC9161419 DOI: 10.1021/acsomega.2c02253
Source DB: PubMed Journal: ACS Omega ISSN: 2470-1343
Figure 1Integrated modules for the generation and screening of TADF molecules. (a) Algorithmic synthesis for the D–A complexes using combinatorial chemistry. (b) Deep prediction for the excited-state properties based on molecular ECFP fingerprints. (c) Adversarial generation for the TADF chemical space. (d) Fine screening of the generated molecules based on high-throughput TDDFT calculations.
Figure 2DNN predictions of the excited-state properties of TADF molecules. S1, S2, S3, T1, T2, and T3 represent the first three singlet and triplet excited-state energies, respectively, while f denotes the corresponding oscillator strength and ΔEST the first singlet–triplet gap. Note that the true data indicate the high-throughput calculations of molecular properties. The blue line is the identity mapping. The singlet and triplet energies and the energy difference are in the unit of eV, while f is dimensionless.
Performance Metrics for the Adversarial Autoencoder Model Applied on the D1A1 Data Set: Fraction of Valid, Novel, Unique Molecules and the Overall Generation Efficiency Fulfilling the above Three Criteria Given Different Generation Sizes
| generation size | validity | novelty | uniqueness | overall efficiency |
|---|---|---|---|---|
| 1 K | 0.8280 | 0.6490 | 0.9990 | 0.4750 |
| 10 K | 0.8079 | 0.6587 | 0.9671 | 0.4460 |
| 100 K | 0.8135 | 0.6503 | 0.8070 | 0.3555 |
| 200 K | 0.8143 | 0.6537 | 0.7228 | 0.3114 |
Figure 3(a) Distributions of the excited-state properties of the original and generated D1A1 molecules. Note that both the original and generated properties are computed via the DNN model predictions. The singlet and triplet energies and the energy difference are in the unit of eV, while f is dimensionless. (b) Distributions of synthetic accessibility of the original and generated D1A1 molecules.
Adiabatic Singlet–Triplet Energy Gaps Computed via the B3LYP Functional with and without TDA and the T1S1 and T1S0 SOC Constants at the S0 Geometry for the Eventually Screened TADF Candidates
| name | SMILES | B3LYP gap (eV) | B3LYP-TDA gap (eV) | T1S1 SOC (cm–1) | T1S0 SOC (cm–1) |
|---|---|---|---|---|---|
| mol_1 | c1cnc2c(c1)[SiH2:2]c1ccnc(-n3c4ccccc4c4ccncc43)c1-2 | 0.1333 | 0.0106 | 0.3239 | 1.8620 |
| mol_2 | O=C(Cc1cccc2[nH]c3[nH]c4ccccc4c3c12)c1cccc2c1[nH]c1ccccc12 | 0.0163 | 0.0117 | 1.8729 | 7.1232 |
| mol_3 | c1ccc2c(c1)[nH]c1c2c2ccccc2n1-c1cc2[nH]c3ccccc3c2cn1 | 0.1223 | 0.0980 | 0.1136 | 0.7360 |
| mol_4 | c1ccc2c(c1)[nH]c1ccc3c(c4ccccc4n3-c3cncc4[nH]c5ccccc5c34)c12 | 0.1759 | 0.0798 | 0.1606 | 0.1746 |
| mol_5 | c1cc2c(cc1)[nH]c1ccc3c(c4ccccc4n3-c3cccc4c3ncc3ccccc34)c12 | 0.0227 | 0.0144 | 0.1703 | 1.3593 |
| mol_6 | c1ccc2c(c1)[nH]c1ccc3c(c4ccccc4n3-c3ccc4c5ccccc5c5nccnc5c4c3)c12 | 0.0130 | 0.0117 | 0.1225 | 1.0532 |
| mol_7 | c1ccc2c(c1)-c1cnc(-n3c4ccccc4c4ccccc43)cc1c1nccnc21 | 0.0995 | 0.0926 | 0.1396 | 1.6184 |
| mol_8 | O=C(c1ccc(C=O)c(-c2ccc3c(c2)Nc2ccccc2S3)c1)c1ccccc1 | 0.0483 | 0.0391 | 0.1507 | 8.6579 |
| mol_9 | O=Cc1ccc(-c2cccc3c2c2ccccc2n3-c2ccccc2)c(C#N)c1C#N | 0.1280 | 0.0367 | 0.2102 | 0.7563 |
| mol_10 | c1cnc2c(c1)-c1ccc(-n3c4ccccc4c4c5ccccc5[nH]c43)cc1c1[nH]cnc21 | 0.0082 | 0.0078 | 0.2818 | 0.8635 |
| mol_11 | O=C(c1ccc(-n2c3ccccc3c3nc[nH]c23)cc1)C(F)(F)F | 0.0070 | 0.0065 | 0.1691 | 1.7473 |
| mol_12 | N#Cc1ccc(C=O)c(-c2ccc(-n3c4ccccc4c4ccccc43)cc2)c1 | 0.0084 | –0.0066 | 0.1288 | 9.6389 |
| mol_13 | CN1c2ccccc2Sc2ccc(-c3cccc4Nc5ccccc5c(=O)c34)cc21 | 0.2109 | 0.1751 | 0.5778 | 6.8733 |
| mol_14 | c1ccc2c(c1)[nH]c1ccc3c(c4ccccc4n3-c3cnc4ncccc4n3)c12 | 0.0239 | 0.0204 | 0.1868 | 1.7842 |
| mol_15 | CN1c2ccccc2Sc2cc(-c3cccc4[nH]c5ccccc5c(=O)c34)ccc21 | 0.1956 | 0.1594 | 0.8683 | 14.9823 |
| mol_16 | O=Cc1ccccc1N(c1cccc2[nH]c3ccccc3c12)c1ccccc1 | 0.1579 | 0.1168 | 0.4755 | 6.4195 |
| mol_17 | c1ccc2c(c1)[nH]c1c2c2ccccc2n1-c1ncc2c(c1)[SiH2:2]c1cnccc1-2 | 0.0761 | 0.0616 | 0.1962 | 0.7084 |
| mol_18 | CC(=O)c1ccc(-c2cccc3c4ccccc4c4nc[nH]c4c23)c(C#N)c1 | 0.2286 | 0.1081 | 0.1208 | 1.2150 |
| mol_19 | c1cc2c(cn1)-c1c(ccnc1-n1c3ccccc3c3ccccc31)c1ccccc12 | 0.2335 | 0.0899 | 0.5408 | 1.8974 |
Figure 4Generated molecules that fulfill the fine screening criteria with satisfactory quantum chemical properties for the delayed fluorescence.
Figure 5Frontier orbitals for two of the fine screened molecules analyzed at the optimized S0, S1, and T1 geometries.