| Literature DB >> 26309399 |
Watshara Shoombuatong1, Veda Prachayasittikul2, Nuttapat Anuwongcharoen1, Napat Songtawee1, Teerawat Monnor1, Supaluk Prachayasittikul1, Virapong Prachayasittikul3, Chanin Nantasenamat2.
Abstract
This study represents the first large-scale study on the chemical space of inhibitors of dipeptidyl peptidase-4 (DPP4), which is a potential therapeutic protein target for the treatment of diabetes mellitus. Herein, a large set of 2,937 compounds evaluated for their ability to inhibit DPP4 was compiled from the literature. Molecular descriptors were generated from the geometrically optimized low-energy conformers of these compounds at the semiempirical AM1 level. The origins of DPP4 inhibitory activity were elucidated from computed molecular descriptors that accounted for the unique physicochemical properties inherently present in the active and inactive sets of compounds as defined by their respective half maximal inhibitory concentration values of less than 1 μM and greater than 10 μM, respectively. Decision tree analysis revealed the importance of molecular weight, total energy of a molecule, topological polar surface area, lowest unoccupied molecular orbital, and number of hydrogen-bond donors, which correspond to molecular size, energy, surface polarity, electron acceptors, and hydrogen bond donors, respectively. The prediction model was subjected to rigorous independent testing via three external sets. Scaffold and chemical fragment analysis was also performed on these active and inactive sets of compounds to shed light on the distinguishing features of the functional moieties. Docking of representative active DPP4 inhibitors was also performed to unravel key interacting residues. The results of this study are anticipated to be useful in guiding the rational design of novel and robust DPP4 inhibitors for the treatment of diabetes.Entities:
Keywords: QSAR; antidiabetic; decision tree; fragment analysis; molecular docking; rational drug design; scaffold analysis
Mesh:
Substances:
Year: 2015 PMID: 26309399 PMCID: PMC4539085 DOI: 10.2147/DDDT.S86529
Source DB: PubMed Journal: Drug Des Devel Ther ISSN: 1177-8881 Impact factor: 4.162
Figure 1Schematic representation of the computational workflow.
Abbreviation: QSAR, quantitative structure–activity relationship.
Exploratory data analysis of actives and inactives using the six-term descriptive statistics
| Statistics | MW | RBN | nCIC | nHDon | nHAcc | ALogP | TPSA | Energy | Dipole moment | HOMO | LUMO | HOMO–LUMO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Actives | |||||||||||||
| Min | 167.2 | 0.000 | 0.000 | 0.000 | 1.000 | −2.936 | 29.260 | 0.137 | −0.908 | 0.747 | −0.572 | −0.298 | 0.217 |
| Q1 | 340.9 | 4.000 | 3.000 | 2.000 | 5.000 | 0.586 | 72.800 | 0.202 | −0.260 | 4.075 | −0.354 | −0.039 | 0.301 |
| Median | 386.5 | 5.000 | 3.000 | 3.000 | 7.000 | 1.571 | 85.250 | 0.217 | −0.123 | 5.831 | −0.343 | −0.025 | 0.314 |
| Mean | 385.8 | 5.008 | 3.155 | 2.735 | 6.897 | 1.566 | 89.490 | 0.222 | −0.144 | 9.842 | −0.352 | −0.035 | 0.318 |
| Q3 | 430.5 | 6.000 | 4.000 | 3.000 | 8.000 | 2.586 | 103.660 | 0.236 | −0.017 | 8.111 | −0.331 | −0.008 | 0.332 |
| Max | 753.8 | 16.000 | 6.000 | 9.000 | 16.000 | 6.598 | 234.780 | 0.535 | 0.488 | 284.562 | −0.286 | 0.047 | 0.386 |
| Inactives | |||||||||||||
| Min | 128.2 | 1.000 | 0.000 | 0.000 | 1.000 | −2.485 | 3.240 | 0.142 | −1.281 | 0.629 | −0.490 | −0.154 | 0.242 |
| Q1 | 238.4 | 3.000 | 2.000 | 1.000 | 4.000 | 0.848 | 47.720 | 0.193 | −0.172 | 2.890 | −0.344 | −0.022 | 0.310 |
| Median | 303.9 | 4.000 | 2.000 | 2.000 | 5.000 | 1.806 | 72.350 | 0.209 | −0.097 | 3.961 | −0.338 | −0.005 | 0.331 |
| Mean | 315.1 | 4.605 | 2.609 | 2.375 | 4.991 | 1.859 | 72.002 | 0.213 | −0.119 | 4.443 | −0.337 | −0.007 | 0.331 |
| Q3 | 359.5 | 6.000 | 3.000 | 3.000 | 6.000 | 2.949 | 88.840 | 0.231 | −0.048 | 5.262 | −0.329 | 0.011 | 0.347 |
| Max | 1,174.6 | 36.000 | 6.000 | 11.000 | 25.000 | 7.528 | 351.810 | 0.346 | 0.139 | 42.433 | −0.290 | 0.100 | 0.414 |
Abbreviations: ALogP, Ghose-Crippen octanol-water partition coefficient; HOMO, highest occupied molecular orbital; HOMO-LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; Max, maximum; Min, minimum; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; Q1, first quartile; Q3, third quartile; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Figure 2Histograms of the molecular descriptors for actives/inactives (A) and active I/active II DPP4 inhibitors (B).
Notes: Actives/active I and inactives/active II are shown in red and blue, respectively; purple regions represent their overlap.
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Figure 3PCA scores plots of actives/inactives (A) and active I/active II (B) DPP4 inhibitors.
Note: The scores and loadings plots are shown in the left and right panels, respectively, where actives/active I and inactives/active II DPP4 inhibitors are shown in the top and bottom rows, respectively.
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; PCA, principle component analysis; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Exploratory data analysis of subclasses of actives (I and II) using the six-term descriptive statistics
| Statistics | MW | RBN | nCIC | nHDon | nHAcc | ALogP | TPSA | Energy | Dipole moment | HOMO | LUMO | HOMO–LUMO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Actives I | |||||||||||||
| Min | 167.2 | 0.000 | 0.000 | 0.000 | 1.000 | −2.936 | 29.260 | 0.137 | −0.908 | 0.747 | −0.572 | −0.298 | 0.217 |
| Q1 | 333.5 | 3.000 | 3.000 | 2.000 | 5.000 | 0.637 | 73.250 | 0.203 | −0.264 | 3.998 | −0.353 | −0.036 | 0.303 |
| Median | 381.0 | 5.000 | 3.000 | 3.000 | 7.000 | 1.587 | 85.250 | 0.219 | −0.133 | 5.675 | −0.343 | −0.023 | 0.317 |
| Mean | 381.9 | 5.000 | 3.103 | 2.636 | 6.961 | 1.585 | 89.980 | 0.224 | −0.154 | 7.976 | −0.350 | −0.029 | 0.320 |
| Q3 | 429.4 | 6.000 | 4.000 | 3.000 | 8.000 | 2.577 | 104.670 | 0.237 | −0.041 | 7.635 | −0.331 | −0.006 | 0.334 |
| Max | 753.8 | 16.000 | 6.000 | 9.000 | 16.000 | 6.598 | 234.780 | 0.535 | 0.488 | 80.233 | −0.286 | 0.047 | 0.386 |
| Actives II | |||||||||||||
| Min | 202.4 | 2.000 | 1.000 | 1.000 | 3.000 | −2.163 | 47.950 | 0.162 | −0.695 | 1.035 | −0.492 | −0.155 | 0.243 |
| Q1 | 375.5 | 4.000 | 3.000 | 3.000 | 5.000 | 0.350 | 69.460 | 0.200 | −0.216 | 4.861 | −0.411 | −0.127 | 0.293 |
| Median | 403.4 | 5.000 | 3.000 | 3.000 | 6.000 | 1.504 | 84.660 | 0.210 | −0.034 | 7.328 | −0.349 | −0.035 | 0.305 |
| Mean | 407.0 | 5.052 | 3.431 | 3.263 | 6.557 | 1.463 | 86.870 | 0.217 | −0.093 | 19.814 | −0.367 | −0.062 | 0.304 |
| Q3 | 435.4 | 6.000 | 4.000 | 4.000 | 8.000 | 2.641 | 101.040 | 0.224 | 0.068 | 33.395 | −0.332 | −0.023 | 0.316 |
| Max | 658.7 | 16.000 | 6.000 | 7.000 | 14.000 | 4.985 | 188.480 | 0.390 | 0.271 | 284.562 | −0.307 | 0.037 | 0.386 |
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; Max, maximum; Min, minimum; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; Q1, first quartile; Q3, third quartile; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Summary of prediction performance of internal and external sets
| Dataset | Details | N | Acc (%) | Sen (%) | Spec (%) | MCC |
|---|---|---|---|---|---|---|
| Internal set (DPP4-TRN) | Full training | 1,122 | 96.43 | 98.30 | 94.38 | 0.929 |
| Ten-fold CV | 1,122 | 82.26 | 84.69 | 79.59 | 0.644 | |
| External set 1 (DPP4-TEST1) | External validation | 149 | 91.28 | – | – | – |
| External set 2 (DPP4-TEST2) | External validation | 160 | 95.63 | – | – | – |
| External set 3 (DPP4-TEST3) | External validation | 167 | 72.25 | – | – | – |
Note: N is the number of compounds.
Abbreviations: Acc, accuracy; CV, cross-validation; MCC, Matthews correlation coefficient; Sen, sensitivity; Spec, specificity.
Figure 4Plot of the descriptor usage derived from the J48 algorithm.
Note: The descriptor with the largest descriptor usage value is the most important.
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Summary of the top ten fragments in the active set of DPP4 inhibitors
| Rank | IUPAC name | Structure | Fragment count |
|---|---|---|---|
| 1 | 1-ethyl-2-fluorobenzene |
| 617 |
| 2 | 2-amino-1-(pyrrolidin-1-yl)ethan-1-one |
| 597 |
| 3 | 1-(1,3-thiazolidin-3-yl)propan-1-one |
| 136 |
| 4 | 1-(pyrrolidin-1-yl)propan-1-one |
| 101 |
| 5 | Propylbenzene |
| 52 |
| 6 | 2-amino- |
| 50 |
| 7 | 2,3,6-trimethylpyridine |
| 45 |
| 8 | (1-formylpyrrolidin-2-yl)boronic acid |
| 43 |
| 9 | 1-chloro-2-ethenylbenzene |
| 36 |
| 10 | 4-(1-ethylhydrazin-1-yl)-1-methylpiperazine |
| 32 |
Abbreviation: IUPAC, International Union of Pure and Applied Chemistry.
Summary of the top ten fragments in the inactive set of DPP4 inhibitors
| Rank | IUPAC name | Structure | Fragment count |
|---|---|---|---|
| 1 | Benzyl(ethyl)amine |
| 102 |
| 2 | 2-methyl-2,3-dihydro-1H-isoindole |
| 77 |
| 3 | 1-(pyrrolidin-1-yl)propan-1-one |
| 64 |
| 4 | 1-(piperidin-1-yl)ethan-1-one |
| 45 |
| 5 | Propylbenzene |
| 35 |
| 6 | 3-ethyl-4-methylpyrrolidin-2-one |
| 19 |
| 7 | 2-amino-1-(pyrrolidin-1-yl)ethan-1-one |
| 17 |
| 8 | 3-ethyl-2,4-dimethylpyridine |
| 14 |
| 9 |
| 14 | |
| 10 | (Pyrrolidin-2-yl)phosphonic acid |
| 13 |
Figure 5Summary of top 20 molecular frameworks for actives (1a–20a) and inactives (1b–20b).
Figure 6Three different binding modes of interaction of DPP4 inhibitors in the active site of the enzyme.
Notes: The identified anchors HB1, HB2, and vdW from the SiMMap server are labeled and shown in cyan and yellow spheres, respectively. Docking poses of two selected inhibitors are visualized herein: the compound with the best SiMMap score (A) and the compound with the lowest half maximal inhibitory concentration values (B). Residues at the active site are shown in green sticks while key interacting residues are labeled and shown in dark grey lines.
PCA loadings score for active and inactive DPP4 inhibitors
| Descriptor | Active
| Inactive
| ||||
|---|---|---|---|---|---|---|
| PC1 | PC2 | PC3 | PC1 | PC2 | PC3 | |
| MW | 0.195 | 0.402 | −0.376 | 0.224 | ||
| RBN | 0.431 | −0.025 | −0.017 | 0.476 | ||
| nCIC | 0.297 | −0.241 | 0.489 | − | 0.077 | |
| nHDon | −0.231 | −0.204 | 0.341 | 0.289 | ||
| nHAcc | 0.527 | 0.013 | 0.160 | 0.112 | ||
| ALogP | 0.061 | 0.267 | 0.164 | − | 0.203 | |
| TPSA | 0.268 | −0.239 | 0.405 | 0.094 | ||
| 0.445 | 0.410 | − | 0.483 | −0.144 | ||
| Energy | −0.242 | − | 0.102 | − | −0.264 | −0.234 |
| Dipole moment | 0.361 | − | −0.261 | 0.439 | 0.091 | − |
| HOMO | −0.361 | 0.380 | −0.211 | −0.352 | ||
| LUMO | −0.585 | 0.073 | −0.559 | 0.344 | ||
| HOMO–LUMO | − | 0.452 | −0.413 | −0.463 | 0.258 | |
Notes: The bold values represent the highest loadings scores at the current PC, compared to other PCs. For instance, MW has a higher loading score of 0.849 at PC1, compared to PC2 (0.195), and PC3 (0.402).
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; PCA, principal component analysis; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Contribution value of each descriptor to principal component for active and inactive DPP4 inhibitors
| Descriptor | Active
| Inactive
| ||||
|---|---|---|---|---|---|---|
| PC1 | PC2 | PC3 | PC1 | PC2 | PC3 | |
| MW | 1.257 | 8.880 | 5.236 | 2.627 | ||
| RBN | 6.106 | 0.035 | 9.681 | 0.011 | ||
| nCIC | 2.421 | 1.914 | 5.246 | 0.308 | ||
| nHDon | 1.751 | 2.302 | 2.544 | 4.380 | ||
| nHAcc | 9.159 | 0.009 | 0.947 | 0.655 | ||
| ALogP | 0.101 | 2.341 | 0.589 | 2.160 | ||
| TPSA(Tot) | 2.371 | 3.139 | 6.069 | 0.466 | ||
| 5.461 | 5.526 | 7.933 | 1.091 | |||
| Energy | 1.612 | 0.569 | 2.579 | 2.876 | ||
| Dipole | 3.594 | 3.757 | 4.220 | 0.306 | ||
| HOMO | 3.585 | 7.945 | 0.974 | 4.565 | ||
| LUMO | 9.419 | 0.291 | 6.851 | 4.380 | ||
| HOMO–LUMO | 9.114 | 6.738 | 4.701 | 3.472 | ||
Note: The bold values show the highest loadings scores at the current PC, compared to other PCs.
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; PC, principal component; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
PCA loadings score for active I and active II DPP4 inhibitors
| Descriptor | Active I
| Active II
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PC1 | PC2 | PC3 | PC4 | PC5 | PC1 | PC2 | PC3 | PC4 | PC5 | |
| MW | 0.241 | 0.412 | −0.002 | −0.047 | 0.338 | −0.026 | 0.458 | 0.154 | ||
| RBN | 0.473 | −0.001 | −0.166 | 0.319 | 0.410 | 0.344 | −0.493 | 0.266 | −0.192 | |
| nCIC | 0.281 | −0.217 | 0.332 | −0.267 | −0.101 | 0.501 | 0.382 | −0.066 | ||
| nHDon | −0.148 | −0.213 | 0.113 | 0.492 | −0.388 | 0.506 | −0.010 | −0.257 | ||
| nHAcc | 0.463 | 0.007 | −0.111 | −0.487 | 0.796 | 0.253 | −0.327 | 0.203 | 0.002 | |
| ALogP | 0.112 | 0.230 | −0.581 | 0.329 | 0.456 | − | −0.067 | 0.367 | 0.332 | |
| TPSA | 0.257 | −0.257 | 0.488 | 0.055 | 0.309 | 0.063 | −0.322 | −0.089 | ||
| 0.448 | 0.383 | − | 0.243 | 0.023 | 0.449 | 0.460 | −0.439 | − | −0.097 | |
| Energy | −0.284 | − | 0.104 | 0.389 | 0.184 | − | 0.183 | 0.408 | −0.076 | −0.174 |
| Dipole moment | 0.515 | − | −0.285 | −0.235 | −0.101 | − | 0.035 | −0.443 | 0.182 | −0.196 |
| HOMO | −0.428 | 0.362 | 0.427 | 0.136 | 0.078 | 0.594 | −0.073 | −0.073 | ||
| LUMO | −0.649 | 0.014 | 0.169 | 0.045 | −0.155 | 0.488 | −0.146 | −0.046 | ||
| HOMO–LUMO | − | 0.378 | −0.469 | −0.284 | −0.105 | −0.555 | −0.022 | −0.215 | 0.037 | |
Note: The bold values show the highest loadings scores at the current PC, compared to other PCs.
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; PCA, principal component analysis; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Contribution value of each descriptor to principal components for active I and active II DPP4 inhibitors
| Descriptor | Active I
| Active II
| ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| PC1 | PC2 | PC3 | PC4 | PC5 | PC1 | PC2 | PC3 | PC4 | PC5 | |
| MW | 2.080 | 8.743 | 0.000 | 0.259 | 2.787 | 0.035 | 18.959 | 3.269 | ||
| RBN | 10.046 | 8.042 | 0.000 | 2.121 | 4.100 | 4.257 | 6.411 | 5.086 | ||
| nCIC | 2.000 | 1.688 | 8.507 | 8.443 | 0.247 | 9.049 | 13.157 | 0.612 | ||
| nHDon | 8.749 | 0.790 | 2.352 | 0.989 | 3.664 | 9.219 | 0.005 | 5.945 | ||
| nHAcc | 11.573 | 7.711 | 0.003 | 0.945 | 2.310 | 5.355 | 3.723 | 0.000 | ||
| ALogP | 0.316 | 1.898 | 19.748 | 12.864 | 5.070 | 10.926 | 0.225 | 12.185 | ||
| TPSA | 12.370 | 2.369 | 3.408 | 0.356 | 2.336 | 0.201 | 9.384 | 1.108 | ||
| 5.091 | 5.278 | 4.540 | 0.065 | 4.910 | 7.636 | 9.651 | 1.291 | |||
| Energy | 2.045 | 0.563 | 11.688 | 4.003 | 1.202 | 8.359 | 0.518 | 4.212 | ||
| Dipole moment | 6.731 | 4.200 | 4.271 | 1.203 | 9.800 | 0.044 | 2.981 | 5.291 | ||
| HOMO | 4.648 | 6.761 | 14.085 | 2.189 | 11.906 | 0.220 | 0.477 | 0.737 | ||
| LUMO | 10.690 | 0.010 | 2.190 | 0.243 | 0.863 | 11.962 | 1.929 | 0.296 | ||
| HOMO–LUMO | 8.103 | 5.131 | 6.234 | 1.308 | 7.647 | 0.023 | 4.179 | 0.193 | ||
Note: The bold values show the highest loadings scores at the current PC, compared to other PCs.
Abbreviations: ALogP, Ghose–Crippen octanol–water partition coefficient; HOMO, highest occupied molecular orbital; HOMO–LUMO, energy gap between the HOMO and LUMO states; LUMO, lowest unoccupied molecular orbital; MW, molecular weight; nCIC, number of rings; nHAcc, number of hydrogen bond acceptors; nHDon, number of hydrogen bond donors; PC, principal component; Qm, mean absolute charge; RBN, rotatable bond number; TPSA, topological polar surface area.
Summary of molecular framework generated from active DPP4 inhibitors
| Number | SMILES | Member size |
|---|---|---|
| 1 | C(CCC1CCC2CCCC2C1)CC1CCCCC1 | 93 |
| 2 | C(CCC1CCCCC1)CC1CCCC1CCCC1CCCCC1 | 87 |
| 3 | C(CC1CCCC1)CC1CCC(CC1)C1CCCCC1 | 85 |
| 4 | C1CCCC1 | 65 |
| 5 | C(CC1CCC(CCC2CCCCC2)CC1)C1CCCC1 | 50 |
| 6 | C(C1CCCC1)C1CCCC1 | 43 |
| 7 | C(CCCC1CCCCC1)CCC1CCCC1 | 37 |
| 8 | C(C1CCCC1)C1CCC(C1)C1CCC(CC1)C1CCCCC1 | 33 |
| 9 | C(C1CCCC1)C1CCC(CC2CCCCC2)C1 | 32 |
| 10 | C(CC1CCCC1)CC1CCC(C1)C1CCCCC1 | 32 |
| 11 | C(C1CCCC1)C1CCC(CC2CCC(CC2)C2CCCC2)C1 | 31 |
| 12 | C(CCC1CCCCCC1)CC1CCCCC1 | 31 |
| 13 | C(C1CCCCC1)C1CCCCC1C1CCCCC1 | 30 |
| 14 | C(CC1C2CC(CC12)C1CCCCC1)CC1CCCC1 | 29 |
| 15 | C1C2CCCCC2C2CCC(CC12)C1CCCCC1 | 27 |
| 16 | C(CC1CCCC1)CC1CCCCC1 | 27 |
| 17 | C(CCC1CCC(CC2CCCCC2)CC1)CC1CCCCC1 | 27 |
| 18 | C(CCC1CCCCC1)CC1CCCC1 | 25 |
| 19 | C(CC1CCC(CC1)C1CCCCC1)C1CCCCC1 | 24 |
| 20 | C(CCCC1CC1C1CCCCC1)CCCC1CCCC1 | 24 |
| 21 | C(C1CCCC1)C1CCC(C1)C1CCCCC1 | 23 |
| 22 | C1CCC(CC1)C1CCC2C(CCC3CCCCC23)C1 | 22 |
| 23 | C(CCCC1CCCC1)CCCC1CCCCC1 | 22 |
| 24 | C1CCC(CC1)C1CCCC2CCCCC12 | 21 |
| 25 | C1CCC(CC1)C1CCCCC1 | 21 |
| 26 | C(CC1CCCC1)CC1CCC(CC1)C1CCC2CCCC2C1 | 20 |
| 27 | C1CCC(CC1)C1CCCC(C1)C1CCCCC1 | 20 |
| 28 | C(C1C2CCCCC2CC1C1CCCCC1)C1CCCCC1 | 19 |
| 29 | C(CC1CCCC1)CC1CC2CCCC2C1 | 18 |
| 30 | C(C1CCC2CC(CC2C1)C1CCCCC1)C1CCCC2CCCCC12 | 18 |
| 31 | C1CC2CCC(CC2C1)C1CCCCC1 | 18 |
| 32 | C(CC1CCCC1)CC12CC3CC(CC(C3)C1)C2 | 17 |
| 33 | C(CC1CCCC1)CC1CCC(CCC2CCCCC2)CC1 | 17 |
| 34 | C(CC1CCCCC1)C1CCC(CC2CCCC2)C1 | 17 |
| 35 | C(CCC1CCC2CCC(C2C1)C1CCCCC1)CC1CCCCC1 | 16 |
| 36 | C(CC1CCC2CC(CC2C1)C1CCCCC1)C1CCCCC1 | 16 |
| 37 | C(CC1CCC(CCC2CCCCC2)CC1)C1CCCCC1 | 16 |
| 38 | C(C1CCCC1)C1CCC(CC2CC3CCCCC3C2)C1 | 15 |
| 39 | C(CC1CCCC1)CC1CCC(CCCC2CCCCC2)CC1 | 15 |
| 40 | C(CCC1CCCCC1)CC1CCCC1CCC1CCCCC1 | 15 |
| 41 | C(CC1CCCCC1)C1CCCCC1 | 14 |
| 42 | C(CC1CCCC1)CC1CCCC1 | 14 |
| 43 | C(CCC1CCCCC1)CCC1(CCCCC1)C1CC2CCCCC2C1 | 14 |
| 44 | C(C1CCCCC1)C1CC2CCCCC2CC1C1CCCCC1 | 13 |
| 45 | C(CC1CCC(CCC2CCCCC2)C1)C1CCCC1 | 13 |
| 46 | C(CCC1CCCC(CC2CCCCC2)CC1)CC1CCCCC1 | 13 |
| 47 | C(CCC1CCCCC1)CC1CCCC1C1CCC(C1)C1CCCCC1 | 13 |
| 48 | C(CCC1CCC2CCCC2C1CC1CCCCC1)CC1CCCCC1 | 13 |
| 49 | C1CCC(CC1)C1CCC(CC1)C1CCCCC1 | 13 |
| 50 | C(CC1CCCCC1)C1CCCC1 | 12 |
| 51 | C(CCC1CCCC1)CCC1CCC(C1)C1CCCCC1 | 12 |
| 52 | C(C1CCCC1)C1CCC(C1)C1CCC(CC1)C1CC2CCCCC2C1 | 11 |
| 53 | C(CCC1CCCCC1)CC1CCCCC1 | 11 |
| 54 | C1CC2CCC(CC2C1)C1CCC(CC1)C1CCCCC1 | 11 |
| 55 | C(CCC1CCC2CC(CC2C1)C1CCCCC1)CC1CCCCC1 | 11 |
| 56 | C(CCCCC1CCCCC1)CCCC1CCCC1 | 11 |
| 57 | C(C(CC1CCCCC1)C1CCCC1CC1CCCC1)C1CCCCC1 | 10 |
| 58 | C(C1CCC2CC(CC2C1)C1CCCCC1)C1CCC2CCCCC2C1 | 10 |
| 59 | C(CCC1CCC(CCC2CCCC2)CC1)CC1CCCCC1 | 10 |
| 60 | C(CCCC1CCCC1)CCC1CCCC1 | 10 |
| 61 | C(CCC1CCCC1)CCC1CCC2CCCC2C1 | 10 |
| 62 | C(CCC1CCCCC1CC1CCCCC1)CC1CCCCC1 | 10 |
| 63 | C(CCCC1CCCC1)CCCC1CCC2CCCC2C1 | 10 |
| 64 | C1CCC(CC1)C1CCC(CC1)C1CCC2CCCCC2C1 | 9 |
| 65 | C(CCCCC1CCCC1)CCCCC1CCCCC1 | 9 |
| 66 | C(CCC1CCCC1)CCC1CCCC1 | 9 |
| 67 | C(CC1CCC(CCC2CCCC2)C1)CC1CCCCC1 | 8 |
| 68 | C(C1CCCC1)C1CCC(CC2CCC3CCCCC3C2)C1 | 8 |
| 69 | C(CC1CCCC1)CC1CCC(CC1)C1CCCC(C1)C1CCCC1 | 8 |
| 70 | C(CCC1CCCCC1)CC1CCCC1CCCC1CC1 | 8 |
| 71 | C(C1CCCC1)C1CCC(C1)C1CCC(CC1)C1CCCC2CCCCC12 | 8 |
| 72 | C(CC1CCCCC1C1CCCCC1)C1CCCC1 | 8 |
| 73 | C(CCCC1CCC2CCCCC2C1)CCC1CCCC1 | 8 |
| 74 | C(CCC1CCC(CC1)C1CCCCC1)CC1CCCCC1 | 8 |
| 75 | C(C1CCCCC1)C1CC2CCCC2CC1C1CCCCC1 | 8 |
| 76 | C(CC1CCCC1)CC1CCC(CC2CCCCC2)CC1 | 8 |
| 77 | C1CCC(C1)C1CCC2C(CCC3CCCCC23)C1 | 7 |
| 78 | C(CCC1CCCC1)CCC1CCCCC1 | 7 |
| 79 | C(C1CCC2CC(CC2C1)C1CCCCCC1)C1CCC2CCCCC2C1 | 7 |
| 80 | C(CCC1CCCCCC1CC1CCCCC1)CC1CCCCC1 | 7 |
| 81 | C(CCC1CCC2CCCCC2C1)CC1CCCC1 | 7 |
| 82 | C(CC1CCCCC1)CC1CCCCC1C1CCCCC1 | 7 |
| 83 | C(C1CCCC1)C1CCC(C1)C1CCC2CCCCC12 | 7 |
| 84 | C1CC(CC1C1CCCCC1)C1CCCC(C1)C1CCCCC1 | 6 |
| 85 | C(CCC1CCC(CCCC2CCCCC2)CC1CC1CCCCC1)CC1CCCCC1 | 6 |
| 86 | C(CC1CCCCC1)CC1CCCC(CCC2CCCC2)C1 | 6 |
| 87 | C(CCC1CCCCC1)CC1CCCC1C1CCC(C1)C1CC1 | 6 |
| 88 | C(C1CCC2CC(CC2C1)C1CCCCCC1)C1CCCC2CCCCC12 | 6 |
| 89 | C(CC1CCCC1CC1CCCC1)CC12CC3CC(CC(C3)C1)C2 | 6 |
| 90 | C(CC1CCCC1)C(CC1CC1)C1CCC(C1)C1CCCCC1 | 6 |
| 91 | C(CCC1CCC2CCC(C3CCCC3)C2C1)CC1CCCCC1 | 5 |
| 92 | C(CCC1CCC2CC(CC3CCCCC3)CC2C1)CC1CCCCC1 | 5 |
| 93 | C1CCC(C1)C1CCCC(C1)C1CCC(C1)C1CCCCC1 | 5 |
| 94 | C1CC2CCCC2C1 | 5 |
| 95 | C(CC1CCC(C1)C1CCCCC1)CC1CCCC2CCCCC12 | 5 |
| 96 | C1C(CC2CCCCC12)C1CCCCC1 | 5 |
| 97 | C(CCC1CCC2CC(CC2C1)C1CC1)CC1CCCCC1 | 5 |
| 98 | C(CCC1CCCCC1)CC1CCCC1CC1CCCCC1 | 4 |
| 99 | C(CC1CCCC1)CC1CCC(CC1)C1CCCC2CCCC12 | 4 |
| 100 | C(CC1CCC(CC2CCCC2)C1)CC1CCCCC1 | 4 |
| 101 | C1CC1C1CCCC2CCC(CC12)C1CCC(C1)C1CCCCC1 | 4 |
| 102 | C(CCC1CCC2CCCC2C1CC1CCCC1)CC1CCCCC1 | 4 |
| 103 | C(CC1CCC1)CC1CCC(CC1)C1CCCCC1 | 4 |
| 104 | C(C1CCCCC1)C1CC(CCC1C1CCCCC1)C1CCCC1 | 4 |
| 105 | C(CC1CCC(CC2CCCCC2)CC1)C1CCCCC1 | 4 |
| 106 | C(CC1CCCC1)CC1C2CC3CC(C2)CC1C3 | 4 |
| 107 | C(C1CCCCC1)C1CC(CCC1C1CCCCC1)C1CCCCC1 | 4 |
| 108 | C(CCC1CCC2C(CCC2C2CCCCC2)C1)CC1CCCCC1 | 4 |
| 109 | C(CCC1CCC(CC2CC3CCCCC3C2)CC1)CC1CCCCC1 | 4 |
| 110 | C(CCCC1CCCCC1)CCC1CCCCC1 | 4 |
| 111 | C(CC1CCC2CC(C(CC3CCCCC3)C2C1)C1CCCCC1)C1CCCCC1 | 4 |
| 112 | C1CCC(CC1)C1CCC2CCCC(C3CCCCC3)C2C1 | 4 |
| 113 | C(CC1CCC(CCC2CCCC2)C1)CC1CCCC2CCCCC12 | 4 |
| 114 | C(CC1CCCCC1)CC1CCC(CCC2CCCC2)CC1 | 4 |
| 115 | C(CCC1CCCC1)CCC1CC2CCCCC2C1 | 4 |
| 116 | C(CC1CCC(C(CC2CCCCC2)C1)C1CCCCC1)C1CCCCC1 | 3 |
| 117 | C(CCC1CC2CCCCC2C1)CC1CCCC1 | 3 |
| 118 | C(C1CCC1)C1CCC(C1)C1CCC(CC1)C1CCCCC1 | 3 |
| 119 | C(CCC1CCC(CC2CCC3CCCCC3C2)CC1)CC1CCCCC1 | 3 |
| 120 | C1CC2CC(CC2C1)C1CCCCC1 | 3 |
| 121 | C(CCC1CCCC1)CCC1CCC2CCCCC2C1 | 3 |
| 122 | C(CCC1CCCCC1)CCC1(CCCCC1)C1CCCCC1 | 3 |
| 123 | C(CCC1CCCCC1)CC1CCCC1C1CCCC1 | 3 |
| 124 | C(C1CCCCC1)C1CCC2CC(CC2C1)C1CCCCC1 | 3 |
| 125 | C(CC1CCCCC1)CC1CCC(CC1)C1CCCCC1 | 3 |
| 126 | C(CCC1CCCCC1)CC1CCCC1C1CCC(C1)C1CCC1 | 3 |
| 127 | C(CC1CCCC1)CC12CC3CC(C1)CC(CCCC1CCCCC1)(C3)C2 | 3 |
| 128 | C(CC1CCCC1)C1CCCC1 | 3 |
| 129 | C(CC1CC1)CC1CCCC1 | 3 |
| 130 | C(CC1CCC(CCC2CCCC3CCCCC23)CC1)C1CCCC1 | 3 |
| 131 | C(CC1CCC(CCC2CCC3CCCCC3C2)CC1)C1CCCC1 | 3 |
| 132 | C | 3 |
| 133 | C(CC1CCC1)CC1CCCC1 | 3 |
| 134 | C(CC1CCCC1)CC1CCCCCCC1 | 3 |
| 135 | C(CCCC1CC2CCCCC2C1)CCC1CCCC1 | 3 |
| 136 | C(C1CCCC1)C1CCCC(C1)C1CCCCC1 | 2 |
| 137 | C(CCC1CCCC(CC1)C1CCCCC1)CC1CCCCC1 | 2 |
| 138 | C(C1CC1)C1CCCC(C1)C1CCC(C1)C1CCCCC1 | 2 |
| 139 | C(CCCC1CCCC1)CCCC1CCC2CC(CC2C1)C1CCCCC1 | 2 |
| 140 | C(CC1CCCC(C1)C1CCCCC1)C1CC1 | 2 |
| 141 | C(CCC1CCCC(CC1)C1CC1)CC1CCCCC1 | 2 |
| 142 | C(CC1C2CC(CC12)C1CC2CCCCC2C1)CC1CCCC1 | 2 |
| 143 | C(CC1CCC(C1)C1CCCCC1)CC1CCCCC1 | 2 |
| 144 | C1CCC(C1)C1CCC2CCCC(C3CCCCC3)C2C1 | 2 |
| 145 | C(CCCC1CCCC1)CCCC1CCCC1 | 2 |
| 146 | C(CCCCCC1CCCCC1)CCCCC1CCCC1 | 2 |
| 147 | C(CC1CCCC1)CC1(CCC2CCCCC2)CCCC1 | 2 |
| 148 | C(CC1CCC(CCC2CCCC3CCCCC23)C1)C1CCCC1 | 2 |
| 149 | C(CCC1CCC(CC2CCC(CC2)C2CCCCC2)CC1)CC1CCCCC1 | 2 |
| 150 | C(CCC1CCCC1)CC1CCCC1 | 2 |
| 151 | C(CC1CCCC1)CC1CCCCC1C1CCCCC1 | 2 |
| 152 | C(CCC1CCCCC1)CC1CCCC1CCCC1CCCC1 | 2 |
| 153 | C(C1CC2CCCC2C1)C1CCCCC1 | 2 |
| 154 | C1CC2CCCCCCCCCCCCCCC3CCCC(C3)CCC2C1 | 2 |
| 155 | C(CC1CCCC1)CC1CC2CCCCC2C1 | 2 |
| 156 | C(CC1CC1)CC1CCC(CCCC2CCCC2)CC1 | 2 |
| 157 | C(CC1CCCC1)CC1CCC(CCCC2CCCC2)CC1 | 2 |
| 158 | C(CC1CCCC1)CC1CC2CC(CC3CCCCC3)CC2C1 | 2 |
| 159 | C1CCC2CCCCC2C1 | 2 |
| 160 | C(CC1CCC(CC1)C1CCC2CCCC2C1)C1CCCCC1 | 2 |
| 161 | C(CC1CCC(CC1)C1CCCC1)C1CCCCC1 | 2 |
| 162 | C1CC2CCC(CC2C1)C1CCCC(C1)C1CCC(C1)C1CCCCC1 | 2 |
| 163 | C(CC1CCC(CCC2CCC(CC2)C2CCCC2)CC1)C1CCCCC1 | 2 |
| 164 | C(CC1CCCC1)CC1CCCCCC1 | 2 |
| 165 | C(C1CCCCC1)C1CCCC(C1)C1CCCCC1 | 2 |
| 166 | C(CC1CCCC1)CC1CC2CCC1C2 | 2 |
| 167 | C(CCC1CCCCC1)CC1CCCC1CCCC1CCC2CCCC2C1 | 2 |
| 168 | C(CCC1CCCCC1)CC1CCCC1CCCC1CCC2CCCCC2C1 | 2 |
| 169 | C(C1CCCC1)C1CCCCC1 | 2 |
| 170 | C(C1CCCC1)C1CCC(C1)C1CCCC2CCCCC12 | 2 |
| 171 | C(CCC1CCC2CCCC2C1CC1CC1)CC1CCCCC1 | 2 |
| 172 | C(C1CCCC1)C1CCC(CC2CCCC2)C1 | 2 |
| 173 | C(CCC1CCCC1CC1CCCC1)CCC1CCCCC1 | 2 |
| 174 | C(C1CCC1)C1CCC(CC2CCCC2)C1 | 2 |
| 175 | C(CCCC1CCCC1)CCCC1CCC2CCCCC2C1 | 2 |
| 176 | C(CCC1CCC2C(CCC2C2CC2)C1)CC1CCCCC1 | 2 |
| 177 | C(C1CCC2CCCC2C1)C1CCC2CCCCC2C1 | 2 |
| 178 | C(C1CCC2CC(CC2C1)C1CCCCC1)C1CC2CCCCC2CC2CCCCC12 | 2 |
| 179 | C1C(CC2CCCCC12)C1CCC(CC1)C1CCCCC1 | 2 |
| 180 | C(C(CC1CCCCC1)C1CCC(CC2CCCC2)C1)C1CCCCC1 | 2 |
| 181 | C(C1C2CC(CC3CCCC4CCCCC34)CCC2CC1C1CCCCC1)C1CCCCC1 | 2 |
| 182 | C(CC1CCCC1)CC1CCC(CC1)C1CCCC(C1)C1CCCCC1 | 1 |
| 183 | C(C1CCC1)C1CCCC1 | 1 |
| 184 | C(C1CCCCC1)C1CC2CC(CC2CC1C1CCCCC1)C1CCCCC1 | 1 |
| 185 | C(CC1CCC(CC1)C1CCC(CC1)C1CCC(CC1)C1CCCCC1)C1CCCCC1 | 1 |
| 186 | C(C1CCCCC1)C1CCCCC1CC1CCCCCC1 | 1 |
| 187 | C(C1CCCC1)C1CCCCC1C1CCCCC1 | 1 |
| 188 | C(CC1CCC(CC2CCCCC2)C1)C1CCCC1 | 1 |
| 189 | C(CC1CCCC1)CC1CCC(CCC2CC2)CC1 | 1 |
| 190 | C(C1CCCCC1)C1CC2CCC(CC2CC1C1CCCCC1)C1CCCCC1 | 1 |
| 191 | C(C1CCCCC1)C1CCC(CC1C1CCCCC1)C1CCCCC1 | 1 |
| 192 | C(CCCC1CCCCCC1)CCC1CCCC1 | 1 |
| 193 | C(CCCC1C2CCC1CC2)CCC1CCCC1 | 1 |
| 194 | C(CCC1CCC(CC2CCCC(C2)C2CCCCC2)CC1)CC1CCCCC1 | 1 |
| 195 | C(CCCC1CCCC1)CCC1CC1C1CCCCC1 | 1 |
| 196 | C(CCC1CCC(CC1)C(C1CCCCC1)C1CCCCC1)CC1CCCCC1 | 1 |
| 197 | C(CC1CCCC1)CC1CCC(CCCC2CCC3CCCCC3C2)CC1 | 1 |
| 198 | C(CCC1CCC(CC1)C1CCCCC1C1CCCCC1)CC1CCCCC1 | 1 |
| 199 | C(CC1CCCC1)CC1CCC(CCCC2CC3CCCCC3C2)CC1 | 1 |
| 200 | C(CC1CCCC1)CC1CC2CC(CC3CCCC3)CC2C1 | 1 |
| 201 | C1CC2CCCCCCCCCCCCCCCCCCCCC3CCCC(C3)CCC2C1 | 1 |
| 202 | C(CCCC1CCC(CC1)C1CCCCC1)CCC1CCCC1 | 1 |
| 203 | C(C1CCCC1)C1CCC(C1)C1CCC(CC1)C1CCC2CCCCC2C1 | 1 |
| 204 | C(CCC1CCC(CC2CCCC2)CC1)CC1CCCCC1 | 1 |
| 205 | C(CC1CCCC1)CC1CC2CC(CCCC3CCCCC3)CC2C1 | 1 |
| 206 | C(CCCC1CCC(CC1)C1CC2CCCCC2C1)CCC1CCCC1 | 1 |
| 207 | C(CCC1CCCCC1)CCC1CCC(CC(CC2CCC(CCCCCC3CCCCC3)CC2)C2CCCC2CC2CCCC2)CC1 | 1 |
| 208 | C(CC1CC2CCCCC2C1)C1CCC(CC2CCCC2)C1 | 1 |
| 209 | C(C1CC2CCCCC2C1)C1CCC2CC(CC2C1)C1CCCCC1 | 1 |
| 210 | C(CC1CCCCC1)C1CCCC1CC1CCCC1 | 1 |
| 211 | C(C1CCCC1)C1CCCC1CC1CC2CCCCC2C1 | 1 |
| 212 | C(C1CCCC1)C1CCC(CC2CCC(CC2)C2CCC(C2)C2CCCCC2)C1 | 1 |
| 213 | C(CCC1CCC(CC2CCC3CCCCC23)CC1)CC1CCCCC1 | 1 |
| 214 | C(CC1CCCCC1)C1CCCC1CC1CC2CCCCC2C1 | 1 |
| 215 | C(CCC1CCCC1CCC1CCCCC1)CC1CCCC1 | 1 |
| 216 | C(CC1CCCCC1)C1CCC1CC1CCCC1 | 1 |
| 217 | C(CC1CCCCC1)C1CCC1CC1CC2CCCCC2C1 | 1 |
| 218 | C(CCC1CCC(CCC2CCCCC2)CC1)CC1CCCCC1 | 1 |
| 219 | C(C1C2CC(CC3CCCC4CCCCC34)CCC2CC1C1CCCCCC1)C1CCCCC1 | 1 |
| 220 | C(CC1CCC2CC(CC2C1)C1CCCCCC1)C1CCCCC1 | 1 |
| 221 | C(CCCC1CCC(CC2CCCCC2)CC1)CCC1CCCC1 | 1 |
| 222 | C(CCC1CCCCCC1CC1CCCC1)CC1CCCCC1 | 1 |
| 223 | C(CC1CCCC1)CC1CCC(CC1)C1CCCC(CC2CC2)C1 | 1 |
| 224 | C(C1CCCC1)C1CCC(CC2CCCC(CC2)C2CCCC2)C1 | 1 |
| 225 | C(C1CC2CCCCC2C1)C1CCC2CC(CC2C1)C1CCCCCC1 | 1 |
| 226 | C(CCC1CCCCC1CCC1CCCC1)CC1CCCCC1 | 1 |
| 227 | C(CC1CCC(CCC2CCCCC2)CC1)C1CCC2CCCCC12 | 1 |
| 228 | C(CC1CCC(CC1)C(CC1CCCC1)CC1CCCCC1)C1CCCCC1 | 1 |
| 229 | C1C(CC2CC(CCC12)C1CCCCC1)C1CCCCC1 | 1 |
| 230 | C1CCC(C1)C1CCC(C1)C1CCCC(C1)C1CCCCC1 | 1 |
| 231 | C(C1CCCC1)C1CCC2CC(CCC2C1)C1CCCC(C1)C1CCCC1 | 1 |
| 232 | C(CCC1CCCCC1)CC1CCCC1CCCC1CCC(CCCC2CCCCC2)CC1 | 1 |
| 233 | C(C1CCC2CCC(CC12)C1CCCCC1)C1CCCCC1 | 1 |
| 234 | C(CCC1CCCC2CCCC12)CC1CCCCC1 | 1 |
| 235 | C(CC1CCC1)CC1CCC(CC1)C1CCC2CCCC2C1 | 1 |
| 236 | C(CC1CCCC(C1)C1CCCCC1)C1CCCC1 | 1 |
| 237 | C(C1CC2CCCCC2C1)C1CCCC1CC1CC2CCCCC2C1 | 1 |
| 238 | C(CCC1CCCC1CCCC1CCCCC1)CC1CCCC1 | 1 |
| 239 | C(CCC1CCCC1)CC(CCC1CCCC1)CC1CCCCC1 | 1 |
| 240 | C(CC1CCCC1)C1CC1 | 1 |
| 241 | C1CCC(C1)C1CCC(C1)C1CCCC(C1)C1CCC2CCCC2C1 | 1 |
| 242 | C(C1CCCC1)C1CC2CCCCC2C1 | 1 |
| 243 | C(CCC1CCCC(CCCCC2CCCCC2)CC1)CC1CCCCC1 | 1 |
| 244 | C(CCCC1CCCC1CCCCC1CCCCC1)CCC1CCCC1 | 1 |
| 245 | C(CCC1CCC(CC2CCCCC2)CC1)CC1CCCC1 | 1 |
| 246 | C(C1CCCC1)C1CCC(CC2CCC3CCCC3C2)C1 | 1 |
| 247 | C(CCC1CCCCC1CC1CCCC1)CC1CCCCC1 | 1 |
| 248 | C(CCC1CCC(CC2CCCCC2)CC1CC1CCCCC1)CC1CCCCC1 | 1 |
| 249 | C(CC12CC3CC(CC(C3)C1)C2)C1CCC2CC12 | 1 |
| 250 | C(CCC1CCC(CC1)C1CCCCC1)CC1CCCC1 | 1 |
| 251 | C(CCC(CC1CCCCC1)C1CCCCC1)CCC1CCCC1 | 1 |
| 252 | C(CCC1CCC2CCCC2C1C1CC1)CC1CCCCC1 | 1 |
| 253 | C(CCC1CCC2C(CC3CCCCC3)CCC2C1)CC1CCCCC1 | 1 |
| 254 | C(CCC1CCC2CCC(CC3CCCCC3)C2C1)CC1CCCCC1 | 1 |
| 255 | C(CC1CCCC1)C(CC1CCC1)C1CCC(CC1)C1CCCCC1 | 1 |
| 256 | C(CCCC(C1CCCCC1)C1CCCCC1)CCCC1CCCC1 | 1 |
| 257 | C(CC1CCCC1)C(CCC1CCCC1)C1CCC(CC1)C1CCCCC1 | 1 |
| 258 | C(C1CCCC1)C1CCC(C1)C1CCC(CC1)C1CCC2CCCCC12 | 1 |
| 259 | C(CCCC1CC1C1CCCC1)CCCC1CCCC1 | 1 |
| 260 | C1CCC(C1)C1CCC2CCCCC12 | 1 |
| 261 | C(C1CCCC1)C1CCC2CCCCC2C1 | 1 |
| 262 | C(CCC1CCCCC1C1CCCCC1)CC1CCCCC1 | 1 |
| 263 | C(CC1CCCC1)C(CC1CCCC1)C1CCC(CC1)C1CCCCC1 | 1 |
| 264 | C1C2CCCCC2C2CCCC(C12)C1CCCCC1 | 1 |
| 265 | C(CCC(CC1CCCCC1)CC1CCCCC1)CCC1CCCC1 | 1 |
| 266 | C(C1CCCC1)C1CCC(C1)C1CCCC1 | 1 |
| 267 | C(C1CCCC1)C1CCC(C1)C1CCC(CC1)C(C1CCCCC1)C1CCCCC1 | 1 |
| 268 | C(C1CCCC1)C1CCC(C1)C1CCCC(CC1)C1CCCCC1 | 1 |
| 269 | C(CC1CCCC1)CC1(CCC1)C1CCCCC1 | 1 |
| 270 | C(C1CCCC1)C1CCC(C1)C1CCC(CC2CCCCC2)CC1 | 1 |
| 271 | C(CCC1CCC2CCCCC2C1)CC1CCCCC1 | 1 |
| 272 | C(CCCC1CCCC1)CCCC1CCC(CC1)C1CCCCC1 | 1 |
| 273 | C(C1CCCC1)C1CCC(C1)C1CC2CCCCC2C1 | 1 |
| 274 | C(C1CCCC1)C1CCC(C1)C1C2CC3CC(C2)CC1C3 | 1 |
| 275 | C(C1CCCC1)C1CCC(CC2CCCCCCC2)C1 | 1 |
| 276 | C(C1CCCC1)C1CCC(CC2CCCCCC2)C1 | 1 |
| 277 | C1CC(C2CC(CCC12)C1CCC(CC1)C1CCCCC1)C1CCCCC1 | 1 |
| 278 | C(CC1CCCC1)CC1CCCCCCCCC1 | 1 |
| 279 | C1CC1C1CCCC2CCC(CC12)C1CCC(CC1)C1CCCCC1 | 1 |
| 280 | C(CCC1CCC2CCCCC2C1CC1CCCCC1)CC1CCCC1 | 1 |
| 281 | C1CC1C1CC2CCC(CC2C1)C1CCC(CC1)C1CCCCC1 | 1 |
| 282 | C(CC1CCCC1)CC12CCC(CC1)CC2 | 1 |
| 283 | C1CCC2C(C1)CCC1CCCCC21 | 1 |
| 284 | C(CC1CCCC1)CC1CCC(CC1)C1CCCC(C1)C1CCC(C1)C1CC1 | 1 |
| 285 | C(CC1CCCC1)CC12CC3CC1CC(C2)C3 | 1 |
| 286 | C(CC1CCCC1)CC1CC2CC1CCC2 | 1 |
| 287 | C1CC1C1CCC2CC(CC2C1)C1CCC(CC1)C1CCCCC1 | 1 |
| 288 | C(CC1CC1)C(CCC1CCCC1)C1CCC(CC1)C1CCCCC1 | 1 |
| 289 | C(CC1CCCCC1)C1CC2CCCC2C1 | 1 |
| 290 | C1CCC(C1)C1CC2CCCC2C1 | 1 |
| 291 | C(CC1CCCC1)C(C1CCCC1)C1CCC(CC1)C1CCCCC1 | 1 |
| 292 | C(CCC1CCCCC1)CC1CCCC1CCCC1CCC1 | 1 |
| 293 | C(CCC1CCCCC1)CC1CCCC1CCC1CC1 | 1 |
| 294 | C(CC1CCCC1)CC1CC2CC(C2)C1 | 1 |
| 295 | C(CC1CCC(CC2CCC3CCCCC23)CC1)C1CCCCC1 | 1 |
| 296 | C(CC1CCCC1)CC1CCCCCCCCCCC1 | 1 |
| 297 | C(CC1CCC(CC2CCC3CCCC3C2)CC1)C1CCCCC1 | 1 |
| 298 | C(CC1CCC(CC2CCC3CCCCC3C2)CC1)C1CCCCC1 | 1 |
| 299 | C(CCC1CC2CC(C2)C1)CC1CCCC1 | 1 |
| 300 | C(CC1CCCC1)CC1CCC(CCCC23CC4CC(CC(C4)C2)C3)CC1 | 1 |
| 301 | C1CC1C1CCCC(C1)C1CCC2C(CCC3CCCCC23)C1 | 1 |
| 302 | C(CCC1CCCC1)CCC1CCC(CCCC2CCCC2)CC1 | 1 |
| 303 | C(CC1CCCC1)CC1CCC(CCCC2CCC3CCCC3C2)CC1 | 1 |
| 304 | C(CCC1CCC(CCCC2CCCC2)CC1)CC1CCCCC1 | 1 |
| 305 | C1CCC2C(C1)CCC1CC(CCC21)C1CCCCCC1 | 1 |
| 306 | C(CCC12CC3CC(CC(C3)C1)C2)CC1CCCC1 | 1 |
| 307 | C(CC1CCCC1)CC1CCC(CC2CC3CCCCC3C2)CC1 | 1 |
| 308 | C(CCC1CCCCC1)CC1CCCC1C1CCC(CC2CC2)C1 | 1 |
| 309 | C(CC1CCCC1)CC1CCC(CC1)C1CC2CCCCC2C1 | 1 |
| 310 | C(CCC1CCCCC1)CC1CCCC1C1CCC(C1)C1CCCC1 | 1 |
| 311 | C(CCC1CCC2CCCCC2C1)CC1CCC(CCC2CCCC2)CC1 | 1 |
| 312 | C(CCC1CCCC2CCCCC12)CC1CCC(CCC2CCCC2)CC1 | 1 |
| 313 | C(CCC1CCCC1)CCC1CCC(CC1)C1CCCCC1 | 1 |
| 314 | C(CCC1CCCC1)CCC1CCC2CC(CC2C1)C1CCCCC1 | 1 |
| 315 | C(CCC(C1CCCCC1)C1CCCCC1)CCC1CCCC1 | 1 |
| 316 | C(CCC1CCCC(CC2CCCC2)CC1)CC1CCCCC1 | 1 |
| 317 | C(CC1C2CC(CC12)C1CCC2CCCCC2C1)CC1CCCC1 | 1 |
| 318 | C(CC1C2CC(CC12)C1CCCC1)CC1CCCC1 | 1 |
| 319 | C(CCC1CCCC(CC2CCC3CCCC3C2)CC1)CC1CCCCC1 | 1 |
| 320 | C(CCC1CCCC(CCC2CCCCC2)CC1)CC1CCCCC1 | 1 |
| 321 | C(CCC1CCCC(CCCC2CCCCC2)CC1)CC1CCCCC1 | 1 |
| 322 | C(CCC1CCCC2CCCCC12)CC1CCCC1 | 1 |
| 323 | C(CCC1CCCCC1)CCC1(CCCCC1)C1CCCC1 | 1 |
| 324 | C(CCC1CCCCC1)CCC1(CCCCC1)C1CC2CCC(CCCC3CCCCC3)CC2C1 | 1 |
| 325 | C(CC1CCC2CC(CC2C1)C1CCC2CCCC12)C1CCCCC1 | 1 |
| 326 | C(CC1CCCC1)C(C1CCCCC1)C1CCCCC1 | 1 |
| 327 | C(CCC1CCCCC1)CCC1(CCC(CC2CCCCC2)CC1)C1CCCCC1 | 1 |
| 328 | C1CCCCCC1 | 1 |
| 329 | C1CCCCC1 | 1 |
| 330 | C(CC1CCC2CCCC2C1)C1CCCC1 | 1 |
| 331 | C(C1CCCC1)C1CC2CCC1C2 | 1 |
| 332 | C(CCC1CCCC1)CCC1CCC2CCCCC12 | 1 |
Abbreviation: SMILEs, simplified molecular-input line-entry system.
Summary of molecular framework generated from inactive DPP4 inhibitors
| Number | SMILES | Member size |
|---|---|---|
| 1 | C1CCCC1 | 43 |
| 2 | C1CCCCC1 | 32 |
| 3 | C(CCCC1CCCCC1)CCC1CCCCC1 | 29 |
| 4 | C(CCC1CCCCC1)CC1CCCC1 | 24 |
| 5 | C1C(CC2CCCCC12)C1CCCCC1 | 20 |
| 6 | C(CC1CCCC1)CC1CCCCC1 | 20 |
| 7 | C1CC2CCCCC2C1 | 18 |
| 8 | C1CCC2CCCCC2C1 | 15 |
| 9 | C1CCC(CC1)C1CCCC(C1)C1CCCCC1 | 14 |
| 10 | C1CCC(CC1)C1CCCCC1 | 12 |
| 11 | C(CC1CC2CCCCC2C1)C1CCC(CC2CCCC2)C1 | 12 |
| 12 | C(CCC1CCC(CC1)C(C1CCCCC1)C1CCCCC1)CC1CC2CCCCC2C1 | 9 |
| 13 | C(CCC1CCCC2CCCCC12)CC1CCCC1 | 9 |
| 14 | C(CC1CCCCC1)CC1CCCCC1 | 8 |
| 15 | C | 8 |
| 16 | C(CC1CCCC1)CC12CC3CC(CC(C3)C1)C2 | 8 |
| 17 | C(CCC1CCCCC1)CC1CCCCC1 | 8 |
| 18 | C(C(CC1CCCCC1)C1CCCC1)C1CCCCC1 | 7 |
| 19 | C1CCC(CC1)C1CCCC2CCCCC12 | 7 |
| 20 | C(C1CCCC1)C1CCCC1 | 6 |
| 21 | C(CC1CCCCC1)C1CCCC1 | 6 |
| 22 | C(C(CC1CCCCC1)C1CCC2CCCCC12)C1CCCCC1 | 6 |
| 23 | C(CCC1CCCCC1)CCC1CCCCC1 | 6 |
| 24 | C(CCC1CCCC1)CCC1CCCCC1 | 6 |
| 25 | C(C1CCCCC1)C1CCCC(C1)C1CCCCC1 | 6 |
| 26 | C(C1CCCC1)C1CCCC(C1)C1CCCCC1 | 6 |
| 27 | C(CC1CCCCC1)C1CCCCC1 | 5 |
| 28 | C1CCC(CC1)C1CCC2CCCCC2C1 | 5 |
| 29 | C(CC1CCCC1)CC1CCCC1 | 4 |
| 30 | C(C(CC1CCCCC1)C1CCCC1CC1CCCC1)C1CCCCC1 | 4 |
| 31 | C(C1CCCCC1)C1CCCCC1 | 4 |
| 32 | C(CCC1CCC2CCCC2C1)CC1CCCCC1 | 4 |
| 33 | C(CC1CCC(CC1)C1CCCCC1)C1CCC(CC2CCCC2)C1 | 3 |
| 34 | C(CC1CCCCC1)CC1CCCC(C1)C(CCC1CCCCC1)C1CCCCC1 | 3 |
| 35 | C(CCC1CCC(CC2CCCCC2)CC1)CC1CC2CCCCC2C1 | 3 |
| 36 | C(CCCC1CCCCC1)CCC1CCCC1 | 3 |
| 37 | C1CCCCCC1 | 3 |
| 38 | C(CC1CCCCC1)C(C1CCCCC1)C1CCCCC1CCC1CCCCC1 | 3 |
| 39 | C(CCC1CCC(CCCC2CCCCC2)CC1CC1CCCCC1)CC1CCCCC1 | 3 |
| 40 | C(CC1CCCC(C1)C1CCCCC1)C1CC1 | 3 |
| 41 | C1CCC1 | 3 |
| 42 | C(CC1CCC2CCCC2C1)C1CCC(CC2CCCC2)C1 | 3 |
| 43 | C(CC1CCCC1)CC1CCC(C1)C1CCCCC1 | 3 |
| 44 | C(CCC1CCCCC1)CC1CCCC1CC1CCCC1 | 3 |
| 45 | C(CCC1CCC2CCCC2C1)CC1CCCC1 | 3 |
| 46 | C(CC1CCC2CCCCC2C1)C1CCCCC1 | 3 |
| 47 | C(CCC1CCCC1)CC1CCCC1 | 2 |
| 48 | C(CCC(C1CCCCC1)C1CCCCC1)CCC1CC2CCCCC2C1 | 2 |
| 49 | C1C2CC(CCC2C2CCC3C(CCC4CCCCC34)C12)C1CCCCC1 | 2 |
| 50 | C1CCC(CC1)C1CCC2C(CCC3CCCCC23)C1 | 2 |
| 51 | C(CC1CCCCC1)C1CCC(CC2CC3CCCCC3C2)C1 | 2 |
| 52 | C1CC2CCC(CC2C1)C1CCCCC1 | 2 |
| 53 | C(CCCC1CCCCC1)CCC1CC2CCCCC2C1C(CC1CCCCC1)CC1CCCCC1 | 2 |
| 54 | C(C1CCC(CC2CCC3CCCCC3C2)C1)C1CC2CCCCC2C1 | 2 |
| 55 | C(CC1CCCC2CCCCC12)C1CCC(CC2CC3CCCCC3C2)C1 | 2 |
| 56 | C(CCC1CCCC1)CCC1CCC(C1)C1CCCCC1 | 2 |
| 57 | C(CC1CCCCC1)CC1CCC(CC2CCCCC2)CC1 | 2 |
| 58 | C(CC1CCCC1CC1CCCC1)CC1CCCCC1 | 2 |
| 59 | C(CCC1CCC2CCCCC2C1)CC1CC2CCCCC2C1 | 2 |
| 60 | C(CC1CCCCC1)CC12CC3CC(CC(C3)C1)C2 | 2 |
| 61 | C(CCC1CCCCC1)CC(CC1CCCCC1)CC1CCCCC1 | 2 |
| 62 | C(CCC1CCC(CC2CCC3CCCCC3C2)CC1)CC1CC2CCCCC2C1 | 1 |
| 63 | C(CCC1CCC(CC2CCC(CC2)C2CCCCC2)CC1)CC1CC2CCCCC2C1 | 1 |
| 64 | C(CCC1CCC(CC2CCCC(C2)C2CCCCC2)CC1)CC1CC2CCCCC2C1 | 1 |
| 65 | C(CCC1CC2CC1CC2C(C1CCCCC1)C1CCCCC1)CC1CC2CCCCC2C1 | 1 |
| 66 | C(CCC1CCCC(CC1)C(C1CCCCC1)C1CCCCC1)CC1CC2CCCCC2C1 | 1 |
| 67 | C(CCC1CCC(CC(C2CCCCC2)C2CCCCC2)CC1)CC1CC2CCCCC2C1 | 1 |
| 68 | C(CC1CCCCC1)C1CC2CCCCC2C1 | 1 |
| 69 | C(CCC1CCC(CC2C3CCCCC3C3CCCCC23)CC1)CC1CC2CCCCC2C1 | 1 |
| 70 | C(CCC1CCC(CC1)C1CCCCC1)CC1CC2CCCCC2C1 | 1 |
| 71 | C(CC1CCC2C1CCC1C2CCC2CCCCC12)CC1CCCCC1 | 1 |
| 72 | C(CC1CCC(CC2CCCC2)C1)CC1CCCCC1 | 1 |
| 73 | C(CC1CCCC1)CC1CCC(CC2CCCCC2)CC1 | 1 |
| 74 | C(CC1CCCCC1)C1CCC(CC2CCCC2)C1 | 1 |
| 75 | C(CC1CCCCC1)CC1CCC(CCCC2CCCCC2)CC1 | 1 |
| 76 | C(CC1CCC(CC2CCCC2)C1)C1CCCC1 | 1 |
| 77 | C1CC2CCC3C(CCC4CCCCC34)C2C1 | 1 |
| 78 | C(CC1CCCCC1)C1CC2CCC3C(CCC4CCCCC34)C2C1 | 1 |
| 79 | C(CCC1CCCC(C1)C1CCCCC1)CC1CCCCC1 | 1 |
| 80 | C(CC1CCCCC1)CC1CCCC(C1)C1CCCCC1 | 1 |
| 81 | C(CC1CCCC(C1)C1CCCCC1)C1CCCC1 | 1 |
| 82 | C(CCC1CCC(CC1)C1CCCC1)CC1CCCC1 | 1 |
| 83 | C(CCC1CCCC(C1)C1CCCCC1)CC1CCCC1 | 1 |
| 84 | C(CCC1CCC2CCCCC2C1)CC1CCCC1 | 1 |
| 85 | C(CCC1CCCC1)CCC1CCC(CC1)C(C1CCCCC1)C1CCCCC1 | 1 |
| 86 | C(CCC1CCCC1)CCC1CCCC1 | 1 |
| 87 | C(CCC1CCCC(CC2CCCC2)C1)CC1CCCC1 | 1 |
| 88 | C(CCCC1CCCC1)CCCC1CCC(CC1)C(C1CCCCC1)C1CCCCC1 | 1 |
| 89 | C1CCC(C1)C1CCCCC1 | 1 |
| 90 | C(CCC1CCCC2CCCCC12)CC1CCCCC1 | 1 |
| 91 | C(CCCCC1CCCC1)CCCCC1CCCCC1 | 1 |
| 92 | C(CCCCC1CCCC1)CCCC(C1CCCCC1)C1CCCCC1 | 1 |
| 93 | C(CCCC1CCCC1)CCCC1CCC(CC2CCCCC2)CC1 | 1 |
| 94 | C(CC1CCC2CCCCC12)CC1CCC2CCCCC2C1 | 1 |
| 95 | C(CCC1CCC(C1)C(C1CCCCC1)C1CCCCC1)CCC1CC2CCCCC2C1 | 1 |
| 96 | C(C1CCCC1)C1CCC2CCCCC2C1 | 1 |
| 97 | C(CCC1CC2CCCCC2C1)CCC1CCC(CC1)C(C1CCCCC1)C1CCCCC1 | 1 |
| 98 | C(CCC1CCCC1CCC(C1CCCCC1)C1CCCCC1)CC1CC2CCCCC2C1 | 1 |
| 99 | C(CCC1CC2CCCCC2C1)CCC1CCC(CC2CCCCC2)CC1 | 1 |
| 100 | C1CC2CCC(CC2C1)C1CCCC(C1)C1CCCCC1 | 1 |
| 101 | C(CC1CCC(CC1)C1CCC(CC1)C1CCCC1)C1CCCCC1 | 1 |
| 102 | C(CCC1CCCC1)CC(C1CCCCC1)C1CCCCC1 | 1 |
| 103 | C(CCC1CCCC1CC1CCC(CC1)C(C1CCCCC1)C1CCCCC1)CC1CC2C1CCCC2C1 | 1 |
| 104 | C(CCC1CC2CCCCC2C1)CCC12CC3CC(CC(C3)C1)C2 | 1 |
| 105 | C(C1CCCC1)C1CCCC1CC1CC2CCCCC2C1 | 1 |
| 106 | C(CC1CCC1)CC12CC3CC(CC(C3)C1)C2 | 1 |
| 107 | C(CC1CCCC1)CC1CCCC1C(CC1CCCCC1)CC1CCCCC1 | 1 |
| 108 | C(CC1CCCCC1)C(C1CCCCC1)C1CCC(CCC2CCCCC2)CC1 | 1 |
| 109 | C(CC1CCCCC1)C(C1CCCCC1)C1CCC2CC(CCC2C1)C1CCCCC1 | 1 |
| 110 | C(CC1CCCC1)CC1CC2CCCCC2C1 | 1 |
| 111 | C(CCCC1CCCC1)CCCC1CCCCC1 | 1 |
| 112 | C(C1CCCCC1)C1CCC2CCC(CC2C1)C1CCCCC1 | 1 |
| 113 | C(CCCCCC1CCCCC1)CCCCC1CC2CCCCC2C1 | 1 |
| 114 | C(CC1CCCC1C(CC1CCCCC1)CC1CCCCC1)CC1CCCCC1 | 1 |
| 115 | C(C(CC1CCCCC1)C1CCCCC1)C1CCCC1 | 1 |
| 116 | C(CC1CCC2CCCCC2C1)CC12CC3CC(CC(C3)C1)C2 | 1 |
| 117 | C(CCC1CCCC1)CC(C(CC1CCCCC1)CC1CCCCC1)C1CCCCC1 | 1 |
| 118 | C(CC1CCCCC1)CC1CCC(CCCC2CCCCC2)C(CC2CCCCC2)C1 | 1 |
| 119 | C(CC1CCCC1)CC1CCCC1C1CCCCC1 | 1 |
| 120 | C(CCCCC1CCCCC1)CCCC1CCCCC1 | 1 |
| 121 | C(CCCC1CCCCC1CCC1CCCCC1)CCC1CCCCC1 | 1 |
| 122 | C(CCCC1CCC2CCCCC2C1)CCC1CCCCC1 | 1 |
| 123 | C1CC2CCCC2C1 | 1 |
| 124 | C(CCCC1CCCCC1)CCCC1CCCCC1 | 1 |
| 125 | C(CCCC1CCC(CCC2CCCCC2)CC1)CCC1CCCCC1 | 1 |
| 126 | C(CCCC1CCCC2CCCCC12)CCC1CCCCC1 | 1 |
| 127 | C1CCC(CC1)C(C1CCCCC1)C1CCCCC1 | 1 |
| 128 | C(C1CCCCC1)C1CCC(CC1)C1CCCCC1 | 1 |
| 129 | C(CCCCC1CCCCC1)CCCC1CC2CCCCC2C1 | 1 |
| 130 | C(CC1CCCC(CC2CC3CCCCC3CC2C2CCCCC2)C1)C1CCCCC1 | 1 |
| 131 | C(C1CCCC1)C1CC2CCCC2C1 | 1 |
| 132 | C(CC1CCC2CCCCC2C1)C1CCCC1 | 1 |
| 133 | C(CC1CC2CCCCC2C1)CC1CCC2CCCCC2C1 | 1 |
| 134 | C(CCC1CCCCC1)CC1CC2CCCCC2C1 | 1 |
| 135 | C(CC1CC2CCCC2C1)CC1CCCCC1 | 1 |
| 136 | C(CC1CCC2CCCCC2C1)C1CCC(CC2CCCC2)C1 | 1 |
| 137 | C(CCC1CCCCCC1)CC1CCCCC1 | 1 |
| 138 | C(CC1CCCCC1)C1CCC(C1C1CCCCC1)C1CCCCC1 | 1 |
| 139 | C1CCC(C1)C1CCC2C(CCC3CCCCC23)C1 | 1 |
| 140 | C(CC1CCCC1)CC1CCC(CCCC2CCCCC2)CC1 | 1 |
| 141 | C(CC1C2CC(CC12)C1CCCCC1)CC1CCCC1 | 1 |
| 142 | C(CC1CCCCC1C1CCCCC1)C1CCCC1CC1CCCC1 | 1 |
| 143 | C(C1CCCC1)C1CCCC1C1CC2CCCCC2C2CCCCC2C1 | 1 |
| 144 | C(CCCC1CC2CCCCC2C1)CCC(C1CCC2CCCCC2C1)C1CCC2CCCC1C2C1 | 1 |
| 145 | C(CCCC1CC2CCCCC2C1)CCC(CC1CCCCC1)CC1CCCCC1 | 1 |
| 146 | C(CCCC1CC2CCCCC2C1)CCCC1CCC2CCCCC2C1 | 1 |
| 147 | C(C1CCCC1)C1CC2CCCCC2C1 | 1 |
| 148 | C(CCCCC(CCCCC1CCCCC1)CC1CCCC1)CCCCC1CCCCC1 | 1 |
| 149 | C(CCCCCC1CCCC2CCCCC12)CCCCC1CCCCC1 | 1 |
| 150 | C(CCCCCC1CCCCC1)CCCCC1CCCC1 | 1 |
| 151 | C(C(CC1CCCCC1)C1CCC(CC2CC3CCCCC3C2)C1)C1CCCCC1 | 1 |
| 152 | C(CC1CCCC1)CC1CCC2CCCCC12 | 1 |
Abbreviation: SMILEs, simplified molecular-input line-entry system.
Summary of important structural fingerprints ranked by the MDGI
| Rank | Fingerprint | Structure | Fingerprint occurrence
| MDGI | |
|---|---|---|---|---|---|
| Actives | Inactives | ||||
| 1 | KRFP4541 |
| 98 | 43 | 9.197 |
| 2 | KRFP2428 |
| 84 | 56 | 3.610 |
| 3 | KRFP3668 |
| 15 | 2 | 2.312 |
| 4 | KRFP0610 |
| 362 | 1,685 | 2.227 |
| 5 | KRFP3616 |
| 16 | 10 | 1.813 |
| 6 | KRFP3405 |
| 31 | 332 | 1.563 |
| 7 | KRFP0223 |
| 3 | 134 | 1.400 |
| 8 | KRFP2650 |
| 5 | 0 | 1.119 |
| 9 | KRFP1945 |
| 9 | 1 | 1.021 |
| 10 | KRFP0018 |
| 7 | 29 | 0.727 |
| 11 | KRFP0605 |
| 284 | 1,182 | 0.588 |
| 12 | KRFP1144 |
| 5 | 80 | 0.587 |
| 13 | KRFP0566 |
| 55 | 84 | 0.581 |
| 14 | KRFP0344 |
| 301 | 935 | 0.572 |
| 15 | KRFP3025 |
| 458 | 1,874 | 0.511 |
| 16 | KRFP3561 |
| 2 | 58 | 0.407 |
| 17 | KRFP3713 |
| 29 | 247 | 0.391 |
| 18 | KRFP0496 |
| 0 | 49 | 0.382 |
| 19 | KRFP2200 |
| 2 | 0 | 0.381 |
| 20 | KRFP0621 |
| 246 | 776 | 0.341 |
| 21 | KRFP3152 |
| 1 | 62 | 0.320 |
| 22 | KRFP3966 |
| 7 | 13 | 0.302 |
| 23 | KRFP3081 |
| 0 | 70 | 0.278 |
| 24 | KRFP3920 |
| 5 | 9 | 0.214 |
| 25 | KRFP4261 |
| 3 | 73 | 0.185 |
| 26 | KRFP3369 |
| 189 | 789 | 0.161 |
| 27 | KRFP0677 |
| 236 | 1,026 | 0.155 |
| 28 | KRFP0508 |
| 1 | 1 | 0.137 |
| 29 | KRFP2264 |
| 85 | 167 | 0.131 |
| 30 | KRFP3602 |
| 4 | 52 | 0.123 |
Abbreviation: MDGI, mean decrease of Gini index.
Applying Lipinski’s rule of five on investigated data sets
| Data sets | Total | Actives | Inactives |
|---|---|---|---|
| DPP4-TRN | 2,339/2,609 (89.651%) | 1,961/2,075 (94.506%) | 478/534 (89.513%) |
| DPP4-TEST1 | 222/325 (68.308%) | ||
| DPP4-TEST2 | 215/325 (66.154%) | ||
| DPP4-TEST3 | 301/325 (92.615%) |
Note: Values shown are for compounds passing the Lipinski’s rule of five/in relation to the total number of compounds (values in parentheses are percentages passing the Lipinski’s rule of five).