| Literature DB >> 30700728 |
Modest von Korff1, Thomas Sander2.
Abstract
Molecular complexity is an important characteristic of organic molecules for drug discovery. How to calculate molecular complexity has been discussed in the scientific literature for decades. It was known from early on that the numbers of substructures that can be cut out of a molecular graph are of importance for this task. However, it was never realized that the cut-out substructures show self-similarity to the parent structures. A successive removal of one bond and one atom returns a series of fragments with decreasing size. Such a series shows self-similarity similar to fractal objects. Here we used the number of distinct fragments to calculate the fractal dimension of the molecule. The fractal dimension of a molecule is a new matter constant that incorporates all features that are currently known to be important for describing molecular complexity. Furthermore, this is the first work that reveals the fractal nature of organic molecules.Entities:
Year: 2019 PMID: 30700728 PMCID: PMC6353876 DOI: 10.1038/s41598-018-37253-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1n-Hexane and four distinct alkanes obtained by successively removing one bond and one atom, followed by saturation with hydrogen.
Figure 2Heptanoic acid and its distinct substructures. Grouped by bond counts.
Figure 3Glucose and some of its distinct subgraphs. Subgraphs A and B are shown twice in both bent and elongated forms.
Figure 4Koch curve.
Figure 5n-Hexane in line notation, covered by boxes for the box counting algorithm.
Box counting algorithm applied to n-hexane in Fig. 5.
| Side length square ε | Number of squares N | log(N)/log(1/ε) |
|---|---|---|
| 1 | 1 | Not defined |
| 0.5 | 2 | 1 |
| 0.25 | 4 | 1 |
| 0.125 | 8 | 1 |
Figure 6The number of distinct subgraphs (N) for five example organic molecules versus the bond count of the subgraphs (γ). The numbers of distinct subgraphs on the y-axis are given in logarithmic scale.
The maximum number of distinct fragments (Nmax), the corresponding bond count (γmax), and the fractal dimension (dim(M)) of five example molecules.
| Name | Structure | Nmax | γmax | dim(M) |
|---|---|---|---|---|
| Adamantane |
| 11 | 8 | 1.2 |
| Anandamide |
| 14 | 5 | 1.6 |
| Glucose |
| 89 | 8 | 2.2 |
| PubChemCS |
| 45,973 | 22 | 3.5 |
| Strychnine |
| 2,022,462 | 21 | 4.8 |