| Literature DB >> 31387220 |
Abstract
Fragment-based drug discovery (FBDD) has become a major strategy to derive novel lead candidates for various therapeutic targets, as it promises efficient exploration of chemical space by employing fragment-sized (MW < 300) compounds. One of the first challenges in implementing a FBDD approach is the design of a fragment library, and more specifically, the choice of its size and individual members. A diverse set of fragments is required to maximize the chances of discovering novel hit compounds. However, the exact diversity of a certain collection of fragments remains underdefined, which hinders direct comparisons among different selections of fragments. Based on structural fingerprints, we herein introduced quantitative metrics for the structural diversity of fragment libraries. Structures of commercially available fragments were retrieved from the ZINC database, from which libraries with sizes ranging from 100 to 100,000 compounds were selected. The selected libraries were evaluated and compared quantitatively, resulting in interesting size-diversity relationships. Our results demonstrated that while library size does matter for its diversity, there exists an optimal size for structural diversity. It is also suggested that such quantitative measures can guide the design of diverse fragment libraries under different circumstances.Entities:
Keywords: diversity; fragment-based drug discovery; library design; library size
Year: 2019 PMID: 31387220 PMCID: PMC6696339 DOI: 10.3390/molecules24152838
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Structural diversity vs size of fragment libraries, with the former measured by: (a) Average of the similarity of each compound to its closest neighbor; (b) total number of unique fingerprints (richness); (c) true diversity calculated by equation 1. Dash curves are generated from cubic spline fitting. Metrics for random selections are average values of triplicates (Table S2).
Figure 2Efficiency in adding diversity: (a) average number of unique fingerprints (richness) per compound; (b) average value of true diversity per compound. Metrics for random selections are average values of triplicates.
Library sizes (diversity-based selection) required to achieve certain values of structural diversity.
| Structural Diversity (Value) | Minimum Size (Ratio of Total 227,787 Fragments) 1 |
|---|---|
| 5% total richness 2 (33,834) | 1,715 (0.75%) |
| 10% total richness 2 (67,669) | 4,103 (1.80%) |
| Overall true diversity (6,662.4) | 2,052 (0.90%) |
| Maximum true diversity 1 (9,097.6) | 17,666 (7.76%) |
1 Values are estimated by cubic spline fitting with 99,901 segments; 2 Total richness (number of unique fingerprints) is 676,686.
Figure 3Structural diversity vs size of fluorinated fragment libraries, with the former measured by: (a) Average of the similarity of each compound to its closest neighbor; (b) total number of unique fingerprints (richness); (c) true diversity calculated by equation 1. Dash curves are generated from cubic spline fitting. Metrics for random selections are average values of triplicates (Table S2).
Figure 4Efficiency in adding diversity: (a) average number of unique fingerprints (richness) per fluorinated compound; (b) average value of true diversity per fluorinated compound. Metrics for random selections are average values of triplicates.
Fluorinated library sizes (diversity-based selection) required to achieve certain values of structural diversity.
| Structural Diversity (Value) | Minimum Size (Ratio of Total 47,708 Fluorinated Fragments) 1 |
|---|---|
| 5% total richness 2 (8,992) | 675 (1.41%) |
| 10% total richness 2 (17,983) | 1,616 (3.39%) |
| Overall true diversity (3,621.9) | 1,203 (2.52%) |
| Maximum true diversity 1 (4,485.5) | 7,483 (15.69%) |
1 Values are estimated by cubic spline fitting with 19,901 segments; 2 Total richness (number of unique fingerprints) is 179,833.