| Literature DB >> 33177514 |
Hitesh Patel1, Wolf-Dietrich Ihlenfeldt2, Philip N Judson3, Yurii S Moroz4, Yuri Pevzner1,5, Megan L Peach6, Victorien Delannée1, Nadya I Tarasova7, Marc C Nicklaus8.
Abstract
We have made available a database of over 1 billion compounds predicted to be easily synthesizable, called Synthetically Accessible Virtual Inventory (SAVI). They have been created by a set of transforms based on an adaptation and extension of the CHMTRN/PATRAEntities:
Year: 2020 PMID: 33177514 PMCID: PMC7658252 DOI: 10.1038/s41597-020-00727-4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Transforms initially chosen from existing LHASA knowledgebase.
| ID | Name | Ring Forming |
|---|---|---|
| Paal-Knorr Pyrrole Synthesis | Yes | |
| Feist Synthesis of Pyrroles | Yes | |
| Hantzsch Thiazole Synthesis | Yes | |
| Allene 2 + 2 Cycloaddition | Yes | |
| Pyrazoles from Beta Carbonyl Carboxylic Acid Derivatives | Yes | |
| Fused Arylpyridines via o-Aminocarbonyls | Yes | |
| Tetrazoles from Azide and Nitriles | Yes | |
| Phthalazin-1-ones from 2-Acylbenzoic Acids | Yes | |
| Fused Aryl(2,3-H/R)Pyridines (Pictet-Spengler) | Yes | |
| Sonogashira Coupling | No | |
| Kabbe Synthesis of 4-Chromanones | Yes | |
| Benzazepin-2-ones by Pictet-Spengler Reaction | Yes | |
| Benzo[b]furans from 2-Hydroxyphenyl Acetylenes | Yes |
Newly developed transforms.
| ID | Name | Ring Forming |
|---|---|---|
| Copper[I]-catalyzed azide-alkyne cycloaddition | Yes | |
| Buchwald-Hartwig Ether Formation | No | |
| Suzuki-Miyaura Cross-Coupling (Bromo) | No | |
| Suzuki-Miyaura Cross-Coupling (Iodo) | No | |
| Suzuki-Miyaura Cross-Coupling (Chloro) | No | |
| Suzuki-Miyaura Cross-Coupling with Alkene | No | |
| Suzuki-Miyaura Cross-Coupling of Alkenes | No | |
| Hiyama Aryl-Alkenyl Cross-Coupling | No | |
| Hiyama Non-Aromatic Cross-Coupling | No | |
| Hiyama Allyl Cross-Coupling | No | |
| Hiyama Carbonylative Cross-Coupling | No | |
| Hiyama Cross-Coupling with Arylhydrazine | No | |
| Liebeskind-Srogl Thioamide Coupling | No | |
| Liebeskind-Srogl Nitrile Formation | No | |
| Liebeskind-Srogl Heterocyclic Coupling | No | |
| Sulfonamide Schotten-Baumann | No | |
| Sulfonamide Schotten-Baumann from Sulfonate | No | |
| Sulfonamide Schotten-Baumann from Thiol | No | |
| Sulfonamide Schotten-Baumann from Aryl Bromide | No | |
| Mitsunobu Reaction | No | |
| Mitsunobu carbon-carbon bond formation | No | |
| Mitsunobu SN2’ Reaction | No | |
| Mitsunobu Imide Reaction | No | |
| Mitsunobu Aryl Ether Formation | No | |
| Mitsunobu Sulfonamide Reaction | No | |
| Ester or Amide or Thiolester Formation | No | |
| Williamson Ether Synthesis | No | |
| Buchwald-Hartwig Reaction | No | |
| Buchwald-Hartwig Reaction | No | |
| Benzimidazoles from o-Phenylenediamines | Yes | |
| Acylsulfonamide from Sulfonamide and Carboxylic Acid | No | |
| Benzimidazoles from o-Phenylenediamines and Aldehydes | Yes | |
| Benzimidazoles from o-Phenylenediamines and Aldehydes | Yes | |
| Sulfonamide from sulfonic acid and amine | No | |
| Sulfonamide alkylation with a cyclic ether | No | |
| Sulfonamide acylation | No | |
| Wittig Reaction | No | |
| Wittig via Methoxy-Ylide | No | |
| Horner-Wadsworth-Emmons Olefination | No | |
| Chan-Lam coupling | No |
Fig. 1SAVI workflow describing adaptation of retrosynthetic transforms for forward synthesis.
Reactants found and reactant pairs generated for each transform.
| ID | Name | R1 | R2 | Pairs |
|---|---|---|---|---|
| Paal-Knorr Pyrrole Synthesis | 38,317 | 4 | 153,268 | |
| Feist Synthesis of Pyrroles | 167 | 55 | 9,185 | |
| Hantzsch Thiazole Synthesis | 505 | 920 | 464,600 | |
| Allene 2 + 2 Cycloaddition | 17 | 7,792 | 132,464 | |
| Pyrazoles from Beta Carbonyl Carboxylic Acid Derivatives | 33 | 1,691 | 55,803 | |
| Fused Arylpyridines via o-Aminocarbonyls | 17,257 | 218 | 3,762,026 | |
| Tetrazoles from Azide and Nitriles | 7,089 | 1 | 7,089 | |
| Phthalazin-1-ones from 2-Acylbenzoic Acids | 55 | 1,690 | 92,950 | |
| Fused Aryl(2,3-H/R)Pyridines (Pictet-Spengler) | 1,309 | 14,095 | 18,450,355 | |
| Sonogashira Coupling | 1,200 | 22,839 | 27,406,800 | |
| Kabbe Synthesis of 4-Chromanones | 5,750 | 35 | 201,250 | |
| Benzazepin-2-ones by Pictet-Spengler Reaction | 14 | 5,092 | 71,288 | |
| Benzo[b]furans from 2-Hydroxyphenyl Acetylenes | 12 | 314 | 3,768 | |
| Copper[I]-catalyzed azide-alkyne cycloaddition | 960 | 1,646 | 1,580,160 | |
| Buchwald-Hartwig Ether Formation | 14,834 | 6,519 | 96,702,846 | |
| Suzuki-Miyaura Cross-Coupling (Bromo) | 10,857 | 543 | 5,895,351 | |
| Suzuki-Miyaura Cross-Coupling (Iodo) | 1,490 | 543 | 809,070 | |
| Suzuki-Miyaura Cross-Coupling (Chloro) | 13,115 | 534 | 7,003,410 | |
| Suzuki-Miyaura Cross-Coupling with Alkene | 543 | 91 | 49,413 | |
| Suzuki-Miyaura Cross-Coupling of Alkenes | 88 | 5,150 | 453,200 | |
| Hiyama Aryl-Alkenyl Cross-Coupling | 1,491 | 2 | 2,982 | |
| Hiyama Non-Aromatic Cross-Coupling | 76 | 151 | 11,476 | |
| Hiyama Allyl Cross-Coupling | 2 | 80 | 160 | |
| Hiyama Carbonylative Cross-Coupling | 12,039 | 2 | 24,078 | |
| Hiyama Cross-Coupling with Arylhydrazine | 553 | 2 | 1,106 | |
| Liebeskind-Srogl Thioamide Coupling | 164 | 543 | 89,052 | |
| Liebeskind-Srogl Nitrile Formation | 1 | 583 | 583 | |
| Liebeskind-Srogl Heterocyclic Coupling | 330 | 539 | 177,870 | |
| Sulfonamide Schotten-Baumann | 1,981 | 62,784 | 124,375,104 | |
| Sulfonamide Schotten-Baumann from Sulfonate | 62,994 | 108 | 6,803,352 | |
| Sulfonamide Schotten-Baumann from Thiol | 62,491 | 2,313 | 144,541,683 | |
| Sulfonamide Schotten-Baumann from Aryl Bromide | 59,213 | 7,159 | 423,905,867 | |
| Mitsunobu Reaction | 28,743 | 5,860 | 168,433,980 | |
| Mitsunobu carbon-carbon bond formation | 11,858 | 18 | 213,444 | |
| Mitsunobu SN2’ Reaction | 29,672 | 3 | 89,016 | |
| Mitsunobu Imide Reaction | 6,073 | 6,146 | 37,324,658 | |
| Mitsunobu Aryl Ether Formation | 11,848 | 4,631 | 54,868,088 | |
| Mitsunobu Sulfonamide Reaction | 10,317 | 2,130 | 21,975,210 | |
| Ester or Amide or Thiolester Formation | 38,104 | 21,136 | 805,366,144 | |
| Williamson Ether Synthesis | 14,159 | 24,371 | 345,068,989 | |
| Buchwald-Hartwig Reaction - Amines | 17,755 | 36,712 | 651,821,560 | |
| Buchwald-Hartwig Reaction - Sulfonamides | 10,619 | 3,516 | 37,336,404 | |
| Benzimidazoles from o-Phenylenediamines | 190 | 29,659 | 5,635,210 | |
| Acylsulfonamide from Sulfonamide and Carboxylic Acid | 1,690 | 29,508 | 49,868,520 | |
| Benzimidazoles from o-Phenylenediamines and Aldehydes - Iodine | 346 | 5,092 | 1,761,832 | |
| Benzimidazoles from o-Phenylenediamines and Aldehydes - Boronic Acid | 142 | 5,092 | 723,064 | |
| Sulfonamide from sulfonic acid and amine | 62,983 | 108 | 6,802,164 | |
| Sulfonamide alkylation with a cyclic ether | 1,475 | 7,500 | 11,062,500 | |
| Sulfonamide acylation | 3,930 | 314 | 1,234,020 | |
| Wittig Reaction | 8,435 | 58,616 | 494,425,960 | |
| Wittig via Methoxy-Ylide | 12 | 14,175 | 170,100 | |
| Horner-Wadsworth-Emmons Olefination | 5,070 | 7 | 35,490 | |
| Chan-Lam coupling | 521 | 58,891 | 30,682,211 | |
| 3,588,136,173 |
Percentage of total SAVI products and unique molecules saved per scoring class.
| Class | SAVI products | Unique within the class | Percentage of total SAVI products |
|---|---|---|---|
| 1,094,782,440 | 976,051,945 | 62.61% | |
| 609,262 | 579,532 | 0.03% | |
| 54,775,204 | 48,036,148 | 3.13% | |
| 82,180,372 | 80,366,188 | 4.7% | |
| 516,116,725 | 457,508,945 | 29.52% | |
| 1,748,464,003 | 1,526,316,392(a) | 100% |
(a)The unique-structure numbers for the individual classes do not add up to the unique structures for all classes combined since some products are present in more than one class.
Fig. 2Reaction success rate (percentage of saved reactions out of tested reactant pairs). (Counts were adjusted for duplication in products due to alkene reactivity at both ends of the bond (ID 6009) or tautomerism (IDs 7005, 7013, 7014)).
Overlap of SAVI with other large databases.
| Database | Access date | Database size | Overlap with SAVI |
|---|---|---|---|
| REAL[ | February-2020 | ~1.2 B | 142,806,769 |
| iRL 2017Q4[ | December-2017 | ~132 M | 10,777,739 |
| PubChem[ | February-2020 | ~102 M | 5,390,125 |
| SAVI BBs | December-2019 | ~152 K | 34,241 |
Ring systems overlap of SAVI with other large databases.
| Database | Access date | Database size | No. of unique ring systems | Overlap with SAVI |
|---|---|---|---|---|
| REAL[ | February-2020 | ~1.2 B | 3,389 | 2,145 |
| iRL 2017Q4[ | December-2017 | ~132 M | 56,144 | 2,883 |
| PubChem[ | February-2020 | ~102 M | 521,946 | 3,295 |
Fig. 3Distributions of drug-design relevant properties calculated for the Plus subset of SAVI (a) Molecular weight. (b) XlogP2[94]. (c) Total Polar Surface Area (2). (d) Fraction of sp3 hybridized carbons. (e) Number of rotatable bonds. (f) QED (Quantitative Estimate of Druglikeness) score[71]. (g) PAINS (Pan Assay Interference Compounds) counts. (h) Bruns & Watson demerits for Identifying Potentially Reactive or Promiscuous Compounds[66].
| Measurement(s) | synthetic accessibility of small molecules • small molecule • Compound |
| Technology Type(s) | computational modeling technique • in silico |
| Factor Type(s) | chemical structure |