| Literature DB >> 30321422 |
Thomas Duigou1, Melchior du Lac1, Pablo Carbonell2, Jean-Loup Faulon1,2,3.
Abstract
RetroRules is a database of reaction rules for metabolic engineering (https://retrorules.org). Reaction rules are generic descriptions of chemical reactions that can be used in retrosynthesis workflows in order to enumerate all possible biosynthetic routes connecting a target molecule to its precursors. The use of such rules is becoming increasingly important in the context of synthetic biology applied to de novo pathway discovery and in systems biology to discover underground metabolism due to enzyme promiscuity. Here, we provide for the first time a complete set containing >400 000 stereochemistry-aware reaction rules extracted from public databases and expressed in the community-standard SMARTS (SMIRKS) format, augmented by a rule representation at different levels of specificity (the atomic environment around the reaction center). Such numerous representations of reactions expand natural chemical diversity by predicting de novo reactions of promiscuous enzymes.Entities:
Mesh:
Year: 2019 PMID: 30321422 PMCID: PMC6323975 DOI: 10.1093/nar/gky940
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
RetroRules database content regarding the number of distinct reaction rules available at each diameter and depending whether or not stereochemistry is expressed in the reaction rules. The ‘All reaction rules’ column is the number of unique reaction rules from the two datasets
| Diameter | Number of distinct reaction rules | ||
|---|---|---|---|
| Dataset not expressing stereochemistry | Dataset expressing stereochemistry | All reaction rules | |
| 2 | 18 607 | 23 391 | 39 016 |
| 4 | 24 241 | 28 906 | 49 933 |
| 6 | 27 686 | 31 734 | 56 544 |
| 8 | 29 276 | 33 128 | 59 770 |
| 10 | 30 158 | 33 944 | 61 627 |
| 12 | 30 762 | 34 519 | 62 854 |
| 14 | 31 219 | 34 932 | 63 786 |
| 16 | 31 559 | 35 258 | 64 453 |
Figure 1.SQL schema used by RetroRules. The tables in white are the parsed meta-information from various datasets and their cross-references to others databases. The reactions and chemical_species tables are linked together through the bridge tables reaction_products and reaction_substrates. In blue are the tables associated with reaction rules. The rules table uniquely identifies a given rule with single reaction_id and substrate_id at a given diameter. The isStereo parameter determines if the rule describes the stereochemistry of the reaction or not. Each rule is associated with a single SMARTS and SMILES description of that reaction, and to link the two, a smarts_smiles table assures that one is able to easily find the corresponding SMILES description of a SMARTS and vice-versa. The rule_products table describes the products of a given rule. Lastly in yellow are the tables that were used to generate the score (also in yellow in the rules table) from the rules table (see the Reaction rule scoring section).
Figure 3.Reaction rules querying process from the RetroRules web interface using as entry point EC number 5.5.1.6 (chalcone–flavanone isomerase) and minimum diameter d = 6. (A) Home page is the go-to page to start a search for specific reactions and associated rules using EC number, reaction ID, substrate name of structure query. (B) Reaction rules matching queries are ranked according to their score in tables that may be exported as CSV or JSON for downstream programmatic analysis. (C) Each rule can be further investigated from contextual information such as associated reactions and enzyme sequences using the single reaction rule view.
Figure 2.Calculation of reaction rules for EC 2.6.1.5. (A) The atom mapping highlights reacting atoms labelled 8, 11, 16 and 19. (B) Calculation of the EC 2.6.1.5 rules for diameters D = ∞, 4 and 0 modeling promiscuity on substrate 4-hydroxyphenylpyruvate. (C) Calculation of the EC 2.6.1.5 rules for diameters D = ∞, 4 and 0 modelling promiscuity on substrate l-glutamate.