| Literature DB >> 29086180 |
Philipp-Maximilian Jacob1, Tian Lan1, Jonathan M Goodman2, Alexei A Lapkin3.
Abstract
The algorithmic, large-scale use and analysis of reaction databases such as Reaxys is currently hindered by the absence of widely adopted standards for publishing reaction data in machine readable formats. Crucial data such as yields of all products or stoichiometry are frequently not explicitly stated in the published papers and, hence, not reported in the database entry for those reactions, limiting their usefulness for algorithmic analysis. This paper presents a possible extension to the IUPAC RInChI standard via an auxiliary layer, termed ProcAuxInfo, which is a standardised, extensible form in which to report certain key reaction parameters such as declaration of all products and reactants as well as auxiliaries known in the reaction, reaction stoichiometry, amounts of substances used, conversion, yield and operating conditions. The standard is demonstrated via creation of the RInChI including the ProcAuxInfo layer based on three published reactions and demonstrates accurate data recoverability via reverse translation of the created strings. Implementation of this or another method of reporting process data by the publishing community would ensure that databases, such as Reaxys, would be able to abstract crucial data for big data analysis of their contents.Entities:
Year: 2017 PMID: 29086180 PMCID: PMC5388667 DOI: 10.1186/s13321-017-0210-6
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Analysis of reaction data content in Reaxys, based on a sample set of 15.4 million reactions
| Property | Percentage of reactions with value for property |
|---|---|
| Yield | 46.0 |
| Temperature | 46.1 |
| Pressure | 1.6 |
| pH-value | 1.0 |
| Reaction time | 48.4 |
| Solvent ID | 70.9 |
| Reagent ID | 67.8 |
| Catalyst ID | 4.3 |
Fig. 1Plot of the number of records added to Reaxys in a given year and their information content when analysing a fixed set of properties. The “# of Records” line is plotted against the right-hand side y-axis
Scheme 1A case study of C–H activation reaction
The amounts of substances fed in the example 1
| Compound | Amount fed (mol s−1) |
|---|---|
| 3,3,5,5-Tetramethylmorpholin-2-one | 8.3 × 10−7 |
| (Diacetoxyiodo)benzene | 8.3 × 10−7 |
| Acetic acid | 8.3 × 10−6 |
| Palladium(II) acetate | 4.2 × 10−9 |
| Acetic anhydride | 1.7 × 10−6 |
| Toluene | 1.5 × 10−4 |
Conditions of reaction 1
| Property | Value |
|---|---|
| Reaction temperature | 393 K |
| Reaction pressure | 6 × 106 Pa |
| Yield | 0.90 |
Residence time: conversion pairs for the reaction 1
| Residence time (s) | Conversion |
|---|---|
| 60 | 0.06 |
| 120 | 0.14 |
| 180 | 0.20 |
| 240 | 0.32 |
| 300 | 0.40 |
| 360 | 0.52 |
| 420 | 0.70 |
| 480 | 0.90 |
| 540 | 1.00 |
| 600 | 1.00 |
Scheme 2A case study of benzalcohol oxidation
Amounts of substances fed into the reaction 2
| Compound | Amount fed |
|---|---|
| Benzyl alcohol | 3.3 × 10−5 mol s−1 |
| Toluene | 3.1 × 10−4 mol s−1 |
| Oxygen | 4.9 × 10−6 mol s−1 |
| Ruthenium | 9 × 10−3 g |
| Aluminium oxide | 0.991 g |
Conditions of the reaction 2
| Property | Value |
|---|---|
| Reaction temperature | 388 K |
| Reaction pressure | 8 × 106 Pa |
| Yield | 0.25 |
| Reactor volume | 9 × 10−4 m3 |
Residence time: conversion pairs for the reaction 2
| Residence time (s) | Conversion |
|---|---|
| 9 | 0.25 |
Scheme 3An example of a Suzuki–Miyaura reaction
Amounts of substances fed into the reaction 3
| Compound | Amount fed |
|---|---|
| Phenylboronic acid | 1.1 × 10−3 mol |
| 4-bromotoluene | 1.0 × 10−3 mol |
| Phosphine ligand | 2.2 × 10−5 mol |
|
| 3.1 × 10−2 mol |
| Palladium(II) acetylacetonate | 2.2 × 10−5 mol |
| Sodium carbonate | Not reported |
Conditions of the reaction 3
| Property | Value |
|---|---|
| Reaction temperature | 363 K |
| Reaction pressure | Not reported |
| Yield | 0.89 |
| Reactor volume | Not reported |
Residence time: conversion pairs for the reaction 3
| Residence time (s) | Conversion |
|---|---|
| . | . |
Reverse lookup results for the reaction 1 species
| Group | Species |
|---|---|
| Reactants | (Diacetoxyiodo)benzene |
| 3,3,5,5-Tetramethyl-2-morpholinone | |
| Products | Acetic acid |
| Iodobenzene | |
| n/a | |
| Auxiliaries | Palladium(II) acetate |
| Acetic acid | |
| Acetic anhydride | |
| Toluene |
Reverse lookup results for the reaction 2 species
| Group | Species |
|---|---|
| Products | Benzaldehyde |
| Water | |
| Reactants | Benzyl alcohol |
| Molecular oxygen | |
| Auxiliaries | Aluminum oxide |
| Toluene | |
| Ruthenium |
Reverse lookup results for the reaction 3 species
| Group | Species |
|---|---|
| Products | 4-Phenyltoluene |
| Unknown | |
| Reactants | Phenylboronic acid |
| 4-Bromotoluene | |
| Auxiliaries | Palladium acetylacetonate |
| Unknown | |
|
| |
| Sodium carbonate |