| Literature DB >> 30143048 |
Carlos Vazquez-Hernandez1, Antonio Loza1, Rosa-Maria Gutierrez-Rios2.
Abstract
OBJECTIVES: Improvements in bioinformatics applications for the enzyme identification of biochemical reactions, enzyme classifications, mining for specific inhibitors and pathfinding require the accurate computational detection of reaction similarity. We provide a set of substrate-product pairs, clustered by reactions that share similar chemical transformation patterns, for which accuracy was calculated, comparing this set with manually curated data sets. DATA DESCRIPTION: The data were analyzed by a new method that naturally split each reaction into compound pairs and loner compounds, which we called architectures (Vazquez-Hernandez et al. in BMC Syst Biol 12:63, 2018). The data include a set of 7491 curated reactions from the KEGG-Ligand data set. The data are presented in two formats, a string format and a tree structure, both of which reflect the splitting process and the final architectures of each reaction. We are also reporting sets of reactions that show similar splitting patterns naturally grouped into clusters of tree structures. The compound pairs in each cluster were compared with the reactant pairs proposed by the KEGG-RCLASS data set, and a match precision value is also provided. These data were collected with the aim of providing research with a confident set of reactant pairs that is useful for selecting between alternative substrate-product pairs predicted by pathfinders.Entities:
Keywords: Compound pairs; Metabolic reaction; Reactant pairs; Reaction patterns
Mesh:
Substances:
Year: 2018 PMID: 30143048 PMCID: PMC6109353 DOI: 10.1186/s13104-018-3724-8
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Overview of the data files
| Label | Name of data file/data set | File types (file extension) | Data repository identifier (DOI) |
|---|---|---|---|
| Data file1 | Compound pairs with a precision value [ | Text file (.txt) | Figshare |
| Data file 2 | Compound pairs without a precision value [ | Text file (.txt) | Figshare |
| Data file 3 | Reaction splitting using the balance rule [ | Text file (.txt) | Figshare |
| Data file 4 | Reaction splitting using the count rule [ | Text file (.txt) | Figshare |
| Data file 5 | Reaction splitting using the both rules [ | Text file (.txt) | Figshare |
| Data file 6 | RPAIR/RCLASS [ | Text file (.txt) | Figshare |
| Data file 7 | reaCTS software [ | Perl library (.pm) | Figshare |
| Data file 8 | CurateKEGG [ | Perl library (.pm) | Figshare |