| Literature DB >> 29556520 |
Mathilde Koch1, Amir Pandi1, Baudoin Delépine1,2,3, Jean-Loup Faulon1,2,3,4.
Abstract
The aim of this dataset is to identify and collect compounds that are known for being detectable by a living cell, through the action of a genetically encoded biosensor and is centred on bacterial transcription factors. Such a dataset should open the possibility to consider a wide range of applications in synthetic biology. The reader will find in this dataset the name of the compounds, their InChI (molecular structure), the publication where the detection was reported, the organism in which this was detected or engineered, the type of detection and experiment that was performed as well as the name of the biosensor. A comment field is also provided that explains why the compound was included in the dataset, based on quotes from the reference publication or the database it was extracted from. Manual curation of ACS Synthetic Biology abstracts (Volumes 1 to 6 and Volume 7 issue 1) was performed as well as extraction from the following databases: Bionemo v6.0 (Carbajosa et al., 2009) [1], RegTransbase r20120406 (Cipriano et al., 2013) [2], RegulonDB v9.0 (Gama-Castro et al., 2016) [3], RegPrecise v4.0 (Novichkov et al., 2013) [4] and Sigmol v20180122 (Rajput et al., 2016) [5].Entities:
Year: 2018 PMID: 29556520 PMCID: PMC5854866 DOI: 10.1016/j.dib.2018.02.061
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Contribution of each data source.
| RegPrecise | 136 | 418 | 73 |
| BioNemo | 5 | 499 | 8 |
| RegTransBase | 683 | 2057 | 63 |
| RegulonDB | 12 | 245 | 23 |
| Sigmol | 2 | 175 | 135 |
| ACS Synthetic Biology | 44 | 287 | 73 |
The first column contains the data source, the second column the number of compounds found without a structure in that source, the third column the number of compounds with a structure (InChI) and the last column the number of compounds with a structure found only in that source.
Fig. 1Type of experiment and biosensor type in the full dataset and the manually curated dataset. A: Full dataset – detection method. B: Full dataset – biosensor type. C: ACS dataset – detection method. D: ACS dataset – biosensor type. A and C: other in detection method corresponds to in silico, in vivo and cell-free detections. C and D: ACS dataset is the dataset obtained from manual curation of ACS Synthetic Biology with compounds that have available structures.
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility |