| Literature DB >> 35350192 |
Tomasz Zieliński1, Johnny Hay2, Andrew Romanowski1, Anja Nenninger1, Alistair McCormick1, Andrew J Millar1.
Abstract
Practical delivery of Findable, Accessible, Reusable and Interoperable principles for research data management requires expertise, time resource, (meta)data standards and formats, software tools and public repositories. The Synthetic Biology Open Language (SBOL2) metadata standard enables FAIR sharing of the designs of synthetic biology constructs, notably in the repository of the SynBioHub platform. Large libraries of such constructs are increasingly easy to produce in practice, for example, in DNA foundries. However, manual curation of the equivalent libraries of designs remains cumbersome for a typical lab researcher, creating a barrier to data sharing. Here, we present a simple tool SynBio2Easy, which streamlines and automates operations on multiple Synthetic Biology Open Language (SBOL) designs using Microsoft Excel® tables as metadata inputs. The tool provides several utilities for manipulation of SBOL documents and interaction with SynBioHub: for example, generation of a library of plasmids based on an original design template, bulk deposition into SynBioHub, or annotation of existing SBOL component definitions with notes and authorship information. The tool was used to generate and deposit a collection of 3661 cyanobacterium Synechocystis plasmids into the public SynBioHub repository. In the process of developing the software and uploading these data, we evaluated some aspects of the SynBioHub platform and SBOL ecosystem, and we discuss proposals for improvement that could benefit the user community. With software such as SynBio2Easy, we aim to deliver a user-driven tooling to make FAIR a reality at all stages of the project lifecycle in synthetic biology research. Graphical Abstract.Entities:
Keywords: SBOL; SynBioHub; research data management; software tools; spreadsheet; synthetic biology
Year: 2022 PMID: 35350192 PMCID: PMC8944294 DOI: 10.1093/synbio/ysac002
Source DB: PubMed Journal: Synth Biol (Oxf) ISSN: 2397-7000
Figure 1.SynBio2Easy Command Line Interface with a guided prompt.
Available operations of SynBio2Easy
| Command | Behavior and example use case |
|---|---|
| GENERATE | Generates a series of designs based on an SBOL template and ‘concrete’ instance parameters (including sub-components’ sequences) specified in an Excel table |
| ANNOTATE | Adds information (e.g. descriptions, authors) to multiple component definitions in an SBOL document using details defined in an Excel table |
| FLATTEN | Converts a tree of SBOL sub-components in a design into a ‘flattened’ component definition with an annotated linear sequence |
| DEPOSIT | Deposits files from a folder(s) into SynBioHub collection(s) |
| UPDATE | Adds information (e.g. notes) as well as attachment files to multiple records in SynBioHub using details defined in an Excel table. Unlike ANNOTATE it is an online operation on a server. |
| CLEAN | Removes annotations and namespaces specific to SynBioHub from an SBOL document, so it can be re-uploaded to SynBioHub |
| SYNBIO2TABLE | Retrieves identity details of all members of a collection and saves them to an Excel file with headings for metadata columns supported by SynBio2Easy |
Figure 4.Library definition file (top), for each row a new ComponentDefinition is generated. The metadata are provided in the corresponding columns (e.g. ‘author’), and field values can be dynamically calculated using excel formulas and simple templating. The example of a generated design after it is uploaded to SynBioHub (bottom). Arrows link excel cells to corresponding fields in the SynBioHub record.