| Literature DB >> 26425437 |
Christina M Powers1, Karmann A Mills2, Stephanie A Morris3, Fred Klaessig4, Sharon Gaheen5, Nastassja Lewinski6, Christine Ogilvie Hendren7.
Abstract
There is a critical opportunity in the field of nanoscience to compare and integrate information across diverse fields of study through informatics (i.e., nanoinformatics). This paper is one in a series of articles on the data curation process in nanoinformatics (nanocuration). Other articles in this series discuss key aspects of nanocuration (temporal metadata, data completeness, database integration), while the focus of this article is on the nanocuration workflow, or the process of identifying, inputting, and reviewing nanomaterial data in a data repository. In particular, the article discusses: 1) the rationale and importance of a defined workflow in nanocuration, 2) the influence of organizational goals or purpose on the workflow, 3) established workflow practices in other fields, 4) current workflow practices in nanocuration, 5) key challenges for workflows in emerging fields like nanomaterials, 6) examples to make these challenges more tangible, and 7) recommendations to address the identified challenges. Throughout the article, there is an emphasis on illustrating key concepts and current practices in the field. Data on current practices in the field are from a group of stakeholders active in nanocuration. In general, the development of workflows for nanocuration is nascent, with few individuals formally trained in data curation or utilizing available nanocuration resources (e.g., ISA-TAB-Nano). Additional emphasis on the potential benefits of cultivating nanomaterial data via nanocuration processes (e.g., capability to analyze data from across research groups) and providing nanocuration resources (e.g., training) will likely prove crucial for the wider application of nanocuration workflows in the scientific community.Entities:
Keywords: curation; informatics; nanoinformatics; nanomaterials; workflows
Year: 2015 PMID: 26425437 PMCID: PMC4578434 DOI: 10.3762/bjnano.6.189
Source DB: PubMed Journal: Beilstein J Nanotechnol ISSN: 2190-4286 Impact factor: 3.649
Figure 1Common steps in nanocuration. The steps commonly included in nanocuration workflows are illustrated, including: 1) Identification of publications relative to the intended scientific purpose; 2) Preliminary assessment of data quality and completeness of selected in-house or publication data for data quality and completeness (with assumption that any in-house data would be pre-identified within a project prior to the wider publication search referred to in Step 1); 3) Data extraction of raw data and/or data from the publication; 4) Communication with publication authors; 5) Curation of data into the intended repository and/or data format (e.g., ISA-TAB-Nano) leveraging common data elements (CDEs) from relevant ontological resources (e.g., NanoParticle Ontology [NPO]); 6) Review of curated data for data quality and completeness; 7) Release of curated data; 8) Update of curated data as additional information is received from the authors. Though shown here in linear fashion, the order of these common steps for an individual process may be flexible and iteration is expected. The specific steps in a workflow may also differ across repositories depending on the intended purpose of the nanomaterial resource.
Figure 2Stakeholder responses regarding sourcing Data. Stakeholder responses to questions related to sourcing nanomaterial data in a workflow for a data repository. Full text of stakeholder responses is available in Supporting Information File 1.
Figure 3Stakeholder responses regarding data entry and review. Stakeholder responses to questions related to entering and reviewing data in a workflow. Full text of stakeholder responses is available in Supporting Information File 1.
Figure 4Stakeholder responses regarding creation and revision. Stakeholder responses to questions related to creating and revising a written workflow. Full text of stakeholder responses is available in the Supporting Information File 1.
Figure 5Stakeholder responses regarding working with other organizations. Stakeholder responses to questions related to interacting with other organizations to develop a workflow or populate a data repository. In each panel, the response categories (e.g., “yes”, “no”, “N/A”) for each question are provided in the legend. Questions are listed on the x-axis and the number of stakeholders responding in each category is on the y-axis. Full text of stakeholder responses is available in Supporting Information File 1.
Figure 6Scientific and nanomaterial curation challenges. Nanomaterial curation challenges expand on curation challenges inherent in general scientific curation.