Literature DB >> 35965468

FAIR data pipeline: provenance-driven data management for traceable scientific workflows.

Sonia Natalie Mitchell1,2, Andrew Lahiff3, Nathan Cummings3, Jonathan Hollocombe3, Bram Boskamp4, Ryan Field5, Dennis Reddyhoff6, Kristian Zarebski3, Antony Wilson7, Bruno Viola3, Martin Burke4, Blair Archibald8, Paul Bessell9, Richard Blackwell10, Lisa A Boden9, Alys Brett3, Sam Brett, Ruth Dundas5, Jessica Enright2,8, Alejandra N Gonzalez-Beltran7, Claire Harris2,4, Ian Hinder11, Christopher David Hughes10, Martin Knight4, Vino Mano10, Ciaran McMonagle2,5, Dominic Mellor2,12, Sibylle Mohr1,2, Glenn Marion2,4, Louise Matthews1,2, Iain J McKendrick2,4, Christopher Mark Pooley4, Thibaud Porphyre13, Aaron Reeves14, Edward Townsend, Robert Turner6, Jeremy Walton15, Richard Reeve1,2.   

Abstract

Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily damaged and is often low, with cynicism arising where claims of 'following the science' are made without accompanying evidence. Tracing the provenance of such decisions back through open software to primary data would clarify this evidence, enhancing the transparency of the decision-making process. Here, we demonstrate a Findable, Accessible, Interoperable and Reusable (FAIR) data pipeline. Although developed during the COVID-19 pandemic, it allows easy annotation of any data as they are consumed by analyses, or conversely traces the provenance of scientific outputs back through the analytical or modelling source code to primary data. Such a tool provides a mechanism for the public, and fellow scientists, to better assess scientific evidence by inspecting its provenance, while allowing scientists to support policymakers in openly justifying their decisions. We believe that such tools should be promoted for use across all areas of policy-facing research. This article is part of the theme issue 'Technical challenges of modelling real-life epidemics and examples of overcoming these'.

Entities:  

Keywords:  COVID-19; FAIR; data management; epidemiology; modelling; provenance

Mesh:

Year:  2022        PMID: 35965468      PMCID: PMC9376726          DOI: 10.1098/rsta.2021.0300

Source DB:  PubMed          Journal:  Philos Trans A Math Phys Eng Sci        ISSN: 1364-503X            Impact factor:   4.019


  13 in total

Review 1.  Models of foot-and-mouth disease.

Authors:  Matt J Keeling
Journal:  Proc Biol Sci       Date:  2005-06-22       Impact factor: 5.349

2.  The SEIRS model for infectious disease dynamics.

Authors:  Ottar N Bjørnstad; Katriona Shea; Martin Krzywinski; Naomi Altman
Journal:  Nat Methods       Date:  2020-06       Impact factor: 28.547

3.  FAIRDOMHub: a repository and collaboration environment for sharing systems biology research.

Authors:  Katherine Wolstencroft; Olga Krebs; Jacky L Snoep; Natalie J Stanford; Finn Bacall; Martin Golebiewski; Rostyk Kuzyakiv; Quyen Nguyen; Stuart Owen; Stian Soiland-Reyes; Jakub Straszewski; David D van Niekerk; Alan R Williams; Lars Malmström; Bernd Rinn; Wolfgang Müller; Carole Goble
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

4.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update.

Authors:  Enis Afgan; Dannon Baker; Bérénice Batut; Marius van den Beek; Dave Bouvier; Martin Cech; John Chilton; Dave Clements; Nate Coraor; Björn A Grüning; Aysam Guerler; Jennifer Hillman-Jackson; Saskia Hiltemann; Vahid Jalili; Helena Rasche; Nicola Soranzo; Jeremy Goecks; James Taylor; Anton Nekrutenko; Daniel Blankenberg
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

5.  Association of tiered restrictions and a second lockdown with COVID-19 deaths and hospital admissions in England: a modelling study.

Authors:  Nicholas G Davies; Rosanna C Barnard; Christopher I Jarvis; Timothy W Russell; Malcolm G Semple; Mark Jit; W John Edmunds
Journal:  Lancet Infect Dis       Date:  2020-12-24       Impact factor: 25.071

6.  Ten simple rules for making a vocabulary FAIR.

Authors:  Simon J D Cox; Alejandra N Gonzalez-Beltran; Barbara Magagna; Maria-Cristina Marinescu
Journal:  PLoS Comput Biol       Date:  2021-06-16       Impact factor: 4.475

7.  The FAIR Guiding Principles for scientific data management and stewardship.

Authors:  Mark D Wilkinson; Michel Dumontier; I Jsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan-Willem Boiten; Luiz Bonino da Silva Santos; Philip E Bourne; Jildau Bouwman; Anthony J Brookes; Tim Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott Edmunds; Chris T Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J G Gray; Paul Groth; Carole Goble; Jeffrey S Grethe; Jaap Heringa; Peter A C 't Hoen; Rob Hooft; Tobias Kuhn; Ruben Kok; Joost Kok; Scott J Lusher; Maryann E Martone; Albert Mons; Abel L Packer; Bengt Persson; Philippe Rocca-Serra; Marco Roos; Rene van Schaik; Susanna-Assunta Sansone; Erik Schultes; Thierry Sengstag; Ted Slater; George Strawn; Morris A Swertz; Mark Thompson; Johan van der Lei; Erik van Mulligen; Jan Velterop; Andra Waagmeester; Peter Wittenburg; Katherine Wolstencroft; Jun Zhao; Barend Mons
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

8.  Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy.

Authors:  Giulia Giordano; Franco Blanchini; Raffaele Bruno; Patrizio Colaneri; Alessandro Di Filippo; Angela Di Matteo; Marta Colaneri
Journal:  Nat Med       Date:  2020-04-22       Impact factor: 87.241

9.  Early dynamics of transmission and control of COVID-19: a mathematical modelling study.

Authors:  Adam J Kucharski; Timothy W Russell; Charlie Diamond; Yang Liu; John Edmunds; Sebastian Funk; Rosalind M Eggo
Journal:  Lancet Infect Dis       Date:  2020-03-11       Impact factor: 25.071

10.  Factors associated with COVID-19-related death using OpenSAFELY.

Authors:  Elizabeth J Williamson; Alex J Walker; Krishnan Bhaskaran; Seb Bacon; Chris Bates; Caroline E Morton; Helen J Curtis; Amir Mehrkar; David Evans; Peter Inglesby; Jonathan Cockburn; Helen I McDonald; Brian MacKenna; Laurie Tomlinson; Ian J Douglas; Christopher T Rentsch; Rohini Mathur; Angel Y S Wong; Richard Grieve; David Harrison; Harriet Forbes; Anna Schultze; Richard Croker; John Parry; Frank Hester; Sam Harper; Rafael Perera; Stephen J W Evans; Liam Smeeth; Ben Goldacre
Journal:  Nature       Date:  2020-07-08       Impact factor: 49.962

View more
  1 in total

1.  Visualization for epidemiological modelling: challenges, solutions, reflections and recommendations.

Authors:  Jason Dykes; Alfie Abdul-Rahman; Daniel Archambault; Benjamin Bach; Rita Borgo; Min Chen; Jessica Enright; Hui Fang; Elif E Firat; Euan Freeman; Tuna Gönen; Claire Harris; Radu Jianu; Nigel W John; Saiful Khan; Andrew Lahiff; Robert S Laramee; Louise Matthews; Sibylle Mohr; Phong H Nguyen; Alma A M Rahat; Richard Reeve; Panagiotis D Ritsos; Jonathan C Roberts; Aidan Slingsby; Ben Swallow; Thomas Torsney-Weir; Cagatay Turkay; Robert Turner; Franck P Vidal; Qiru Wang; Jo Wood; Kai Xu
Journal:  Philos Trans A Math Phys Eng Sci       Date:  2022-08-15       Impact factor: 4.019

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.