Literature DB >> 27433158

Publishing FAIR Data: An Exemplar Methodology Utilizing PHI-Base.

Alejandro Rodríguez-Iglesias1, Alejandro Rodríguez-González2, Alistair G Irvine3, Ane Sesma1, Martin Urban4, Kim E Hammond-Kosack4, Mark D Wilkinson1.   

Abstract

Pathogen-Host interaction data is core to our understanding of disease processes and their molecular/genetic bases. Facile access to such core data is particularly important for the plant sciences, where individual genetic and phenotypic observations have the added complexity of being dispersed over a wide diversity of plant species vs. the relatively fewer host species of interest to biomedical researchers. Recently, an international initiative interested in scholarly data publishing proposed that all scientific data should be "FAIR"-Findable, Accessible, Interoperable, and Reusable. In this work, we describe the process of migrating a database of notable relevance to the plant sciences-the Pathogen-Host Interaction Database (PHI-base)-to a form that conforms to each of the FAIR Principles. We discuss the technical and architectural decisions, and the migration pathway, including observations of the difficulty and/or fidelity of each step. We examine how multiple FAIR principles can be addressed simultaneously through careful design decisions, including making data FAIR for both humans and machines with minimal duplication of effort. We note how FAIR data publishing involves more than data reformatting, requiring features beyond those exhibited by most life science Semantic Web or Linked Data resources. We explore the value-added by completing this FAIR data transformation, and then test the result through integrative questions that could not easily be asked over traditional Web-based data resources. Finally, we demonstrate the utility of providing explicit and reliable access to provenance information, which we argue enhances citation rates by encouraging and facilitating transparent scholarly reuse of these valuable data holdings.

Entities:  

Keywords:  FAIR data; Linked Data; PHI-base; Pathogen-Host Interactions; SPARQL; Semantic PHI-base; Semantic Web; data integration

Year:  2016        PMID: 27433158      PMCID: PMC4922217          DOI: 10.3389/fpls.2016.00641

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


  15 in total

1.  RNAcentral: A vision for an international database of RNA sequences.

Authors:  Alex Bateman; Shipra Agrawal; Ewan Birney; Elspeth A Bruford; Janusz M Bujnicki; Guy Cochrane; James R Cole; Marcel E Dinger; Anton J Enright; Paul P Gardner; Daniel Gautheret; Sam Griffiths-Jones; Jen Harrow; Javier Herrero; Ian H Holmes; Hsien-Da Huang; Krystyna A Kelly; Paul Kersey; Ana Kozomara; Todd M Lowe; Manja Marz; Simon Moxon; Kim D Pruitt; Tore Samuelsson; Peter F Stadler; Albert J Vilella; Jan-Hinnerk Vogel; Kelly P Williams; Mathew W Wright; Christian Zwieb
Journal:  RNA       Date:  2011-09-22       Impact factor: 4.942

2.  Modeling sample variables with an Experimental Factor Ontology.

Authors:  James Malone; Ele Holloway; Tomasz Adamusiak; Misha Kapushesky; Jie Zheng; Nikolay Kolesnikov; Anna Zhukova; Alvis Brazma; Helen Parkinson
Journal:  Bioinformatics       Date:  2010-03-03       Impact factor: 6.937

3.  The Ontology Lookup Service: bigger and better.

Authors:  Richard Côté; Florian Reisinger; Lennart Martens; Harald Barsnes; Juan Antonio Vizcaino; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2010-05-11       Impact factor: 16.971

4.  Modeling biomedical experimental processes with OBI.

Authors:  Ryan R Brinkman; Mélanie Courtot; Dirk Derom; Jennifer M Fostel; Yongqun He; Phillip Lord; James Malone; Helen Parkinson; Bjoern Peters; Philippe Rocca-Serra; Alan Ruttenberg; Susanna-Assunta Sansone; Larisa N Soldatova; Christian J Stoeckert; Jessica A Turner; Jie Zheng
Journal:  J Biomed Semantics       Date:  2010-06-22

5.  BioBenchmark Toyama 2012: an evaluation of the performance of triple stores on biological data.

Authors:  Hongyan Wu; Toyofumi Fujiwara; Yasunori Yamamoto; Jerven Bolleman; Atsuko Yamaguchi
Journal:  J Biomed Semantics       Date:  2014-07-10

6.  BioPortal: enhanced functionality via new Web services from the National Center for Biomedical Ontology to access and use ontologies in software applications.

Authors:  Patricia L Whetzel; Natalya F Noy; Nigam H Shah; Paul R Alexander; Csongor Nyulas; Tania Tudorache; Mark A Musen
Journal:  Nucleic Acids Res       Date:  2011-06-14       Impact factor: 16.971

7.  The Gene Ontology (GO) project in 2006.

Authors: 
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

8.  The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery.

Authors:  Michel Dumontier; Christopher Jo Baker; Joachim Baran; Alison Callahan; Leonid Chepelev; José Cruz-Toledo; Nicholas R Del Rio; Geraint Duck; Laura I Furlong; Nichealla Keath; Dana Klassen; Jamie P. McCusker; Núria Queralt-Rosinach; Matthias Samwald; Natalia Villanueva-Rosales; Mark D Wilkinson; Robert Hoehndorf
Journal:  J Biomed Semantics       Date:  2014-03-06

9.  Araport: the Arabidopsis information portal.

Authors:  Vivek Krishnakumar; Matthew R Hanlon; Sergio Contrino; Erik S Ferlanti; Svetlana Karamycheva; Maria Kim; Benjamin D Rosen; Chia-Yi Cheng; Walter Moreira; Stephen A Mock; Joseph Stubbs; Julie M Sullivan; Konstantinos Krampis; Jason R Miller; Gos Micklem; Matthew Vaughn; Christopher D Town
Journal:  Nucleic Acids Res       Date:  2014-11-20       Impact factor: 16.971

10.  EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats.

Authors:  Jon Ison; Matús Kalas; Inge Jonassen; Dan Bolser; Mahmut Uludag; Hamish McWilliam; James Malone; Rodrigo Lopez; Steve Pettifer; Peter Rice
Journal:  Bioinformatics       Date:  2013-03-11       Impact factor: 6.937

View more
  6 in total

1.  The health care and life sciences community profile for dataset descriptions.

Authors:  Michel Dumontier; Alasdair J G Gray; M Scott Marshall; Vladimir Alexiev; Peter Ansell; Gary Bader; Joachim Baran; Jerven T Bolleman; Alison Callahan; José Cruz-Toledo; Pascale Gaudet; Erich A Gombocz; Alejandra N Gonzalez-Beltran; Paul Groth; Melissa Haendel; Maori Ito; Simon Jupp; Nick Juty; Toshiaki Katayama; Norio Kobayashi; Kalpana Krishnaswami; Camille Laibe; Nicolas Le Novère; Simon Lin; James Malone; Michael Miller; Christopher J Mungall; Laurens Rietveld; Sarala M Wimalaratne; Atsuko Yamaguchi
Journal:  PeerJ       Date:  2016-08-16       Impact factor: 2.984

2.  PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database.

Authors:  Martin Urban; Alayne Cuzick; Kim Rutherford; Alistair Irvine; Helder Pedro; Rashmi Pant; Vidyendra Sadanadan; Lokanath Khamari; Santoshkumar Billal; Sagar Mohanty; Kim E Hammond-Kosack
Journal:  Nucleic Acids Res       Date:  2016-12-03       Impact factor: 16.971

3.  SCALEUS-FD: A FAIR Data Tool for Biomedical Applications.

Authors:  Arnaldo Pereira; Rui Pedro Lopes; José Luís Oliveira
Journal:  Biomed Res Int       Date:  2020-08-26       Impact factor: 3.411

Review 4.  It's Hard to Avoid Avoidance: Uncoupling the Evolutionary Connection between Plant Growth, Productivity and Stress "Tolerance".

Authors:  Albino Maggio; Ray A Bressan; Yang Zhao; Junghoon Park; Dae-Jin Yun
Journal:  Int J Mol Sci       Date:  2018-11-20       Impact factor: 5.923

5.  BioHackathon 2015: Semantics of data for life sciences and reproducible research.

Authors:  Rutger A Vos; Toshiaki Katayama; Hiroyuki Mishima; Shin Kawano; Shuichi Kawashima; Jin-Dong Kim; Yuki Moriya; Toshiaki Tokimatsu; Atsuko Yamaguchi; Yasunori Yamamoto; Hongyan Wu; Peter Amstutz; Erick Antezana; Nobuyuki P Aoki; Kazuharu Arakawa; Jerven T Bolleman; Evan Bolton; Raoul J P Bonnal; Hidemasa Bono; Kees Burger; Hirokazu Chiba; Kevin B Cohen; Eric W Deutsch; Jesualdo T Fernández-Breis; Gang Fu; Takatomo Fujisawa; Atsushi Fukushima; Alexander García; Naohisa Goto; Tudor Groza; Colin Hercus; Robert Hoehndorf; Kotone Itaya; Nick Juty; Takeshi Kawashima; Jee-Hyub Kim; Akira R Kinjo; Masaaki Kotera; Kouji Kozaki; Sadahiro Kumagai; Tatsuya Kushida; Thomas Lütteke; Masaaki Matsubara; Joe Miyamoto; Attayeb Mohsen; Hiroshi Mori; Yuki Naito; Takeru Nakazato; Jeremy Nguyen-Xuan; Kozo Nishida; Naoki Nishida; Hiroyo Nishide; Soichi Ogishima; Tazro Ohta; Shujiro Okuda; Benedict Paten; Jean-Luc Perret; Philip Prathipati; Pjotr Prins; Núria Queralt-Rosinach; Daisuke Shinmachi; Shinya Suzuki; Tsuyosi Tabata; Terue Takatsuki; Kieron Taylor; Mark Thompson; Ikuo Uchiyama; Bruno Vieira; Chih-Hsuan Wei; Mark Wilkinson; Issaku Yamada; Ryota Yamanaka; Kazutoshi Yoshitake; Akiyasu C Yoshizawa; Michel Dumontier; Kenjiro Kosaki; Toshihisa Takagi
Journal:  F1000Res       Date:  2020-02-24

6.  medna-metadata: an open-source data management system for tracking environmental DNA samples and metadata.

Authors:  M Kimble; S Allers; K Campbell; C Chen; L M Jackson; B L King; S Silverbrand; G York; K Beard
Journal:  Bioinformatics       Date:  2022-08-12       Impact factor: 6.931

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.