Literature DB >> 25621321

Provenance Context Entity (PaCE): Scalable Provenance Tracking for Scientific RDF Data.

Satya S Sahoo1, Olivier Bodenreider2, Pascal Hitzler, Amit Sheth, Krishnaprasad Thirunarayan.   

Abstract

The Resource Description Framework (RDF) format is being used by a large number of scientific applications to store and disseminate their datasets. The provenance information, describing the source or lineage of the datasets, is playing an increasingly significant role in ensuring data quality, computing trust value of the datasets, and ranking query results. Current provenance tracking approaches using the RDF reification vocabulary suffer from a number of known issues, including lack of formal semantics, use of blank nodes, and application-dependent interpretation of reified RDF triples. In this paper, we introduce a new approach called Provenance Context Entity (PaCE) that uses the notion of provenance context to create provenance-aware RDF triples. We also define the formal semantics of PaCE through a simple extension of the existing RDF(S) semantics that ensures compatibility of PaCE with existing Semantic Web tools and implementations. We have implemented the PaCE approach in the Biomedical Knowledge Repository (BKR) project at the US National Library of Medicine. The evaluations demonstrate a minimum of 49% reduction in total number of provenance-specific RDF triples generated using the PaCE approach as compared to RDF reification. In addition, performance for complex queries improves by three orders of magnitude and remains comparable to the RDF reification approach for simpler provenance queries.

Entities:  

Keywords:  Biomedical knowledge repository; Context theory; Provenance context entity; Provenir ontology; RDF reification

Year:  2010        PMID: 25621321      PMCID: PMC4303908          DOI: 10.1007/978-3-642-13818-8_32

Source DB:  PubMed          Journal:  Sci Stat Database Manag


  3 in total

1.  The KEGG resource for deciphering the genome.

Authors:  Minoru Kanehisa; Susumu Goto; Shuichi Kawashima; Yasushi Okuno; Masahiro Hattori
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  The Unified Medical Language System (UMLS): integrating biomedical terminology.

Authors:  Olivier Bodenreider
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Relations in biomedical ontologies.

Authors:  Barry Smith; Werner Ceusters; Bert Klagges; Jacob Köhler; Anand Kumar; Jane Lomax; Chris Mungall; Fabian Neuhaus; Alan L Rector; Cornelius Rosse
Journal:  Genome Biol       Date:  2005-04-28       Impact factor: 13.583

  3 in total
  5 in total

1.  The Translational Medicine Ontology and Knowledge Base: driving personalized medicine by bridging the gap between bench and bedside.

Authors:  Joanne S Luciano; Bosse Andersson; Colin Batchelor; Olivier Bodenreider; Tim Clark; Christine K Denney; Christopher Domarew; Thomas Gambet; Lee Harland; Anja Jentzsch; Vipul Kashyap; Peter Kos; Julia Kozlovsky; Timothy Lebo; Scott M Marshall; Jamie P. McCusker; Deborah L McGuinness; Chimezie Ogbuji; Elgar Pichler; Robert L Powers; Eric Prud'hommeaux; Matthias Samwald; Lynn Schriml; Peter J Tonellato; Patricia L Whetzel; Jun Zhao; Susie Stephens; Michel Dumontier
Journal:  J Biomed Semantics       Date:  2011-05-17

2.  Don't Like RDF Reification? Making Statements about Statements Using Singleton Property.

Authors:  Vinh Nguyen; Olivier Bodenreider; Amit Sheth
Journal:  Proc Int World Wide Web Conf       Date:  2014-04-11

3.  Insight: Semantic Provenance and Analysis Platform for Multi-center Neurology Healthcare Research.

Authors:  Priya Ramesh; Annan Wei; Elisabeth Welter; Yvan Bamps; Shelley Stoll; Ashley Bukach; Martha Sajatovic; Satya S Sahoo
Journal:  Proceedings (IEEE Int Conf Bioinformatics Biomed)       Date:  2015-11

4.  A unified framework for managing provenance information in translational research.

Authors:  Satya S Sahoo; Vinh Nguyen; Olivier Bodenreider; Priti Parikh; Todd Minning; Amit P Sheth
Journal:  BMC Bioinformatics       Date:  2011-11-29       Impact factor: 3.169

5.  Representing annotation compositionality and provenance for the Semantic Web.

Authors:  Kevin M Livingston; Michael Bada; Lawrence E Hunter; Karin Verspoor
Journal:  J Biomed Semantics       Date:  2013-11-22
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.