Literature DB >> 33407816

An ontology-based approach for developing a harmonised data-validation tool for European cancer registration.

Nicholas Charles Nicholson1, Francesco Giusti2, Manola Bettio2, Raquel Negrao Carvalho2, Nadya Dimitrova2, Tadeusz Dyba2, Manuela Flego2, Luciana Neamtiu2, Giorgia Randi2, Carmen Martos2.   

Abstract

BACKGROUND: Population-based cancer registries constitute an important information source in cancer epidemiology. Studies collating and comparing data across regional and national boundaries have proved important for deploying and evaluating effective cancer-control strategies. A critical aspect in correctly comparing cancer indicators across regional and national boundaries lies in ensuring a good and harmonised level of data quality, which is a primary motivator for a centralised collection of pseudonymised data. The recent introduction of the European Union's general data-protection regulation (GDPR) imposes stricter conditions on the collection, processing, and sharing of personal data. It also considers pseudonymised data as personal data. The new regulation motivates the need to find solutions that allow a continuation of the smooth processes leading to harmonised European cancer-registry data. One element in this regard would be the availability of a data-validation software tool based on a formalised depiction of the harmonised data-validation rules, allowing an eventual devolution of the data-validation process to the local level.
RESULTS: A semantic data model was derived from the data-validation rules for harmonising cancer-data variables at European level. The data model was encapsulated in an ontology developed using the Web-Ontology Language (OWL) with the data-model entities forming the main OWL classes. The data-validation rules were added as axioms in the ontology. The reasoning function of the resulting ontology demonstrated its ability to trap registry-coding errors and in some instances to be able to correct errors.
CONCLUSIONS: Describing the European cancer-registry core data set in terms of an OWL ontology affords a tool based on a formalised set of axioms for validating a cancer-registry's data set according to harmonised, supra-national rules. The fact that the data checks are inherently linked to the data model would lead to less maintenance overheads and also allow automatic versioning synchronisation, important for distributed data-quality checking processes.

Entities:  

Keywords:  Cancer registry; Data federation; Data harmonisation; Data validation; Ontology; Semantic web

Year:  2021        PMID: 33407816      PMCID: PMC7789225          DOI: 10.1186/s13326-020-00233-x

Source DB:  PubMed          Journal:  J Biomed Semantics


  8 in total

1.  Toward semantic interoperability of electronic health records.

Authors:  Idoia Berges; Jesús Bermúdez; Arantza Illarramendi
Journal:  IEEE Trans Inf Technol Biomed       Date:  2011-12-30

2.  A federated semantic metadata registry framework for enabling interoperability across clinical research and care domains.

Authors:  A Anil Sinaci; Gokce B Laleci Erturkmen
Journal:  J Biomed Inform       Date:  2013-06-07       Impact factor: 6.317

3.  The cancer registry in cancer control: an overview.

Authors:  C S Muir; E Démaret; P Boyle
Journal:  IARC Sci Publ       Date:  1985

4.  The Protégé Project: A Look Back and a Look Forward.

Authors:  Mark A Musen
Journal:  AI Matters       Date:  2015-06

5.  Federated ontology-based queries over cancer data.

Authors:  Alejandra González-Beltrán; Ben Tagger; Anthony Finkelstein
Journal:  BMC Bioinformatics       Date:  2012-01-25       Impact factor: 3.169

6.  Building a model for disease classification integration in oncology, an approach based on the national cancer institute thesaurus.

Authors:  Vianney Jouhet; Fleur Mougin; Bérénice Bréchat; Frantz Thiessard
Journal:  J Biomed Semantics       Date:  2017-02-07

7.  The FAIR Guiding Principles for scientific data management and stewardship.

Authors:  Mark D Wilkinson; Michel Dumontier; I Jsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan-Willem Boiten; Luiz Bonino da Silva Santos; Philip E Bourne; Jildau Bouwman; Anthony J Brookes; Tim Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott Edmunds; Chris T Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J G Gray; Paul Groth; Carole Goble; Jeffrey S Grethe; Jaap Heringa; Peter A C 't Hoen; Rob Hooft; Tobias Kuhn; Ruben Kok; Joost Kok; Scott J Lusher; Maryann E Martone; Albert Mons; Abel L Packer; Bengt Persson; Philippe Rocca-Serra; Marco Roos; Rene van Schaik; Susanna-Assunta Sansone; Erik Schultes; Thierry Sengstag; Ted Slater; George Strawn; Morris A Swertz; Mark Thompson; Johan van der Lei; Erik van Mulligen; Jan Velterop; Andra Waagmeester; Peter Wittenburg; Katherine Wolstencroft; Jun Zhao; Barend Mons
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

8.  Analysis and visualization of disease courses in a semantically-enabled cancer registry.

Authors:  Angel Esteban-Gil; Jesualdo Tomás Fernández-Breis; Martin Boeker
Journal:  J Biomed Semantics       Date:  2017-09-29
  8 in total
  2 in total

Review 1.  Ontologies and Knowledge Graphs in Oncology Research.

Authors:  Marta Contreiras Silva; Patrícia Eugénio; Daniel Faria; Catia Pesquita
Journal:  Cancers (Basel)       Date:  2022-04-10       Impact factor: 6.575

2.  A multipurpose TNM stage ontology for cancer registries.

Authors:  Nicholas Charles Nicholson; Francesco Giusti; Manola Bettio; Raquel Negrao Carvalho; Nadya Dimitrova; Tadeusz Dyba; Manuela Flego; Luciana Neamtiu; Giorgia Randi; Carmen Martos
Journal:  J Biomed Semantics       Date:  2022-02-22
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.