Literature DB >> 29852316

Complex overlapping concepts: An effective auditing methodology for families of similarly structured BioPortal ontologies.

Ling Zheng1, Yan Chen2, Gai Elhanan3, Yehoshua Perl4, James Geller4, Christopher Ochs5.   

Abstract

In previous research, we have demonstrated for a number of ontologies that structurally complex concepts (for different definitions of "complex") in an ontology are more likely to exhibit errors than other concepts. Thus, such complex concepts often become fertile ground for quality assurance (QA) in ontologies. They should be audited first. One example of complex concepts is given by "overlapping concepts" (to be defined below.) Historically, a different auditing methodology had to be developed for every single ontology. For better scalability and efficiency, it is desirable to identify family-wide QA methodologies. Each such methodology would be applicable to a whole family of similar ontologies. In past research, we had divided the 685 ontologies of BioPortal into families of structurally similar ontologies. We showed for four ontologies of the same large family in BioPortal that "overlapping concepts" are indeed statistically significantly more likely to exhibit errors. In order to make an authoritative statement concerning the success of "overlapping concepts" as a methodology for a whole family of similar ontologies (or of large subhierarchies of ontologies), it is necessary to show that "overlapping concepts" have a higher likelihood of errors for six out of six ontologies of the family. In this paper, we are demonstrating for two more ontologies that "overlapping concepts" can successfully predict groups of concepts with a higher error rate than concepts from a control group. The fifth ontology is the Neoplasm subhierarchy of the National Cancer Institute thesaurus (NCIt). The sixth ontology is the Infectious Disease subhierarchy of SNOMED CT. We demonstrate quality assurance results for both of them. Furthermore, in this paper we observe two novel, important, and useful phenomena during quality assurance of "overlapping concepts." First, an erroneous "overlapping concept" can help with discovering other erroneous "non-overlapping concepts" in its vicinity. Secondly, correcting erroneous "overlapping concepts" may turn them into "non-overlapping concepts." We demonstrate that this may reduce the complexity of parts of the ontology, which in turn makes the ontology more comprehensible, simplifying maintenance and use of the ontology.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Abstraction network; Family-based ontology quality assurance; National Cancer Institute thesaurus; Ontology auditing; Ontology quality assurance; SNOMED CT

Mesh:

Year:  2018        PMID: 29852316      PMCID: PMC6609446          DOI: 10.1016/j.jbi.2018.05.015

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  44 in total

1.  Structural methodologies for auditing SNOMED.

Authors:  Yue Wang; Michael Halper; Hua Min; Yehoshua Perl; Yan Chen; Kent A Spackman
Journal:  J Biomed Inform       Date:  2006-12-24       Impact factor: 6.317

2.  Special issue on auditing of terminologies.

Authors:  J Geller; Y Perl; M Halper; R Cornet
Journal:  J Biomed Inform       Date:  2009-06       Impact factor: 6.317

3.  Auditing the semantic completeness of SNOMED CT using formal concept analysis.

Authors:  Guoqian Jiang; Christopher G Chute
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

4.  Using the UMLS Semantic Network to validate NCI Thesaurus structure and analyze its alignment with the OBO relations ontology.

Authors:  Sherri de Coronado; Mark S Tuttle; Harold R Solbrig
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

5.  Formal ontologies in biomedical knowledge representation.

Authors:  S Schulz; L Jansen
Journal:  Yearb Med Inform       Date:  2013

6.  Scalable quality assurance for large SNOMED CT hierarchies using subject-based subtaxonomies.

Authors:  Christopher Ochs; James Geller; Yehoshua Perl; Yan Chen; Junchuan Xu; Hua Min; James T Case; Zhi Wei
Journal:  J Am Med Inform Assoc       Date:  2014-10-21       Impact factor: 4.497

7.  Harmonization and semantic annotation of data dictionaries from the Pharmacogenomics Research Network: a case study.

Authors:  Qian Zhu; Robert R Freimuth; Zonghui Lian; Scott Bauer; Jyotishman Pathak; Cui Tao; Matthew J Durski; Christopher G Chute
Journal:  J Biomed Inform       Date:  2012-11-29       Impact factor: 6.317

8.  A unified software framework for deriving, visualizing, and exploring abstraction networks for ontologies.

Authors:  Christopher Ochs; James Geller; Yehoshua Perl; Mark A Musen
Journal:  J Biomed Inform       Date:  2016-06-23       Impact factor: 6.317

9.  NCI Thesaurus: using science-based terminology to integrate cancer research results.

Authors:  Sherri de Coronado; Margaret W Haber; Nicholas Sioutos; Mark S Tuttle; Lawrence W Wright
Journal:  Stud Health Technol Inform       Date:  2004

10.  Knowledge Discovery from Biomedical Ontologies in Cross Domains.

Authors:  Feichen Shen; Yugyung Lee
Journal:  PLoS One       Date:  2016-08-22       Impact factor: 3.240

View more
  1 in total

1.  Outlier concepts auditing methodology for a large family of biomedical ontologies.

Authors:  Ling Zheng; Hua Min; Yan Chen; Vipina Keloth; James Geller; Yehoshua Perl; George Hripcsak
Journal:  BMC Med Inform Decis Mak       Date:  2020-12-15       Impact factor: 2.796

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.